Simple tools for data cleaning in R
Multi functional app to find duplicates, empty folders, similar images
An AI-powered data science team of agents
Analytics for developers, setup Analytics in 30 seconds
The open source mesh processing system
FDUPES is a program for identifying or deleting duplicate files
Import public NYC taxi and for-hire vehicle (Uber, Lyft)
A windows batch script that cleans your PC from temporary files
CSV Lint plug-in for Notepad++ for syntax highlighting
Clean Jupyter notebooks of outputs, metadata, and empty cells
This is a multi-use bash script for Linux systems
ExtractThinker is a Document Intelligence library for LLMs
Image polygonal annotation with Python
Basic To Intermediate Python data science guide
Data and tools for generating and inspecting OLMo pre-training data
The basic implementation for chunk upload with multiple providers
Links to everything you'd ever want to learn about data engineering
Converts books written in Markdown to HTML, LaTeX/PDF and EPUB
Scan and remove junk files, caches, logs, and more
A robust Javascript library for capturing keyboard input
A natural language interface for computers
Java dataframe and visualization library
Big Model Application Development Practice 1
Master the essential skills needed to recognize and solve problems
Haskell code prettifier