🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories....
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agent...
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted...
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI....
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion...
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development...
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files...
This repository holds the device support files for the iOS, and I will update it regularly.
CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows pipi...
Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles....
Keras code and weights files for popular deep learning models.
Very efficient backup system based on the git packfile format, providing fast incremental saves and global deduplication (among and within files, including virtual...
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbo...
🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙♀️...
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward....
:open_file_folder: :rabbit2: :tophat: See what a program does before deciding whether you really want it to happen (NO LONGER MAINTAINED)...
Securely and anonymously share files, host websites, and chat with friends using the Tor network
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files....
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)...
PathPicker accepts a wide range of input -- output from git commands, grep results, searches -- pretty much anything. After parsing the input, PathPicker presents...
Parse Redis dump.rdb files, Analyze Memory, and Export Data to JSON
The FLARE team's open-source tool to identify capabilities in executable files.
Nginx UI allows you to access and modify the nginx configurations files without cli.
📀 Unlimited Google Drive Storage by splitting binary files into base64
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM...
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG,...
文件快递柜-匿名口令分享文本,文件,像拿快递一样取文件(FileCodeBox - File Express Cabinet - Anonymous Passcode Sharing Text, Files, Like Taking Express Delivery for Files)...
Python tool for converting files and office documents to Markdown.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs....
[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file
Official Repository: Telegram bot which can download direct links, torrents, nzb, google drive, telegram document, any file/folder from rclone supported clouds, al...
borb is a library for reading, creating and manipulating PDF files in python.
An advanced web directory & file scanning tool that will be more powerful than DirBuster, Dirsearch, cansina, and Yu Jian.一个高级web目录、文件扫描工具,功能将会强于DirBuster、Dirsearc...
oletools - python tools to analyze MS OLE2 files (Structured Storage, Compound File Binary Format) and MS Office documents, for malware analysis, forensics and deb...
Compresses linked and inline javascript or CSS into a single cached file.
JSFinder is a tool for quickly extracting URLs and subdomains from JS files on a website.
Pack up to 3MB of data into a tweetable PNG polyglot file.
A semantic diff utility and library for tree-like files such as JSON, JSON5, XML, HTML, YAML, and CSV....
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents....
Download media files from a telegram conversation/chat/channel up to 2GiB per file