Top Python Frameworks & Libraries for files 53

🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories....

27973
2308
Python

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agent...

20427
2138
Python

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted...

17969
4474
Python

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI....

15426
2286
Python

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

28090
1894
Python

A formatter for Python files

13878
898
Python

:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion...

11154
1899
Python

q - Run SQL directly on delimited files and multi-file sqlite databases

10268
424
Python

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development...

10984
1163
Python

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files...

8972
1460
Python

This repository holds the device support files for the iOS, and I will update it regularly.

8178
1167
Python

CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows pipi...

8135
218
Python

Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles....

8059
452
Python

Keras code and weights files for popular deep learning models.

7329
2450
Python

Very efficient backup system based on the git packfile format, providing fast incremental saves and global deduplication (among and within files, including virtual...

7224
422
Python

PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbo...

7114
850
Python

🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙‍♀️...

6865
371
Python

Preview GitHub README.md files locally before committing them.

6634
431
Python

pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward....

7166
408
Python

:open_file_folder: :rabbit2: :tophat: See what a program does before deciding whether you really want it to happen (NO LONGER MAINTAINED)...

6349
163
Python

Securely and anonymously share files, host websites, and chat with friends using the Tor network

6503
665
Python

A suite of utilities for converting to and working with CSV, the king of tabular file formats.

6163
608
Python

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files....

7059
750
Python

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)...

6900
744
Python

Find duplicate files

5961
431
Python

PathPicker accepts a wide range of input -- output from git commands, grep results, searches -- pretty much anything. After parsing the input, PathPicker presents...

5161
278
Python

A User-Focused Photo & File Management System

5808
391
Python

Parse Redis dump.rdb files, Analyze Memory, and Export Data to JSON

5126
743
Python

Simple Python style checker in one Python file

5069
750
Python

Scanning APK file for URIs, endpoints & secrets.

5122
501
Python

The FLARE team's open-source tool to identify capabilities in executable files.

5170
582
Python

16-bit CPU for Excel, and related files

4509
386
Python

Nginx UI allows you to access and modify the nginx configurations files without cli.

4447
274
Python

📀 Unlimited Google Drive Storage by splitting binary files into base64

4355
280
Python

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM...

37743
3636
Python

A library to manipulate font files from Python.

4488
465
Python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG,...

5557
377
Python

文件快递柜-匿名口令分享文本,文件,像拿快递一样取文件(FileCodeBox - File Express Cabinet - Anonymous Passcode Sharing Text, Files, Like Taking Express Delivery for Files)...

6466
743
Python

Python tool for converting files and office documents to Markdown.

53633
2667
Python

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs....

5994
298
Python

[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file

4160
1648
Python

Beancount: Double-Entry Accounting from Text Files.

3921
319
Python

Picard is a cross-platform music tagger powered by the MusicBrainz database

4037
399
Python

A python script that finds endpoints in JavaScript files

3780
608
Python

Official Repository: Telegram bot which can download direct links, torrents, nzb, google drive, telegram document, any file/folder from rclone supported clouds, al...

3525
4816
Python

Parse files for optimal RAG

3483
336
Python

borb is a library for reading, creating and manipulating PDF files in python.

3430
146
Python

An advanced web directory & file scanning tool that will be more powerful than DirBuster, Dirsearch, cansina, and Yu Jian.一个高级web目录、文件扫描工具,功能将会强于DirBuster、Dirsearc...

3193
550
Python

File upload vulnerability scanner and exploitation tool.

3150
512
Python

oletools - python tools to analyze MS OLE2 files (Structured Storage, Compound File Binary Format) and MS Office documents, for malware analysis, forensics and deb...

3043
571
Python

File support for asyncio

2925
154
Python

Compresses linked and inline javascript or CSS into a single cached file.

2842
603
Python

JSFinder is a tool for quickly extracting URLs and subdomains from JS files on a website.

2683
407
Python

Radically simplified static file serving for Python web apps

2571
152
Python

Pack up to 3MB of data into a tweetable PNG polyglot file.

2553
158
Python

Version controlled file system

2507
154
Python

A semantic diff utility and library for tree-like files such as JSON, JSON5, XML, HTML, YAML, and CSV....

2385
47
Python

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents....

2227
372
Python

Download media files from a telegram conversation/chat/channel up to 2GiB per file

2203
370
Python

Detect file content types with deep learning

8585
442
Python