Top Python Frameworks & Libraries for files 53

🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories....

27528
2276
Python

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agent...

19358
2062
Python

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted...

17308
4330
Python

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI....

15384
2287
Python

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

18330
1234
Python

A formatter for Python files

13847
895
Python

:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion...

10986
1879
Python

q - Run SQL directly on delimited files and multi-file sqlite databases

10240
424
Python

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development...

10798
1146
Python

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files...

8748
1442
Python

This repository holds the device support files for the iOS, and I will update it regularly.

8151
1163
Python

CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows pipi...

8055
214
Python

Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles....

7904
443
Python

Keras code and weights files for popular deep learning models.

7320
2453
Python

Very efficient backup system based on the git packfile format, providing fast incremental saves and global deduplication (among and within files, including virtual...

7191
422
Python

PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbo...

7054
849
Python

🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙‍♀️...

6785
368
Python

Preview GitHub README.md files locally before committing them.

6595
431
Python

pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward....

7038
405
Python

:open_file_folder: :rabbit2: :tophat: See what a program does before deciding whether you really want it to happen (NO LONGER MAINTAINED)...

6343
163
Python

Securely and anonymously share files, host websites, and chat with friends using the Tor network

6422
664
Python

A suite of utilities for converting to and working with CSV, the king of tabular file formats.

6092
607
Python

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files....

6659
716
Python

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)...

6361
706
Python

Find duplicate files

5747
425
Python

PathPicker accepts a wide range of input -- output from git commands, grep results, searches -- pretty much anything. After parsing the input, PathPicker presents...

5149
279
Python

A User-Focused Photo & File Management System

5635
385
Python

Parse Redis dump.rdb files, Analyze Memory, and Export Data to JSON

5126
743
Python

Simple Python style checker in one Python file

5069
750
Python

Scanning APK file for URIs, endpoints & secrets.

5122
501
Python

The FLARE team's open-source tool to identify capabilities in executable files.

4999
567
Python

16-bit CPU for Excel, and related files

4509
386
Python

Nginx UI allows you to access and modify the nginx configurations files without cli.

4444
276
Python

📀 Unlimited Google Drive Storage by splitting binary files into base64

4355
280
Python

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM...

37352
3625
Python

A library to manipulate font files from Python.

4488
465
Python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG,...

5307
351
Python

文件快递柜-匿名口令分享文本,文件,像拿快递一样取文件(FileCodeBox - File Express Cabinet - Anonymous Passcode Sharing Text, Files, Like Taking Express Delivery for Files)...

5455
645
Python

Python tool for converting files and office documents to Markdown.

38446
1773
HTML

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs....

5624
279
Python

[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file

4160
1648
Python

Beancount: Double-Entry Accounting from Text Files.

3921
319
Python

MusicBrainz Picard audio file tagger

3930
396
Python

A python script that finds endpoints in JavaScript files

3780
608
Python

Official Repository: Telegram bot which can download direct links, torrents, nzb, google drive, telegram document, any file/folder from rclone supported clouds, al...

3525
4816
Python

Parse files for optimal RAG

3483
336
Python

borb is a library for reading, creating and manipulating PDF files in python.

3430
146
Python

An advanced web directory & file scanning tool that will be more powerful than DirBuster, Dirsearch, cansina, and Yu Jian.一个高级web目录、文件扫描工具,功能将会强于DirBuster、Dirsearc...

3193
550
Python

File upload vulnerability scanner and exploitation tool.

3150
512
Python

oletools - python tools to analyze MS OLE2 files (Structured Storage, Compound File Binary Format) and MS Office documents, for malware analysis, forensics and deb...

2991
568
Python

File support for asyncio

2925
154
Python

Compresses linked and inline javascript or CSS into a single cached file.

2827
601
Python

JSFinder is a tool for quickly extracting URLs and subdomains from JS files on a website.

2683
407
Python

Radically simplified static file serving for Python web apps

2571
152
Python

Pack up to 3MB of data into a tweetable PNG polyglot file.

2553
158
Python

Version controlled file system

2507
154
Python

A semantic diff utility and library for tree-like files such as JSON, JSON5, XML, HTML, YAML, and CSV....

2385
47
Python

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents....

2227
372
Python

Download media files from a telegram conversation/chat/channel up to 2GiB per file

2203
370
Python