Python

Top Python Frameworks & Libraries for text processing

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向...

75159
14938
Python

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more....

9405
1168
Python

Natural Language Processing Best Practices & Examples

6424
915
Python

Python library for processing Chinese text

6569
1371
Python

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/...

2388
371
Python

Multilingual text (NLP) processing toolkit

2351
339
Python

Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Gene...

1412
365
Python

A PyTorch-based knowledge distillation toolkit for natural language processing

1667
247
Python

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-pro...

746
113
Python

Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

647
95
Python

TensorFlow implementation of Neural Variational Inference for Text Processing

537
76
Python

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, wo...

670
94
Python

:snake: Syntax, working with Shell commands, Files, Text Processing, and more...

575
191
Python

A Sublime Text package for the programming language Processing

447
59
Python

Simple SQL-like syntax on top of Perl text processing.

411
14
Python

Chinese text normalization for speech processing

694
148
Python

Constants used in Chinese text processing

373
46
Python

LeakScraper is an efficient set of tools to process and visualize huge text files containing credentials. Theses tools are designed to help penetration testers and...

425
82
Python

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-proc...

11755
1189
Python

Models, data loaders and abstractions for language processing, powered by PyTorch

3550
814
Python

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

1387
125
Python

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model....

896
39
Python

An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high...

852
63
Python

Library for Textless Spoken Language Processing

548
54
Python

Text Normalization & Inverse Text Normalization

619
88
Python

AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions ba...

375
23
Python

AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions ba...

484
41
Python

Gradio WebUI for audio processing, powered by Whisper (OpenAI-Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer(RVC), zero-shot Voice Cloning (...

3352
251
Python

Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has state of the art retrieval p...

1186
93
Python

✨ AsrTools: Smart Voice-to-Text Tool | Efficient Batch Processing | User-Friendly Interface | No GPU Required | Supports SRT/TXT Output | Turn your audio into accu...

2458
235
Python

Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support f...

640
64
Python

An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high...

864
63
Python

NeMo text processing for ASR and TTS

349
119
Python

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice...

408
103
Python