Top Python Frameworks & Libraries for text processing

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向...

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more....

Natural Language Processing Best Practices & Examples

Python library for processing Chinese text

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/...

Multilingual text (NLP) processing toolkit

Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Gene...

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-pro...

TensorFlow implementation of Neural Variational Inference for Text Processing

A PyTorch-based knowledge distillation toolkit for natural language processing

:snake: Syntax, working with Shell commands, Files, Text Processing, and more...

A Sublime Text package for the programming language Processing

Simple SQL-like syntax on top of Perl text processing.

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, wo...

Constants used in Chinese text processing

LeakScraper is an efficient set of tools to process and visualize huge text files containing credentials. Theses tools are designed to help penetration testers and...

Chinese text normalization for speech processing

PyTorch deep learning models for text processing

Twitter text processing library (auto linking and extraction of usernames, lists and hashtags).

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) &...

Text Mining and Topic Modeling Toolkit for Python with parallel processing power

Text pre-processing library for deep learning (Keras, tensorflow).

🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code....

Twitter text processing library (auto linking and extraction of usernames, lists and hashtags). Based on the Java implementation by Matt Sanford...

A tiny library for Python text normalisation. Useful for ad-hoc text processing.

The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-process...

Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials an...

Platform for few-shot natural language processing: Text Classification, Sequene Labeling.

Python library to extract text from PDF, and default to OCR when text extraction fails.

Pre-process arabic text (remove diacritics, punctuations and repeating characters)

Detect handwritten words in a text-line (classic image processing method).