中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向...
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, an...
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python...
common data analysis and machine learning tasks using python
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding th...
StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environment, using datasources and...
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data....
Data scraper for Facebook Pages, and also code accompanying the blog post How to Scrape Data From Facebook Page Posts for Statistical Analysis...
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sort...
[NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis
A python module for scientific analysis of 3D data based on VTK and Numpy
A Python data analysis library that is optimized for humans instead of machines.
A Python tool that automatically cleans data sets and readies them for analysis.
This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets....
Fully-fledged Fundamental Analysis package capable of collecting 20 years of Company Profiles, Financial Statements, Ratios and Stock Data of 20.000+ companies....
🕷️ A `.git` folder exploiting tool that is able to restore the entire Git repository, including stash, common branches and common tags....
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc....
A stand alone, light-weight web server for building, sharing graphs created in ipython. Build for data science, data analysis guys. Aiming at building an interact...
(已完结)《极客时间数据分析实战45讲-详细笔记》包含markdown、图片、思维导图、代码 、数据。 可直接阅读代码、测试!
Data ETL & Analysis on the dataset 'Baby Names from Social Security Card Applications - National Data'....
Deep probabilistic analysis of single-cell and spatial omics data
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI)...
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents....
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
A unified framework for privacy-preserving data analysis and machine learning
:crown: Multivariate exploratory data analysis in Python — PCA, CA, MCA, MFA, FAMD, GPA
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scie...
Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation...
(Finished) Geek Time Data Analysis Practical 45 Lecture - Detailed notes containing markdown images mind map code data can be read directly code test...