Stanford NLP Python library for many human languages
Stanford CoreNLP, a Java suite of core NLP tools
Document (PDF, Word, PPTX ...) extraction and parse API
Syntax tree editor for rapid annotation of existing text
Modest natural-language processing
The Classical Language Toolkit
A simple tool for reading in poorly redacted documents
Python tool for converting files and office documents to Markdown
PostgreSQL extension for BM25 relevance-ranked full-text search
Qwen3-TTS is an open-source series of TTS models
Han Language Processing
LLM-based Reinforcement Learning audio edit model
NLTK Source
Industrial-level controllable zero-shot text-to-speech system
Discourse Network Analyzer (DNA)
The most accurate natural language detection library for Rust
Text mining using tidy tools
The most accurate natural language detection library for Python
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Open source annotation tool for machine learning practitioners
General natural language facilities for node
Open source semantic search and text analytics for large document sets
A Pioneering Open-Source Alternative to GPT-4o
Open-Source Python3 tool for recognizing layouts, tables, and math
Metaprogramming library to analyze and transform Java source code