Instagram OSINT tool for gathering profile data and public posts
A high-quality tool for convert PDF to Markdown and JSON
PDF Parser for AI-ready data. Automate PDF accessibility
ContextGem: Effortless LLM extraction from documents
A machine learning software for extracting information
A distributed job server
Fast and efficient unstructured data extraction
lightweight Go package to parse, analyze and extract metadata
A tool to simulate Amazon EC2 instance metadata
Python & command-line tool to gather text on the Web
Tool to help you collect, organize, annotate, cite, and share research
Download pictures (or videos) along with their captions
A versatile toolkit for PDF manipulation
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A library for interacting with the nhentai API
Cross platform GUI tool for downloading videos from Bilibili sites
Assist in organizing your piles of documents
Open source OSINT tool for gathering data on emails, phones, and IPs
CLI tool to extract (meta)data from PDF and manipulate PDF files
ExtractThinker is a Document Intelligence library for LLMs
A self-hostable bookmark-everything app
Movie metadata scraper and organizer for media libraries and NFO
Document content and metadata extraction microservice
Copybara: A tool for transforming and moving code between repositories
Coomer downloader