Instagram OSINT tool for gathering profile data and public posts
A high-quality tool for convert PDF to Markdown and JSON
ContextGem: Effortless LLM extraction from documents
Fast and efficient unstructured data extraction
A distributed job server
A machine learning software for extracting information
A tool to simulate Amazon EC2 instance metadata
lightweight Go package to parse, analyze and extract metadata
Python & command-line tool to gather text on the Web
Download pictures (or videos) along with their captions
A versatile toolkit for PDF manipulation
A library for interacting with the nhentai API
Assist in organizing your piles of documents
CLI tool to extract (meta)data from PDF and manipulate PDF files
Open source OSINT tool for gathering data on emails, phones, and IPs
Cross platform GUI tool for downloading videos from Bilibili sites
Copybara: A tool for transforming and moving code between repositories
ExtractThinker is a Document Intelligence library for LLMs
A self-hostable bookmark-everything app
Movie metadata scraper and organizer for media libraries and NFO
Document content and metadata extraction microservice
Coomer downloader
This is a public repository containing scrapers
Magical shell history
A utility for downloading TV and radio programmes from BBC iPlayer