521 projects for "text database" with 1 filter applied:

  • Run your private office with the ONLYOFFICE Icon
    Run your private office with the ONLYOFFICE

    Secure office and productivity apps

    A Comprehensive Alternative to Office 365 for Business
    Learn More
  • File Synchronization, File Replication and File Archiving software solutions. Icon
    File Synchronization, File Replication and File Archiving software solutions.

    SIMPLIFY CRITICAL FILE TRANSFERS

    SureSync is a file replication and synchronization application that provides one-way and multi-way processing in both scheduled and real-time modes.
    Learn More
  • 1
    OOO Extras

    OOO Extras

    Extra tools for OpenOffice under weak copyleft or other licenses

    A space to store classic OOo dependencies that cannot be easily redistributed in Apache OpenOffice's Source tree. Initially this was meant for copyleft tarballs only but it is also pretty handy to mirror other file dependencies.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 2
    yt-fts

    yt-fts

    Search all of YouTube from the command line

    yt-fts, short for YouTube Full Text Search, is an open-source command-line tool that enables users to search the spoken content of YouTube videos by indexing their subtitles. The program automatically downloads subtitles from a specified YouTube channel using the yt-dlp utility and stores them in a local SQLite database. Once indexed, users can perform full-text searches across all transcripts to quickly locate keywords or phrases mentioned within the videos. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 3
    DB Browser for SQLite

    DB Browser for SQLite

    The DB Browser for SQLite

    DB Browser for SQLite (DB4S) is a high quality, visual, open source tool to create, design, and edit database files compatible with SQLite. DB4S is for users and developers who want to create, search, and edit databases. DB4S uses a familiar spreadsheet-like interface, and complicated SQL commands do not have to be learned. This program is not a visual shell for the sqlite command line tool, and does not require familiarity with SQL commands.
    Downloads: 123 This Week
    Last Update:
    See Project
  • 4
    Aix-DB

    Aix-DB

    Based on the LangChain/LangGraph framework

    Aix-DB is an open-source intelligent data analysis platform that combines large language models with database technologies to enable conversational data exploration. The system is designed as a ChatBI solution that allows users to query datasets using natural language and receive structured insights, charts, and visualizations automatically. Built on frameworks such as LangChain and LangGraph, Aix-DB integrates retrieval-augmented generation and Text-to-SQL capabilities to convert user questions into executable database queries. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Waitlist App and Paging System Icon
    Waitlist App and Paging System

    We make it cheaper and easier to manage your waitlist, order backlog, and just about any other waiting scenario.

    Streamline your customer flow with our SMS-powered waitlist, reservations, and queue management app for restaurants, health care providers, and many other businesses.
    Free Trial
  • 5
    BrowserAI

    BrowserAI

    Run local LLMs like llama, deepseek, kokoro etc. inside your browser

    BrowserAI is a cutting-edge platform that allows users to run large language models (LLMs) directly in their web browser without the need for a server. It leverages WebGPU for accelerated performance and supports offline functionality, making it a highly efficient and privacy-conscious solution. The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines. Additionally, it supports features such as...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    TailwindSQL

    TailwindSQL

    TailwindSQL lets you write SQL queries using Tailwind-style class name

    TailwindSQL is an experimental library that lets developers write SQL queries using Tailwind-style class names, essentially turning database queries into declarative class attributes that can be used directly in frameworks like React Server Components. Inspired by the simplicity and utility of Tailwind CSS, TailwindSQL allows developers to describe database queries using a familiar syntax such as db-table-column-where-field-value, collapsing traditional SQL into readable, utility-driven structures that map directly to data fetches. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Korvus

    Korvus

    Korvus is a search SDK that unifies the entire RAG pipeline

    Korvus is an open-source retrieval-augmented generation (RAG) pipeline designed to run entirely inside PostgreSQL, allowing developers to build AI search and knowledge systems directly within a database environment. The project consolidates the typical steps of a RAG pipeline—including embedding generation, document retrieval, reranking, and text generation—into a single query executed within the Postgres ecosystem. By leveraging PostgresML and vector extensions such as pgvector, Korvus eliminates the need for external microservices typically used for AI search architectures, reducing both system complexity and latency. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    MyScaleDB

    MyScaleDB

    A @ClickHouse fork that supports high-performance vector search

    MyScaleDB is an open-source SQL vector database designed for building large-scale AI and machine learning applications that require both analytical queries and semantic vector search. The system is built on top of the ClickHouse database engine and extends it with specialized indexing and search capabilities optimized for vector embeddings. This design allows developers to store structured data, unstructured text, and high-dimensional vector embeddings within a single database platform. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AtomicServer

    AtomicServer

    An open source headless CMS / real-time database

    Create, share, fetch, and model Atomic Data! AtomicServer is a lightweight, yet powerful CMS / Graph Database. Atomic Data is a modular specification for sharing, modifying, and modeling graph data. It combines the ease of use of JSON, the connectivity of RDF (linked data), and the reliability of type-safety. Atomic Data uses links to connect pieces of data, and therefore makes it easier to connect datasets to each other - even when these datasets exist on separate machines.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Powerful Business Process Automation Icon
    Powerful Business Process Automation

    With ThinkAutomation, you get an open-ended studio to build any and every automated workflow you could ever need.

    When a message is received ThinkAutomation automatically executes one or more Automations. Automations are created using an easy to use drag-and-drop interface to run simple or complex tasks. Automations can perform many business process Actions, including: updating company databases, CRM systems and cloud services, sending outgoing emails, Teams & SMS messages, document processing, custom scripting, integration and much more. Over 100 built-in actions are included, plus ThinkAutomation is extensible with Custom Actions.  
    Learn More
  • 10
    PageIndex

    PageIndex

    Document Index for Vectorless, Reasoning-based RAG

    PageIndex is an innovative open-source framework that reimagines retrieval-augmented generation (RAG) by eliminating conventional vector similarity search and instead building hierarchical semantic indexes that mirror a document’s natural structure. Rather than chunking text and embedding it into a vector database, PageIndex constructs a tree-structured index — similar to a detailed, AI-enhanced table of contents — that a large language model can traverse to locate the most relevant sections of long documents. This reasoning-driven retrieval aligns more naturally with how humans explore complex texts, improving relevance and traceability, especially in professional domains like financial reports, legal contracts, and technical manuals. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    news-please

    news-please

    Python tool for crawling and extracting structured data from news site

    ...Developers can use the software either as a standalone command line application or integrate it into their own Python applications through its library interface. Extracted article data can be stored in different formats and systems, including JSON files or database-backed storage solutions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Voyage

    Voyage

    Voyager - The Missing Laravel Admin

    Voyager is a Laravel Admin Package that includes BREAD(CRUD) operations, a media manager, menu builder, and much more. Voyager can save you so much time and it will make building applications even more fun. Voyager's admin interface allows you to create CRUD or BREAD (Browse, Read, Edit, Add, and Delete) functionality to your posts, pages, or any other table in your database. Voyager has a fully functional media manager which allows you to view/edit/delete files from your storage. All files...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    LOTUS

    LOTUS

    AI-Powered Data Processing: Use LOTUS to process all of your datasets

    ...It offers a Python interface with a Pandas-like API, making it familiar for data scientists and engineers already working with data analysis libraries. The core concept of the framework is the use of semantic operators, which extend traditional relational database operations to support reasoning over text and other unstructured data. These operators allow tasks such as semantic filtering, ranking, clustering, and summarization to be expressed directly within data processing pipelines. The LOTUS engine automatically optimizes how language models are used during execution, which can significantly improve performance and reduce computational cost.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    Relaticle

    Relaticle

    The Next-Generation Open-Source CRM Platform written with Laravel

    Relaticle is a self-hosted knowledge graph and note linking platform designed to help users organize, connect, and explore ideas in a visual and intuitive way. It blends the simplicity of a note-taking system with the power of a graph database, allowing users to create nodes for concepts, link them through relationships, and navigate their thoughts like a map instead of a linear document. The interface lets you write plain text notes and tag or connect them dynamically, making it easier to uncover patterns and connections over time instead of losing insights in a long, unstructured list. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    RuoYi AI

    RuoYi AI

    Enterprise AI platform for building, deploying, and managing apps

    RuoYi AI is a full-stack enterprise-oriented AI development platform designed to help developers rapidly build, deploy, and manage intelligent applications using modern large language models and AI ecosystems. It provides a unified framework for integrating multiple AI models from different providers, allowing teams to switch or combine models through a consistent interface without vendor lock-in. RuoYi AI includes built-in support for retrieval-augmented generation, enabling organizations...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Canopy

    Canopy

    Retrieval Augmented Generation (RAG) framework

    Canopy is an open-source retrieval-augmented generation (RAG) framework developed by Pinecone to simplify the process of building applications that combine large language models with external knowledge sources. The system provides a complete pipeline for transforming raw text data into searchable embeddings, storing them in a vector database, and retrieving relevant context for language model responses. It is designed to handle many of the complex components required for a RAG workflow, including document chunking, embedding generation, prompt construction, and chat history management. Developers can use Canopy to quickly build chat systems that answer questions using their own data instead of relying solely on the pretrained knowledge of the language model. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    LangChain Extract

    LangChain Extract

    Did you say you like data?

    LangChain Extract is an open-source reference application designed to demonstrate how large language models can be used to extract structured data from unstructured text and document files. The project implements a lightweight web service that allows developers to define extraction schemas and apply them to various sources such as plain text, HTML, or PDF documents. Built using FastAPI and the LangChain framework, the application exposes a REST API that can process documents and return...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    OP Vault

    OP Vault

    Give ChatGPT long-term memory using the OP Stack

    OP Vault is an open-source system designed to give large language models long-term memory by enabling them to interact with a custom knowledge base built from user-provided documents. It combines a backend written in Go with a React frontend, allowing users to upload files such as PDFs, text documents, and books to create a searchable repository of information. The system uses vector databases like Pinecone alongside OpenAI models to index and retrieve relevant content, enabling precise...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AgentBench

    AgentBench

    A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

    AgentBench is an open-source benchmark designed to evaluate the capabilities of large language models when used as autonomous agents. Unlike traditional language model benchmarks that focus on static text tasks, AgentBench measures how models perform in interactive environments that require planning, reasoning, and decision-making. The benchmark includes multiple environments that simulate realistic scenarios such as web interaction, database querying, and problem solving tasks. These environments require agents to interpret instructions, take actions, and adapt their strategies based on feedback from the environment. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Dash Data Agent

    Dash Data Agent

    Self-learning data agent that grounds its answers in layers of content

    Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live runtime context — to generate SQL queries that are both technically correct and semantically meaningful. The system then executes those queries against a database and interprets the results, returning human-friendly insights not just raw rows, while learning from errors and successes to reduce repeated mistakes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    xMarkup Text Transformation Utility
    xMarkup is a text transformation utility for batch-processing of a set of ANSI/UTF-8 text files. All Win-32 and POSIX/UNIX platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Apache OpenOffice

    Apache OpenOffice

    The free and Open Source productivity suite

    Free alternative for Office productivity tools: Apache OpenOffice - formerly known as OpenOffice.org - is an open-source office productivity software suite containing word processor, spreadsheet, presentation, graphics, formula editor, and database management applications. OpenOffice is available in many languages, works on all common computers, stores data in ODF - the international open standard format - and is able to read and write files in other formats, included the format used by the...
    Leader badge
    Downloads: 227,016 This Week
    Last Update:
    See Project
  • 23
    Microsoft Works format import library
    libwps is a Microsoft Works file format import filter based on top of the librevenge (see https://sourceforge.net/p/libwpd/wiki/librevenge/ ). Currently, libwps can import all word processing Works formats since about 1995 with some success. It may also be able to import some basic database and spreadsheet files.
    Leader badge
    Downloads: 380 This Week
    Last Update:
    See Project
  • 24
    RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents, sort of a Reference Manager or BibTeX for markup languages. It is portable and known to run on Linux, Free/NetBSD, OSX, Solaris, and Windows/Cygwin.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files. Cobol Copybooks can be used to format Cobol Data files. As well as an editor, The following utilities are supplied * Formatted...
    Leader badge
    Downloads: 63 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB