2019 projects for "data file" with 1 filter applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Form-Data

    Form-Data

    A module to create readable `"multipart/form-data"` streams

    A library to create readable "multipart/form-data" streams. Can be used to submit forms and file uploads to other web applications.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    Syncthing

    Syncthing

    Open Source Continuous File Synchronization

    Syncthing is an open source continuous file synchronization program, which essentially works by synchronizing files between two or more computers in real time. Syncthing is designed to first and foremost keep users’ data safe and protected against data loss and against attackers who would unlawfully try and access this data. It doesn’t store data anywhere else except on your computers and uses encryption to secure all your data.
    Downloads: 87 This Week
    Last Update:
    See Project
  • 3
    Kedro

    Kedro

    A Python framework for creating reproducible, maintainable code

    Kedro is an open sourced Python framework for creating maintainable and modular data science code. Provides the scaffolding to build more complex data and machine-learning pipelines. In addition, there's a focus on spending less time on the tedious "plumbing" required to maintain data science code; this means that you have more time to solve new problems. Standardises team workflows; the modular structure of Kedro facilitates a higher level of collaboration when teams solve problems...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    Cloud Storage FUSE

    Cloud Storage FUSE

    A user-space file system for interacting with Google Cloud Storage

    Cloud Storage FUSE is an open-source user-space file system adapter that allows Google Cloud Storage buckets to be mounted and accessed as if they were local file systems on a machine. This approach enables applications to interact with cloud storage using standard file system semantics, eliminating the need to rewrite code to use object storage APIs directly. The tool is particularly valuable in data-intensive workflows such as machine learning, where large datasets can be accessed on demand without requiring full local downloads. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    Data Crow

    Data Crow

    The ultimate cataloguer

    Data Crow allows you to use the standard movie & video (divx, xvid, DVD, Blu-ray, etc), book (and eBooks), images, board games, comic books, games & software, music (mp3 and other music files) cataloguing modules. Besides these modules, which you can change to fit your requirements, you can create new modules (want to catalogue your stamps, equipment, or anything else?). The GUI is skinnable. Reporting (using JasperReports and their community edition JasperSoft Developer Studio ), loan...
    Leader badge
    Downloads: 273 This Week
    Last Update:
    See Project
  • 6
    Curl

    Curl

    Command line tool and library for transferring data with URLs

    Curl is a command line tool and library for transferring data specified with URL syntax. It supports HTTP, HTTPS, FTP, FTPS, GOPHER, TFTP, SCP, SFTP, SMB, TELNET, DICT, SSL certificates, cookies, user+password authentication, and so much more! Curl is used for many different things. It's used in command lines or scripts for transferring data. It's also used in just about every device you can think of: mobile phones and tablets, television sets, printers, routers, media players and other...
    Downloads: 52 This Week
    Last Update:
    See Project
  • 7

    yq

    Portable command-line YAML processor

    yq is a portable and lightweight command-line YAML processor. It can be likened to projects like jq (a command-line JSON processor) or sed but for YAML files. yq is able to do quite a number of things. It can deep read a YAML file with a given path expression, deeply compare YAML files, update a YAML file given a path expression or script file, and so much more. It can also merge several YAML files while offering plenty of options for overriding and appending. yq is written in portable...
    Downloads: 53 This Week
    Last Update:
    See Project
  • 8
    Humanizer Skill

    Humanizer Skill

    Claude Code skill that removes signs of AI-generated writing from text

    Humanizer Skill is a utility library focused on transforming technical or machine-oriented text into expressions that are more natural, readable, and “human-friendly.” It provides a suite of algorithms that convert timestamps, identifiers, file sizes, code tokens, and structured data into phrases that resemble typical human phrasing rather than compact machine output. For example, date and time values can be expressed as relative terms (“two hours ago”), and file sizes can be shown in intuitive units instead of raw bytes. It also includes functions for transforming camelCase, snake_case, or PascalCase identifiers into spaced and capitalized representations suitable for user interfaces, reports, or documentation. ...
    Downloads: 132 This Week
    Last Update:
    See Project
  • 9
    v2ray-rules-dat

    v2ray-rules-dat

    V2Ray routing rules file enhanced version, which can replace V2Ray

    v2ray-rules-dat is a repository that compiles and distributes enhanced rule data (domain lists, geo-IP/geo-domain data, block/proxy/detect lists) intended for use with tools like V2Ray, Xray-core, and similar network/proxy frameworks. The dataset serves as an alternative or supplement to official geoip/ geosite data files, often providing more up-to-date, community-curated entries — enabling better routing, blocking, or traffic management when using those proxy tools. The repository is...
    Downloads: 48 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    ...It’s designed to replace the random assortment of “free online tools” people use for quick tasks, while avoiding ads, tracking, and the need to upload sensitive files to unknown servers. A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent to the backend. The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. It’s also packaged for straightforward self-hosting, with a lightweight Docker image and simple run commands, so it can be deployed quickly on a homelab or internal network.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 11
    fess

    fess

    Open source enterprise search server for websites, files, and data

    Fess is an open source enterprise search server designed to provide powerful full-text search capabilities across multiple data sources. It enables organizations to quickly deploy a scalable search environment without requiring deep knowledge of underlying search technologies. Fess is built on top of OpenSearch and offers an integrated solution for crawling, indexing, and searching documents from websites, file systems, and various data stores. Fess includes a built-in crawler that can collect content from sources such as databases, CSV files, and shared storage, making it suitable for centralized knowledge discovery. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 12
    Aix-DB

    Aix-DB

    Based on the LangChain/LangGraph framework

    ...The platform supports multiple types of data sources and provides an end-to-end pipeline that includes intent recognition, SQL generation, database execution, and visual presentation of results. Its architecture includes multiple layers such as a web interface, API gateway, AI service layer, and data storage layer that support relational databases, vector stores, graph databases, and file systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    GeoIP

    GeoIP

    This project automatically generates GeoIP files in multiple formats

    ...The repo provides automated, periodic releases (e.g. weekly or on schedule) and also offers a CLI tool so users can regenerate or customize geo data in the format they need — for example, producing a .dat file for V2Ray / Xray-core, or a MaxMind-compatible .mmdb.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 14
    diskover-community

    diskover-community

    Open source file indexing & storage analytics powered by Elasticsearch

    Diskover Community Edition is an open source file system indexing and storage analytics platform designed to help organizations understand and manage large volumes of file data. It crawls file systems and indexes metadata using Elasticsearch, enabling fast search, analysis, and organization of files stored across different storage systems. It allows administrators and users to explore file structures, monitor storage usage, and gain insights into how data is distributed across infrastructure. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Preswald

    Preswald

    Python tool for browser-based interactive data apps in one file

    Preswald is an open source Python-based framework and static-site generator designed for building interactive data applications that run entirely in the browser. It packages application logic, data processing, and user interface components into a single self-contained output, enabling easy sharing and deployment without requiring local dependencies. Preswald leverages a WebAssembly runtime along with technologies like Pyodide and DuckDB to execute Python code directly in the browser...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    DynamicalSystems.jl

    DynamicalSystems.jl

    Award winning software library for nonlinear dynamics timeseries

    DynamicalSystems.jl is an award-winning Julia software library for nonlinear dynamics and nonlinear time series analysis. To install DynamicalSystems.jl, run import Pkg; Pkg.add("DynamicalSystems"). To learn how to use it and see its contents visit the documentation, which you can either find online or build locally by running the docs/make.jl file. DynamicalSystems.jl is part of JuliaDynamics, an organization dedicated to creating high-quality scientific software. All implemented algorithms...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    Borg

    Borg

    Deduplicating archiver with compression and encryption

    BorgBackup or Borg for short is a deduplicating backup program that also supports compression and authenticated encryption. It provides a secure and efficient way to backup data, and is ideal for daily backups and backups to not fully tested targets.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 18
    Uncloud

    Uncloud

    A lightweight tool for deploying and managing containerised apps

    Uncloud is a self-hosted personal cloud and file synchronization platform that gives individuals full control over their data without relying on centralized third-party cloud providers. Designed to replace services like Dropbox or Google Drive for people who want privacy and ownership, uncloud lets you sync files across multiple devices—such as laptops, phones, or home servers—while keeping all contents under your own infrastructure.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    Rapid YAML

    Rapid YAML

    Library to parse and emit YAML, and do it fast

    Or ryml, for short. ryml is a C++ library to parse and emit YAML, and do it fast, on everything from x64 to bare-metal chips without operating system. (If you are looking to use your programs with a YAML tree as a configuration tree with override facilities, take a look at c4conf). ryml parses both read-only and in-situ source buffers; the resulting data nodes hold only views to sub-ranges of the source buffer. No string copies or duplications are done, and no virtual functions are used. The...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 20
    Sync Server

    Sync Server

    Secure, open-source platform for file storage, sharing, collaboration

    Sync Server is the core backend of a secure, open-source file storage, sharing, collaboration, and synchronization platform designed to give users full control over their data while supporting modern collaboration needs. It provides a sleek web interface where teams or individuals can upload, organize, and share files with fine-grained access permissions, and its security-minded design includes things like multi-factor authentication and role-based controls to help protect sensitive documents. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    RustDesk Server PRO

    RustDesk Server PRO

    Some scripts for RustDesk Server Pro are hosted here

    ...It encompasses the components required to broker, relay, and supervise connections while adding administrative layers suitable for regulated or large environments. Organizations can standardize how clients authenticate, how sessions are permitted and audited, and how data paths are routed under diverse network constraints. The server emphasizes reliability for interactive use cases like remote control, file transfer, and support workflows, even when clients sit behind strict NAT or firewall rules. It aims to balance performance with privacy by letting companies retain custody over metadata and configuration within their own perimeter. ...
    Downloads: 48 This Week
    Last Update:
    See Project
  • 22
    Tox

    Tox

    A New Kind of Instant Messaging

    ...In today's world where digital surveillance is rampant, Tox offers users a communication software alternative that's free from prying eyes and ears, and is, quite literally free and without advertising. Tox comes with all the great features you'd expect from an instant messenger application, including voice calls, video calls, file sharing and screen sharing. Everything done on Tox is encrypted using open-source libraries so you can be assured of completely private conversations. It also has no central servers that can be raided, shut down, or forced to turn over data. There are no corporate interests, no hidden agendas-- just simple, secure and easy to use messaging.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 23
    NeuralNote

    NeuralNote

    Audio Plugin for Audio to MIDI transcription using deep learning

    NeuralNote is an open-source audio software tool designed to convert recorded audio into MIDI data using modern machine learning techniques. The software functions as an audio plugin that can be used inside digital audio workstations as well as a standalone application for music production and analysis. Its main purpose is to perform audio-to-MIDI transcription, allowing musicians to record a performance and automatically transform it into editable MIDI notes. NeuralNote supports polyphonic...
    Downloads: 84 This Week
    Last Update:
    See Project
  • 24
    syncthing-android

    syncthing-android

    Wrapper of syncthing for Android

    Syncthing is a continuous file synchronization program. It synchronizes files between two or more computers in real time, safely protected from prying eyes. Your data is your data alone and you deserve to choose where it is stored, whether it is shared with some third party, and how it’s transmitted over the internet. None of your data is ever stored anywhere else other than on your computers.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 25
    FDUPES

    FDUPES

    FDUPES is a program for identifying or deleting duplicate files

    ...Because it operates directly on file content rather than just filenames, fdupes can accurately detect true copies and guide cleaning operations in data cleanup or migration tasks. It’s a simple, efficient, and widely used utility on Unix-like systems, appreciated by administrators, developers, and power users.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB