Showing 12 open source projects for "audio to text"

View related business solutions
  • CloudZero: The Cloud Cost Optimization Platform Icon
    CloudZero: The Cloud Cost Optimization Platform

    CloudZero automates the collection, allocation, and analysis of your infrastructure and AI spend to uncover waste and improve unit economics.

    CloudZero is the leader in proactive cloud cost efficiency. We enable engineers to build cost-efficient software without slowing down innovation. CloudZero's next-generation cloud cost optimization platform automates the collection, allocation, and analysis of cloud costs to uncover savings opportunities and improve unit economics. We are the only platform that enables companies to understand 100% of their operational cloud spend and take an engineering-led approach to optimizing that spend. CloudZero is used by industry leaders worldwide, such as Coinbase, Klaviyo, Miro, Nubank, and Rapid7.
    Learn More
  • Intelligent predictive dialer for outbound call center Icon
    Intelligent predictive dialer for outbound call center

    Solution for lead generation, Political Campaigning, Telemarketing and Phone Polling

    Our autodialer software can massively increase productivity by dropping busy, unanswered and disconnected line, passing calls answered by real people back to your agents, and leaving messages on answering machines.
    Learn More
  • 1
    KrillinAI

    KrillinAI

    Video translation and dubbing tool powered by LLMs

    ...It integrates several stages of the pipeline: video acquisition (either from local files or remote via download tools), speech recognition (ASR), subtitle segmentation and alignment, machine translation (with context-aware translation to preserve semantics), and voice cloning + text-to-speech (TTS) to produce dubbed audio tracks. KrillinAI supports both landscape and portrait videos, which makes it suitable for a wide range of platforms — from YouTube to TikTok or other vertical-video sites — and ensures correct formatting and layout for the final video. The tool offers “one-click” workflows and desktop versions, lowering the barrier for users who may not be familiar with video editing or audio processing pipelines.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 2
    hfapigo

    hfapigo

    Unofficial (Golang) Go bindings for the Hugging Face Inference API

    (Golang) Go bindings for the Hugging Face Inference API. Directly call any model available in the Model Hub. An API key is required for authorized access. To get one, create a Hugging Face profile.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Scriberr

    Scriberr

    Self-hosted AI audio transcription

    Scriberr is a self-hosted AI-powered transcription platform designed to convert audio and video into highly accurate text while prioritizing privacy and local processing. Unlike cloud-based transcription services, Scriberr runs entirely on the user’s machine, ensuring that sensitive recordings are never sent to third-party servers and remain fully under user control. It leverages modern speech recognition models such as Whisper and other advanced architectures to deliver precise transcripts with word-level timing and speaker identification. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 4
    Whishper

    Whishper

    Transcribe any audio to text, translate and edit subtitles 100% locall

    Open-source, local-first audio transcription and subtitling suite with a simple web UI. Thanks to open-source technologies, Whishper can run 100% offline. Your data never leaves your computer. Whishper allows you to translate your transcriptions to and from more than 60 languages thanks to Argos Translate and LibreTranslate. Download the transcriptions in many formats (json, txt, vtt, srt). Easily edit your subtitles right in the Web-UI.
    Downloads: 21 This Week
    Last Update:
    See Project
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Learn More
  • 5
    Peer Calls

    Peer Calls

    Group peer to peer video calls for everyone written in Go

    Peer Calls is a self-hosted, open-source WebRTC-based video and audio calling platform for group communication. Designed for simplicity and privacy, it allows anyone to run their own video conferencing service without relying on third-party providers. Peer Calls supports multi-user rooms, screen sharing, and chat, all delivered via a clean web interface. It’s great for small teams, communities, and educational groups seeking secure and customizable alternatives to mainstream conferencing tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Git Large File Storage

    Git Large File Storage

    Git extension for versioning large files

    An open source Git extension for versioning large files. Git Large File Storage (LFS) replaces large files such as audio samples, videos, datasets, and graphics with text pointers inside Git, while storing the file contents on a remote server like GitHub.com or GitHub Enterprise. Download and install the Git command line extension. Once downloaded and installed, set up Git LFS for your user account. In each Git repository where you want to use Git LFS, select the file types you'd like Git LFS to manage (or directly edit your .gitattributes). ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 7
    Ebitengine

    Ebitengine

    A dead simple 2D game engine for Go

    Ebitengine (formerly known as Ebiten) is a lightweight, open-source 2D game engine built for the Go programming language. It is designed to be simple and easy to use, allowing developers to build games quickly with a clean and minimal API. Ebitengine supports cross-platform deployment, including desktop, mobile, web, and select console platforms. The engine provides essential features such as 2D graphics rendering, input handling, and audio playback. Developers can work with transformations,...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Acontext

    Acontext

    Context data platform for building observable, self-learning AI agents

    Acontext is a cloud-native context data platform designed to support the development and operation of advanced AI agents. It provides a unified system to store and manage contexts, multimodal messages, artifacts, and task workflows, enabling developers to engineer context effectively for their agent products. The platform observes agent tasks and user feedback in real time, offering robust observability into workflows and helping teams understand how agents perform over time. Acontext also...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Chanify

    Chanify

    Chanify is a safe and simple notification tools

    Chanify is a safe and simple notification tools. For developers, system administrators, and everyone can push notifications with API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Composable, Open Source Payments Platform Icon
    Composable, Open Source Payments Platform

    Build or enhance your payments stack, while maintaining control with an open-source, full-stack and modular infrastructure.

    Juspay's Payments Orchestration Platform offers a comprehensive product suite for businesses, including open-source payment orchestration, global payouts, seamless authentication, payment tokenization, fraud & risk management, end-to-end reconciliation, unified payment analytics & more. The company’s offerings also include end-to-end white label payment gateway solutions & real-time payments infrastructure for banks. These solutions help businesses achieve superior conversion rates, reduce fraud, optimize costs, and deliver seamless customer experiences at scale.
    Learn More
  • 10
    VideoSrt

    VideoSrt

    Windows-GUI

    ...Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. ...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 11
    Pixel

    Pixel

    A hand-crafted 2D game library in Go

    ...See the requirements for the list of libraries necessary for compilation. Fast 2D graphics, sprites, and primitive shapes with immediate mode style IMDraw (circles, rectangles, lines, ...) Optimized drawing with Batch. Text drawing with text package. Audio through a separate Beep library. Simple and convenient API. Drawing a sprite to a window is as simple as sprite.Draw(window, matrix) Wanna know where the center of a window is? window.Bounds().Center() Full documentation and tutorial. Works on Linux, macOS and Windows. Window creation and manipulation (resizing, fullscreen, multiple windows, ...) ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    go-rock-libraries-showcases

    go-rock-libraries-showcases

    Comprehensive tutorial that integrates blogs, examples, audio, video

    ...It is designed for learners who have already finished Go, and want to deepen their skills by exploring real-world libraries that are popular and highly regarded in the Go ecosystem. The course uses a “three-in-one” approach: blog posts, code examples, and audio/video lectures are combined to explain each library in practical context. The README explains that the motivation is the sheer number of powerful libraries that many developers do not have time to evaluate, so the course highlights a subset of “rock star” libraries and shows how to use them effectively. The basic information section lists the instructor, prerequisites (basic Go ability, preferably completion of the fundamentals course), and the development environment: Go 1.2–1.4, Sublime Text 3, and macOS 10.9–10.10.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB