Showing 30 open source projects for "listen"

View related business solutions
  • Next-generation security awareness training. Built for AI email phishing, vishing, smishing, and deepfakes. Icon
    Next-generation security awareness training. Built for AI email phishing, vishing, smishing, and deepfakes.

    Track your GenAI risk, run multichannel deepfake simulations, and engage employees with incredible security training.

    Assess how your company's digital footprint can be leveraged by cybercriminals. Identify the most at-risk individuals using thousands of public data points and take steps to proactively defend them.
    Learn More
  • End-To-End Document Management Software Icon
    End-To-End Document Management Software

    UnForm is ideal for businesses focusing on distribution, manufacturing ERP solutions, and general accounting.

    UnForm® is a platform-independent software product that creates, delivers, stores and retrieves graphically enhanced documents from ERP application printing. A complete, end-to-end document management solution, UnForm interfaces at the point of printing to produce documents in various formats for printing and electronic delivery.
    Learn More
  • 1
    Speech Note

    Speech Note

    Speech Note Linux app. Note taking, reading and translating

    Speech Note is a Linux desktop and Sailfish OS application for taking, reading, and translating notes with integrated offline speech technology. It combines speech-to-text, text-to-speech, and machine translation in a single interface, allowing users to dictate notes, listen back to them, and translate them without ever sending data to the cloud. All processing is done locally, which means audio, text, and translations never leave the device, emphasizing strong privacy guarantees. The application supports multiple STT engines such as Coqui STT (DeepSpeech fork), Vosk, whisper.cpp, Faster Whisper, and april-asr, giving users flexibility in accuracy, speed, and hardware requirements. ...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 2
    Bot Framework Web Chat

    Bot Framework Web Chat

    A highly-customizable web-based client for Azure Bot Services

    ...This repo is part of the Microsoft Bot Framework, a comprehensive framework for building enterprise-grade conversational AI experiences. Create a bot with the ability to speak, listen, understand, and learn from your users with Azure Cognitive Services. Benefit from open source SDK and tools to build, test, and connect bots that interact naturally with users, wherever they are. Build secure, global, scalable solutions that integrate with your existing IT ecosystem. Create an AI experience that can extend your brand and keep you in control of your own data.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Agili Hacker Podcast

    Agili Hacker Podcast

    AI tool that turns Hacker News posts into daily podcast updates

    ...This creates a hands-free way to stay updated on tech, startups, and developer discussions without reading long threads. Hacker Podcast combines content aggregation, natural language processing, and text-to-speech to deliver clear and digestible updates. Users can listen through web interfaces or podcast platforms, while also accessing written summaries for deeper reading. Built with modern web technologies, the project focuses on automation, speed, and accessibility. It supports continuous updates, allowing listeners to receive fresh insights daily. As an open-source tool, it also encourages community contributions and customization for developers who want to adapt or extend its workflow for similar AI-driven content pipelines.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Open Vision Agents by Stream

    Open Vision Agents by Stream

    Build Vision Agents quickly with any model or video provider

    Open Vision Agents by Stream is an open source framework from Stream for building real time, multimodal AI agents that watch, listen, and respond to live video streams. It focuses on combining video understanding models, such as YOLO and Roboflow based detectors, with real time large language models like OpenAI Realtime and Gemini Live to create interactive experiences. The framework uses Stream’s ultra low latency edge network so agents can join sessions quickly and maintain very low audio and video latency while processing frames and generating responses. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Learn More
  • 5
    ChatTTS_colab

    ChatTTS_colab

    One-click deployment (including offline integration package)

    ...It provides an integrated offline bundle and scripts for Windows and macOS so users can run ChatTTS locally without wrestling with complex environment setup. The repository includes Colab notebooks that launch a Gradio-based web UI and expose streaming TTS, making it possible to listen to generated audio as it is produced. A distinctive feature is the “voice gacha” system, which batch-generates many distinct voice timbres and allows users to save the ones they like into a curated voice library. It has first-class support for long-form audio generation, making it suitable for audiobooks, podcasts, or long narration tasks. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Node.js Telegram Bot API

    Node.js Telegram Bot API

    Telegram Bot API for NodeJS

    ...Depending on the properties of the Message, one of these events may ALSO be emitted, text, audio, document, photo, sticker, video, voice, contact, location, new_chat_members, left_chat_member, new_chat_title, new_chat_photo, delete_chat_photo, group_chat_created, game, pinned_message, poll, dice, migrate_from_chat_id, migrate_to_chat_id, channel_chat_created, supergroup_chat_created, successful_payment, invoice, video_note, etc. Its much better to listen a specific event rather than on message in order to stay safe from the content. Bot must be enabled on inline mode for receive some messages.T elegram only supports HTTPS connections to WebHooks. Therefore, in order to set a WebHook, you will need a SSL certificate. The library makes it easy to get started sending files. By default, you may provide a file-path and the library will handle reading it for you.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Eva AI

    Eva AI

    Eva is an A.I. assistant that helps users multi-task.

    ...It also has the purpose of helping people with disabilities use the computer with a greater ease. Eva can open and close system related and non-system related applications, search content on web applications, set timers, and take screenshots. Tell Eva "Listen" or "Hey listen" followed by a command. For more instructions, check the instruction manual included in the application. [Update] * 🆕 Removed paged memory cleanup * 🆕 Re-added physical model switch-up * 🆕 Added automatic microphone audio level maximisation * 🆕 Re-calibrated the * 🐞 Re-added the wake word engine reset mechanism * 🐞 Fixed UI related issues regarding threading * 🐞 Fixed thread synchronisation bugs
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8

    TTS Reader Pro

    A Wordpress plugin for read articles with text-to-speech (tts)

    A Wordpress plugin that adds a button to read articles with text-to-speech (tts).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    SpeakFlow-TTS

    SpeakFlow-TTS

    Multilingual Text-to-Speech (TTS)

    ...Key features of SpeakFlow: Multilingual support: Choose from a wide range of languages and voices (Ukrainian, English, German, Russian, Polish, French, Italian, Spanish, Portuguese, and more). Simple and intuitive interface: Designed for quick and convenient audio generation. Audio Playback: Instantly listen to and download the generated text in MP3 format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • ShareCRM is an AI-powered enterprise CRM platform designed to connect data and teams across the entire customer lifecycle. Icon
    ShareCRM is an AI-powered enterprise CRM platform designed to connect data and teams across the entire customer lifecycle.

    Trusted by 6000+ Large and Medium Enterprises

    ShareCRM is an AI-powered, customizable and affordable enterprise CRM solution to seamlessly integrate and empower every aspect of your business.
    Learn More
  • 10
    Amica

    Amica

    Amica is an open source interface for interactive communication

    Amica is an open source interface for interacting with fully animated 3D characters that combine voice chat, vision, and an emotion engine into a single experience. It lets you hold natural conversations with AI characters that can see, listen, and speak, while expressing emotional states through facial expressions and body language. Users can import VRM character models, adjust their appearance, tune the voice to match the character, and define behavior using different large language models and TTS backends. Under the hood, Amica leverages modern web and desktop technologies: three.js and three-vrm for 3D rendering, Transformers.js for running models in the browser, Whisper and Silero VAD for speech recognition and voice-activity detection, and a variety of LLM backends such as llama.cpp servers, ChatGPT-compatible APIs, Ollama, KoboldCpp, and others. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    Yodd's AI Chat

    Yodd's AI Chat

    This app uses the OpenAISwift library, ChatGPTSwift library and OpenAI

    ...With Yodd AI Chat, you can also generate images to accompany your messages, adding a new level of creativity and personalization to your conversations. Plus, you can save, listen to, and delete messages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    TalkingBot

    TalkingBot

    A virtual parrot for Android platform that can listen and talk to you.

    A virtual personality for Android platform that can listen and talk to you (like a parrot) via the Google synthesizer. Idea born for fun and totally experimental, but feel free to modify it and, if you like, to help me improve it. This initial version only understands the Italian language! I'm sorry.. Please support this project by donating to this fundraiser: ------------------------------------------- Give birth to new travel bicycles!!!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Botkit

    Botkit

    Tool for building chat bots, apps and custom integrations

    ...Include Botkit into your Node application and boot up a controller that will define your bot's behaviors. In this case, we're setting up a bot to use with the Bot Framework Emulator. Tell the bot to listen for users saying "hello," and use `bot.reply` to send an immediate response. Start a conversation, then queue up multiple messages to send, including a prompt sent using `convo.ask()` which allows your bot to capture user input and use it. Botkit is just one part of a bigger set of developer tools and SDKs that encompass the Microsoft Bot Framework. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    VoiceOver

    VoiceOver

    VoiceOver is a web application that allows you to transcribe audio

    VoiceOver is a web application that allows you to transcribe English audio and listen to it in another voice. Choose a source, an audio file (.wav) in English only. Transcribe audio, several algorithms will take care of it. Listen to the generated transcription, a man or a woman, it's up to you!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    English2Cards

    English2Cards

    English Learning software

    English2Cards is a smart English learning program designed to improve your listening & speaking skills without Internet connection using great learning files. The program repeatedly reviews the educational cards at different times so that you can remember the new words and phrases that you have learned and be able to use them in real conversations easily and without thinking. During learning you can remember the words in the text and also get the translation, pronunciation and examples of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Olivia

    Olivia

    Your new best friend powered by an artificial neural network

    ...Its goal is to provide a free and open-source alternative to big services like DialogFlow. You can chat with her by speaking (STT) or writing, she replies with a text message but you can enable her voice (TTS). Olivia can listen to you by saying “Hey Olivia” or clicking on the central button. She speaks to reply to you unless you've disabled her voice. Olivia respects your privacy. All the data used by Olivia is saved in your client. Olivia is organized in modules to facilitate the addition of new capabilities. These modules can be written in Go to execute multiple tasks. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    insofts player
    insofts-player Free media player, with which you can easily and conveniently view video and listen to audio files in various formats, without installing additional codecs. View streaming video, audio.   Constantly updating the online media library Additional features: sound recording, uart protocol support, speech to text
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19

    Jambo Text to Speech Plus

    Text to Speech is a user-friendly and reliable piece of software that

    "Jambo - Text to Speech is a user-friendly and reliable piece of software that was created to help you convert written messages into audio, so you can listen to them on the go, on a media player" 3.5/5 Review by: Softpedia More Details / Features: http://www.vysilhq.com/text-to-speech.html
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    S.T.A.N Voice Recognition Software

    a software like J.A.R.V.I.S!!

    Hello everyone! I am presenting a software named S.T.A.N, a voice recognition software which recognizes your voice and performs actions like from opening to facebook to renaming, copying a file, creating a folder and many more! This software has over 50+ commands to perform by just listening to your voice! so just download it and check it out. Your comments and feedback is valuable to us. And if you like, please like our facebook page and mail us!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    My ZZZ Knowledge Micro Web Base

    This is a base which stores knowledge in the form of nested sets

    ...The access to knowledge base is realized through a "server" to which you can connect an unlimited number of "clients" as one computer can run multiple servers that are located in different folders and listen to different ports. Each server can create and access the unlimited knowledge bases and databases associated with other servers located on the Internet or Intranet. Knowledge bases managed by artificial intelligence language, such as executable code can be written as in external files and in the elements of the base. The Knowledge Base is powerful enough, it can be used for indexing web contents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Audivolv

    Audivolv

    Artificial intelligence evolves musical instruments played with mouse

    ...You can also write the Java code into its window. Uses Javassist internal compiler on the new Java code assembled by renaming variables to overlap eachother and reordering, copying, and removing blocks of code. Listen to the recorded music in files section of sourceforge or try it for yourself. This branched into my other projects: GigaLineCompile, JSelfModify, JSoundCard, and Human AI Net as I got more into the AI and systems side of things and explored vision research as a better user interface to network minds together through the Internet. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Open Exchange (OpEx)

    Open Exchange (OpEx)

    The open source Algorithmic Trading System

    OpEx is an application suite that includes the main building blocks of commercial electronic trading systems. All OpEx applications run on distributed system architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Text2Speech

    Converts text to speech.

    Type in text in the simple textbox and listen to the words come out of your computer's speakers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB