Alternatives to Voci

Compare Voci alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Voci in 2026. Compare features, ratings, user reviews, pricing, and more from Voci competitors and alternatives in order to make an informed decision for your business.

  • 1
    Google Cloud Speech-to-Text
    Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech into text in 73 languages and 137 different local variants. Leverage Google’s most advanced deep learning neural network algorithms for automatic speech recognition (ASR) and deploy ASR wherever you need it, whether in the cloud with the API, on-premises with Speech-to-Text On-Prem, or locally on any device with Speech On-Device.
    Leader badge
    Compare vs. Voci View Software
    Visit Website
  • 2
    QEval

    QEval

    Etech Global Services

    QEval is contact center quality assurance software that automates quality monitoring across 100% of voice, chat, and email interactions. Most call center QA teams manually sample 1 to 5% of calls. QEval replaces that with AI-powered speech analytics, automated quality scoring, and real-time compliance monitoring. Core functionality: call monitoring and evaluation, agent performance management, sentiment analysis, keyword detection, customer experience analytics, coaching workflows, gamification, and 110+ dashboards with predictive analytics. Compliance monitoring covers PCI, HIPAA, and GDPR with 98% accuracy and real-time alerts. QEval's speech analytics engine is trained on 138M+ interactions with 94% classification accuracy. The platform deploys in 30 days, not the 90 to 120 days typical of call center quality monitoring software. ISO 27001, SOC 2, PCI-DSS certified. Built by Etech Global Services for Fortune 500 contact centers in healthcare, telecom, retail, banking, and BPO.
    Leader badge
    Partner badge
    Compare vs. Voci View Software
    Visit Website
  • 3
    Twilio Voice
    Create a scalable voice experience with the API that connects millions globally. With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources, like our Voice SDK. Then, add on features like Interactive Voice Response (IVR), recording transcriptions, and speech recognition to create an experience that your customers will appreciate. Whether you're looking to set up global conferencing or alerts & notifications, Twilio has the support you need for building with Voice. Find docs, code samples, helper libraries, and developer tools such as Twilio Runtime and our visual workflow builder, Studio.
    Starting Price: $0.0085 per min
  • 4
    Speechmatics

    Speechmatics

    Speechmatics

    Best-in-Market Speech-to-Text & Voice AI for Enterprises. Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional precision—across the widest range of languages, dialects, and accents. Powered by Foundational Speech Technology, Speechmatics supports mission-critical voice applications in media, contact centers, finance, healthcare, and more. With on-prem, cloud, and hybrid deployment, businesses maintain full control over data security while unlocking voice insights. Trusted by global leaders, Speechmatics is the top choice for best-in-class transcription and voice intelligence. 🔹 Unmatched Accuracy – Superior transcription across languages & accents 🔹 Flexible Deployment – Cloud, on-prem, and hybrid 🔹 Enterprise-Grade Security – Full data control 🔹 Real-Time & Batch Processing – Scalable transcription
    Starting Price: $0 per month
  • 5
    Eleveo

    Eleveo

    Eleveo

    Global, award winning contact center compliance & workforce optimization solutions. Compliance recording can protect your company from theft, litigation & fines. Sleep well knowing Elevēo has you covered for everything from voice calls to land mobile radios. Remove, or anonymize details from data collected to stay in compliance. Archive datasets based on configurable rules and automatic categorization. Supervise your teams voice interactions with customers in real-time. Detailed audit logging for every system action with simplified extracts for compliance reviews. Your support, sales & back-office business transactions are critical. Protect your interests by recording everything into a single place with automated categorization by type, source, and customer where any interaction can be easily reviewed. We have been recording voice calls for decades. All over the world our solutions are trusted.
  • 6
    Rev

    Rev

    Rev

    Rev provides premium on-demand, manual and automated transcription, closed caption, and foreign subtitling services. With 170,000+ customers, Rev's clients span from global enterprises to freelance journalists. Rev processes more audio and video than any other provider and has the ability to scale to fit any customer's needs. Pricing is simple starting at just $0.25 per audio/video minute for automated speech-to-text services and $1.25/min for manual with 99% accuracy. Rev also offers Rev.ai which is a speech recognition engine that's available to companies that want it.
    Starting Price: $1.25 per minute
  • 7
    Call Center Studio

    Call Center Studio

    Call Center Studio

    Elevate your customer service with AI powered contact center software. Call Center Studio contact center cloud software provides a range of capabilities to help businesses manage their inbound and outbound contact center operations. In terms of inbound calls, the software typically includes features such as automatic call distribution, interactive voice response, and call routing to ensure that calls are directed to the appropriate agent or team. The software often includes predictive, preview, and progressive dialing for outbound calls to help agents efficiently reach out to customers. Call Center Studio offers real-time monitoring and analytics to help managers track performance in one screen. If you are frustrated with the high cost, complexity and difficulty of the conventional systems, our cloud based software is just perfect for you! Meet our user-friendly product! 💰 Pay-as-you-go ✔️ No hardware. No software. No maintenance 👨‍💻 Easy to use 🔗Smooth integration
  • 8
    Deepgram

    Deepgram

    Deepgram

    Deploy accurate speech recognition at scale while continuously improving model performance by labeling data and training from a single console. We deliver state-of-the-art speech recognition and understanding at scale. We do it by providing cutting-edge model training and data-labeling alongside flexible deployment options. Our platform recognizes multiple languages, accents, and words, dynamically tuning to the needs of your business with every training session. The fastest, most accurate, most reliable, most scalable speech transcription, with understanding — rebuilt just for enterprise. We’ve reinvented ASR with 100% deep learning that allows companies to continuously improve accuracy. Stop waiting for the big tech players to improve their software and forcing your developers to manually boost accuracy with keywords in every API call. Start training your speech model and reaping the benefits in weeks, not months or years.
    Starting Price: $0
  • 9
    NeoSound

    NeoSound

    NeoSound Intelligence

    NeoSound Intelligence is an AI tech company that turns emotions into actionable insights in order to create a world with better conversations between organizations and consumers. ​We intend to make all conversations better between consumers and organizations. By providing AI-powered speech analytics tools, we help call center companies to optimize their customer communication. Turn calls into revenue. Optimise customer communication by listening to customer calls automatically. NeoSound tools turn phone conversations into meaningful actionable insights to make customer communication better. NeoSound tools do not only speech-to-text translation. Smart algorithms do acoustics and intonation analysis. The machine listens to how people speak not only what they say. That is why our trained machines can easily address your company-specific needs. NeoSound offers a unique combination of speech-to-text semantic analytics and acoustic analysis of intonation.
  • 10
    CallMiner Eureka
    CallMiner Eureka leverages Artificial Intelligence (AI) and Machine Learning (ML) to analyze every customer interaction, across all channels, and automatically uncover actionable intelligence. We are continuously improving and expanding CallMiner Eureka products in an effort to provide our customers with the right, most updated tools to maximize ROI. Analytics workbench, discovery, category and scoring configuration. Agent/supervisor portal, direct performance feedback. Real-time monitoring & alerting, agent next-best-action, API/message driven. Audio capture for efficient speech analytics. PCI and sensitive data redaction from audio and transcripts. Data extraction, audio / contact / data ingestion, app development. Brings speech analytics data story to life. Elevate the customer experience. Communicate via your customer’s preferred channels. Power your business with customer insights. Optimize outcomes.
  • 11
    Gemini Audio
    Gemini Audio is a set of advanced real-time audio models built on Gemini's architecture, designed to enable natural, fluid voice interaction and expressive audio generation through simple language prompts. It supports conversational experiences where users can speak, listen, and interact with AI in a seamless loop, combining understanding, reasoning, and response generation in audio form. It is capable of both analyzing and generating audio, allowing applications such as speech-to-text transcription, translation, speaker identification, emotion detection, and detailed audio content analysis. They are optimized for low-latency, real-time use cases, making them suitable for live assistants, voice agents, and interactive systems that require continuous, multi-turn dialogue. Gemini Audio also integrates advanced capabilities like function calling, enabling the model to trigger external tools and incorporate real-time data into responses.
    Starting Price: Free
  • 12
    Verbio

    Verbio

    Verbio

    Increase security and user experience in daily interactions with the unique potential of voice. An innovative language agnostic, cost-effective and reliable alternative to seamlessly verify and identify users in real-time. Voice biometrics allows to automatically recognize any person through the characteristics of their voice and it can smartly substitute traditional authentication methods (cards, passwords, signature, fingerprint, etc) in security access control, user verification for digital transactions or for fraud prevention and detection. With an easy and cost-effective solution, authentication through voice biometrics brings an innovative and safe experience to users, with a risk-free and remote access. Biometric Authentication and Identification through voice has never been so secure and fast with different operational uttering models for each type of client and advanced anti-spoofing methodologies.
  • 13
    MAI-Transcribe-1
    MAI-Transcribe-1 is a state-of-the-art speech-to-text model developed by Microsoft and available through Azure AI Foundry, designed to deliver high-accuracy transcription for real-world audio across enterprise and developer use cases. It supports 25 major languages and is optimized to handle diverse accents, dialects, and speaking styles, maintaining consistent performance even in challenging conditions such as background noise, low-quality recordings, or overlapping speech. It is built by Microsoft’s AI Superintelligence team with a dual focus on accuracy and efficiency, enabling fast batch transcription and scalable deployment for production environments. MAI-Transcribe-1 powers a wide range of applications, including meeting transcription, live captions, accessibility tools, call center analytics, and voice-driven agents, making it a foundational component for voice-enabled systems.
    Starting Price: Free
  • 14
    talvala surveillance
    Talvala is a speech analytics company. We use Baidu’s Deep Speech technology and machine learning for compliance surveillance and human/machine interfaces. We develop speech-based monitoring applications and human machine interfaces (“HMI”) for a wide variety of clients. We believe that the time is ripe for voice-based HMIs! Talvala Surveillance is our compliance monitoring product and combines an advanced speech-to-text transcription engine with alerts generation for a revolutionary 2-in-1 surveillance speech analytics solution. Our R&D Unit develops customized human/machine interfaces for clients in the field of robotics or internet-of-things and looking to take human voice as an input.
    Starting Price: $30000.00/year
  • 15
    Speech2Structure
    When treating a patient, doctors spend on average two-thirds of their time documenting the treatment and far less time on examinations or patient interviews. To allow doctors to spend more time with their patients, Averbis is working on Speech2Structure – a software solution where the documentation is recorded live by voice and structured on-the-fly. Speech2Structure can correctly recognize and resolve many linguistic variations such as negations, suspected diagnoses, diagnoses that have taken place, etc. when recognizing diagnoses. Pathological laboratory values or microbiology results are also converted into corresponding diagnoses. The recorded medications can also provide clues to diagnoses.
  • 16
    Level AI

    Level AI

    Level AI

    Level AI is the leading customer experience intelligence platform that helps enterprises analyze conversations, improve agent performance, and automate customer support across voice and chat. Built for modern CX & contact centers, Level AI combines conversation analytics, automated quality assurance, real-time agent coaching, and AI virtual agents in a unified platform trained on real customer interactions. By analyzing 100% of conversations, Level AI uncovers root causes of customer issues, identifies operational bottlenecks, and surfaces insights that help CX leaders improve service quality and resolution rates. Organizations use Level AI to automate quality monitoring, deploy AI support agents, coach human agents in real time, and turn conversations into actionable insights. The platform integrates with leading contact center systems to help enterprises scale support operations, improve customer satisfaction, and reduce costs through AI-driven automation.
  • 17
    MOJO-CX

    MOJO-CX

    MOJO-CX

    Make sure you are never one of them by ensuring your compliance is watertight with customizable voice analysis triggers. Over 53% of UK consumers show at least one characteristic of vulnerability, so we’ve made it easier to spot them and notify the best person in your organization. A massive 91% of customers reported poorer CX from contact centers in the second half of 2021. Focus on the things that drive uplift faster and understand what agents need to say to drive more positive outcomes for customers. Set custom rules that allow you to immediately alert the appropriate person for every critical moment, based on any data points within the platform. Even the ones that you provide. Easily keep track of how well every conversation has gone based on the metrics that matter to you, giving you a clear view of agent performance across each interaction.
    Starting Price: $7,171.51 per month
  • 18
    SpokenData

    SpokenData

    ReplayWell

    Let the automatic speech-to-text technology transcribe your data. Or transcribe your data yourself or buy professional transcript. Use our on-line time synchonous editor to surf your data and transcripts. Download transcripts in many formats. Manage your team of transcribers using tags and categories. Help them with transcription by automatic voice-to-text technology. Integrate SpokenData into your application via our REST API. We adapt the voice-to-text on your data domain to maximize the transcript accuracy and lower your labor costs. Enable speech technologies in your applications through integrating SpokenData using our REST API. We are ready to process huge amounts of your data. You get API fitting your needs. Just contact our support team. We customize the voice-to-text on your data and purpose to maximize the transcript accuracy. Suitable for: web/mobile app developers, media monitoring agencies, audio/video archive business.
  • 19
    Inspeech

    Inspeech

    Inconcert

    Inspeech is an AI-powered speech analytics solution designed for contact centers that automatically analyzes 100% of customer interactions across voice and digital channels to improve service quality and generate actionable business insights. It leverages artificial intelligence trained on millions of real customer experience operations to interpret conversations in more than 20 languages, processing inputs from channels such as calls, chat, WhatsApp, email, and social media. It includes a highly reliable speech-to-text engine that transcribes large volumes of calls in real time, enabling organizations to quickly identify patterns, opportunities, and areas for improvement. Users can customize quality evaluation criteria by defining specific concepts, keywords, or behaviors to detect, allowing analysis to align with business priorities and compliance requirements. Inspeech also provides real-time monitoring tools that evaluate agent performance through metrics.
  • 20
    MediaSpeech

    MediaSpeech

    ChapsVision

    Exploit the richness of speech, a source of information and interaction. Based on deep neural learning, MediaSpeech by ChapsVision offers a fine and precise transcription of your audios and videos. If the digital undoubtedly occupies a growing place in the Customer Relationship, the telephone remains essential. The analysis of agent-customer conversations is essential for the proper consideration of the reasons for calling but also allows access to a wealth of strategic information, from the evaluation of satisfaction to the detection of trends, in going through competition monitoring through their unsolicited mentions. The regulatory inflation of the last decade requires constant reinforcement of the compliance function, both in human and technological terms. The obligation to take into account telephone communications calls for new means, in particular the ability to process voice flows to detect sensitive elements or to reconstruct a given transaction.
  • 21
    RocketWhisper

    RocketWhisper

    Mojosoft Co., Ltd.

    RocketWhisper is a powerful desktop speech recognition and transcription application that runs 100% offline on your computer. Your voice data never leaves your machine - complete privacy guaranteed. Powered by OpenAI's Whisper engine with NVIDIA GPU (CUDA) acceleration, RocketWhisper delivers fast and accurate speech-to-text conversion for professionals, content creators, and anyone who works with voice and text. Key Features: - 100% offline processing - voice data never leaves your PC - OpenAI Whisper engine for high-accuracy speech recognition - NVIDIA CUDA GPU acceleration - up to 10x faster than CPU - Real-time voice-to-text input with global hotkey (Push-to-Talk with Right Alt) - Batch transcription of multiple audio/video files (MP3, WAV, M4A, MP4, MKV, AVI, etc.) - SRT/VTT subtitle export for video content - AI text formatting with LLM integration (OpenAI, Anthropic, Google Gemini, Grok, local LLM)
    Starting Price: $32 one-time
  • 22
    RapportCMS
    RapportCMS is our competitive advantage vs our competitors. We are focused on the intersection between telephony, interaction management and the people who handle the calls. This approach ensures that we make ‘human technology’ designed by and for contact center practitioners. We know that world-class call center technology must be equally adept at addressing what happens after the agent says hello as to how the call is routed to the desktop. As one of the leading Contact Centres in the AUNZ market, we had over 10 years of building, refining and improving our technology before then releasing it to market as a SAAS solution. While most providers have built solutions from a telephony perspective, we recognize what happens after the agent says hello is of equal importance to what happened before.
  • 23
    VoxSci

    VoxSci

    VoxSciences

    Listening to voice messages can be terribly inefficient and laborious. VoxSciences™ provides a paradigm shift by transcribing voice messages into text messages. This gives voice messages a quantum leap to join email, SMS and IM on an equal basis with all the inherent advantages such as textural search. Our VERBS (Virtual Engine for Recognition of Basic Speech) engine converts voice messages into text messages and delivers them either as an email, SMS or via an API interface. Voicemail to text (SMS) is ideal for personal or corporate voicemail systems. Our XML API is typically used when a particularly high volumes of voice message transcription is required often by larger companies for Voice of The Customer analysis, comment lines, network or PABX operators and affiliates. Voice of the Customer is a market research technique that produces a detailed set of customer wants and needs. It involves the analysis of feedback from various sources such as email, web and IVR surveys.
  • 24
    Azure AI Speech
    Build voice-enabled apps confidently and quickly with the Speech SDK. Transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and use speaker recognition during conversations. Create custom models tailored to your app with Speech studio. Get state-of-the-art speech to text, lifelike text to speech, and award-winning speaker recognition. Your data stays yours, your speech input is not logged during processing. Create custom voices, add specific words to your base vocabulary, or build your own models. Run Speech anywhere, in the cloud or at the edge in containers. Quickly and accurately transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, capture key discussions in meetings and more. Use text to speech to create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages.
  • 25
    Verint Speech Analytics
    Speech analytics solution to help businesses extract valuable insights from phone calls. Speech Analytics: lower costs and improve CX. Transcribe and analyze millions of calls to discover customer insights and improve contact center performance in the cloud. Nothing can tell you more about your business than analyzing your customer calls. Call recordings are a gold mine of rich insights about customer satisfaction, customer churn, competitive intelligence, service issues, agent performance and campaign effectiveness. However, the sheer volume of phone calls exceeds the contact center’s ability to manually review and analyze them. Manual review can process only a fraction of calls using unsophisticated analysis, there has to be a better way. Verint Speech Analytics can transcribe and analyze 100 percent of your recorded calls to help surface valuable intelligence. At Verint, we use our unparalleled experience and expertise to continually drive innovation and improve accuracy.
  • 26
    Yactraq

    Yactraq

    Yactraq

    Yactraq is the industry value leader in speech analytics software. Our customers typically realize benefits across two broad functional areas. Marketing teams looking to extend their Voice-of-the-Customer (VoC) capabilities beyond the feedback form and social media now want to mine sales and customer service phone calls as part of their omni-channel capability. Contact Center Quality Management teams typically use speech analytics / audio mining as a way of leveraging AI / Machine Learning to evaluate the performance of their call agents. Yactraq offers customized free trials based on a clients own data so they can experience the value of our software before deciding to buy. Our products are cost-effectively priced to suit the needs of end customers as well as partners in the Business Process Outsourcing (BPO), Contact Center as a Service (CCAS), Voice-of-the-Customer (VoC), CRM Software and Network Service Provider businesses.
  • 27
    SpeechText.AI

    SpeechText.AI

    SpeechText.AI

    Transcribe audio and video into text. Get accurate transcriptions of podcasts with domain-specific speech recognition. SpeechText.AI is a powerful artificial intelligence software for speech to text conversion and audio transcription. Upload audio or video files. AI transcription software supports various file formats and transcribes from speech to text in any language. Select domain. Select industry domain and audio type from predefined categories to improve the recognition accuracy of domain-specific words. Transcribe. Our speech transcription engine uses state-of-the-art deep neural network models to convert from audio to text with close to human accuracy. Edit & Export. Search, modify and verify audio transcriptions using interactive editing tools. Export your content in different formats. Why SpeechText.AI? Set of amazing features to help you transcribe audio and video in seconds. Speech recognition. Powerful speech-to-text tech.
    Starting Price: $19 one-time payment
  • 28
    VoiceBase

    VoiceBase

    VoiceBase

    Our customers discover new ways to lower call center costs, maximize revenue, and minimize compliance risk with our flexible, scalable solutions. Using AI, Natural Language Processing, and Intelligence Tools, we turn raw unstructured call data into structured, rich data for analysis. Make better business decisions from every sales, service, or marketing conversation. Voice Analytics software to transcribe contact center calls and organize the data for actionable insights. Automatically transcribe recordings with natural language processing (NLP). Analyze, inspect and categorize calls with our industry-leading query solution. Automatically detect and redact sensitive data PCI / PII data from the audio and transcript. Includes 40 paralinguistic metrics such as silence, overtalk, dynamism & sentiment. Detect and predict complex behavior with high accuracy using machine learning. Analyze chat, email, CRM, and support data for a complete view of customer interactions.
  • 29
    Picovoice

    Picovoice

    Picovoice

    Picovoice is the first and only ubiquitous on-device voice AI platform. Picovoice offers speech-to-text, voice search, wake word, Speech-to-Intent (intent detection) and voice activity detection engines. Its stack can run on anything from embedded devices to web browsers, providing an immersive experience not achievable by any Big Tech.
    Starting Price: Free
  • 30
    Azure Speaker Recognition
    A Speech service feature that verifies and identifies speakers. Enable frictionless, secure customer experiences: Improve the customer experience by streamlining verification processes. Use voice to verify individuals for secure, frictionless customer engagements in a wide range of solutions, from web applications to call centers. Speaker verification can use either passphrases or free-form voice input. Improve the customer experience by streamlining verification processes. Use voice to verify individuals for secure, frictionless customer engagements in a wide range of solutions, from web applications to call centers. Speaker verification can use either passphrases or free-form voice input. Unlock value from scenarios with multiple speakers: Determine a speaker’s identity from within a group of enrolled speakers. Speaker identification enables you to attribute speech to individual speakers, support multiuser voice recognition for personalized interactions, and more.
  • 31
    Rev.ai

    Rev.ai

    Rev.ai

    Rev.ai was built by leading speech recognition experts from millions of hours of accurate human-transcribed content. We began in 2011 with Rev.com, providing human transcription services. We are now the world's largest transcription vendor, with over 35,000 contractors who transcribe millions of minutes of audio each month. In 2017 we launched Temi, an automated speech-to-text transcription and editing service. Temi has already transcribed 20 million minutes of content and was named the best transcription service by Wirecutter. Today our best-in-class speech engine is available to everyone as Rev.ai. We're helping companies get the most out of their audio and video content by making it searchable and accessible.
  • 32
    Rubidium

    Rubidium

    Rubidium

    Rubidium enables leading companies to embed voice commands and text to speech in their products. Voice Trigger is an “always on” engine that continuously listens and wakes up when you say the proper “magic word”. Voice Trigger identification uses a sophisticated miniature footprint Automatic Speech Recognition (ASR) engine to run in the background and distinguish between the trigger phrase and the rest of the speech, sounds and noise. Automated Speech Recognition (ASR) easily and safely controls any set of functions through voice commands. For example: call acceptance and rejection, device setup and installation procedure (pairing, calibration, interconnection, etc.), voice dialing, music streaming control and music selection. Rubidium technology is now embedded in over 50 million consumer products with customers and partners including leading global brands such as RIM (Blackberry), GN Netcom (Jabra), Panasonic, Uniden, CSR, Mattel, General Motors, Electrolux and many others.
  • 33
    Yandex SpeechKit
    Speech technologies based on machine learning to create voice assistants, automate call centers, monitor service quality, and perform other tasks. Leverage the advanced technology behind the wildly successful Alice voice assistant, now ready for use in your business. In a fraction of a second, SpeechKit accurately recognizes speech, allowing our clients' voice assistants to communicate quickly and easily. Choose the right version for you, the full version creates a smart voice assistant while the adaptive version gives your brand a unique voice in just a month. A solution for the most demanding customers who need to control speech processing and synthesis within their own infrastructure. SpeechKit’s ML models can now be deployed to your infrastructure. We offer both hybrid options and 100% on-premise deployments for sensitive traffic. The service can recognize audio in MP3, LPCM, and OggOpus formats.
    Starting Price: $0.000020 per unit
  • 34
    Wynyard Voice Frequency Analytics
    There is a lot of unstructured data in various formats such as call records, recorded conversations, unclear voices, etc. To identify the relevant data and recognize the voices, a powerful tool is required. Wynyard Voice Frequency Analytics (VFA) is an analyzing tool that helps in identifying the person behind an unclaimed voice or decoding the speech in a readable format from an unclear voice. It is a web application that recognizes the identity of the speaker. The application is beneficial for the law enforcement and Government bodies to prevent crimes. Wynyard VFA works on the simple concept of matching the suspected voice with the ones available in the database and recognizing the owner of that voice. The advanced and superior technology used in the application ensures accurate results. The application can also be used to identify keywords or phrases from a conversation and convert the speech into readable text.
  • 35
    VoxSigma

    VoxSigma

    Vocapia

    The VoxSigma software suite is offered as a Web service via a REST API over HTTPS, always providing customers access to our latest systems thereby quickly benefiting from regular advances and take advantage of additional features offered by the online environment. Our speech-to-text service is available 24/7/365 with failover servers and geographic redundancy. Automatic on-the-fly adaptation allows the user to provide texts related to the audio document being processed, what can be considered topic/domain adaptation. These accompanying texts serve to increase the lexical coverage of the speech-to-text system and to adapt the language model to the specific domain of the audio document with the aim of improving the transcription accuracy.
  • 36
    Alibaba Cloud Intelligent Speech Interaction
    Intelligent Speech Interaction is developed based on state-of-the-art technologies such as speech recognition, speech synthesis, and natural language understanding. Enterprises can integrate Intelligent Speech Interaction into their products to enable them to listen, understand, and converse with users, providing users with an immersive human-computer interaction experience. Intelligent Speech Interaction is currently available in Mandarin Chinese, Cantonese Chinese, English, Japanese, Korean, French and Indonesian, and please stay tuned for other languages. Intelligent Speech Interaction is suitable for various scenarios, including intelligent Q&A, intelligent quality inspection, real-time subtitling for speeches, and transcription of audio recordings. Intelligent Speech Interaction has been successfully applied in many industries such as finance, insurance, eCommerce and smart home.
    Starting Price: $1.40 per hour
  • 37
    Fusion Speech
    Back-end speech recognition is the most significant technology development in the dictation and transcription industries. Without physician training, or changes in practice patterns, Fusion Speech® powered by Nuance’s SpeechMagic™ harnesses this powerful technology for facility-wide deployment in nearly every medical specialty. Capture dictation with Fusion Voice®, process the dictation through Fusion Speech, and boost transcription productivity in Fusion Text®. The Fusion modules drive cost savings in reoccurring labor and outsourcing fees. This is the speech recognition solution you have envisioned. Other speech recognition has provided cute gimmicks but fell short in offering a sustainable business application. Fusion Speech provides the tools you require to truly deploy speech recognition that returns measurable and tangible results for your investments.
  • 38
    Transkriptor

    Transkriptor

    Transkriptor

    Automatically transcribe audio, and turn your audio or video to text. Upload your file and convert your audio to text with Transkriptor. Transkriptor’s powerful artificial intelligence generates online transcriptions within few minutes. Transkriptor is used by many professionals or students. Transkriptor is the best assistant for interview transcription, lecture transcription and video transcription. Transkriptor creates editable TXT, word or SRT files. You can download your transcriptions within seconds or you can use Transkriptor’s online editor for easy and quick editing. Sign up today and be more productive in school, work, and life. Even though Transkriptor is one of the most powerful artificial intelligence solutions, it is extremely easy to use. Transkriptor is an online speech-to-text converter and no installation required. Simply upload your file and start.
    Starting Price: $9.99 per month
  • 39
    aiOla

    aiOla

    aiOla

    aiOla is a deep tech Conversational, Voice, and Speech AI lab with an enterprise-level automatic speech recognition (ASR) foundation model, Text-to-speech (TTS) technology and Natural Language Understanding (NLU). It’s designed to help enterprises and developers adapt speech technologies to any process, whether through seamless API integration or an intuitive in-house app. aiOla is revolutionizing enterprise operations with enterprise level Conversational AI. We specialize in speech-to-text and text-to-speech AI that deliver unmatched accuracy (95%), specialized in specific jargon, in any language, accent, vertical, or acoustic environment. From empowering frontline workers with hands-free workflows to enabling voice AI agents with enterprise-grade ASR and TTS, aiOla seamlessly integrates into workflows, internal apps and products.
  • 40
    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile's modern, robust speech recognition is optimized for mobile devices in over 40 languages. Designed for industry workflows, cutting edge noise abatement technology delivers outstanding recognition in noisy environments. A speaker-independent voice engine works for all users out-of-the-box, without the need to voice train or maintain voice files for each user. AccuSpeechMobile is a 100% device-based solution. No voice server or middleware is required and no changes are needed to the backend system (WMS, ERP, EAM, CMMS). Cloud or network connection is not required to use the full functionality of device-based data collection. AccuSpeechMobile fully supports multi-modal capabilities so that users can hear spoken information and speak commands in tandem with the use of intelligent scanners. The ability to reference additional information on the device screen is also always available in conjunction with speech-to-text and text-to-speech commands.
  • 41
    Observe.AI

    Observe.AI

    Observe.AI

    Observe.AI powers end-to-end quality management with the most accurate speech analytics for Contact Centers. With its Voice AI Platform, support teams analyze 100% of voice calls for quality and compliance, automate agent evaluations, and improve coaching. Analyze Calls for 100% compliance and call quality monitoring, so you'll never miss an opportunity or risk. Evaluate Agents with automated agent evaluations, and build trust with accurate data while fixing broken processes. Coach Teams with targeted coaching, know what training programs drive change, and replicate what top supervisors and trainers do best. Observe.AI uses the industry’s most accurate conversation intelligence engine to analyze customer interactions across every channel, giving you visibility into what's happening in your business. We incorporate these insights into evaluation and coaching workflows to improve agent and seller performance at scale—in one seamless platform.
  • 42
    SpeechMotion
    Document a patient encounter with full or partial dictation, voice recognition, or on-the-go with a customized solution tailored to your unique environment. Solving common documentation issues, like lowering costs and integrating workflows, begins with choosing a solution designed to meet your evolving needs. Improve workflow efficiencies and physician adoption for a rapid return on investment with a partner committed to your long-term success. A leading, national provider of US-based transcription, speech recognition, voice capture and advanced documentation technologies, SpeechMotion partners with healthcare facilities and the organizations supporting them to create a customized documentation solution tailored to support both long and short-term goals. SpeechMotion provides the flexible options healthcare facilities need to quickly and efficiently document a complete patient story, all under one product and service umbrella.
  • 43
    SpeechWrite

    SpeechWrite

    SpeechWrite

    SpeechWrite specializes in a range of cloud dictation and voice recognition agile workflow solutions designed to meet the flexible working needs of the modern-day professional. Scalable and future-proofed solutions to suit all types of organizations. Our industry-leading range of digital dictation and transcription solutions link authors and transcribers facilitating efficient communication. Individual and organizational workflow settings enhance flexibility to ensure you receive your written dictations quickly and efficiently when in the office or on the move. Use your most powerful tool, your voice, and put it to work. Our practical technology, sophisticated yet simple, allows you to enhance your working environment and simply work smarter. We listen, learn and collaborate to support you through every stage of the process while also offering professional guidance and support along the way.
  • 44
    SoundHound

    SoundHound

    SoundHound AI

    We believe every brand should have a voice and every person should be able to interact naturally with the products around them, by simply talking. At SoundHound Inc., we’re working together with our strategic partners to build a more accessible and connected world. We build custom voice assistants for companies wanting to keep their brand, users, and data. Built on the foundation of proprietary Speech-to-Meaning® and Deep Meaning Understanding® technologies, the Houndify platform provides conversational intelligence unmatched by others in the industry. Houndify everything! Voice-enable the world with conversational intelligence. Create a voice AI platform that exceeds human capabilities and brings value and delight via an ecosystem of billions of products enhanced by innovation and monetization opportunities. Headquartered in the heart of Silicon Valley, we are a global company with 9 offices in key markets and teams in 16 countries.
  • 45
    Contact Cubed

    Contact Cubed

    Contact Cubed

    We are a speech analytics company that unlocks the hidden insights that are buried within your call recordings. Our automated AI driven platform keeps the spotlight on 100% of your customer interactions. Stop flying blind - see what's hiding in your calls by scheduling a demo today. Our seamlessly integrated solution analyzes 100% of your calls using our proprietary speech & voice analytics platform. Your internal processes & goals with the power of industry specific C.I and cutting edge A.I. is our recipe for your success. Whether you're looking to increase conversion, improve NPS or just simply create call efficiency, we are your all in on solution. Every industry from collections, insurance, sales through banking has its own nuances, language, normalcy and we cover them all. Our dedication to optimizing the call center management experience solves it from every angle from simplest to complex.
  • 46
    Call Journey

    Call Journey

    Call Journey

    It’s no secret that voice communications remain the number one method used by customers to interact with organizations. Voice is our most natural interface, but at the same time, it is the hardest from which to capture insights. Take the contact center as an example: contact centers are experts at measuring the data around calls, yet rely on post-call surveys to measure the quality of the call - the actual content of the conversation is largely ignored. Post-call surveys cannot capture or understand the depth of what is really being said. The amount of seconds processed via our VoiceAI ecosystem in the last 12 months. Saved in fines by improving risk and compliance process. Saved in fines by improving risk and compliance process. Improved sales offering and customer profiling by getting deeper insights into customer journey. Boost in conversion by spotting verbal trends that lead to a sale.
  • 47
    Marsview

    Marsview

    Marsview

    Marsview APIs are trusted by thousands of developers and CX teams who are integrating conversation intelligence in voice, video, and chat-driven applications. Together we can shape the future of conversation in the digital world. Let's jointly move your business forward by leading innovation to deliver world-class conversational intelligence and analytics to our customers. Intelligent virtual agents execute tasks and handle questions with a human-like conversational experience. Automatically detect intents to provide in-call assistance, on-screen actions, call disposition, and summarize call notes. Automatically generate actionable insights from 100% of customer interactions across all channels. Marsview's full suite of language, speech, vision, and empathy APIs help you to rapidly deploy customized AI solutions at scale with high confidence. Return the best matching responses to questions or the next best actions.
    Starting Price: $9.99 per month
  • 48
    Soniox

    Soniox

    Soniox

    Soniox develops highly accurate foundational speech models that transcribe, translate, and understand speech as it happens, and also provides the developer platform that makes it easy to integrate real-time voice intelligence into any application. Soniox Speech-to-Text API allows you to transcribe speech in 60+ languages in real-time with high accuracy - built for large scale. Soniox also provides regional data residency and is SOC 2 Type 2, GDPR and HIPAA compliant.
    Starting Price: $0.10/hour of audio
  • 49
    Diktamen

    Diktamen

    Diktamen

    Diktamen is a cloud-based digital dictation and transcription platform designed to streamline voice capture, task management, and workflow automation across professional sectors. The solution enables users to dictate audio from any location, via mobile, desktop, or dedicated devices, and securely transmit that audio for transcription, speech recognition, and task assignment. It supports industry-specific workflows (notably in legal and healthcare), allows integration with existing systems, and features centralized management for submissions, status tracking, and BI reporting with AI-driven forecasting. Clients benefit from cost reduction in dictation infrastructure, efficient transcription turnaround through outsourced partner networks, real-time task routing, and a flexible SaaS deployment model with minimal local installation or maintenance. Diktamen holds ISO 27001 certification and adheres to GDPR for data security and compliance.
  • 50
    Stratifyd

    Stratifyd

    Stratifyd

    You don't know what you don't know. Stratifyd is designed to find the trends and anomalies that point to changes you need to make in your customer, product, and employee experiences. Capture and connect voice of the customer data from any source— first- and third-party data, social, chat, speech, reviews, and more, into a single secured and trusted Experience Analytics Platform with our expansive library of experience, operational, behavioral, public data, and open API sources. Quickly drill down to the moments that matter by harnessing the power of Smart AI. Take charge of a 24/7 stream of customer experience, behavioral, and operational data to reveal and predict key topics, anomalies, sentiment, and trends, no data science or coding required. Show customers you’re truly listening to their experience. Reduce churn, drive loyalty, and improve efficiency by automatically acting on the insights that matter.   
    Starting Price: $1000 per year