Accurately convert voice to text in over 125 languages and variants by applying powerful machine learning models with an easy-to-use API.
New customers get $300 in free credits to spend on Speech-to-Text. All customers get 60 minutes for transcribing and analyzing audio free per month, not charged against your credits.
Try for free
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
Polymeter is a MIDI sequencer for music that's in multiple prime meters (2, 3, 5, 7, 11, etc.) simultaneously. Each track has its own loop length, and when the lengths differ, the tracks "slip" (or shift phase) relative to each other. The resulting interference pattern is sufficiently intricate that variations similar to the embellishments of a live performer can be generated algorithmically.
an application to automatically extract text from comic books.
cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes.
The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text.
The text extraction is achieved by a combination of statistical and graphical processing operations. ...
...It's a great way to add a subtitle to a video that doesn't have any subtitle support. It can also be used with native and online streaming videos, such as, Netflix, Hulu, Amazon Prime videos, Google Play Movies, etc.
PRIME (Parallel Reduced Interface Multi-threaded Engine) is a 3D engine whose goal is to provide a significantly minimized API while supporting massive multi-threading to exploit multi-core systems.
OneTimePIM is a comprehensive Product Information Management System designed to streamline the import and distribution of product data.
A single source of truth for all of your product information with easy ways to distribute that data to wherever it needs to go, including the most powerful e-commerce connectors in the industry.
Enabling Green Video Streaming Over Internet of Things
This project presents a novel research vision for green video streaming over Internet of Multimedia (IoM), which is an enhancement to the Internet of Things (IoT). Its prime objective is to enable video streaming as part of the realization of IoT with reduced overall ecological footprint, in terms of energy consumption and CO2 emissions, while maintaining video fidelity as tradeoff with the energy and carbon footprints. The solution consists of a new novel architecture for IoM and a framework that comprises of compressive sensing based video encoding and communication protocols for green communication. ...