Search Results for "speech & image processing based project in matlab" - Page 2

Showing 97 open source projects for "speech & image processing based project in matlab"

View related business solutions
  • ServiceDesk Plus, a world-class IT and enterprise service management platform Icon
    ServiceDesk Plus, a world-class IT and enterprise service management platform

    Design, automate, deliver, and manage critical IT and business services

    Best in class online service desk software. Offer your customers world-class services with ServiceDesk Plus Cloud, the easy-to-use SaaS service desk software from ManageEngine, the IT management division of Zoho. Track and manage IT tickets efficiently, resolve issues faster, and ensure end-user satisfaction with the cloud-based IT ticketing system used by over 100,000 IT service desks worldwide. Manage the complete life cycle of IT incidents, problems, changes, and projects with out of the box ITIL workflows. Create support SLAs, define escalation levels, and ensure compliance. Automate ticket dispatch, categorization, classification, and assignment based on predefined business rules, and set up notifications and alerts for timely ticket resolution. Reduce walk ins and unnecessary tickets by giving your users more control. Enable end users to access IT services through your service catalog in the self-service portal. Help users create and track tickets and search for solutions.
    Learn More
  • Download the most trusted enterprise browser Icon
    Download the most trusted enterprise browser

    Chrome Enterprise brings enterprise controls and easy integrations to the browser users already know and love.

    Chrome Enterprise is ideal for businesses of all sizes, IT professionals, and organizations looking for a secure, scalable, and easily managed browser solution that supports remote work, data protection, and streamlined enterprise operations.
    Learn More
  • 1
    Advanced AI explainability for PyTorch

    Advanced AI explainability for PyTorch

    Advanced AI Explainability for computer vision

    pytorch-grad-cam is an open-source library that provides advanced explainable AI techniques for interpreting the predictions of deep learning models used in computer vision. The project implements Grad-CAM and several related visualization methods that highlight the regions of an image that most strongly influence a neural network’s decision. These visualization techniques allow developers and researchers to better understand how convolutional neural networks and transformer-based vision...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Roadmap To Learn Generative AI In 2025

    Roadmap To Learn Generative AI In 2025

    Basic Machine Learning Natural Language Processing Roadmap

    Roadmap To Learn Generative AI In 2025 is a curated learning path focused on contemporary generative AI — covering large language models (LLMs), diffusion-based image generation, prompt engineering, multi-modal AI, fine-tuning techniques, and the practical considerations for deploying generative models. It’s aimed at learners and developers who already have some programming or ML basics and wish to specialize in generative AI, offering a modern, structured plan that reflects the state of the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Perfect Pixel

    Perfect Pixel

    Refine and quantize messy AI pixel art into clean, perfect pixels

    perfectPixel is a workflow tool for turning messy “pixel-style” images, especially those produced by generative models, into truly grid-aligned pixel art that reads cleanly at any scale. It tackles a common problem with AI pixel art: edges that look pixelated at first glance but are not actually aligned to a coherent pixel grid, which causes shimmer, blur, and uneven block sizes when you zoom in. The tool analyzes an image to infer the intended grid size, then refines and quantizes the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ProStack

    ProStack

    ProStack - a platform for image processing and analysis

    ProStack - a platform for image processing and analysis. It implements various image processing methods as separate modules, that can be joined in a complex image processing scenario by use of a graphical user interface. RPMs are available at https://build.opensuse.org/project/repositories/home:mackoel:compbio
    Downloads: 0 This Week
    Last Update:
    See Project
  • The fastest way to host, scale and get paid on WordPress Icon
    The fastest way to host, scale and get paid on WordPress

    For developers searching for a web hosting solution

    Lightning-fast hosting, AI-assisted site management, and enterprise payments all in one platform designed for agencies and growth-focused businesses.
    Learn More
  • 5
    PyDenseCRF

    PyDenseCRF

    Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs

    PyDenseCRF is a Python library that provides a wrapper around the implementation of fully connected Conditional Random Fields (CRFs) developed by Philipp Krähenbühl and Vladlen Koltun. The project allows developers and researchers to integrate Dense CRF inference into Python-based machine learning pipelines, particularly for computer vision tasks such as image segmentation and labeling. Conditional Random Fields are probabilistic graphical models used to model contextual relationships...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    unwarp

    Increase image resolution by eliminating atmospheric distortion

    Unwarp is an open-source tool that enhances image resolution by eliminating scintillations caused by atmospheric turbulence and similar distortion phenomena. The software processes a series of images of the same subject, aligning and stacking them using advanced feature selection algorithms and phase correlation approaches. The core technique matches features between images, applies triangulation across the entire frame, and warps each pixel to its optimal position. The resulting aligned...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SigPack

    SigPack

    SigPack - A signal processing library using Armadillo

    SigPack is a C++ signal processing library using the Armadillo library as a base. The API will be familiar for those who has used IT++ and Octave/Matlab.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    GeoTools, the Java GIS toolkit

    GeoTools, the Java GIS toolkit

    Toolkit for working with and mapping geospatial data

    GeoTools is an open source (LGPL) Java code library which provides standards compliant methods for the manipulation of geospatial data. GeoTools is an Open Source Geospatial Foundation project. The GeoTools library data structures are based on Open Geospatial Consortium (OGC) specifications.
    Leader badge
    Downloads: 130 This Week
    Last Update:
    See Project
  • 9
    cleanvideo-cli

    cleanvideo-cli

    CLI tool for removing watermarks from AI-generated videos using frame-

    cleanvideo-cli is a command-line tool designed to remove visible watermarks from AI-generated videos. It works by analyzing video frames and reconstructing the underlying pixels in watermark regions, without cropping or blurring the original content. This project is intended for developers, researchers, and creators who need a lightweight utility for cleaning preview or draft videos before further processing. Note: This tool does not bypass platform restrictions and should be used...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Accounting Software Built for Owners, and Their Clients Icon
    Accounting Software Built for Owners, and Their Clients

    Make invoicing and billing painless for your small business with FreshBooks.

    Balancing your books, client relationships, and business isn’t easy. FreshBooks gives you the info and time you need to focus on your big picture—your business, team, and clients.
    Learn More
  • 10
    Free AI Watermark Remover - FreeRepair

    Free AI Watermark Remover - FreeRepair

    AI-powered tool to quickly remove watermarks from images flawlessly

    AI Watermark Remover (Free And Open-Source) & Make Blurry Images Clearer Or Larger Tool - FreeRepair, Simulation IOPaint Based On The Django Of Python With No Sign-Up. As a free, open-source, AI-powered tool, FreeRepair makes it easy to remove watermarks, logos, text or clutter from images, and blurry images can be made clearer or larger. No installation, no internet connection, it works out of the box, safe and secure, unlimited.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Glint Translator
    Glint Translator is a high-performance Windows application for real-time in-game and voice translation without interrupting gameplay. It supports 240+ languages using DeepL, Google, OpenAI, Azure, and Google Gemini models. The interface is available in 18 languages. Features • 3 Translation Modes: Fluent (parallel), Area (overlay), Full Screen (smart detection) • Speaker detection with color-coding • Glint AI custom terminology control • Game-based profile system • Advanced...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 12
    EmotiVoice

    EmotiVoice

    Multi-Voice and Prompt-Controlled TTS Engine

    EmotiVoice is a multi-voice, prompt-controlled text-to-speech engine designed to generate highly expressive speech across thousands of voices. It supports both English and Chinese and ships with over 2,000 preset voices, making it suitable for everything from characters and virtual anchors to narration and dialogue. The core idea is prompt-based emotional and style control: you can ask the engine to speak “happy,” “sad,” “excited,” or with other high-level style prompts that shape prosody,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    SAGA GIS
    SAGA - System for Automated Geoscientific Analyses - is a Geographic Information System (GIS) software with immense capabilities for geodata processing and analysis. SAGA is programmed in the object oriented C++ language and supports the implementation of new functions with a very effective Application Programming Interface (API). Functions are organised as modules in framework independent Module Libraries and can be accessed via SAGA’s Graphical User Interface (GUI) or various scripting...
    Leader badge
    Downloads: 8,072 This Week
    Last Update:
    See Project
  • 14
    ekho

    ekho

    Chinese text-to-speech engine

    ekho is a project with relatively sparse documentation, but from the repository it appears to be a small-scale tool for audio processing and playback, possibly with features for speech synthesis or manipulation. The repo includes scripts and configuration files suggesting interactions with media/audio handling libraries. Because of limited README detail, it seems targeted at users comfortable reading and modifying code, rather than end users expecting polished UIs. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    mfbdjvu

    mfbdjvu

    Easy converting pgm and ppm to (MASK+FG+BG)-djvu.

    MFBdjvu is a simple project for easy converting pgm and ppm to (MASK+FG+BG)-djvu. It uses djvulibre for all technichal work and compression. The breakdown of the image into components is done using DjVuL and DjVuL wiki. MFBdjvu based of simpledjvu.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Amiga Memories

    Amiga Memories

    A walk along memory lane

    Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Real-ESRGAN

    Real-ESRGAN

    Real-ESRGAN aims at developing Practical Algorithms

    Real-ESRGAN is a highly popular open-source project that provides practical algorithms for general image and video restoration using deep learning-based super-resolution techniques. It extends the original Enhanced Super-Resolution Generative Adversarial Network (ESRGAN) approach by training on synthetic degradations to make results more robust on real-world images, effectively enhancing resolution, reducing noise/artifacts, and reconstructing fine detail in low-quality imagery. The...
    Downloads: 220 This Week
    Last Update:
    See Project
  • 18
    MaryTTS

    MaryTTS

    An open-source, multilingual text-to-speech synthesis system

    MaryTTS is an open-source, multilingual Text-to-Speech Synthesis platform written in Java. It was originally developed as a collaborative project of DFKI’s Language Technology Lab and the Institute of Phonetics at Saarland University. It is now maintained by the Multimodal Speech Processing Group in the Cluster of Excellence MMCI and DFKI. As of version 5.2, MaryTTS supports German, British and American English, French, Italian, Luxembourgish, Russian, Swedish, Telugu, and Turkish; more languages are in preparation. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 19
    NaveGo

    NaveGo

    NaveGo: an open source MATLAB/GNU Octave toolbox for processing integr

    NaveGo is an open source MATLAB/GNU Octave toolbox designed for processing integrated navigation systems, simulating inertial sensors and GNSS receivers, and profiling inertial sensors using methods like Allan variance—providing a community-driven simulation framework for navigation system design and analysis. I am reaching out to share an important update regarding the NaveGo project. Due to a shift in both my professional career and personal interests away from navigation systems, I have...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ReViSP

    ReViSP

    ReViSP, a 3D volume rendering MATLAB tool for multicellular spheroids

    Reconstruction and Visualization from a Single Projection (ReViSP) tool: a 3D volume rendering method we developed to reconstruct the 3D shape of multicellular spheroids, besides estimating the volume by counting the voxels (3D pixels) fully included in the 3D surface. ReViSP is written in MATLAB (The MathWorks, Inc., Massachusetts, USA) and the source code is freely provided. Requirements for running ReViSP from the source code: MATLAB 2020a and Image Processing Toolbox 11.1 or later versions. Please, when using this software, cite these articles: (a) F. Piccinini, et al., Cancer multicellular spheroids: Volume assessment from a single 2D projection. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Music Source Separation

    Music Source Separation

    Separate audio recordings into individual sources

    Music Source Separation is a PyTorch-based open-source implementation for the task of separating a music (or audio) recording into its constituent sources — for example isolating vocals, instruments, bass, accompaniment, or background from a mixed track. It aims to give users the ability to take any existing song and decompose it into separate stems (vocals, accompaniment, etc.), or to train custom separation models on their own datasets (e.g. for speech enhancement, instrument isolation, or...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    CNN for Image Retrieval
    cnn-for-image-retrieval is a research-oriented project that demonstrates the use of convolutional neural networks (CNNs) for image retrieval tasks. The repository provides implementations of CNN-based methods to extract feature representations from images and use them for similarity-based retrieval. It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Makani

    Makani

    Makani was developed a commercial-scale airborne wind turbine

    Makani was an ambitious Google X project that sought to harness wind energy using airborne wind turbines — autonomous kites capable of generating power while flying in crosswind patterns. This open-source repository contains the complete software stack that powered Makani’s research and flight systems, including the flight simulator, autopilot controller, avionics firmware, visualization tools, and ground control software. The software enables simulation, control, and analysis of the Makani...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This project contains an open-source implementation of an OPT platform. The 3D-printable components are provided to built your customized sample stage. We offer our open source MATLAB software with an GUI for OPT imaging. We developed a generalized automated workflow including a two-step registration approach for correcting the center of rotation and provide accurate and high-quality 3D reconstruction. *Here we provide the raw tomography videos recorded using the zOPT hardware. *For...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB