Search Results for "speech & image processing based project in matlab" - Page 2

Sort By:

Showing 97 open source projects for "speech & image processing based project in matlab"

View related business solutions

ServiceDesk Plus, a world-class IT and enterprise service management platform
Design, automate, deliver, and manage critical IT and business services

Best in class online service desk software. Offer your customers world-class services with ServiceDesk Plus Cloud, the easy-to-use SaaS service desk software from ManageEngine, the IT management division of Zoho. Track and manage IT tickets efficiently, resolve issues faster, and ensure end-user satisfaction with the cloud-based IT ticketing system used by over 100,000 IT service desks worldwide. Manage the complete life cycle of IT incidents, problems, changes, and projects with out of the box ITIL workflows. Create support SLAs, define escalation levels, and ensure compliance. Automate ticket dispatch, categorization, classification, and assignment based on predefined business rules, and set up notifications and alerts for timely ticket resolution. Reduce walk ins and unnecessary tickets by giving your users more control. Enable end users to access IT services through your service catalog in the self-service portal. Help users create and track tickets and search for solutions.

Learn More
Download the most trusted enterprise browser
Chrome Enterprise brings enterprise controls and easy integrations to the browser users already know and love.

Chrome Enterprise is ideal for businesses of all sizes, IT professionals, and organizations looking for a secure, scalable, and easily managed browser solution that supports remote work, data protection, and streamlined enterprise operations.

Learn More
1

Advanced AI explainability for PyTorch

Advanced AI Explainability for computer vision

pytorch-grad-cam is an open-source library that provides advanced explainable AI techniques for interpreting the predictions of deep learning models used in computer vision. The project implements Grad-CAM and several related visualization methods that highlight the regions of an image that most strongly influence a neural network’s decision. These visualization techniques allow developers and researchers to better understand how convolutional neural networks and transformer-based vision...

Downloads: 0 This Week

Last Update: 2026-03-29
See Project
2

Roadmap To Learn Generative AI In 2025

Basic Machine Learning Natural Language Processing Roadmap

Roadmap To Learn Generative AI In 2025 is a curated learning path focused on contemporary generative AI — covering large language models (LLMs), diffusion-based image generation, prompt engineering, multi-modal AI, fine-tuning techniques, and the practical considerations for deploying generative models. It’s aimed at learners and developers who already have some programming or ML basics and wish to specialize in generative AI, offering a modern, structured plan that reflects the state of the...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
3

Perfect Pixel

Refine and quantize messy AI pixel art into clean, perfect pixels

perfectPixel is a workflow tool for turning messy “pixel-style” images, especially those produced by generative models, into truly grid-aligned pixel art that reads cleanly at any scale. It tackles a common problem with AI pixel art: edges that look pixelated at first glance but are not actually aligned to a coherent pixel grid, which causes shimmer, blur, and uneven block sizes when you zoom in. The tool analyzes an image to infer the intended grid size, then refines and quantizes the...

Downloads: 1 This Week

Last Update: 2026-02-01
See Project
4

ProStack

ProStack - a platform for image processing and analysis

ProStack - a platform for image processing and analysis. It implements various image processing methods as separate modules, that can be joined in a complex image processing scenario by use of a graphical user interface. RPMs are available at https://build.opensuse.org/project/repositories/home:mackoel:compbio

2 Reviews

Downloads: 0 This Week

Last Update: 2025-10-29
See Project
The fastest way to host, scale and get paid on WordPress
For developers searching for a web hosting solution

Lightning-fast hosting, AI-assisted site management, and enterprise payments all in one platform designed for agencies and growth-focused businesses.

Learn More
5

PyDenseCRF

Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs

PyDenseCRF is a Python library that provides a wrapper around the implementation of fully connected Conditional Random Fields (CRFs) developed by Philipp Krähenbühl and Vladlen Koltun. The project allows developers and researchers to integrate Dense CRF inference into Python-based machine learning pipelines, particularly for computer vision tasks such as image segmentation and labeling. Conditional Random Fields are probabilistic graphical models used to model contextual relationships...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
6

unwarp

Increase image resolution by eliminating atmospheric distortion

Unwarp is an open-source tool that enhances image resolution by eliminating scintillations caused by atmospheric turbulence and similar distortion phenomena. The software processes a series of images of the same subject, aligning and stacking them using advanced feature selection algorithms and phase correlation approaches. The core technique matches features between images, applies triangulation across the entire frame, and warps each pixel to its optimal position. The resulting aligned...

Downloads: 0 This Week

Last Update: 2025-07-23
See Project
7

SigPack

SigPack - A signal processing library using Armadillo

SigPack is a C++ signal processing library using the Armadillo library as a base. The API will be familiar for those who has used IT++ and Octave/Matlab.

2 Reviews

Downloads: 6 This Week

Last Update: 2026-02-27
See Project
8

GeoTools, the Java GIS toolkit

Toolkit for working with and mapping geospatial data

GeoTools is an open source (LGPL) Java code library which provides standards compliant methods for the manipulation of geospatial data. GeoTools is an Open Source Geospatial Foundation project. The GeoTools library data structures are based on Open Geospatial Consortium (OGC) specifications.

38 Reviews

Downloads: 130 This Week

Last Update: 2026-03-19
See Project
9

cleanvideo-cli

CLI tool for removing watermarks from AI-generated videos using frame-

cleanvideo-cli is a command-line tool designed to remove visible watermarks from AI-generated videos. It works by analyzing video frames and reconstructing the underlying pixels in watermark regions, without cropping or blurring the original content. This project is intended for developers, researchers, and creators who need a lightweight utility for cleaning preview or draft videos before further processing. Note: This tool does not bypass platform restrictions and should be used...

Downloads: 1 This Week

Last Update: 2026-01-04
See Project
Accounting Software Built for Owners, and Their Clients
Make invoicing and billing painless for your small business with FreshBooks.

Balancing your books, client relationships, and business isn’t easy. FreshBooks gives you the info and time you need to focus on your big picture—your business, team, and clients.

Learn More
10

Free AI Watermark Remover - FreeRepair

AI-powered tool to quickly remove watermarks from images flawlessly

AI Watermark Remover (Free And Open-Source) & Make Blurry Images Clearer Or Larger Tool - FreeRepair, Simulation IOPaint Based On The Django Of Python With No Sign-Up. As a free, open-source, AI-powered tool, FreeRepair makes it easy to remove watermarks, logos, text or clutter from images, and blurry images can be made clearer or larger. No installation, no internet connection, it works out of the box, safe and secure, unlimited.

1 Review

Downloads: 4 This Week

Last Update: 2026-03-30
See Project
11

Glint Translator

Glint Translator is a high-performance Windows application for real-time in-game and voice translation without interrupting gameplay. It supports 240+ languages using DeepL, Google, OpenAI, Azure, and Google Gemini models. The interface is available in 18 languages. Features • 3 Translation Modes: Fluent (parallel), Area (overlay), Full Screen (smart detection) • Speaker detection with color-coding • Glint AI custom terminology control • Game-based profile system • Advanced...

1 Review

Downloads: 31 This Week

Last Update: 23 hours ago
See Project
12

EmotiVoice

Multi-Voice and Prompt-Controlled TTS Engine

EmotiVoice is a multi-voice, prompt-controlled text-to-speech engine designed to generate highly expressive speech across thousands of voices. It supports both English and Chinese and ships with over 2,000 preset voices, making it suitable for everything from characters and virtual anchors to narration and dialogue. The core idea is prompt-based emotional and style control: you can ask the engine to speak “happy,” “sad,” “excited,” or with other high-level style prompts that shape prosody,...

Downloads: 2 This Week

Last Update: 2025-11-30
See Project
13

SAGA GIS

SAGA - System for Automated Geoscientific Analyses - is a Geographic Information System (GIS) software with immense capabilities for geodata processing and analysis. SAGA is programmed in the object oriented C++ language and supports the implementation of new functions with a very effective Application Programming Interface (API). Functions are organised as modules in framework independent Module Libraries and can be accessed via SAGA’s Graphical User Interface (GUI) or various scripting...

42 Reviews

Downloads: 8,072 This Week

Last Update: 8 hours ago
See Project
14

ekho

Chinese text-to-speech engine

ekho is a project with relatively sparse documentation, but from the repository it appears to be a small-scale tool for audio processing and playback, possibly with features for speech synthesis or manipulation. The repo includes scripts and configuration files suggesting interactions with media/audio handling libraries. Because of limited README detail, it seems targeted at users comfortable reading and modifying code, rather than end users expecting polished UIs. ...

Downloads: 3 This Week

Last Update: 2025-11-28
See Project
15

mfbdjvu

Easy converting pgm and ppm to (MASK+FG+BG)-djvu.

MFBdjvu is a simple project for easy converting pgm and ppm to (MASK+FG+BG)-djvu. It uses djvulibre for all technichal work and compression. The breakdown of the image into components is done using DjVuL and DjVuL wiki. MFBdjvu based of simpledjvu.

Downloads: 0 This Week

Last Update: 2023-10-06
See Project
16

Amiga Memories

A walk along memory lane

Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
17

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms

Real-ESRGAN is a highly popular open-source project that provides practical algorithms for general image and video restoration using deep learning-based super-resolution techniques. It extends the original Enhanced Super-Resolution Generative Adversarial Network (ESRGAN) approach by training on synthetic degradations to make results more robust on real-world images, effectively enhancing resolution, reducing noise/artifacts, and reconstructing fine detail in low-quality imagery. The...

Downloads: 220 This Week

Last Update: 2025-12-11
See Project
18

MaryTTS

An open-source, multilingual text-to-speech synthesis system

MaryTTS is an open-source, multilingual Text-to-Speech Synthesis platform written in Java. It was originally developed as a collaborative project of DFKI’s Language Technology Lab and the Institute of Phonetics at Saarland University. It is now maintained by the Multimodal Speech Processing Group in the Cluster of Excellence MMCI and DFKI. As of version 5.2, MaryTTS supports German, British and American English, French, Italian, Luxembourgish, Russian, Swedish, Telugu, and Turkish; more languages are in preparation. ...

Downloads: 15 This Week

Last Update: 2023-08-11
See Project
19

NaveGo

NaveGo: an open source MATLAB/GNU Octave toolbox for processing integr

NaveGo is an open source MATLAB/GNU Octave toolbox designed for processing integrated navigation systems, simulating inertial sensors and GNSS receivers, and profiling inertial sensors using methods like Allan variance—providing a community-driven simulation framework for navigation system design and analysis. I am reaching out to share an important update regarding the NaveGo project. Due to a shift in both my professional career and personal interests away from navigation systems, I have...

Downloads: 0 This Week

Last Update: 2025-09-08
See Project
20

ReViSP

ReViSP, a 3D volume rendering MATLAB tool for multicellular spheroids

Reconstruction and Visualization from a Single Projection (ReViSP) tool: a 3D volume rendering method we developed to reconstruct the 3D shape of multicellular spheroids, besides estimating the volume by counting the voxels (3D pixels) fully included in the 3D surface. ReViSP is written in MATLAB (The MathWorks, Inc., Massachusetts, USA) and the source code is freely provided. Requirements for running ReViSP from the source code: MATLAB 2020a and Image Processing Toolbox 11.1 or later versions. Please, when using this software, cite these articles: (a) F. Piccinini, et al., Cancer multicellular spheroids: Volume assessment from a single 2D projection. ...

Downloads: 2 This Week

Last Update: 2022-05-19
See Project
21

SVoice (Speech Voice Separation)

We provide a PyTorch implementation of the paper Voice Separation

SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
22

Music Source Separation

Separate audio recordings into individual sources

Music Source Separation is a PyTorch-based open-source implementation for the task of separating a music (or audio) recording into its constituent sources — for example isolating vocals, instruments, bass, accompaniment, or background from a mixed track. It aims to give users the ability to take any existing song and decompose it into separate stems (vocals, accompaniment, etc.), or to train custom separation models on their own datasets (e.g. for speech enhancement, instrument isolation, or...

Downloads: 2 This Week

Last Update: 2025-12-02
See Project
23

CNN for Image Retrieval

cnn-for-image-retrieval is a research-oriented project that demonstrates the use of convolutional neural networks (CNNs) for image retrieval tasks. The repository provides implementations of CNN-based methods to extract feature representations from images and use them for similarity-based retrieval. It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
24

Makani

Makani was developed a commercial-scale airborne wind turbine

Makani was an ambitious Google X project that sought to harness wind energy using airborne wind turbines — autonomous kites capable of generating power while flying in crosswind patterns. This open-source repository contains the complete software stack that powered Makani’s research and flight systems, including the flight simulator, autopilot controller, avionics firmware, visualization tools, and ground control software. The software enables simulation, control, and analysis of the Makani...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
25

zOPT Tomography reconstruction

This project contains an open-source implementation of an OPT platform. The 3D-printable components are provided to built your customized sample stage. We offer our open source MATLAB software with an GUI for OPT imaging. We developed a generalized automated workflow including a two-step registration approach for correcting the center of rotation and provide accurate and high-quality 3D reconstruction. *Here we provide the raw tomography videos recorded using the zOPT hardware. *For...

Downloads: 1 This Week

Last Update: 2020-05-18
See Project