Showing 48 open source projects for "input-output model"

View related business solutions
  • A privacy-first API that predicts global consumer preferences Icon
    A privacy-first API that predicts global consumer preferences

    Qloo AI adds value to a wide range of Fortune 500 companies in the media, technology, CPG, hospitality, and automotive sectors.

    Through our API, we provide contextualized personalization and insights based on a deep understanding of consumer behavior and more than 575 million people, places, and things.
    Learn More
  • Atera IT Autopilot Icon
    Atera IT Autopilot

    Ensure operational efficiency at any scale with 24/7 autonomous IT support.

    IT Autopilot takes the pressure off your team by handling first-tier support across the channels your end users already live in — email, chat, Slack, Teams, and your Customer Portal. It doesn’t just respond to end-user queries, issues, and crises — it solves them.
    Learn More
  • 1
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    PersonaLive

    PersonaLive

    Expressive Portrait Image Animation for Live Streaming

    PersonaLive is an open-source diffusion-based portrait animation framework focused on generating expressive, long-duration animated sequences in real time, primarily for live streaming or interactive applications. It leverages deep generative models that condition on a static reference image and a driving input (such as motion or expression cues) to produce a seamless animated portrait sequence that can run indefinitely without segmentation artifacts. The framework prioritizes low-latency...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    ...Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Moonshine Voice

    Moonshine Voice

    Fast and accurate automatic speech recognition (ASR) for edge devices

    ...The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. The toolkit also includes specialized model variants, including monolingual options that improve accuracy for specific languages. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • Tremendous is the global payouts platform for businesses sending gift cards and money at scale. Icon
    Tremendous is the global payouts platform for businesses sending gift cards and money at scale.

    Getting started is simple: add a funding method and place your first order in minutes.

    Trusted by 20,000+ leading organizations, Tremendous has delivered billions of rewards and enables businesses to reach recipients across 230+ countries and regions. Recipients have 2,500+ payout options to choose from, including gift cards, prepaid cards, cash transfers, and charitable donations.
    Learn More
  • 5
    TRELLIS 2

    TRELLIS 2

    Native and Compact Structured Latents for 3D Generation

    TRELLIS.2 is a cutting-edge open-source model and codebase for high-fidelity 3D asset generation from 2D images, developed to push forward the state of the art in image-to-3D generation. At its core is a novel sparse voxel structure called O-Voxel that jointly encodes both geometry and surface appearance, enabling reconstruction and generation of complex 3D shapes with arbitrary topology, open surfaces, and physically based rendering (PBR) textures. The system leverages a large...
    Downloads: 61 This Week
    Last Update:
    See Project
  • 6
    Real-ESRGAN GUI

    Real-ESRGAN GUI

    Cross-platform GUI for image upscaler Real-ESRGAN

    ...According to actual measurements, arm64the single-architecture performance is better than universal2the dual- architecture Mac on the Apple chip, so Apple chip users are advised to pack arm64single-architecture applications by themselves. Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.
    Downloads: 119 This Week
    Last Update:
    See Project
  • 7
    CadQuery

    CadQuery

    A python parametric CAD scripting framework based on OCCT

    ...Build models with scripts that are as close as possible to how you’d describe the object to a human, using a standard, already established programming language. Create parametric models that can be very easily customized by end users. Output high-quality CAD formats like STEP and AMF in addition to traditional STL. Provide a non-proprietary, plain text model format that can be edited and executed with only a web browser. The scripts use a standard programming language, Python, and thus can benefit from the associated infrastructure. This includes many standard libraries and IDEs. ...
    Downloads: 52 This Week
    Last Update:
    See Project
  • 8
    Mesh R-CNN

    Mesh R-CNN

    code for Mesh R-CNN, ICCV 2019

    ...The system combines 2D detection from Mask R-CNN with 3D reasoning modules that output full mesh reconstructions aligned with the input image. It has been evaluated on datasets such as Pix3D, where it demonstrates state-of-the-art performance in reconstructing real-world object geometry.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Paint.NET

    Paint.NET

    Downloads for Paint.NET, such as installer EXEs and portable ZIPs

    ...Starting the app is nearly instantaneous, and every feature has been thoroughly optimized to take advantage of the latest multicore CPUs, GPUs, and NVMe SSDs. The use of DXGI Flip Model ensures low input latency and reduced power consumption. Whether you have a power-conscious laptop or a monstrous desktop with a gigantic GPU, you can expect it to start up immediately, respond quickly to every mouse click, and take full advantage of all of your hardware.
    Downloads: 118 This Week
    Last Update:
    See Project
  • Software for managing apps and accounts | WebCatalog Icon
    Software for managing apps and accounts | WebCatalog

    Tired of juggling countless browser tabs? WebCatalog Desktop turns your favorite web apps into dedicated desktop apps

    Turn websites into desktop apps with WebCatalog Desktop—your all-in-one tool to manage apps and accounts. Switch between multiple accounts, organize apps by workflow, and access a curated catalog of desktop apps for Mac and Windows.
    Learn More
  • 10
    MediaPipe

    MediaPipe

    Cross-platform, customizable ML solutions for live and streaming media

    ...MediaPipe Face Mesh is a face geometry solution that estimates 468 3D face landmarks in real-time even on mobile devices. It employs machine learning (ML) to infer the 3D surface geometry, requiring only a single camera input without the need for a dedicated depth sensor. Utilizing lightweight model architectures together with GPU acceleration throughout the pipeline, the solution delivers real-time performance-critical for live experiences. Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control. ...
    Downloads: 70 This Week
    Last Update:
    See Project
  • 11
    screenshot-to-code

    screenshot-to-code

    Drop in a screenshot and convert it to clean code

    screenshot-to-code converts UI screenshots or design images into working front-end code, accelerating the path from concept to prototype. It uses modern vision-capable or code-generating models to infer layout structure, typography, and components, then outputs clean HTML/CSS (often Tailwind) or framework code. A web interface lets you upload images, tune options, and preview generated results, while a backend service orchestrates the model calls and post-processing. The tool focuses on...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Open Asset Import Library

    Open Asset Import Library

    The open-asset-importer-library repository

    ...Written in C++, it is available under a liberal BSD license. There is a C API as well as bindings to various other languages, including C#, Python and D. Assimp loads all input model formats into one straightforward data structure for further processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    GoJS

    GoJS

    JavaScript diagramming library for interactive flowcharts & org charts

    Build interactive flowcharts or flow diagrams. Let your users build, modify, and save diagrams with JSON model output. Visualize state charts and other behavior diagrams. Create diagrams with live updates to monitor state, or interactive diagrams for planning. GoJS allows considerable customization of links and nodes to build all kinds of diagrams. Visualize flow, or connect pipes. Create genogram and medical diagrams, or editable family trees with collapsible levels. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Segmentation Models

    Segmentation Models

    Segmentation models with pretrained backbones. PyTorch

    ...Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder. Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 16
    lcd-image-converter

    lcd-image-converter

    Tool to create bitmaps and fonts for embedded applications.

    This program allows you to create bitmaps and fonts, and transform them to "C" source format for embedded applications. The transformation of the images to the source code is made by using templates. Therefore, by modifying the templates, you can change the format of the output within certain limits.
    Leader badge
    Downloads: 461 This Week
    Last Update:
    See Project
  • 17
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 18
    Competent Audio

    Competent Audio

    Machine graph audio engine for computer games

    ...It is written in C, but is designed for interoperability with other languages. Windows and Linux binaries for x86 and amd64 are available. CA uses a machine graph model with support for arbitrary numbers of machines, limited only by the available system resources: - Samplers play back audio clips. - Mixers combine audio signals and optionally perform signal processing. - Sinks send audio signals to an output device. Stereo and mono sound output is supported via a slightly customized version of libsoundio 2.0. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Warlock-Studio

    Warlock-Studio

    AI Suite for upscaling, interpolating & restoring images/videos

    v6.0. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 20
    Kisekae UltraKiss

    Kisekae UltraKiss

    Kisekae UltraKiss is a full featured integrated development environmen

    UltraKiss is a computer program that implements the Kisekae Set system, KiSS, a Japanese graphics system originally developed to facilitate costume changes on virtual dolls. UltraKiss was developed to help artists build their KiSS sets. It is a full featured viewer for all KiSS dolls, games, and visual applications. It is also a complete graphical development environment for creating KiSS applications. It fully implements the FKiSS event driven programming language up to and including...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 21
    Stable Diffusion Rembg

    Stable Diffusion Rembg

    Removes backgrounds from pictures. Extension for webui

    This project is an extension for the Stable Diffusion Web UI that removes backgrounds from images directly inside the interface. It wraps popular background-removal models so creators can take a generated or uploaded image and isolate the subject with a single click. The workflow is designed to be non-destructive: you can preview, tweak thresholds, and export either a transparent PNG or a masked layer for further editing. Because it runs within the Web UI, you can chain it with other...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SuperImage

    SuperImage

    Sharpen your low-resolution pictures with the power of AI upscaling

    ...By leveraging the power of your device's GPU, SuperImage is able to upscale and restore the details of your images without uploading them to the internet, keeping your data secure. SuperImage is a neural network-based image upscaling application for Android built with the MNN deep learning framework and Real-ESRGAN. The input image is processed in tiles on the device GPU, using a pre-trained Real-ESRGAN model. The tiles are then merged into the final high-resolution image. This application requires Vulkan or OpenCL support and Android 7 or above.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 23

    abcCairo

    Extend abcm2ps to support direct generation of PNG, SVG and PDF files

    ...The Cairo graphics library is an open-source graphics library offering a similar 2D graphics model to PostScript. The Cairo library can write to a GTK canvas, allowing integration with programs using the GTK+ toolkit, or it can write to an image file in a choice of formats: PNG, SVG or PDF. This means that other programs can have access to the abcm2ps music-rendering capability without having to incorporate a PostScript interpreter.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Modeling software including Archimate
    Openmodeling has currently 5 modules available: • Application portfolio management • ArchiMate 3.2 architecture (including The Open Group Model Exchange) • Demo conceptual modeling, both versions 3 and 4 • Efqm quality assessment • Business and system modeling Openmodeling facilitates modeling online controlled by a web interface using SVG. A wide range of modeling techniques is available, including Demo and Archimate diagrams. All functionality is web-based, including administrator...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    TinyEngine

    TinyEngine

    Tiny OpenGL Wrapper / 3D Engine in C++

    TinyEngine is a small, self-contained OpenGL-based 3D engine written in C, designed for learning and prototyping graphics applications. It packs a lot of functionality into a compact codebase, making it ideal for exploring 3D rendering, game development, and shader programming without the overhead of larger engines. Despite its size, TinyEngine supports modern OpenGL features, texture mapping, lighting, and audio, providing a practical foundation for building lightweight 3D experiences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB