Showing 250 open source projects for "dna sequence analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Discourse Network Analyzer (DNA)

    Discourse Network Analyzer (DNA)

    Discourse Network Analyzer (DNA)

    The Java software Discourse Network Analyzer (DNA) is a qualitative content analysis tool with network export facilities. You import text files and annotate statements that persons or organizations make, and the program will return network matrices of actors connected by shared concepts.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2

    proSeq

    ProSeq is a GUI program to edit and analyse DNA polymorphism datasets

    ProSeq ('Processor of sequences') is a package including GUI and command line programs to process and analyse DNA polymorphism data. It allows one to open and save sequence data in over a dozen file formats (and convert between these formats). It shows sequence alignments in a graphical window and allows the user to edit, manipulate and analyse sequences in various ways. ProSeq4 includes tools to analyse DNA polymorphism, scan for selection, do PCA and reconstruct phylogenies from sequence or SNP data. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 3
    Plaso

    Plaso

    Super timeline all the things

    Plaso (Plaso Langar Að Safna Öllu), or "super timeline all the things," is a Python-based engine designed for automatic creation of timelines in digital forensic investigations. It processes various log files and artifacts to generate a chronological sequence of events, aiding analysts in understanding system activities.​
    Downloads: 12 This Week
    Last Update:
    See Project
  • 4
    Bowtie, an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Please cite: Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.
    Leader badge
    Downloads: 4,525 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    ReachabilityAnalysis.jl

    ReachabilityAnalysis.jl

    Compute reachable states of dynamical systems

    Reachability analysis is concerned with computing rigorous approximations of the set of states reachable by a dynamical system. In the scope of this package are systems modeled by continuous or hybrid dynamical systems, where the dynamics change with discrete events. Systems are modeled by ordinary differential equations (ODEs) or semi-discrete partial differential equations (PDEs), with uncertain initial states, uncertain parameters or non-deterministic inputs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6

    CpGtools

    Python package to analyze DNA methylation data

    CpGtools package provides a number of Python programs to annotate, QC, visualize, and analyze DNA methylation data generated from Illumina HumanMethylation450 BeadChip (450K) / MethylationEPIC BeadChip (850K) array or RRBS / WGBS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Profile Data

    Profile Data

    Analyze computation-communication overlap in V3/R1

    profile-data is a repository that publishes profiling traces and metrics from DeepSeek’s training and inference infrastructure (especially during DeepSeek-V3 / R1 experiments). The profiling data targets insights into computation-communication overlap, pipeline scheduling (e.g. DualPipe), and how MoE / EP / parallelism strategies interact in real systems. The repository contains JSON trace files like train.json, prefill.json, decode.json, and associated assets. Users can load them into tools...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    AQUAD

    Software for Analysis of Qualitative Data

    AQUAD Eight is available in English, German, and Spanish with separate modules for the analysis of texts, audios, videos, graphic files, and a complementary module for exploratory statistical analysis with R. The modules follow the coding paradigm of qualitative analysis, offering for text analysis also functions for sequence analysis (Objective Hermeneutics) and word based quantitative analysis.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Python Zero to Hero for DevOps Engineers

    Python Zero to Hero for DevOps Engineers

    Learn Python from DevOps Engineer point of you

    Python Zero to Hero for DevOps Engineers is a structured “Python Zero to Hero for DevOps Engineers” course laid out as a day-by-day learning path. The repository is organized into Day-01 through Day-19 folders plus a small sample app, which makes it very easy to follow in sequence like a bootcamp. The curriculum starts with Python installation, environment setup, and writing your first script, then quickly moves into data types, strings, regular expressions, variables, and functions. It places a strong emphasis on DevOps-specific use cases: environment variables, command-line arguments, configuration handling, and automating log analysis or user management tasks are all explicitly woven into the exercises. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10

    janda

    structural variant detector

    Janda is a tool to detect structural variants in whole-genome DNA sequence data. It identifies structural variants (deletions, duplications, translocations, and inversions) using anomalously mapped pair reads and realigning potential junction reads.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    ArchCUT3-D

    ArchCUT3-D

    ArchCUT3-D: 3-D Analysis of Incised Archeological Surfaces

    Software developed to precisely evaluate the 3-D micromorphological characteristics of archaeological surface incisions. Analysis based on a continuous 3-D slice sequence from a chosen range within the incision path. Measurements and shapes of individual slices can be extracted. ArchCUT3-D’s interface is designed to provide a computational and mathematical analysis of engravings to a wide audience. Please cite Dubinsky, L., David, M., & Grosman, L. (2023). ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12

    popoolation

    estimate natural variation and positive selection

    ...One of the main challenges in population genomics is to identify regions of intererest on a genome wide scale. We believe that PoPoolation will greatly aid this task by allowing a fast and user friendly analysis of NGS data from DNA pools. Documentation: https://sourceforge.net/p/popoolation/wiki/Main/
    Downloads: 16 This Week
    Last Update:
    See Project
  • 13

    UniversalMer2

    A CLI k-mer counting tool for multiple size of k at once.

    UniversalMer is a k-mer counting tool for multiple size of k at once. It is available for DNA, RNA, and protein sequences. The program counts and summarizes the exact frequency of all k-mers from 1-mer to a user-defined maximum length (kmax). This kmax can be specified as any length or can be automatically determined by the longest repeated patterns found in the input sequence.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    relax

    relax

    Molecular dynamics by NMR data analysis

    The software package 'relax' is designed for the study of molecular dynamics through the analysis of experimental NMR data. Organic molecules, proteins, RNA, DNA, sugars, and other biomolecules are all supported. It supports exponential curve fitting for the calculation of the R1 and R2 relaxation rates, calculation of the NOE, reduced spectral density mapping, the Lipari and Szabo model-free analysis, study of domain motions via the N-state model and frame order dynamics theories using anisotropic NMR parameters such as RDCs and PCSs, the investigation of stereochemistry in dynamic ensembles, and the analysis of relaxation dispersion data.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15

    MetaEntropy

    R package for Virome Mutational Analysis

    Estimates Shannon entropy, per gene and per genomic position, associated with non-synonymous mutation frequencies in viral populations, such as wastewater samples. The package uses codon translations for functional insights. Each amino acid can be treated as an individual state, resulting in a 20-state entropy computation, or grouped into one of six physicochemical classes, adding further functional context. Provides normalized values (0-1 scale) to facilitate the direct comparison of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NGSEP

    NGSEP

    NGSEP (Next Generation Sequencing Experience Platform)

    NGSEP is an integrated framework for analysis of short and long DNA high throughput sequencing reads. The current version provides functionalities for both de-novo and reference guided analysis of sequencing data, including genome assembly, read mapping, variants detection and genotyping and de-novo analysis of data generated from reduced representation protocols. NGSEP also provides modules for analysis of genomic variation databases (VCF files), including functional annotation, filtering, format conversion, comparison, clustering, imputation, introgression analysis and different kinds of statistics. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17

    RABV-GUI

    Interface for rabies virus sequence analysis and Genbank submission

    RABV-GUI is a graphical user interface designed for running RABV-GLUE, specifically to facilitate GenBank submissions for rabies sequences. In addition to supporting GenBank submissions, RABV-GUI has been updated with the latest rabies sequences and can perform database queries similar to those available on the RABV-GLUE website (http://rabv-glue.cvr.gla.ac.uk/).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    PFAAT is a Java-based multiple sequence alignment editor and viewer designed for protein family analysis. You can download PFAAT from http://pfaat.sourceforge.net/
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    # Digital Visual Computer (DVC) The Digital Visual Computer (DVC) is an experimental computing platform where programs and data are represented visually as images. This project contains the specifications, tools, and examples for the DVC ecosystem. ## Introduction DVC explores the concept of screen-to-screen computation. Instead of text-based code, DVC uses a "color language" where sequences of colors represent instructions (opcodes) and data. The state of the computer's memory is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    gendasm

    gendasm

    Generic Code-Seeking Diassembler with Fuzzy-Function Analyzer

    ...It allows you to enter known starting vectors for a given code image for the micro. It will disassemble the code and follow through branches to assist in the separation of code and data. Its included Fuzzy Function Analyzer companion uses a DNA Sequence Alignment Algorithm to locate similar code in multiple binaries to facilitate reverse-engineering and/or code recovery. The original purpose of the Fuzzy Function Analyzer was to assist in code recovery where the source code for the current binaries got lost, yet the source code for an old binary was retained. The Fuzzy Function Analyzer allows you to match up known functions between the two binaries so you can concentrate on disassembling and reverse engineering the parts that are different and recover the code for the current binary that got lost.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    kSNP

    kSNP4 does SNP discovery and SNP annotation from whole genomes

    kSNP4 identifies the pan-genome SNPs in a set of genome sequences, and estimates phylogenetic trees based upon those SNPs. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a reference genome, so kSNP4 can take 100's of microbial genomes as input. A SNP locus is defined by an oligo of length k surrounding a central SNP allele. kSNP4 can analyze both complete (finished) genomes and unfinished genomes in assembled contigs or raw, unassembled reads. ...
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • 23
    FastQC

    FastQC

    A quality control analysis tool for high throughput sequencing data

    FastQC is a quality control analysis tool designed to spot potential problems in high throughput sequencing datasets. Its goal is to provide a simple way by which to check the quality of raw sequence data coming from high throughput sequencing pipelines. It does this by running a modular set of analyses on one or more raw sequence files in fastq or bam format. It then produces a report summarizing the results, and highlighting any areas where the library may appear unusual. ...
    Downloads: 36 This Week
    Last Update:
    See Project
  • 24
    methylr

    methylr

    a single shiny solution from sequencer data to pathway analysis

    Here we introduce methylR, a complete pipeline for the analysis of both 450K and EPIC Illumina arrays which not only offers data visualization and normalization but also provide additional features such as the annotation of the genomic features resulting from the analysis, pairwise comparisons of DMCs with different graphical representation plus functional and pathway enrichment as downstream analysis, all packed in a minimal, elegant and intuitive graphical user interface which brings the analysis of array DNA methylation data.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    quantitative

    quantitative

    Quantized transactions python3

    ...As an open-source educational resource, it’s designed for Python users interested in automatic trading, algorithmic strategies, and financial data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB