Search Results for "faculty evaluation system"

Sort By:

Showing 319 open source projects for "faculty evaluation system"

View related business solutions

Powering the next decade of business messaging | Twilio MessagingX
For organizations interested programmable APIs built on a scalable business messaging platform

Build unique experiences across SMS, MMS, Facebook Messenger, and WhatsApp – with our unified messaging APIs.

Learn More
Dynamic Work and Complex Project Management Platform | Quickbase
Quickbase is the leading application platform for dynamic work.

Our no-code platform lets you easily create, connect, and customize enterprise applications that fix visibility and workflow gaps without replacing a single system.

Learn More
1

Quantitative Trading System

A comprehensive quantitative trading system with AI-powered analysis

Quantitative Trading System is a comprehensive quantitative trading platform that integrates artificial intelligence, financial data analysis, and automated strategy execution within a unified software system. The project is designed to provide an end-to-end infrastructure for building and operating algorithmic trading strategies in financial markets. It includes tools for collecting and processing market data from multiple sources, performing statistical and machine learning analysis, and...

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
2

Gorse Recommender System Engine

An open source recommender system service written in Go

An open-source recommender system service written in Go. Recommend items from Popular, latest, user-based, item-based and collaborative filtering. Search the best recommendation model automatically in the background. Support horizontal scaling in the recommendation stage after single node training. Support Redis, MySQL, Postgres, MongoDB, and ClickHouse as its storage backend. Expose RESTful APIs for data CRUD and recommendation requests. Analyze online recommendation performance from...

Downloads: 5 This Week

Last Update: 1 day ago
See Project
3

VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs)

VLMEvalKit is an open-source evaluation toolkit designed for benchmarking large vision-language models that combine visual understanding with natural language reasoning. The toolkit provides a unified framework that allows researchers and developers to evaluate multimodal models across a wide range of datasets and standardized benchmarks with minimal setup. Instead of requiring complex data preparation pipelines or multiple repositories for each benchmark, the system enables evaluation through simple commands that automatically handle dataset loading, model inference, and metric computation. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
4

Typst

A new markup-based typesetting system that is powerful and easy

...Whether in the classroom, the faculty office, or at home. Typst runs in your browser, so everyone on the team can just start writing.

Downloads: 9 This Week

Last Update: 2025-12-12
See Project
Contractor Foreman is the most affordable all-in-one construction management software for contractors and is trusted by contractors in more than 75 countries.
For Residential, Commercial and Public Works Contractors

Starting at $49/m for the WHOLE company, Contractor Foreman is the most affordable all-in-one construction management system for contractors. Our customers in 75+ countries and industry awards back it up. And it's all backed by a 100 day guarantee.

Learn More
5

Easy DataSet

A powerful tool for creating datasets for LLM fine-tuning

...The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. Beyond dataset creation, Easy-dataset also provides a built-in evaluation system with model testing and blind-test features, helping teams validate model performance using curated test sets.

Downloads: 12 This Week

Last Update: 2026-04-10
See Project
6

FIT Framework

An enterprise-level AI development framework

FIT Framework is an open-source infrastructure designed to support the development, training, and evaluation of machine learning and AI models through a modular and scalable architecture. It aims to streamline the lifecycle of AI systems by providing standardized components for data processing, model training, evaluation, and deployment. The framework is particularly useful for research and production environments where reproducibility and consistency are critical, as it enforces structured workflows and configurable pipelines. ...

Downloads: 7 This Week

Last Update: 2026-03-19
See Project
7

Hallucination Leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations

...By focusing on hallucination rates rather than traditional metrics such as accuracy or fluency, the benchmark highlights an important aspect of AI system safety and trustworthiness. The leaderboard is regularly updated as new models are released and evaluation methods evolve.

Downloads: 0 This Week

Last Update: 2026-03-20
See Project
8

Opik

Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI

Confidently evaluate, test, and monitor LLM applications. Opik is an open-source platform for evaluating, testing, and monitoring LLM applications. Built by Comet. Record, sort, search, and understand each step your LLM app takes to generate a response. Manually annotate, view, and compare LLM responses in a user-friendly table. Log traces during development and in production. Run experiments with different prompts and evaluate against a test set. Choose and run pre-configured evaluation...

Downloads: 5 This Week

Last Update: 10 hours ago
See Project
9

i-Educar

Launching the most free educational software in Brazil

Accessible from anywhere and with single student registration available for the entire education network. Time-saving for everyone. Get current quantitative, financial and statistical data on all processes, at the time and place you want. Evaluation system and reports adapted to the different realities of the country, with numerical, conceptual or descriptive evaluation notes. Management of allocations, removals, substitutions, absences and delays, offering an integrated view of all professionals. Time frame management for analysis of demands and availability of professionals in the education network in each school period. ...

Downloads: 8 This Week

Last Update: 2025-07-01
See Project
deskbird is the most intuitive desk booking app for your hybrid office.
With deskbird, creating an efficient workplace has never been easier.

For companies in need of a people-centric workplace management solution so employees can see who is in the office, schedule their office and work-from-home days, and book resources for office days.

Learn More
10

Agent Behavior Monitoring

The open source post-building layer for agents

Agent Behavior Monitoring is an open-source framework designed to monitor, evaluate, and improve the behavior of AI agents operating in real or simulated environments. The system focuses on agent behavior monitoring by collecting interaction data and analyzing how agents perform across different scenarios and tasks. Developers can use the framework to observe agent actions in both online production environments and offline evaluation settings, making it useful for debugging and performance analysis. Judgeval transforms agent interaction trajectories into structured evaluation datasets that can be used for reinforcement learning, supervised fine-tuning, or other forms of post-training improvement. ...

Downloads: 4 This Week

Last Update: 2026-04-09
See Project
11

RecBole

A unified, comprehensive and efficient recommendation library

...We implement more than 100 commonly used recommendation algorithms and provide formatted copies of 28 recommendation datasets. We support a series of widely adopted evaluation protocols or settings for testing and comparing recommendation algorithms. RecBole is developed based on Python and PyTorch for reproducing and developing recommendation algorithms in a unified, comprehensive and efficient framework for research purpose. It can be installed from pip, conda and source, and is easy to use. We have implemented more than 100 recommender system models, covering four common recommender system categories in RecBole and eight toolkits of RecBole2.0, including General Recommendation, Sequential Recommendation, Context-aware Recommendation, and Knowledge-based Recommendation and sub-packages.

Downloads: 2 This Week

Last Update: 2025-02-23
See Project
12

DeepSeek-OCR 2

Visual Causal Flow

...The repository provides model code and inference scripts that let researchers and developers run and benchmark the system on both images and PDFs, with support for batch evaluation and optimized pipelines leveraging vLLM and transformers.

Downloads: 8 This Week

Last Update: 2026-02-03
See Project
13

Rogue

AI Agent Evaluator & Red Team Platform

Rogue is an open-source evaluation and red-team framework designed to test the reliability, safety, and policy compliance of AI agents. The platform automatically interacts with an AI agent by generating dynamic scenarios and multi-turn conversations that simulate real-world interactions. Instead of relying solely on static test scripts, Rogue uses an agent-as-a-judge architecture where one agent probes another agent to detect failures or unexpected behaviors.

Downloads: 17 This Week

Last Update: 2026-03-17
See Project
14

GrowthBook

Open source feature flagging and AB testing platform

GrowthBook is an open-source platform for feature flagging and AB testing built to give teams the power of a fully-featured experimentation system without building it entirely from scratch. It supports both self-hosted and cloud-hosted deployment models, giving organizations the flexibility to own their infrastructure or consume it as a managed service. The platform is designed for performance and scale: its SDKs are lightweight, supporting local evaluation to minimize latency, and it integrates deeply with existing data stacks so you can use your warehouse or analytics system as the source of truth. ...

Downloads: 5 This Week

Last Update: 2026-02-04
See Project
15
$DeepSeek Math$

DeepSeek Math

Pushing the Limits of Mathematical Reasoning in Open Language Models

DeepSeek-Math is DeepSeek’s specialized model (or dataset + evaluation) focusing on mathematical reasoning, symbolic manipulation, proof steps, and advanced quantitative problem solving. The repository is likely to include fine-tuning routines or task datasets (e.g. MATH, GSM8K, ARB), demonstration notebooks, prompt templates, and evaluation results on math benchmarks. The goal is to push DeepSeek’s performance in domains that require rigorous symbolic steps, calculus, linear algebra, number theory, or multi-step derivations. ...

Downloads: 2 This Week

Last Update: 2025-10-03
See Project
16

autoresearch for AMD

AI agents running research on single-GPU nanochat training

autoresearch for AMD is a framework for autonomous scientific experimentation in machine learning, enabling AI agents to iteratively improve models through a continuous loop of hypothesis generation, experimentation, and evaluation. The system is built around a minimal structure that includes a data preparation module, a training script that can be modified, and a program specification that guides the agent’s decision-making process. During each iteration, the agent edits the training code, runs an experiment within a fixed time budget, evaluates performance metrics, and decides whether to retain or discard the changes. ...

Downloads: 0 This Week

Last Update: 2026-03-30
See Project
17

Kiln

Open source platform for managing, testing, and deploying AI apps

Kiln is an open source platform designed to help developers build, evaluate, and deploy AI-powered applications with greater structure and reliability. It provides a unified environment for managing prompts, datasets, and evaluation workflows, allowing teams to iterate on AI behavior in a controlled and measurable way. Kiln emphasizes reproducibility, enabling users to track changes to prompts and models while comparing outputs across different configurations. Kiln also supports systematic testing of AI systems by defining evaluation criteria and running experiments to assess performance over time. ...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
18

LangWatch

The platform for LLM evaluations and AI agent testing

LangWatch is an open-source observability and monitoring platform designed to help developers evaluate and improve applications built with large language models. The platform provides tools for tracking model interactions, analyzing prompt behavior, and identifying issues such as hallucinations, latency problems, or unexpected responses. By collecting telemetry data from AI applications, LangWatch allows developers to understand how their systems perform in real-world usage scenarios. The...

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
19

Youtu-Agent

A simple yet powerful agent framework that delivers with models

Youtu-Agent is an open-source framework developed to simplify the creation, execution, and evaluation of autonomous AI agents. The system focuses on reducing the complexity traditionally involved in configuring large language model agents by providing a modular architecture that separates execution environments, tools, and context management. This structure allows developers to rapidly assemble agent systems capable of performing tasks such as research, file processing, and data analysis. ...

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
20

Lmod

An Environment Module System based on Lua, Reads TCL Modules

Lmod is a program to manage the user environment under Unix: (Linux, Mac OS X, ...). It is a new implementation of environment modules. Lmod is a Lua-based module system that easily handles the MODULEPATH Hierarchical problem. Environment Modules provide a convenient way to dynamically change the users’ environment through modulefiles. This includes easily adding or removing directories to the PATH environment variable. Module files for Library packages provide environment variables that...

Downloads: 7 This Week

Last Update: 2026-04-06
See Project
21

Auto-Deep-Research

Your Fully-Automated Personal AI Assistant

Auto-Deep-Research is a system designed to fully automate deep research workflows using language models, retrieval, planning, and multi-stage reasoning to produce structured research artifacts such as surveys, benchmarks, reports, and even prototypes without heavy human intervention. Users provide a research topic or multifaceted goal, and the system autonomously breaks the objective down into subtasks like literature collection, critical summarization, cross-comparison, citation extraction, metric evaluation, and structured writing. ...

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
22

EvoAgentX

Self-evolving AI agent framework for automated workflows

EvoAgentX is an open source framework for building, evaluating, and continuously improving LLM-based agents and multi-agent workflows. It moves beyond static pipelines by introducing a self-evolving system where agents are automatically generated, tested, and optimised through iterative feedback. Developers can define goals in natural language, while the framework handles workflow creation, execution, and refinement. Its modular architecture supports layered components for agents, workflows, evaluation, and evolution, enabling flexible experimentation and scaling. ...

Downloads: 3 This Week

Last Update: 2026-03-19
See Project
23

autoresearch-macos

AI agents running research on single-GPU nanochat training

autoresearch-macos is a macOS-focused adaptation of autonomous research loop systems inspired by the autoresearch paradigm, enabling AI agents to iteratively improve machine learning models through self-directed experimentation. The system follows a structured loop in which an agent modifies a training script, executes a fixed-duration experiment, evaluates performance metrics, and decides whether to keep or revert changes. It is designed to operate efficiently within macOS environments,...

Downloads: 0 This Week

Last Update: 2026-03-30
See Project
24

Prompt flow

Build high-quality LLM apps

Prompt flow is a suite of development tools designed to streamline the end-to-end development cycle of LLM-based AI applications, from ideation, prototyping, testing, and evaluation to production deployment and monitoring. It makes prompt engineering much easier and enables you to build LLM apps with production quality.

Downloads: 0 This Week

Last Update: 2025-01-09
See Project
25

MaxKB

Open-source platform for building enterprise-grade agents

...It focuses on practical deployments such as customer support, internal knowledge bases, research assistants, and education, bundling tools for data ingestion, chunking, embedding, retrieval, and answer synthesis. The system exposes flexible tool-use (including MCP), supports multi-model backends, and provides dashboards for dataset management and evaluation. It’s backed by an active org that also builds adjacent ops tooling, and there’s a dedicated documentation repo for configuration and contribution. Community posts describe “self-host your ChatGPT-style assistant” positioning, with integrations and workflows to move from demo to production. ...

Downloads: 4 This Week

Last Update: 18 hours ago
See Project