CodeT5

CodeT5

Salesforce
StarCoder

StarCoder

BigCode
+
+

Related Products

  • Windsurf Editor
    168 Ratings
    Visit Website
  • Twilio
    1,380 Ratings
    Visit Website
  • Google Cloud Run
    341 Ratings
    Visit Website
  • Docmosis
    48 Ratings
    Visit Website
  • Checksum.ai
    1 Rating
    Visit Website
  • ZeroPath
    2 Ratings
    Visit Website
  • Google Cloud BigQuery
    2,008 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • Vertex AI
    961 Ratings
    Visit Website
  • XpertCoding
    42 Ratings
    Visit Website

About

Code for CodeT5, a new code-aware pre-trained encoder-decoder model. Identifier-aware unified pre-trained encoder-decoder models for code understanding and generation. This is the official PyTorch implementation for the EMNLP 2021 paper from Salesforce Research. CodeT5-large-ntp-py is specially optimized for Python code generation tasks and employed as the foundation model for our CodeRL, yielding new SOTA results on the APPS Python competition-level program synthesis benchmark. This repo provides the code for reproducing the experiments in CodeT5. CodeT5 is a new pre-trained encoder-decoder model for programming languages, which is pre-trained on 8.35M functions in 8 programming languages (Python, Java, JavaScript, PHP, Ruby, Go, C, and C#). In total, it achieves state-of-the-art results on 14 sub-tasks in a code intelligence benchmark - CodeXGLUE. Generate code based on the natural language description.

About

StarCoder and StarCoderBase are Large Language Models for Code (Code LLMs) trained on permissively licensed data from GitHub, including from 80+ programming languages, Git commits, GitHub issues, and Jupyter notebooks. Similar to LLaMA, we trained a ~15B parameter model for 1 trillion tokens. We fine-tuned StarCoderBase model for 35B Python tokens, resulting in a new model that we call StarCoder. We found that StarCoderBase outperforms existing open Code LLMs on popular programming benchmarks and matches or surpasses closed models such as code-cushman-001 from OpenAI (the original Codex model that powered early versions of GitHub Copilot). With a context length of over 8,000 tokens, the StarCoder models can process more input than any other open LLM, enabling a wide range of interesting applications. For example, by prompting the StarCoder models with a series of dialogues, we enabled them to act as a technical assistant.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers and users interested in a solution to generate, summarize, and autocomplete code

Audience

Developers interested in an LLM for code generation

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Salesforce
github.com/salesforce/CodeT5

Company Information

BigCode
Founded: 2023
huggingface.co/blog/starcoder

Alternatives

GLM-OCR

GLM-OCR

Z.ai

Alternatives

CodeGemma

CodeGemma

Google
Mu

Mu

Microsoft
CodeQwen

CodeQwen

Alibaba
DeepSeek Coder

DeepSeek Coder

DeepSeek
CodeQwen

CodeQwen

Alibaba
Mercury Coder

Mercury Coder

Inception Labs

Categories

Categories

Integrations

Python
C
C#
ChatGPT
CodeQwen
Git
GitHub
Go
Java
JavaScript
LM Studio
OpenAI
PHP
Ruby
Tabby
Taylor AI
Visual Studio Code

Integrations

Python
C
C#
ChatGPT
CodeQwen
Git
GitHub
Go
Java
JavaScript
LM Studio
OpenAI
PHP
Ruby
Tabby
Taylor AI
Visual Studio Code
Claim CodeT5 and update features and information
Claim CodeT5 and update features and information
Claim StarCoder and update features and information
Claim StarCoder and update features and information