mbpp

Here are 4 public repositories matching this topic...

OpenMLRL / LLM_Collab_Code_Generation

LLM Collaboration for Code Generation

code-generation multi-agent-systems multi-agent-reinforcement-learning humaneval large-language-models code-agent mbpp comlrl openmlrl coophumaneval

Updated Feb 17, 2026
Python

abhaymundhara / llm-benchmark-suite

Star

Benchmark suite for evaluating LLMs and SLMs on coding and SE tasks. Features HumanEval, MBPP, SWE-bench, and BigCodeBench with an interactive Streamlit UI. Supports cloud APIs (OpenAI, Anthropic, Google) and local models via Ollama. Tracks pass rates, latency, token usage, and costs.

python benchmark evaluation gemini openai code-generation claude streamlit humaneval llm ollama swe-bench mbpp bigcodebench

Updated Feb 5, 2026
Python

scouzi1966 / qwen-humaneval

Star

🧪 Automated LLM coding benchmarks with Ollama - HumanEval & MBPP evaluation suite with safe execution, comprehensive logging, and detailed analysis tools

python benchmarking machine-learning evaluation coding humaneval llm ollama qwen mbpp

Updated Aug 1, 2025
Python

Shreyash-Gaur / TensorFlow_Python_Code_Generation

Star

Fine-tuning CodeT5 for Python code generation on the MBPP dataset. Features custom TensorFlow training loops, mixed precision, XLA optimization, and distributed multi-GPU strategies.

deep-learning tensorflow transformer code-generation distributed-training mixed-precision huggingface nl2code text-to-code llm generative-ai codet5 mbpp

Updated Mar 19, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the mbpp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mbpp topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mbpp

Here are 4 public repositories matching this topic...

OpenMLRL / LLM_Collab_Code_Generation

abhaymundhara / llm-benchmark-suite

scouzi1966 / qwen-humaneval

Shreyash-Gaur / TensorFlow_Python_Code_Generation

Improve this page

Add this topic to your repo