🤘 awesome-semantic-segmentation
-
Updated
May 8, 2021
🤘 awesome-semantic-segmentation
Building a modern functional compiler from first principles. (http://dev.stephendiehl.com/fun/)
Python package for the evaluation of odometry and SLAM
Klipse is a JavaScript plugin for embedding interactive code snippets in tech blogs.
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Test your prompts, models, RAGs. Evaluate and compare LLM outputs, catch regressions, and improve prompt quality. LLM evals for OpenAI/Azure GPT, Anthropic Claude, VertexAI Gemini, Ollama, Local & private models like Mistral/Mixtral/Llama with CI/CD
A unified evaluation framework for large language models
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
☁️ 🚀 📊 📈 Evaluating state of the art in AI
Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
An open-source visual programming environment for battle-testing prompts to LLMs.
Multi-class confusion matrix library in Python
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Short and sweet LISP editing
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
Add a description, image, and links to the evaluation topic page so that developers can more easily learn about it.
To associate your repository with the evaluation topic, visit your repo's landing page and select "manage topics."