METR
NonprofitPreliminaryModel Evaluation and Threat Research - conducts frontier model capability evaluations to assess whether AI systems pose catastrophic risks. Works with governments and frontier labs to test dangerous capabilities.
Strong safety posture with established governance frameworks and active risk management.
The gold standard for frontier model capability evaluation. No commercial competitor operates at this level. Unique position as trusted independent evaluator.
Nonprofit model. Evaluation contracts are project-based, not recurring SaaS revenue. Depends on continued government and lab willingness to submit to evaluation.
Frontier lab and government evaluation contracts. Not a commercial product.
Evaluations & Benchmarking
Security Assessment
Security-relevant indicators for vendor evaluation
Dimension Breakdown
Social Impact & Safety Profile
ModerateMETR conducts frontier model capability evaluations for governments and AI labs, assessing whether systems pose catastrophic risks. Their independent evaluator role gives them unique access to test dangerous capabilities before deployment. Government and lab partnerships (OpenAI, Anthropic) validate their methodology.
Want METR scored on the Mappera framework?
Subscribe to get notified when full safety scoring becomes available, or reach out to request a detailed brief.