Screener/Galileo AI

Galileo AI

LLM evaluation platform focused on hallucination detection and data quality for safer AI applications.

HQ🇺🇸 US
Est2021
Size11-50
rungalileo.io
Score
58.0 / 100
Evidence
5 items
Confidence
medium

Strong safety posture with established governance frameworks and active risk management.

Strengths:Governance Maturity, Technical Safety, Risk Assessment, Regulatory Readiness, External Engagement
Focus Areas
llm evaluationhallucination detectionobservabilitydata quality
Table of Contents

Security Assessment

Security-relevant indicators for vendor evaluation

Security Posture
59
TS-01dim: 65
Red Teaming & Pre-deployment Testing
Adversarial testing before deployment
TS-05dim: 65
Robustness & Adversarial Resilience
Resistance to adversarial attacks
RA-01dim: 52
Sector-Specific Risk Assessment
Risk analysis for deployment context
RA-03dim: 52
Dual-Use & Misuse Risk
Dangerous capability awareness
RA-07dim: 52
Incident History & Track Record
Past incidents and response quality
EE-04dim: 66
Vulnerability Disclosure Program
Bug bounty or CVE reporting process
Incident History
Galileo AI incident records sourced from AIAAIC Repository and public reporting.
Integration: AIAAIC, OECD AI Incidents Monitor
Third-Party Audits
External audit reports, SOC 2 attestations, and ISO certifications verified where published.
Sources: Company filings, registry lookups
CVE & Disclosures
Known vulnerabilities and security advisories from NVD, GitHub Security Advisories, and vendor pages.
Sources: NVD, GHSA, vendor disclosure pages

Dimension Breakdown

GM
Governance Maturitypreliminary
Published policies, corporate structure, safety mandate, whistleblowing, executive commitment.
55
TS
Technical Safetypreliminary
Benchmarks, adversarial robustness, fine-tuning safety, watermarking, model cards, research output.
65
RA
Risk Assessmentpreliminary
Dangerous capability evaluations, thresholds, external testing, bug bounty, halt conditions.
52
RR
Regulatory Readinesspreliminary
ISO 42001, EU AI Act compliance, GPAI obligations, international commitments, incident reporting.
52
EE
External Engagementpreliminary
Survey participation, research support, transparency, behavior specs, open-source contributions.
66

Social Impact & Safety Profile

Limited

Galileo AI provides evaluation and observability tooling for LLMs, with particular focus on hallucination detection, data quality scoring, and output monitoring. Their Guardrail Metrics help developers identify problematic model behavior before deployment.

hallucination detectionevaluationdata quality

Peer Comparison

AI Underwriting Company
A-70

Governance Tooling

Compare
Gray Swan
B+64.7

Evaluations & Benchmarking

Compare
Haize Labs
B56.7

Evaluations & Benchmarking

Compare
Goodfire
B56.3

Interpretability

Compare

Data Sources & Methodology

Scoring methodology v0.1 · 40 indicators · 6 frameworks

Last assessment: 2026-03-23 · Confidence: medium · Evidence: 5 items

NIST AI RMF · EU AI Act · ISO 42001 · FLI AI Safety Index · MLCommons AILuminate · METR

Scores reflect publicly available information. A low score may indicate limited transparency rather than poor safety practices.