xAI

Builds Grok LLM with minimal published safety governance or social impact frameworks.

HQ🇺🇸 US
Est2023
Size201-500
EU AI ActGPAI Systemic
x.ai
Score
25.0 / 100
Evidence
3 items
Confidence
low

Early-stage safety posture - basic practices exist but significant gaps remain.

Weaknesses:Governance Maturity, Technical Safety, Risk Assessment, Regulatory Readiness, External Engagement
Focus Areas
foundation modelsAGI researchGrok

Strengths

No notable strengths identified

Risks

  • Regulatory score (15) - significant gap
  • Governance score (18) - significant gap
  • Risk score (25) - significant gap
  • Engagement score (30) - significant gap
  • Low evidence coverage (3 items)
Table of Contents

Security Assessment

Security-relevant indicators for vendor evaluation

Security Posture
30
TS-01dim: 35
Red Teaming & Pre-deployment Testing
Adversarial testing before deployment
TS-05dim: 35
Robustness & Adversarial Resilience
Resistance to adversarial attacks
RA-01dim: 25
Sector-Specific Risk Assessment
Risk analysis for deployment context
RA-03dim: 25
Dual-Use & Misuse Risk
Dangerous capability awareness
RA-07dim: 25
Incident History & Track Record
Past incidents and response quality
EE-04dim: 30
Vulnerability Disclosure Program
Bug bounty or CVE reporting process
Incident History
xAI incident records sourced from AIAAIC Repository and public reporting.
Integration: AIAAIC, OECD AI Incidents Monitor
Third-Party Audits
External audit reports, SOC 2 attestations, and ISO certifications verified where published.
Sources: Company filings, registry lookups
CVE & Disclosures
Known vulnerabilities and security advisories from NVD, GitHub Security Advisories, and vendor pages.
Sources: NVD, GHSA, vendor disclosure pages

Dimension Breakdown

GM
Governance Maturitypreliminary
Published policies, corporate structure, safety mandate, whistleblowing, executive commitment.
18
TS
Technical Safetypreliminary
Benchmarks, adversarial robustness, fine-tuning safety, watermarking, model cards, research output.
35
RA
Risk Assessmentpreliminary
Dangerous capability evaluations, thresholds, external testing, bug bounty, halt conditions.
25
RR
Regulatory Readinesspreliminary
ISO 42001, EU AI Act compliance, GPAI obligations, international commitments, incident reporting.
15
EE
External Engagementpreliminary
Survey participation, research support, transparency, behavior specs, open-source contributions.
30

Social Impact & Safety Profile

Limited

xAI develops frontier models (Grok) and operates one of the largest GPU clusters (Colossus). Despite the scale of deployment via X/Twitter integration, published safety research, governance structures, and social impact commitments are sparse compared to peers. No published responsible scaling policy or independent safety board.

foundation modelscompute infrastructure

Peer Comparison

Cohere
C-38

Foundation Models

Compare
Mistral AI
C-35

Foundation Models

Compare
AI21 Labs
D+33

Foundation Models

Compare
Stability AI
D-22

Foundation Models

Compare

Data Sources & Methodology

Scoring methodology v0.1 · 40 indicators · 6 frameworks

Last assessment: 2026-03-23 · Confidence: low · Evidence: 3 items

NIST AI RMF · EU AI Act · ISO 42001 · FLI AI Safety Index · MLCommons AILuminate · METR

Scores reflect publicly available information. A low score may indicate limited transparency rather than poor safety practices.