Screener/Elicit

Elicit

AI research assistant used by 2M+ researchers, automating literature reviews across 138M+ papers.

HQ🇺🇸 US
Est2021
Size11-50
elicit.com
Score
35.0 / 100
Evidence
4 items
Confidence
low

Developing safety practices - core foundations in place with room for improvement.

Weaknesses:Governance Maturity, Technical Safety, Risk Assessment, Regulatory Readiness, External Engagement
Focus Areas
ai research assistantliterature reviewscientific researchalignment

Strengths

No notable strengths identified

Risks

  • Regulatory score (25) - significant gap
  • Risk score (30) - significant gap
  • Engagement score (35) - significant gap
Table of Contents

Security Assessment

Security-relevant indicators for vendor evaluation

Security Posture
38
TS-01dim: 45
Red Teaming & Pre-deployment Testing
Adversarial testing before deployment
TS-05dim: 45
Robustness & Adversarial Resilience
Resistance to adversarial attacks
RA-01dim: 30
Sector-Specific Risk Assessment
Risk analysis for deployment context
RA-03dim: 30
Dual-Use & Misuse Risk
Dangerous capability awareness
RA-07dim: 30
Incident History & Track Record
Past incidents and response quality
EE-04dim: 35
Vulnerability Disclosure Program
Bug bounty or CVE reporting process
Incident History
Elicit incident records sourced from AIAAIC Repository and public reporting.
Integration: AIAAIC, OECD AI Incidents Monitor
Third-Party Audits
External audit reports, SOC 2 attestations, and ISO certifications verified where published.
Sources: Company filings, registry lookups
CVE & Disclosures
Known vulnerabilities and security advisories from NVD, GitHub Security Advisories, and vendor pages.
Sources: NVD, GHSA, vendor disclosure pages

Dimension Breakdown

GM
Governance Maturitypreliminary
Published policies, corporate structure, safety mandate, whistleblowing, executive commitment.
40
TS
Technical Safetypreliminary
Benchmarks, adversarial robustness, fine-tuning safety, watermarking, model cards, research output.
45
RA
Risk Assessmentpreliminary
Dangerous capability evaluations, thresholds, external testing, bug bounty, halt conditions.
30
RR
Regulatory Readinesspreliminary
ISO 42001, EU AI Act compliance, GPAI obligations, international commitments, incident reporting.
25
EE
External Engagementpreliminary
Survey participation, research support, transparency, behavior specs, open-source contributions.
35

Social Impact & Safety Profile

Moderate

Elicit is a Public Benefit Corporation incubated at Ought, a nonprofit focused on scaling human reasoning. Automates literature reviews and data extraction across 138M+ scientific papers. Millions in recurring revenue and 2M+ researcher users demonstrate product-market fit in academic and enterprise research contexts.

scientific researchliterature automationreasoning at scale

Peer Comparison

Redwood Research
B55

Alignment Research

Compare
Softmax
C41.3

Alignment Research

Compare
Conjecture
C40

Alignment Research

Compare
Fathom
D25

Alignment Research

Compare

Data Sources & Methodology

Scoring methodology v0.1 · 40 indicators · 6 frameworks

Last assessment: 2026-03-25 · Confidence: low · Evidence: 4 items

NIST AI RMF · EU AI Act · ISO 42001 · FLI AI Safety Index · MLCommons AILuminate · METR

Scores reflect publicly available information. A low score may indicate limited transparency rather than poor safety practices.