Generate bench-ready cosmetic formulations with built-in regulatory intelligence across 16 global markets.
theformulator.ai helps cosmetic formulators and innovation teams turn product briefs into structured prototype formulations — with ingredient guidance, regulatory screening across 16 markets, and formulation troubleshooting built in.
Designed for professional formulators and R&D teams. Not a consumer recipe tool.
Start free — no credit card required.
See sample output →Built for the teams that turn concepts into products.
R&D Formulators
Generate structured starting formulations and reduce dead-end iterations with market-validated ingredient systems.
Innovation Teams
Translate product briefs and claims into formulation pathways faster — from concept to bench-ready prototype.
Regulatory Affairs
Catch ingredient and market-fit issues earlier in development, across 16 regulatory markets including Brazil, Thailand, and Malaysia.
Technical & Sales Teams
Respond to customer briefs with higher-quality prototype concepts backed by real formulation intelligence.
Formulation development hasn't changed in decades.
Cosmetic formulators still work from disconnected PDFs, spreadsheets, and supplier portals. We've unified the intelligence.
Professional formulators still navigate regulatory requirements across PDFs and spreadsheets. Supplier data lives in disconnected portals. When something goes wrong in stability testing, you start over.
- Regulatory limits change across 16 markets — no single source of truth
- 605,000+ commercially launched products contain co-occurrence intelligence that nobody has organised
- CIR safety data, supplier TDS, research literature — all disconnected
- Every failed stability batch is a reformulation from scratch
One of the most comprehensive dedicated knowledge systems for cosmetic formulation and compliance.
ingredient co-occurrence intelligence across skin care, hair care, sun care, and personal care
ingredient-level data supporting selection, screening, and compatibility checking
historical prototype patterns to support generation and benchmarking
peer-reviewed safety data integrated at ingredient level
full-text extraction from PMC, Europe PMC, CORE — not just abstracts
safety, efficacy, and formulation data extracted from primary literature
EU, UK, USA, China, Japan, Korea, India, Australia, Canada, Brazil, Thailand, Malaysia, and more
Green — full regulatory coverage · Amber — coverage expanding in 2026
Sources include: 82,000+ peer-reviewed papers (PMC, Europe PMC, CORE, PubMed, OpenAlex) · CIR safety assessments · ECHA CLP/SVHC · SCCS opinions · IARC/NTP carcinogen classifications · EUR-Lex Annexes · MHLW Japan · NMPA China · MFDS Korea · MOCRA · CDSCO India
This corpus took years to assemble. It is the foundation that makes every formulation, market intelligence suggestion, and regulatory check meaningful — not generic.
Brief. Generate. Test. Ship.
Define the brief
Enter product format, claims, target markets, sensory goals, and technical constraints across 5 structured layers.
sulphate free · no silicones · EU + India · brightening
Generate formulation routes
Receive structured prototypes where every concentration is pre-validated against your target markets. Regulatory limits, ingredient compatibility, and HLB requirements are checked during generation — not after.
Aqua · Glycerin · Niacinamide · Centella Asiatica Extract · Bakuchiol
Screen for fit and risk
Evaluate ingredient suitability, market-specific regulatory constraints, and formulation confidence indicators across 16 markets.
Stability 82% · Performance 91% · Regulatory 76%
Refine and troubleshoot
Compare variants, diagnose instability concerns, and iterate toward a stronger starting formula with the Formulation Partner.
emulsion break · pH drift · preservative efficacy
Reverse-engineer any product on the shelf.
Paste an INCI list, search our 605,000+ product database, or photograph a label. Get a probable bench-ready formula with concentrations, phase assignments, and regulatory screening — in minutes.
Search 605K+ Products
Type a product name. If it's in our database of 605,000+ marketed products across 16 markets, the INCI list auto-populates instantly. Coverage spans Sephora, Nykaa, Olive Young, Douglas, and dozens more retailers.
3-Layer Blend Detection
Commercial INCI lists decompose supplier blends into constituent names. Our engine re-groups them using thousands of ingredient technical records, statistical co-occurrence from 605K products, and functional chemistry rules. See what the formulator actually weighed out.
Concentration Reconstruction
INCI order gives you descending concentration above 1%. We go further — cross-referencing regulatory limits, TDS usage ranges, clinical efficacy data, and market norms to estimate probable concentrations with confidence tiers.
Same credit cost as formulation. Same 15-layer retrieval. Same regulatory screening across 16 markets. Same INCI-only, supplier-agnostic output.
Three levels of intelligence.
Choose the depth your project needs.
Quick Formula
1 creditA bench-ready formula with regulatory pass/fail per market and preservation system. Built for speed — ideal for early-stage screening.
INCI formula table · regulatory status summary · preservation system
Download sample →Intelligence Brief
3 creditsFull formulation intelligence with safety scores, per-ingredient regulatory detail, market co-occurrence data, and preservation rationale. The workhorse for professional R&D.
Everything in Quick plus safety overview · market intelligence · formulation notes · ingredient-level regulatory table
Download sample →Dossier
5 creditsComplete technical package with stability risk assessment, manufacturing brief, technical ingredient data, and PubMed citations. Built for regulatory submissions and CMO handoff.
Everything in Brief plus stability risk assessment (ICH zones) · manufacturing brief · technical ingredient data · 3–5 PubMed citations per active
Download sample →All three variants are included in every report. Credits buy the intelligence depth, not the number of formulations.
How Our Formulation Engine Works
Every formulation is built from 13 specialised data sources — and the engine reasons twice before a single ingredient is committed.
Brief Analysis
Your formulation brief is parsed to identify product type, target markets, ingredient constraints, claims, and technical requirements. This determines which data sources activate and how deeply the engine retrieves.
Intelligent Ingredient Selection
The engine doesn't pick ingredients from a dropdown. It reasons about your brief against real data — surfactant mildness scores, preservative pH compatibility, market co-occurrence patterns, and safety profiles — to assemble a candidate pool. Every ingredient earns its place through data, not defaults.
Market Intelligence
Co-occurrence patterns from 605,000+ commercially launched products
Surfactant Properties
Mildness-ranked systems with Zein Number and clinical irritation data
Preservative Systems
Pre-validated systems ranked by spectrum, pH range, and natural origin percentage
Ingredient Hazard Data
Raw hazard classifications across 34,000+ ingredients from ECHA CLP, IARC Monographs, NTP, and REACH SVHC candidate lists
Ingredient Technical Data
Use levels, phase of addition, processing notes, solubility, HLB from 43,000+ profiles
COSING Reference Data
Canonical INCI names, CAS numbers, EC numbers, and regulatory functions for 30,000+ ingredients
Regulatory Intelligence
Per-ingredient status across 16 markets — prohibited, restricted, permitted, with concentration ceilings
Concentration Limits
Maximum use levels from EU Annexes, Japan Standards, Korea MFDS, India IS 4707, and more
Peer-Reviewed Literature
400,000+ extracted findings on safety, efficacy, and formulation science from 82,000+ papers
CIR Safety Assessments
Cosmetic-specific safety conclusions from 2,600+ peer-reviewed final reports
Proprietary Safety Engine
6-axis hazard scoring — carcinogenicity, reproductive toxicity, sensitization, systemic toxicity, irritation, and endocrine disruption — computed from ECHA, SCCS, SVHC, and IARC primary data with exposure-adjusted weighting
Retailer Compliance
Ingredient restriction lists for Sephora Clean, Credo, Ulta Conscious Beauty, and Whole Foods
Reference Formulations
25,000+ professional benchmark formulations with phase structure and concentration patterns
Formulation Generation
The selected ingredients are assembled into per-ingredient dossiers — each containing technical data, regulatory status, safety scores, literature findings, and a pre-resolved effective concentration range. A second, independent intelligence pass receives these dossiers and designs the complete formulation: phase assignments, concentrations, processing order, and preservation strategy. It formulates with full regulatory awareness — it doesn't guess concentrations, it checks limits.
4-Sweep Validation
Every output passes four independent validation sweeps: prohibited substance screening, concentration compliance against regulatory ceilings, restricted substance flagging, and data-driven phase placement verification. Trade names and supplier references are stripped at three independent layers — the platform is supplier-agnostic by architecture, not by policy.
13 data sources. Two independent intelligence passes. Four validation sweeps. The complete formulation — including regulatory screening across all target markets — generates in under 60 seconds.
Same Score, Different Story
Every ingredient scored across six hazard axes using data from regulatory authorities — not consumer databases. A single composite number hides where the risk actually lies.
Benzophenone-3 (Oxybenzone)
Composite 1.50·GREEN·HIGH confidence
Phenoxyethanol
Composite 1.50·GREEN·HIGH confidence
Both score 1.50 — but for entirely different reasons. A single number can't tell you this.
Six hazard axes
- Carcinogenicity— IARC Monographs, ECHA CLP
- DART— ECHA CLP reproductive toxicity classifications
- Sensitization— ECHA CLP, NACDG patch test data
- Systemic Toxicity— ECHA CLP, SCCS opinions
- Irritation— CIR assessments, Zein Number data
- Endocrine Disruption— ECHA ED assessments, REACH SVHC
Three core principles
- No data gap penalty — absence of evidence ≠ evidence of harm
- Exposure context matters — leave-on and rinse-off scored differently
- Source hierarchy enforced — ECHA/CIR/IARC data always trumps lower-tier sources
Scores derived from 11 authoritative sources including ECHA CLP Annex VI, IARC/NTP carcinogen classifications, CIR safety assessments, ESSCA clinical patch test data, and EPA CompTox predictions.
Built for how formulators actually work
Every feature is grounded in real regulatory data, real market intelligence, and real formulation science.
The platform flags regulatory conflicts before you generate.
As you build your brief, real-time intelligence surfaces market-specific tensions. EU/China conflicts, Ecocert constraints, prohibited ingredient flags — all resolved before a single INCI is committed.
Know what is already working in 605,000 products.
Co-occurrence analysis across 605,000 marketed formulations surfaces ingredient combinations that correlate with consumer acceptance in your target markets — giving every output a market-intelligence foundation.
Phase-structured. Regulatory-validated. Ready for the bench.
INCI-only output — no brand names, no supplier bias. Phase A through C clearly structured with critical control points flagged and 3-dimensional confidence scores across stability, performance, and regulatory safety.
Confidence indicators are generated using internal evaluation logic across ingredient compatibility, known formulation patterns, regulatory constraints, and category fit. They support expert review — they do not replace laboratory validation or formal safety assessment.
No supplier relationships. No commercial influence.
Every ingredient recommendation is ranked by performance data, market frequency, and regulatory status. No supplier pays to influence your formulation output. Platform neutrality is architectural.
34,000+ ingredient safety profiles integrated into every output.
Every ingredient is cross-referenced against 34,000+ safety profiles built from CIR assessments, ECHA dossiers, SCCS opinions, IARC/NTP classifications, and 82,000+ peer-reviewed research papers. Safety tiers are computed per-ingredient and flagged in the output before you reach the lab.
Paste a failing formula. Get a ranked diagnosis.
Describe an emulsion breaking at 40°C. Get three ranked root causes with probability scores, cross-referenced against 2,573 similar systems in the knowledge base. Fix recommendations included.
Your Formulations Stay Private
Every formulation you generate is stored in your private workspace. No one at theformulator.ai — including our team — can access your generated formulations. Export or delete your data anytime.
Private Workspace
Your generated formulations are stored in a workspace tied to your account. No theformulator.ai team member can read your formulation content.
No Internal Access
We do not use your formulation data for training, analytics, or any purpose other than delivering your results back to you.
You Always Own Your Data
Export your formulations at any time in full. If you leave, your data leaves with you.
We never share formulation data with ingredient suppliers, distributors, or any third party. Platform neutrality is architectural.
Start free. Upgrade when you're ready.
Registration is free — no credit card, no commitment. When you're ready to subscribe, we'll walk you through your options personally with sample reports so you can decide with full information.
Free account
7-day trial · 6 credits included
Register in 30 seconds. Get 6 credits to generate real formulations — enough for a Quick Formula and an Intelligence Brief, or six Quick Formulas. No credit card required.
Subscription access
When you're ready to subscribe, request access. We'll send you sample reports at every intelligence level on the same formulation, explain the credit system, and recommend the right tier for your workflow. No pressure, full information.
Subscription plans from $199/month
Quick Formula — 1 credit — up to 2 variants
Intelligence Brief — 3 credits — up to 3 variants
Dossier — 5 credits — up to 3 variants, stability protocol, manufacturing brief, research citations
Formulation Partner — 1 credit per 5 exchanges
Subscriptions activate within 24 hours of confirmation. Sample reports sent before you commit.
The formulation tool you didn't know you were missing.
Register free. Experience the platform. Request a subscription when you're ready.