Claude Mythos Preview (gated research preview)
Version: 1
Models from Partners and Community
These models constitute the vast majority of the Azure AI Foundry Models and are provided by trusted third-party organizations, partners, research labs, and community contributors. These models offer specialized and diverse AI capabilities, covering a wide array of scenarios, industries, and innovations. An example of models from Partners and community are the family of large language models developed by Anthropic. Anthropic includes Claude family of state-of-the-art large language models that support text and image input, text output, multilingual capabilities, and vision. See Anthropic's privacy policy to know more about privacy. Learn how to deploy Anthropic models . Characteristics of Models from Partners and Community:- Developed and supported by external partners and community contributors.
- Diverse range of specialized models catering to niche or broad use cases.
- Typically validated by providers themselves, with integration guidelines provided by Azure.
- Community-driven innovation and rapid availability of cutting-edge models.
- Standard Azure AI integration, with support and maintenance managed by the respective providers.
Key capabilities
About this model
Claude Mythos Preview (gated research preview) is a new class of intelligence built for ambitious projects, and the world's best model for cybersecurity, autonomous coding, and long-running agents. Only available as a gated research preview with access prioritized for defensive cybersecurity use cases.Key model capabilities
- Adaptive thinking is an upgrade to extended thinking that gives Claude the freedom to think as much or as little as needed depending on the task and effort level.
- Image & text input: With strong vision capabilities, Claude Mythos Preview can process images and return text outputs to analyze and understand charts, graphs, technical diagrams, reports, and other visual assets.
Use cases
See Responsible AI for additional consideration for responsible use.Key use cases
Claude Mythos Preview is a new class of intelligence built for ambitious projects, and the world's best model for cybersecurity, autonomous coding, and long-running agents. Only available as a gated research preview with access prioritized for defensive cybersecurity use cases.- Cybersecurity: Claude Mythos Preview is the world's best model for defensive security. It is capable of finding and suggesting fixes for real vulnerabilities in production codebases, then helping prove the fixes hold.
- Autonomous coding: Claude Mythos Preview is able to handle the full engineering cycle more effectively than any prior model. It investigates, implements, and tests across large codebases from objective to shipped.
- Long-running agents: Claude Mythos Preview sets a new bar for long-horizon agentic work. It can sustain coherent execution over extended, multi-hour tasks, adapting as conditions change and driving work forward with fewer interventions.
Out of scope use cases
Claude Mythos Preview is only available as a gated research preview with access prioritized for defensive cybersecurity use cases. Please refer to the Claude Mythos Preview system card .Pricing
Azure credits cannot be applied to Claude models. Usage will be billed to the credit card associated with your subscription. Users are advised to review documentation here for applicable pricing and terms prior to deployment to avoid unintended charges.Technical specs
Please refer to the Claude Mythos Preview system card .Training cut-off date
End of December 2025Input formats
Image & text input: With powerful vision capabilities, Claude Mythos Preview can process images and return text outputs to analyze and understand charts, graphs, technical diagrams, reports, and other visual assets. Text output: Claude Mythos Preview can output text of a variety of types and formats, such as prose, lists, Markdown tables, JSON, HTML, code in various programming languages, and more.Supported language
Claude Mythos Preview can understand and output a wide variety of languages, such as English, French, Standard Arabic, Mandarin Chinese, Japanese, Korean, Spanish, and Hindi. Performance will vary based on how well-resourced the language is.Supported Azure regions
Global StandardSample JSON response
Input
{"model":"claude-mythos-preview","max_tokens":4096,"messages":[{"role":"user","content":"Hi, who are you?"}]}
Output
200: {"model":"claude-mythos-preview","id":"msg_01XivxT7GeJ5C2rrJZP2174N","type":"message","role":"assistant","content":[{"type":"thinking","thinking":"The user is asking a simple identity question. I should be clear, friendly, and concise. I'll introduce myself as Claude, mention I'm made by Anthropic, and briefly describe what I can do. I shouldn't be overly verbose for such a simple greeting.","signature":"<base64-encoded cryptographic signature>"},{"type":"text","text":"Hi there! I'm Claude, an AI assistant made by Anthropic. I'm here to help with a wide range of tasks—answering questions, writing, analysis, coding, brainstorming, math, and much more.\n\nWhat can I help you with today?"}],"stop_reason":"end_turn","stop_sequence":null,"stop_details":null,"usage":{"input_tokens":69,"cache_creation_input_tokens":0,"cache_read_input_tokens":0,"cache_creation":{"ephemeral_5m_input_tokens":0,"ephemeral_1h_input_tokens":0},"iterations":[{"input_tokens":69,"output_tokens":185,"cache_read_input_tokens":0,"cache_creation_input_tokens":0,"cache_creation":{"ephemeral_5m_input_tokens":0,"ephemeral_1h_input_tokens":0},"type":"message"}],"output_tokens":185,"service_tier":"standard","inference_geo":"global"}}
4XX: {"type":"error","error":{"type":"invalid_request_error","message":"max_tokens: Field required"},"request_id":"req_011CZfRSX1xudxHbTnPvjSaQ"}
Model architecture
Please refer to the Claude Mythos Preview system card .Long context
1MOptimizing model performance
Please refer to the Claude Mythos Preview system card .Additional assets
- Claude Documentation : Visit Claude documentation for a wealth of resources on model capabilities, prompting techniques, use case guidelines, and more.
- Adaptive Thinking Guide : Understand how best to use extended thinking with Claude.
- Claude Prompting Resources : Check out Anthropic's prompting tools and guides to learn how to craft prompts that elicit more helpful, nuanced responses.
- Claude Cookbooks : Check out example code for a variety of complex tasks, such as RAG from various web sources, making SQL queries, function calling, multimodal prompting, and more.
Distribution channels
Claude Mythos Preview is only available as a gated research preview with access prioritized for defensive cybersecurity use cases.More information
NAData handling
By default, we may process customer data in select countries in the US, Europe, Asia and Australia. We will only store data in data centers located in the United States. For more on data handling and retention, see our Privacy Center.By default, we will not use your inputs or outputs from our commercial products (Anthropic API and Claude Code Enterprise) to train our models. If you explicitly report feedback or bugs to us or otherwise choose to allow us to use your data, then we may use your chats and coding sessions to train our models.
To find out more information regarding your use of an Anthropic commercial offering, or if you would like to know how to contact us regarding a privacy related topic, see our Trust Center and Commercial Terms.
Safety techniques
The Claude Mythos Preview system card describes in detail the wide range of evaluations Anthropic ran to assess the model's safety and alignment.Safety evaluations
Claude Mythos Preview is a new class of intelligence representing significant advances across cybersecurity, autonomous coding, and long-running agents. Due to its advanced capabilities, Claude Mythos Preview is being deployed as a gated research preview prioritizing defensive cybersecurity use cases. The Claude Mythos Preview system card details the most comprehensive safety testing Anthropic has ever conducted for a model launch, including: static behavioral evaluations, automated interactive behavioral evaluations, dictionary-learning interpretability methods, activation oracles, white-box steering and probing methods, non-assistant persona sampling, misalignment-related capability evaluations, training data review, feedback from pilot use internally and externally, automated analysis of internal pilot use, and third-party behavioral assessments from the UK AI Security Institute and Andon Labs. We recommend that users exercise significantly increased caution with this model when using it in contexts where it could gain access to important systems, at least in the absence of close human supervision.Known limitations
Please refer to the Claude Mythos Preview system card .Acceptable use
Acceptable use policy
Anthropic's Usage Policy is intended to help users stay safe and promote the responsible use of Anthropic products and services.Terms of Service
Terms of Service Link
Claude is a proprietary model developed by Anthropic. Usage is governed by Anthropic's Commercial Terms of Service for API access.Quality and performance evaluations
| Benchmark | Test name | Claude Mythos Preview (gated research preview) score |
|---|---|---|
| Agentic coding | SWE-bench Verified | 93.9% |
| Agentic coding | SWE-bench Pro | 77.8% |
| Agentic terminal coding | Terminal-Bench 2.0 | 82.0% |
| Multimodal coding | SWE-bench Multimodal (internal implementation) | 59.0% |
| Multilingual coding | SWE-bench Multilingual | 87.3% |
| Multidisciplinary reasoning | Humanity's Last Exam | Without tools: 56.8%, With tools: 64.7% |
| Graduate-level reasoning | GPQA Diamond | 94.6% |
| Cybersecurity vulnerability reproduction | CyberGym | 83.1% |
| Agentic search | BrowseComp | 86.9% |
| Agentic computer use | OSWorld Verified | 79.6% |
Benchmarking methodology
Please refer to the Claude Mythos Preview system card for benchmarking methodology.Public data summary
Model Specifications
Context Length1000000
Training DataDecember 2025
Last UpdatedApril 2026
Input TypeText,Image,Code
Output TypeText,Image,Code
ProviderAnthropic
Languages8 Languages