gpt-oss-120B
Advanced open reasoning model with enterprise-grade capabilities.
About model
Enterprise-Ready Open Reasoning:
gpt-oss-120B delivers sophisticated chain-of-thought reasoning capabilities in a fully open model. Built with community feedback and released under Apache 2.0, this 120B parameter model provides transparency, customization, and deployment flexibility for organizations requiring complete data security & privacy control.
Model | AIME 2025 | GPQA Diamond | HLE | LiveCodeBench | MATH500 | SWE-bench verified |
|---|---|---|---|---|---|---|
gpt-oss-120B | 75.8% | Related open-source models | Competitor closed-source models | |||
90.5% | 34.2% | 78.7% | ||||
83.3% | 24.9% | 99.2% | 62.3% | |||
76.8% | 96.4% | 48.9% | ||||
49.2% | 2.7% | 32.3% | 89.3% | 31.0% |
API usage
Endpoint:
Model card
Architecture Overview:
• Mixture-of-Experts (MoE) architecture with SwiGLU activations
• Alternating attention layers between full context and sliding 128-token window
• Learned attention sink per-head for enhanced performance
Training Methodology:
• Comprehensive safety training and evaluation protocols
• Community feedback integration from global listening sessions
• Rigorous testing under Preparedness Framework
• Standard GPT-4o tokenizer with additional Harmony format tokens
Performance Characteristics:
• Native FP4 quantization for efficient inference
• 128K context window with RoPE positional encoding
• Chain-of-thought reasoning with adjustable effort levelsApplications & use cases
Enterprise Applications:
• Complex reasoning and analysis tasks
• Research and development support
• Technical documentation generation
• Strategic planning and decision support
Developer Use Cases:
• Code generation and review
• API development and integration
• System architecture design
• Technical troubleshooting and debugging
Industry Solutions:
• Healthcare: Clinical decision support and medical research
• Finance: Risk analysis and regulatory compliance
• Legal: Contract analysis and legal research
• Education: Curriculum development and tutoring systems
Deployment Scenarios:
• On-premises infrastructure for data sovereignty
• Private cloud deployments for security compliance
• Custom fine-tuning for domain-specific applications
• Multi-modal integration with existing systems
- TypeReasoning
- Main use casesChatSmall & FastMedium General Purpose
- FeaturesJSON Mode
- SpeedHigh
- IntelligenceHigh
- DeploymentServerlessOn-Demand DedicatedMonthly Reserved
- Endpoint
- Parameters120B
- Context length128K
- Input price
$0.15 / 1M tokens
- Output price
$0.60 / 1M tokens
- Input modalitiesText
- Output modalitiesText
- ReleasedAugust 4, 2025
- Last updatedAugust 18, 2025
- External link
- CategoryChat