OpenAI GPT 4.1 API

Readme

GPT‑4.1 mini is a compact, high-performance model designed for real-world applications that require fast response times and low cost—without sacrificing intelligence. It delivers performance competitive with GPT‑4o while cutting latency nearly in half and reducing cost by 83%.

Key Features

Fast and lightweight, ideal for latency-sensitive use cases
High accuracy across coding, reasoning, and instruction following
Supports 1 million token context windows
Cost-effective for large-scale deployments
Reliable for long-context and format-specific tasks

Benchmark Highlights

SWE-bench Verified (coding): 24%
MultiChallenge (instruction following): 36%
IFEval (format compliance): 84%
Aider Diff Format Accuracy (diff): 45%
MMMU (vision QA): 73%

Use Cases

Chatbots and assistants
Lightweight code generation and review
Document Q&A and summarization
Image reasoning
High-volume, cost-sensitive tasks

Notes

Available via the OpenAI API
Not currently available in ChatGPT
Supports up to 1 million tokens of context