xAI has released weights for some earlier Grok models (Grok-1) under an Apache 2.0 license, making them open source in the weights sense. However, Grok 3 — the current production model — is not open source. xAI's stated commitment to openness has been partial and selective. The Grok-1 release was notable because it made a 314-billion-parameter model publicly available, which was significant for the research community, but the frontier production models remain closed.

Grok AI [2026]: What It Is, How It Works, and Whether It's

Two years ago, Grok was a curiosity — a side project bolted onto X with an edgy personality and a few months of training data. In 2026, that characterization is out of date. Grok 3 benchmarks competitively against GPT-4o on several tasks, the xAI team has grown to hundreds of engineers, and the model's unique access to real-time X/Twitter data has become a genuine moat that neither OpenAI nor Anthropic has been able to replicate.

314B

Parameters in the open-sourced Grok-1 model weights

500M+

X/Twitter users whose public posts Grok can query in real time

$6B

xAI funding raised as of early 2025 (Series C)

Key Takeaways

What is Grok AI and who made it? Grok is a large language model built by xAI, the AI company founded by Elon Musk in 2023.
How does Grok 3 compare to ChatGPT and Claude? Grok 3 is competitive with GPT-4o and Claude 3.5 Sonnet on standard reasoning and coding benchmarks, though it trails on some tasks requiring nuanc...
Is the Grok API available and how much does it cost? Yes, the Grok API is available through xAI's developer platform at console.x.ai.
Is Grok open source? xAI has released weights for some earlier Grok models (Grok-1) under an Apache 2.0 license, making them open source in the weights sense.

That does not mean Grok has surpassed the competition across the board. It has not. But the question is no longer whether xAI is serious — it clearly is — but where Grok is genuinely the best tool for the job and where it still falls short. That is what this guide covers, without the hype in either direction.

What Is Grok and Who Built It

Grok is a large language model built by xAI, Elon Musk's AI company founded in July 2023. It launched in November 2023 as an X Premium exclusive, trained partly on X/Twitter data — giving it native real-time social data access that no other major AI model has. Grok 3 (2025) is the current production model, available via the X platform and at console.x.ai.

Grok is a large language model (LLM) developed by xAI, the AI company Elon Musk founded in July 2023 after departing OpenAI's board. The name is a reference to Robert Heinlein's science fiction novel Stranger in a Strange Land, where "grokking" means to understand something so thoroughly that you merge with it. The branding was deliberate — and somewhat ironic given that Musk's departure from OpenAI was framed around concerns about AI safety and transparency, while Grok was subsequently positioned as a less filtered, more direct alternative to models like ChatGPT.

The first version of Grok launched in November 2023 as an exclusive feature for X Premium subscribers. It was trained partly on data from X/Twitter, which gave it something no other major model had: native access to real-time social data. Subsequent versions — Grok 1.5, Grok 2, and now Grok 3 — have expanded capabilities significantly while maintaining that real-time data integration as the core differentiator.

xAI at a Glance (2026)

Founded: July 2023 by Elon Musk
Headquarters: Memphis, Tennessee (Colossus data center)
Funding: $6B Series C (2025), valuation estimated at $40–50B
Compute: Colossus cluster — over 100,000 NVIDIA H100 GPUs, one of the largest private AI clusters in the world
Distribution: Integrated into X platform, standalone app, and API via console.x.ai
Key differentiator: Real-time access to X/Twitter's full post graph

Grok's personality is a notable departure from competitors. Where ChatGPT defaults to cautious, neutral responses and Claude leans heavily toward careful reasoning and safety caveats, Grok is deliberately more blunt — and has historically been more willing to engage with controversial topics that other models deflect. This is partly a product decision and partly a reflection of xAI's stated position that current AI models are over-censored. Whether that is an advantage or a liability depends on your use case.

Grok 3 vs Grok 2: What Changed

Grok 3 trained on roughly 10x the compute of Grok 2 using xAI's 100,000-H100 Colossus cluster. The result: a 128K token context window, a "Think" mode chain-of-thought reasoning feature that pushes MATH benchmark performance from ~59% to ~76%, and Aurora image generation integrated into the same API — competitive with DALL-E 3 on photorealistic output.

Grok 3 represents xAI's first serious attempt to compete at the frontier of AI capability, not just in a niche. The improvements over Grok 2 are substantial across several dimensions.

Training Scale

Grok 3 was trained on xAI's Colossus cluster using over 100,000 H100 GPUs — roughly ten times the compute used for Grok 2. That raw scale matters. More compute, more data, and improved training techniques compound into meaningfully better models. xAI has not published full training details, but the benchmark improvements reflect a major investment in compute and data quality.

Reasoning Improvements

Grok 3 introduced a "Think" mode — a chain-of-thought reasoning approach similar to OpenAI's o1 series and Anthropic's extended thinking feature. When enabled, Grok 3 works through problems step by step before producing a final answer, which dramatically improves performance on math, logic, and multi-step coding tasks. On MATH and GPQA benchmarks published by xAI, Grok 3 with Think mode enabled outperforms GPT-4o on several categories.

Context Window

Grok 3 supports a 131,072-token context window (128K tokens), matching GPT-4o and falling just short of Claude 3.7's 200K token context. For most tasks this distinction is irrelevant, but for very long document analysis — entire codebases, lengthy legal documents, book-length research — Claude's larger context gives it an edge.

Multimodality

Grok 3 handles both text and images as inputs. You can share screenshots, diagrams, and photos and ask questions about them. Image generation (not just understanding) is handled by Aurora, xAI's separate image model, which is covered in its own section below.

    Grok 2 → Grok 3: The Numbers
    Compute: ~10x increase in training compute (Colossus cluster)
Context window: Expanded to 128K tokens
Reasoning: "Think" mode added — chain-of-thought for hard problems
Benchmarks: MATH benchmark score improved from ~59% to ~76% (xAI reported)
Multimodal: Vision input now standard; Aurora generates images
API: Grok 3 available via API at console.x.ai

  

Grok vs ChatGPT vs Claude: Honest Comparison

Grok 3 is competitive with GPT-4o and Claude 3.7 on math and reasoning benchmarks, but trails on instruction-following consistency and long-form writing quality. Its decisive advantage is real-time X/Twitter data access — a structural differentiator neither GPT-4o nor Claude can replicate via web search plugins. For current events, social sentiment, and X-specific tasks, Grok wins. For coding and analysis, GPT-4o and Claude still lead.

Here is the comparison most people actually want: how does Grok stack up against GPT-4o and Claude 3.7 on the dimensions that matter for real work? The table below reflects current performance as of Q1 2026 based on public benchmarks, community testing, and direct evaluation. No model wins every category.

Capability	Grok 3 (xAI)	GPT-4o (OpenAI)	Claude 3.7 (Anthropic)
Coding	Good — competitive on common tasks, occasional inconsistency	Strong — large training data, reliable output	Best — highest consistency on complex, long-form code
Reasoning / Math	Strong — Think mode competes with o1 on hard math	Strong — o1/o3 series excel at deep reasoning	Good — extended thinking helps; slightly below o-series
Real-time data	Best — native X/Twitter integration, no web search lag	Good — web search via plugin, not native	Limited — web search available but not deeply integrated
Writing quality	Good — direct and clear; less nuanced on tone	Strong — versatile across styles	Best — most nuanced long-form writing
Context window	128K	128K	200K
Image generation	Aurora — high-quality, integrated	DALL-E 3 — mature and widely used	None — no native image generation
API availability	Yes — newer, smaller ecosystem	Yes — mature, broadest ecosystem	Yes — strong developer tooling
Instruction following	Good — can drift on complex multi-step prompts	Strong	Best — highest scores on instruction benchmarks
Pricing (API, input)	~$3 / 1M tokens	~$2.50 / 1M tokens	~$3 / 1M tokens
Open source	Partial — Grok-1 weights released; Grok 3 closed	Closed	Closed

The honest summary: for most general-purpose tasks — coding assistance, writing, research, document analysis — GPT-4o and Claude 3.7 are still more consistent and reliable. Where Grok 3 wins decisively is anything requiring current data, social context, or X/Twitter-specific information. That is a genuinely unique advantage, and for certain applications, it is the only tool that can do the job.

Grok's Real-Time X/Twitter Data Advantage

Grok has native, real-time access to the full X/Twitter post graph — not scraped web pages or search results, but live posts, threads, and trending topics from 600M+ monthly active users as they occur. No other major AI model has a comparable real-time social data architecture. For journalists, market researchers, and anyone monitoring public discourse, this is a categorically different capability.

This is the most underappreciated part of Grok's story and the clearest reason to include it in your AI toolkit even if you use other models for most tasks.

Every other major AI model — GPT-4o, Claude, Gemini — has a training cutoff. When you ask them about current events, they either rely on stale knowledge or use a web search plugin that retrieves public web pages. Neither approach gives you access to the live stream of what people are saying, thinking, and sharing right now.

Grok has a fundamentally different architecture for current events: it has real-time, native access to the full X/Twitter post graph. Not scraped web pages. Not search engine results. The actual posts, threads, trending topics, and public conversations happening on X as they occur. This is not a feature — it is a structural advantage that competitors cannot easily replicate without a comparable social data asset.

600M+

Monthly active users on X — all of whose public posts feed Grok's real-time knowledge

No other AI model has comparable real-time social data access as a native capability

What This Means in Practice

Ask Grok what people are saying about a product launch that happened three hours ago — it knows. Ask it to summarize the current narrative around a public figure or policy topic based on what X users are actually posting — it can do that with a granularity and immediacy that no other model matches. Ask it to identify emerging trends before they reach mainstream news — it has data that mainstream news does not yet have.

For journalists, market researchers, PR professionals, social media managers, and anyone whose work involves understanding public discourse in real time, this is a fundamentally different tool than anything else available. It does not replace GPT-4o or Claude for deep reasoning tasks, but for situational awareness of what the internet is actually saying right now, nothing else competes.

Use Cases Where Grok's Real-Time Data Wins

Monitoring public sentiment on a brand, product, or event as it develops
Research on breaking news — context and reaction before traditional media has processed it
Identifying emerging topics in a niche community before they reach mainstream attention
Tracking how specific narratives spread across X over hours or days
Political and policy analysis tied to current public discourse
Financial market context — understanding the social narrative around earnings, news events, or regulatory actions

Aurora: Grok's Image Generation Model

Aurora is xAI's image generation model integrated directly into Grok and the Grok API. It produces photorealistic and illustrative images competitive with DALL-E 3, with fewer content restrictions than OpenAI applies — including images of public figures. For developers needing both text and image generation from a single API, Grok + Aurora eliminates a separate image provider integration.

Aurora is xAI's image generation model, integrated into Grok and available through the same interface and API. It was released alongside Grok 3 improvements and represents xAI's entry into the image generation market that DALL-E, Midjourney, and Stable Diffusion have dominated.

Aurora's quality is genuinely competitive with DALL-E 3 on photorealistic and illustrative image generation. Independent community comparisons have placed it in the same tier as DALL-E 3 and above Stable Diffusion XL on prompt adherence and visual coherence. It is not yet at the level of Midjourney v6 for artistic image generation, but for practical use cases — generating product mockups, illustrations for content, reference images for design — it performs well.

Where Aurora Stands Out

The main advantage Aurora has over DALL-E 3 is in generating images of real people, including public figures — a category where OpenAI applies aggressive restrictions. xAI has taken a less restrictive approach, which has drawn criticism from some quarters and significant interest from others. This is a deliberate policy difference, not a capability difference, and the right view of it depends on your use case and values.

Aurora is also available via the Grok API, meaning developers can integrate image generation into applications without managing a separate API key from a separate provider. For applications that need both text and image generation from a single model provider, the Grok API with Aurora integration simplifies the stack.

Aurora vs DALL-E 3 vs Midjourney v6

Aurora: Competitive photorealism, fewer content restrictions, integrated with Grok API. Best for real-person imagery and applications that need a single-vendor text + image stack.

DALL-E 3: Mature ecosystem, tightly integrated into ChatGPT and the OpenAI API, strong on abstract and creative prompts. More restrictive on certain content categories.

Midjourney v6: Still the leader for high-end artistic and stylized image generation. Not available via API. Best for design and creative work where aesthetics are the primary goal.

When Grok Is the Right Choice

Use Grok when you need real-time social data, current X/Twitter trends, breaking news context, or social sentiment analysis — no other model competes there. Use it when you want a single API for text and image generation via Aurora. For standard coding, writing, and deep reasoning tasks, GPT-4o and Claude 3.7 still deliver more consistent output.

The question most people need answered is not "is Grok good?" but "when should I reach for Grok instead of my default AI tool?" Here is a direct answer.

Use Grok when you need real-time social or news data. This is the clearest decision rule. If your task requires understanding what is happening right now — trending topics, breaking news context, current public sentiment, live event discussion — Grok is the only major model that can do this natively. No web search plugin from another provider matches the depth and immediacy of Grok's X data access.

Use Grok when you want image generation alongside text in a single API. For developers building applications that need both text and image generation, Grok + Aurora from xAI's single API is a cleaner integration than combining OpenAI for text and a separate image service.

Use Grok when you want less filtered responses on edge topics. If you are doing research or writing that requires a model willing to engage with controversial material that GPT-4o and Claude typically deflect, Grok's less restrictive defaults may serve you better. This is a practical consideration, not an endorsement of harmful content generation.

Stick with GPT-4o or Claude for most coding and writing work. On general-purpose coding, instruction-following, and long-form writing, GPT-4o and Claude 3.7 are still more consistent and produce fewer unexpected outputs. Grok 3 has closed the gap significantly, but for professional-grade work where reliability matters, the established leaders still have an edge on most tasks.

"Grok's real-time X data access is not a feature you can replicate by giving another model a web search tool. It is a structurally different kind of current-events intelligence."

Grok for Developers: API Setup and Code Examples

The Grok API is available at console.x.ai and intentionally mirrors the OpenAI API schema — switching from the OpenAI Python client requires only changing the base URL and model name. Pricing is ~$3/million input tokens and ~$15/million output tokens. It supports function calling, system prompts, streaming, and Aurora image generation via the same endpoint.

The Grok API is available through xAI's developer console at console.x.ai. The API follows the OpenAI API schema almost exactly — a deliberate choice by xAI to minimize migration friction for developers already using OpenAI's client libraries. In most cases, switching from the OpenAI API to the Grok API requires changing one URL and one model name.

Getting Started

Create an account at console.x.ai
Generate an API key from the dashboard
Use the base URL https://api.x.ai/v1
Specify the model as grok-3 or grok-3-mini

Python — Basic Grok API call (using OpenAI client)
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_XAI_API_KEY",
    base_url="https://api.x.ai/v1"
)

response = client.chat.completions.create(
    model="grok-3",
    messages=[
        {
            "role": "system",
            "content": "You are a research assistant focused on current events."
        },
        {
            "role": "user",
            "content": "What are people on X saying about the Fed's rate decision today?"
        }
    ],
    temperature=0.7
)

print(response.choices[0].message.content)

Because xAI adopted OpenAI's API schema, you can use the official openai Python library with Grok simply by overriding the base URL. This means any application already built on the OpenAI SDK can be pointed at Grok with minimal changes — useful for A/B testing or for applications where you want real-time X data access without a full rebuild.

Python — Grok API with streaming response
stream = client.chat.completions.create(
    model="grok-3",
    messages=[
        {"role": "user", "content": "Summarize the top 5 trending topics on X right now."}
    ],
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content is not None:
        print(chunk.choices[0].delta.content, end="", flush=True)

Pricing

As of early 2026, Grok API pricing is approximately $3 per million input tokens and $15 per million output tokens for Grok 3. Grok 3 Mini (a smaller, faster model) is priced at roughly $0.30 per million input tokens and $0.50 per million output tokens — making it cost-competitive with GPT-4o Mini and Claude Haiku for high-volume applications where cost per token matters.

API Pricing Summary (Approximate, Q1 2026)

Grok 3: $3 / 1M input tokens · $15 / 1M output tokens
Grok 3 Mini: $0.30 / 1M input tokens · $0.50 / 1M output tokens
GPT-4o (comparison): $2.50 / 1M input · $10 / 1M output
Claude 3.7 (comparison): $3 / 1M input · $15 / 1M output
Prices subject to change — check console.x.ai for current rates

Open Source Aspects: xAI's Approach

xAI released Grok-1's weights under Apache 2.0 in March 2024 — a genuine open-source release of a 314-billion-parameter Mixture of Experts model allowing commercial use and modification. Grok 2, Grok 3, and Aurora remain closed proprietary models. The open-source commitment is real but limited to the older, less capable model; the frontier production models are not open.

Elon Musk's original criticism of OpenAI was that it had abandoned its open-source, nonprofit roots. His founding of xAI was presented, in part, as a corrective. The reality of xAI's actual open-source commitments is more nuanced than the rhetoric.

Grok-1: The Real Open-Source Release

In March 2024, xAI released the weights for Grok-1 under an Apache 2.0 license. This was significant: Grok-1 is a 314-billion-parameter Mixture of Experts (MoE) model — at the time, one of the largest openly available model weights. Apache 2.0 allows commercial use, modification, and redistribution, making it a genuine open-source release rather than a research-only license like some Meta model releases.

The Grok-1 release was a meaningful contribution to the open-source AI community. Researchers and independent developers could download and run the full model (requiring substantial compute — around 8× A100 80GB GPUs at minimum), fine-tune it, or use it as a foundation for further research. Several derivative models and fine-tunes appeared in the months following the release.

The Frontier Models Stay Closed

Grok 2, Grok 3, and Aurora are not open source. The production models that power the X integration and the Grok API remain closed, with no announced plans to release their weights. This is the same position as OpenAI (despite the name) and Anthropic. xAI's open-source commitment is real in the sense that they released one model's weights — but the frontier production models are proprietary.

xAI Open Source Reality Check

Grok-1: Open source (Apache 2.0), 314B parameters, MoE architecture — real and usable
Grok 2: Closed — weights not released
Grok 3: Closed — weights not released
Aurora: Closed — image generation model not open sourced
Training data: Not disclosed — typical for frontier labs

For developers interested in running truly open models, the more relevant comparison is xAI's Grok-1 release against Meta's Llama 3.1 (70B and 405B), Mistral's models, and the broader open-weights ecosystem. Grok-1 is larger than most open alternatives, but Llama 3.1 405B is broadly considered more capable and has a more active fine-tuning community.

Is Grok Catching Up to OpenAI and Anthropic?

Yes — meaningfully, but not all the way. Grok 3 is a legitimate frontier model: competitive on math benchmarks with Think mode, in the same tier as DALL-E 3 on image generation, and definitively ahead on real-time social data. It still trails Claude 3.7 on instruction-following and long-form writing, and trails OpenAI's o3 on the hardest reasoning tasks. The pace of improvement from Grok 1 to Grok 3 is faster than OpenAI's at a comparable stage.

This is the question the whole industry is watching. The honest answer in early 2026: yes, meaningfully, but not all the way there yet.

The trajectory is the most important part of the story. Grok 1 was a proof-of-concept. Grok 2 was a credible competitor on a narrow set of tasks. Grok 3 is a legitimate frontier model that scores competitively on multiple major benchmarks and introduces novel capabilities (Think mode, Aurora, deep real-time X integration) that are not just catching up to competitors but establishing new categories.

Where the Gap Has Closed

On mathematical reasoning with Think mode enabled, Grok 3 matches or exceeds GPT-4o on several benchmark categories. On coding tasks with clear specifications, Grok 3 is competitive and developers report broadly similar quality outputs. On image generation with Aurora, Grok is in the same tier as DALL-E 3. And on real-time current events, Grok is definitively ahead of every competitor with no clear path for others to close that gap without owning a comparable real-time social platform.

Where the Gap Remains

Instruction-following consistency is still a gap. Claude 3.7 consistently outperforms Grok 3 on complex multi-step instructions where precise adherence to constraints matters. Long-form writing quality favors Claude. The OpenAI o3 model (the reasoning-specialized series) still outperforms Grok 3's Think mode on the hardest math and logic problems. And the ecosystem gap — integrations, libraries, community knowledge, fine-tuning guides — remains large because OpenAI and Anthropic have a two-year head start on developer adoption.

4th

Grok's approximate position in the frontier AI landscape as of early 2026 (behind OpenAI's o-series, Claude 3.7, and Gemini Ultra on most aggregate benchmarks)

But moving up faster than any other model in the category

The Trajectory Question

xAI has the compute (Colossus is among the largest GPU clusters in the world), the funding ($6B raised), the talent (several former OpenAI and DeepMind researchers), and the data asset (X/Twitter). The pace of improvement from Grok 1 to Grok 3 is faster than OpenAI's improvement over a comparable time period when they were at the same stage. Whether that trajectory continues is the open question. But dismissing Grok as not a serious competitor is a mistake — the same mistake many made about OpenAI itself in 2019.

The Practical Verdict on Grok in 2026

Grok does not replace your current AI tools. It extends them. Add Grok to your toolkit for real-time X data, social sentiment, and current events — it has no equal for those tasks. For coding, writing, and deep reasoning, continue to lean on GPT-4o and Claude 3.7. For image generation, Aurora is now a credible option alongside DALL-E 3. Watch the trajectory — xAI is moving faster than the pace that kept early-adopter developers locked into single-vendor AI stacks.

Build Real AI Applications — Not Just Demos

Understanding multiple AI models is the starting point. The real skill gap is in integrating them — routing Grok for real-time data, Claude or GPT-4o for reasoning, and Aurora or DALL-E for images into production applications that handle failure states, manage costs, and make architectural decisions at runtime. Most AI tutorials show a single model call; production apps chain multiple models.

Understanding Grok's capabilities is the starting point. Actually integrating multiple AI APIs — Grok for real-time data, GPT-4o or Claude for reasoning, Aurora or DALL-E for images — into production applications is where the real skill gap between developers gets created.

Most AI tutorials show you a single model call. Real applications chain multiple models, handle failure states, manage costs, stream responses, and make architectural decisions about which model to use for which task. Those decisions require hands-on experience with the full tool landscape, not just documentation reading.

What You Build in the Bootcamp

A full-stack application that routes requests to different models based on task type (Grok for real-time, Claude for analysis, GPT-4o for generation)
Real-time data pipelines using the Grok API and X data integration
Image generation workflows with Aurora and DALL-E 3 comparison
Prompt engineering techniques specific to each model's strengths and failure modes
Cost management and model selection architecture for production applications

Bootcamp Details

Price: $1,490 — all-inclusive (materials, lunch, coffee, certificate)
Format: 2 full days, in-person, small cohort (max 40 students)
Cities: Denver, Los Angeles, New York City, Chicago, Dallas
First event: June 2026 (NYC)
Instructor: Bo Peng — AI systems builder, federal AI consultant, former university instructor

Your employer can likely cover the cost. Under IRS Section 127, employers can provide up to $5,250 per year in educational assistance tax-free. At $1,490, this bootcamp falls comfortably within that limit. Read our guide on asking your employer to pay for AI training, including email templates you can use today.

The bottom line: Grok is not a ChatGPT or Claude replacement — it is a specialized tool with one clear structural advantage nobody else has: real-time native access to X/Twitter's full post graph. Add it to your AI toolkit for current events, social sentiment, and breaking news context. For coding, writing, and deep analysis, continue using your current primary model. Watch xAI closely — the improvement trajectory from Grok 1 to Grok 3 suggests the gap will keep narrowing.

Frequently Asked Questions

What is Grok AI and who made it?

Grok is a large language model built by xAI, the AI company founded by Elon Musk in 2023. It is integrated into the X platform (formerly Twitter) and available as a standalone assistant and via API. Grok's main differentiator is real-time access to data from X/Twitter, which no other major AI model has natively. The current version, Grok 3, was released in early 2025 and represents a significant capability leap over prior versions.

How does Grok 3 compare to ChatGPT and Claude?

Grok 3 is competitive with GPT-4o and Claude 3.7 on standard reasoning and coding benchmarks, though it trails on some tasks requiring nuanced instruction-following. Grok's unique advantage is real-time access to posts, trends, and public data from X/Twitter — something neither ChatGPT nor Claude can match natively. For general coding and writing tasks, GPT-4o and Claude 3.7 still have an edge in consistency. Grok is the clear winner when the task requires current events, social media data, or X-specific analysis.

Is the Grok API available and how much does it cost?

Yes, the Grok API is available through xAI's developer platform at console.x.ai. Pricing for Grok 3 is approximately $3 per million input tokens and $15 per million output tokens — competitive with GPT-4o and Claude 3.5 Sonnet. The API supports function calling, system prompts, and streaming. Because xAI adopted OpenAI's API schema, switching from the OpenAI Python client to Grok requires only changing the base URL and model name.

Is Grok open source?

Partially. xAI released the weights for Grok-1 (314 billion parameters) under an Apache 2.0 license in March 2024 — a genuine open-source release allowing commercial use and modification. However, Grok 2, Grok 3, and Aurora remain closed. xAI's production frontier models are proprietary, the same as OpenAI and Anthropic. The Grok-1 release was meaningful for the research community but does not represent the capability level of the current production models.

Sources: World Economic Forum Future of Jobs Report 2025, AI.gov — National AI Initiative, McKinsey State of AI 2025