xAI's May 15 Update Makes X's Phoenix Ranking Engine Fully Runnable

This photograph taken on January 13, 2025 in Toulouse shows — LIONEL BONAVENTURE/AFP via Getty Images

Three days after xAI pushed a sweeping 187-file, 18,000-line update to its open-source recommendation repository, the developer community has produced something the platform's 600 million users never had before: a verified map of what X's algorithm structurally rewards. The most-shared finding, drawn from the original 2023 scoring weights and circulated by OpenTweet after the May 15 release, reorders years of creator strategy: when an original author replies back to someone who replied to their post, that single two-way exchange carries roughly the same weight as 150 likes in the Phoenix ranking model.

That number has cascaded through content and marketing communities faster than the code release itself. If accurate, it rewrites the premise of X growth strategy — passive like accumulation is a poor substitute for sparking genuine conversation, then participating in it.

The May 15 commit to xai-org/x-algorithm — xAI's open-source repository for the algorithm powering X's "For You" feed — is categorically different from earlier releases. The repository was first published in January 2026, when xAI committed to a four-week public update cadence that has held without interruption. Where earlier commits were primarily reference material, the May 15 update ships something a developer can actually run.

Phoenix, the Transformer Behind Your Feed, Is Now Downloadable

At the center of the update is a downloadable pre-trained model artifact known as mini Phoenix. The checkpoint, approximately 3 gigabytes and distributed via Git Large File Storage, implements 256-dimensional embeddings, four attention heads, and two transformer layers — a compressed but architecturally faithful representation of the production ranking system. For the first time, a developer with sufficient hardware can run the full retrieval-to-ranking sequence that determines what 600 million users see when they open the app.

The pipeline is now unified: a single entry point runs retrieval and ranking together, mirroring how the two stages are composed on X's production servers. Prior releases required separate scripts for each step. The Phoenix transformer itself is ported from xAI's open-source Grok-1 language model, adapted for engagement prediction rather than language generation. As of this update, the system has eliminated nearly all hand-engineered features from the ranking process — relevance is now learned entirely from user engagement sequences.

The repository, written 62.9% in Rust and 37.1% in Python under an Apache 2.0 license, also ships three other significant additions in the May 15 commit: an ad-blending module that exposes how sponsored content is woven into organic ranking, including the brand-safety signals that determine placement; a Grox content-understanding subsystem that classifies posts by topic and sentiment before they reach the ranker; and six new candidate hydrators — the enrichment layers that attach engagement counts, language codes, mutual-follow scores, and other features to posts before ranking.

The 150-Likes Finding Comes From 2023 — Not This Release

The reply-equals-150-likes figure driving creator attention right now requires an important caveat. In the January 2026 release, xAI redacted the numerical engagement weights from the codebase, citing security reasons. The specific scoring coefficients that analysts have circulated — including the +75.0 weight for an author replying back to a reply, versus +0.5 for a like — come from the 2023 Twitter/the-algorithm release, which did include explicit weights. Those 2023 figures are widely treated in the research community as directionally accurate for the current Phoenix system, but xAI has not confirmed they still apply.

John Thickstun, an assistant professor of computer science at Cornell University, reviewed the January release and described the codebase in Engadget as a "redacted" version of X's algorithm. "What troubles me about these releases is that they give you a pretense that they're being transparent for releasing code and the sense that someone might be able to use this release to do some kind of auditing work or oversight work," Thickstun said. "And the fact is that that's not really possible at all."

Ruggero Lazzaroni, a doctoral researcher at the University of Graz working on an EU-funded project exploring alternative recommendation algorithms, put the practical limitation plainly: "We have the code to run the algorithm, but we don't have the model that you need to run the algorithm."

The production weights powering the live X feed remain proprietary. Mini Phoenix is a high-fidelity architectural proxy, not a clone of what X's servers are running today.

What the Architecture Itself Confirms

Even without published weights, the structure of the 2026 release reveals a great deal about how Phoenix decides what to rank. The model predicts probabilities for multiple engagement types per post — likes, replies, reposts, quote posts, clicks, profile visits, bookmarks, video watch completions, dwell time, and negative signals such as mutes, blocks, and reports. Positive-action probabilities receive positive multipliers; negative-action probabilities reduce the final score. The weighted sum of all predictions becomes a post's rank score.

One key design choice visible in the code: during ranking, candidate posts cannot attend to each other in the transformer's attention layer. Each post is scored based on user context alone, not on what else is in the same processing batch. This keeps scores consistent and cacheable — a post's rank does not change based on what it is processed alongside.

Mohsen Foroughifar, an assistant professor of business technologies at Carnegie Mellon University, flagged the deeper limitation this creates for fairness research. Without access to the training data, he told Engadget, the model could carry learned biases from historical engagement patterns regardless of how cleanly the architecture is written.

The First Inspectable Production Recommendation Engine at Scale

The academic implications extend beyond creator strategy. Recommendation researchers have historically been limited to behavioral experiments, synthetic proxies, and occasional leaked documents when studying large-platform systems. The May 15 release gives researchers a directly inspectable codebase powering a platform at production scale, with explicit feature definitions in the candidate hydrators and a verified architecture for both retrieval and ranking.

Questions that were previously difficult to investigate systematically — how the system weights political content relative to entertainment, whether it amplifies certain languages or suppresses others, how mutual-follow relationships affect new users — now have a codebase to interrogate. A 2025 ACM conference paper auditing X's amplification patterns during the 2024 U.S. presidential election found that the algorithm disproportionately amplified high-popularity accounts and showed a default lean toward partisan content for new users. That kind of research, previously conducted through behavioral observation alone, can now be grounded against production architecture.

The European Commission has an active formal investigation into X's recommender systems under the Digital Services Act, extended from proceedings opened in December 2023. The Commission levied a €120 million fine on X in December 2025 for separate transparency failures, including non-compliance with advertising archive requirements. The open-source release does not resolve those proceedings, but it adds a layer of external auditability that no other platform of comparable scale currently provides.

No other platform with a user base in that range — Meta, TikTok, YouTube, Snapchat — has published a runnable production recommendation engine. Whether that reflects a genuine commitment to transparency, a bid to attract developer talent, or confidence that the competitive advantage lies in training data and compute rather than architecture is a reasonable question. What the May 15 commit resolves is narrower and more concrete: for the first time, the system that shapes what 600 million people see every day can be read, run, and tested by anyone with a terminal — even if the weights that make it perform like the production system remain out of reach.

The repository is available at github.com/xai-org/x-algorithm under an Apache 2.0 license.

Tags:XAI Digital Services Act

Join the Discussion

xAI’s May 15 Update Makes X’s Phoenix Ranking Engine Fully Runnable — and One Reply Outweighs 150 Likes

The largest commit to xAI’s open-source algorithm turns a code reference into an operational tool — but the scoring math that reshapes creator strategy comes from 2023 weights the company redacted in its newest release

Phoenix, the Transformer Behind Your Feed, Is Now Downloadable

The 150-Likes Finding Comes From 2023 — Not This Release

What the Architecture Itself Confirms

The First Inspectable Production Recommendation Engine at Scale

The Mandalorian and Grogu Depicts Real Ion Propulsion: NASA's Psyche Used at Mars Two Days Before LA Premiere

Top Digital Skills Everyone Should Learn for Career Growth and Everyday Life

ASUS, Xreal R1 Gaming Display Glasses Now Open For Preorders—Is it Worth it?

WhatsApp Rolls Out Liquid Glass Adoption on iOS to More Users—What to Expect?

Party Animals Studio Recreate Games Loses Steam "Very Positive" Rating in Under 24 Hours Over $75,000 AI Video Contest