SID AI’s SID-1 is the first retrieval model trained end-to-end with RL. Some observations through a search-and-IR lens: the middle of the retrieval pipeline collapses into one trained model, the NDCG reward gets deliberately bent toward recall, and the agentic-retrieval loop becomes a subagent you hand to a larger system.
Laurie Voss says applied-AI iteration has moved off the model and into "the harness". He’s right - and once you strip the new vocabulary, the harness is mostly a retrieval system.
xAI open-sourced the For You feed algorithm today. Three observations through a search-and-IR lens: two-tower’s quiet dominance, the retrieve/rank split surviving the bitter lesson, and recsys converging with search.
TurboQuant landed as a KV cache result, but the more interesting application might be ColBERT-style late interaction. Here’s the case, and the open questions.
A new quantisation method out of Google Research is making the rounds. Qdrant shipped it. Elastic ran the benchmarks and politely declined. Both responses tell you something useful.