Daily AI Brief · 2026-06-18

Models

New models, weights and benchmarks.

Improving health intelligence in ChatGPT

Learn how GPT-5.5 Instant improves ChatGPT’s health and wellness responses with stronger reasoning, better context, clearer communication, and physician-informed evaluations.

推理

Simon WillisonRSS·7d ago64

GLM-5.2 is probably the most powerful text-only open weights LLM

Chinese AI lab Z.ai released GLM-5.2 to their coding plan subscribers on June 13th, and then yesterday (June 16th) released the full open weights under an MIT license. Similar in size to their previous GLM-5 and GLM-5.1 releases, this is 753B parameter, 1.51TB monster - with 40 active parameters (Mixture of Experts). GLM-5.2 is a text input only model - Z.ai have a separate vision family most recently represented by GLM-5V-Turbo, but that one isn't open weights. GLM-5.2 has a 1 million token context window, up from GLM-5.1's 200,000. The buzz around this model is strong. Artificial Analysis, who run one of the most widely respected independent benchmarks: GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index. GLM-5.2 is the leading open weights model on the Intelligence Index v4.1. At 51, it leads MiniMax-M3 (44), DeepSeek V4 Pro (max, 44) and Kimi K2.6 (43) They did however find it to be quite token-hungry: GLM-5.2 uses more output toke

多模态编程

Hugging FaceRSS·7d ago63

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Hugging FaceRSS·7d ago63

Is it agentic enough? Benchmarking open models on your own tooling

智能体

Products

Product launches and noteworthy updates.

TechCrunch AIRSS·6d ago71

Almost half of U.S. singles feel negatively about AI in dating, Match says

About 47% of singles look negatively at the use of AI in dating -- but, many dating app users are open to AI helping with profile punch-ups and conversation starters.

TechCrunch AIRSS·6d ago71

‘Queer Eye’s’ life coach Karamo Brown launches Kē, a wellness app featuring his AI digital clone

Karamo Brown, famous for his pep talks on Netflix’s “Queer Eye,” has jumped into the wellness and AI space with his new app, Kē. After spending a year and a half focusing on his own journey—from fitness and nutrition to meditation, sobriety, relationships, and personal growth—Brown wants to help others do the same. Kē offers […]

TechCrunch AIRSS·6d ago71

Pixi’s new iOS app turns text messages into interactive AR experiences

Forget stickers, GIFs, and emoji reactions. Pixi is betting that the next evolution of messaging is interactive augmented reality (AR).

Industry

Funding, policy and market moves.

TechCrunch AIRSS·6d ago71

AI inference startup Baseten reportedly raising $1.5B months after its last mega round

Startup Baseten is reportedly close to finalizing a $1.5 billion round at a $13 billion as the “inference gold rush" marches on.

推理

TechCrunch AIRSS·6d ago71

Snap spins off AI video team into new company, Dotmo, due to costs

The Snapchat maker is spinning off yet another internal unit. Dotmo will be comprised of current Snap staff who are leaving the social media company to focus on AI video development.

多模态

TechCrunch AIRSS·6d ago71

AI data centers just got a government-mandated fast lane to the grid

FERC told grid operators to give data centers a fast lane for interconnections, but it failed to address electricity supply shortages.

TechCrunch AIRSS·6d ago71

The smartphone era created an attention crisis. Slowtech is fixing it

“People just really want to take back control of their time, their lives, their attention... They’re down for whatever helps them do that.”

TechCrunch AIRSS·6d ago71

General Intuition in talks to raise $300M at around $2B valuation

General Intuition is in talks to raise around $300 million at a roughly $2 billion valuation from backers including Jeff Bezos. The startup trains AI agents on spatial-temporal reasoning.

推理融资

Papers

Research worth a read.

Hugging FaceRSS·6d ago70

MosaicLeaks: Can your research agent keep a secret?

智能体

arXiv cs.LGPaper·7d ago61

DRIFT: Refining Instruction Data via On-Policy Data Attribution

arXiv:2606.18307v1 Announce Type: new Abstract: Optimizing the training data distribution for Supervised Fine-Tuning (SFT) dictates the capability of Large Language Models (LLMs). While existing data curation methods excel at accelerating training under constrained budgets, they are less suited to elevating the capability upper bound. The challenge here is no longer to identify a smaller subset that preserves performance, but to refine the data distribution toward instances most capable of improving the final model. To address this problem, we explore instance-level data attribution using Influence Functions (IF). We identify that standard IF formulations struggle in this setting due to two structural limitations: a proximity gap caused by off-policy validation targets, and a severe bias towards gradient norm. We propose DRIFT (Data Refinement via On-Policy Influence Functions for Supervised Fine-Tuning). Instead of relying on external reference data, DRIFT utilizes the model's on-pol

arXiv cs.CLPaper·7d ago61

Towards Scalable Customization and Deployment of Multi-Agent Systems for Enterprise Applications

arXiv:2606.18502v1 Announce Type: new Abstract: Large language model (LLM)-based multi-agent systems demonstrate strong performance on complex reasoning and task execution, enabling broad enterprise applications. However, production deployment remains challenging due to domain-specific customization requirements and high latency and inference costs in agentic workflows. We propose a unified framework for customization and efficient deployment of multi-agent systems in real-world settings. The first stage, Agentic Model Customization, combines continual pretraining, supervised fine-tuning, and preference optimization to adapt a compact model to specialized domains while retaining strong agentic capabilities. The second stage, Inference Optimization, integrates speculative decoding and FP8 quantization with targeted calibration to enable cost-efficient serving with minimal quality loss. Across enterprise workloads, our framework enables rapid domain adaptation and achieves a 4.48x speed

推理智能体

arXiv cs.CLPaper·7d ago61

MCompassRAG: Topic Metadata as a Semantic Compass for Paragraph-Level Retrieval

arXiv:2606.18508v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) systems depend critically on how documents are chunked and searched. Fine-grained chunks can improve retrieval precision but expand the search space, increasing latency and cost; larger chunks reduce the number of candidates but make dense similarity less reliable, as the representation for each chunk mixes multiple topics and introduces more semantic noise. This trade-off becomes especially limiting in deep research tasks, where retrieval must be both fast and precise across large, heterogeneous corpora. We introduce MCompassRAG, a metadata-guided retrieval framework that uses topic-level signals as a semantic compass for selecting relevant evidence. Instead of relying only on cosine similarity between queries and noisy chunk embeddings, MCompassRAG enriches chunk representations with topic metadata in the same embedding space and trains a lightweight retriever through LLM-teacher distillation. At in

arXiv cs.CLPaper·7d ago61

Speech-Driven End-to-End Language Discrimination towards Chinese Dialects

arXiv:2606.18584v1 Announce Type: new Abstract: Language discrimination among similar languages, varieties, and dialects is a challenging natural language processing task. The traditional text-driven focus leads to poor results. In this paper, we explore the effectiveness of speech-driven features towards language discrimination among Chinese dialects. First, we systematically explore the appropriateness of speech-driven MFCC features towards CNN-based language discrimination. Then, we design an end-to-end speech recognition model based on HMM-DNN to predict Chinese dialect words. We adopt attention to extract the discriminative words related to different Chinese dialects. Finally, through a CNN, we combine the word-level embedding and the MFCC-based features. Evaluation of two benchmark Chinese dialect corpora shows the appropriateness and effectiveness of the proposed speech-driven approach to fine-grained Chinese dialect discrimination compared to the state-of-the-art methods.

Big Tech

What the major labs and platforms shipped.

OpenAI BlogRSS·6d ago79

New usage analytics and updated spend controls for enterprises

OpenAI introduces new spend controls and usage analytics for ChatGPT Enterprise, helping organizations manage costs and scale AI with confidence.

OpenAI BlogRSS·6d ago79

Using AI to help physicians diagnose rare genetic diseases affecting children

Researchers used an OpenAI reasoning model to help diagnose rare diseases, identifying 18 new diagnoses in previously unsolved cases.

推理

TechCrunch AIRSS·6d ago71

OpenAI is bringing on some big guns in the lead-up to its IPO

OpenAI is bulking up before its IPO, landing Transformer co-inventor Noam Shazeer from Google DeepMind and former Trump AI policy official Dean Ball in the same week.

TechCrunch AIRSS·6d ago71

Amazon hopes to challenge Nvidia more directly by selling its AI chips

AWS is in talks to sell its chips to other data centers. CEO Andy Jassy has said this represents a $50 billion opportunity for the company.

TechCrunch AIRSS·7d ago71

How to turn off AI in your Google Docs

Here's what you need to do to get those pesky "write with Gemini" pop-ups to go away.

AI Hot Daily Brief · 2026-06-18

Models

Improving health intelligence in ChatGPT

GLM-5.2 is probably the most powerful text-only open weights LLM

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Is it agentic enough? Benchmarking open models on your own tooling

Products

Almost half of U.S. singles feel negatively about AI in dating, Match says

‘Queer Eye’s’ life coach Karamo Brown launches Kē, a wellness app featuring his AI digital clone

Pixi’s new iOS app turns text messages into interactive AR experiences

Industry

AI inference startup Baseten reportedly raising $1.5B months after its last mega round

Snap spins off AI video team into new company, Dotmo, due to costs

AI data centers just got a government-mandated fast lane to the grid

The smartphone era created an attention crisis. Slowtech is fixing it

General Intuition in talks to raise $300M at around $2B valuation

Papers

MosaicLeaks: Can your research agent keep a secret?

DRIFT: Refining Instruction Data via On-Policy Data Attribution

Towards Scalable Customization and Deployment of Multi-Agent Systems for Enterprise Applications

MCompassRAG: Topic Metadata as a Semantic Compass for Paragraph-Level Retrieval

Speech-Driven End-to-End Language Discrimination towards Chinese Dialects

Big Tech

New usage analytics and updated spend controls for enterprises

Using AI to help physicians diagnose rare genetic diseases affecting children

OpenAI is bringing on some big guns in the lead-up to its IPO

Amazon hopes to challenge Nvidia more directly by selling its AI chips

How to turn off AI in your Google Docs