KubeCon EU 2025 Summary

Posted on 2025-04-11 in Work • 316 words • 2 minute read

Tags: 2025, opensource, kubecon

Just finished the KubeCon EU 2025 and will end my vocation tomorrow, I’m going to write down something to summarize my KubeCon trip.

The White Cliffs

Posted on 2025-04-08 in Travel • 6 words • 1 minute read

Tags: 2025, england, hiking

Hiking in the South of England.

Stamford Bridge

Posted on 2025-04-03 in Travel • 2 words • 1 minute read

Tags: 2025, stadium, football, england, london

Stamford Bridge.

KubeCon London - Sailing Multi-Host Inference with LWS

Posted on 2025-04-02 in Work • 134 words • 1 minute read

Tags: 2025, kubernetes, kubecon, opensource, inference, talk, england, london

[Slides] [Project]

Inference workloads are becoming increasingly prevalent and vital in Cloud Native world. However, it’s not easy, one of the biggest challenges is large foundation model can not fit into a single node, which brings out the distributed inference with model parallelism, again, make serving inference workloads more complicated.

Emirates Stadium

Posted on 2025-04-01 in Travel • 2 words • 1 minute read

Tags: 2025, stadium, football, england, london

Emirates Stadium.

Anfield Stadium

Posted on 2025-03-31 in Travel • 2 words • 1 minute read

Tags: 2025, stadium, football, england, liverpool

Anfield Stadium.

Old Trafford

Posted on 2025-03-30 in Travel • 4 words • 1 minute read

Tags: 2025, stadium, football, england, manchester

Raining in Old Trafford.

Etihad Stadium

Posted on 2025-03-29 in Travel • 2 words • 1 minute read

Tags: 2025, stadium, football, england, manchester

Etihad Stadium.

Tottenham Hotspur Stadium

Posted on 2025-03-28 in Travel • 3 words • 1 minute read

Tags: 2025, stadium, football, england, london

Tottenham Hotspur Stadium.

KServe, AIBrix, and llmaz

Posted on 2025-03-05 in Work • 364 words • 2 minute read

Tags: 2025, ai, inference

As a follower and active contributor for inference platform, I created the llmaz project to provide an unified inference platform for LLMs and also joined the AIBrix community to build the next-gen GenAI infrastructure.