Yineng Zhang zhyncs

🔭 Lead Software Engineer at Baseten, focusing on model performance optimization
💼 Previously at Meituan, specialized in GPU inference using TensorFlow/TensorRT for CTR and PyTorch for LLMs. Earlier at Baidu, familiar with bRPC and Babylon
💻 Open Source: Team Member at LMSYS Org, core developer of SGLang, committer for FlashInfer and LMDeploy
👀 Check out my talk on SGLang at GPU MODE
📫 Contact: me@zhyncs.com | Telegram
📄 More: LinkedIn | Homepage
🙌 The best way to reach me is through the SGLang Slack. We're seeking open-source enthusiasts and learners to help develop the SGLang project and community. If you'd like to talk to me on Google Meet, you can schedule a time through my Calendly.

Provide feedback