- π Lead Software Engineer at Baseten, focusing on model performance optimization
- πΌ Previously at Meituan, specialized in GPU inference using TensorFlow/TensorRT for CTR and PyTorch for LLMs. Earlier at Baidu, familiar with bRPC and Babylon
- π» Open Source: Team Member at LMSYS Org, core developer of SGLang, committer for FlashInfer and LMDeploy
- π Check out my talk on SGLang at GPU MODE
- π« Contact: me@zhyncs.com | Telegram
- π More: LinkedIn | Homepage
- π The best way to reach me is through the SGLang Slack. We're seeking open-source enthusiasts and learners to help develop the SGLang project and community. If you'd like to talk to me on Google Meet, you can schedule a time through my Calendly.
zhyncs
Follow
π―
Pinned Loading
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a fast serving framework for large language models and vision language models.
-
flashinfer-ai/flashinfer
flashinfer-ai/flashinfer PublicFlashInfer: Kernel Library for LLM Serving
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.