- 👩 I’m Sheng Zhou, a PhD student from China, currently visiting the National University of Singapore.
- 🧐 My focus is multimedia learning, especially VQA, and I’m currently exploring multimodal LLMs.
✈️ 🏊 🎶 I enjoy traveling, sports like swimming and yoga, and music in all its moods, which energizes me for work.- 💬 As an ENFJ-A, I thrive on meaningful collaboration and communication.
- 📫 You can reach me at hzgn97@gmail.com—let’s connect!
- 🏃♀️ I believe scientific research is a marathon. Let’s keep going!
🐢
Focusing
Pinned Loading
-
Awesome-MLLM-TextVQA
Awesome-MLLM-TextVQA Public✨✨Latest Research on Multimodal Large Language Models on Scene-Text VQA Tasks
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.