Skip to content
View zhousheng97's full-sized avatar
🐢
Focusing
🐢
Focusing
  • Hefei University of Technology
  • China
  • 11:28 (UTC +08:00)

Block or report zhousheng97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zhousheng97/README.md

Hi there 👋

  • 👩 I’m Sheng Zhou, a PhD student from China, currently visiting the National University of Singapore.
  • 🧐 My focus is multimedia learning, especially VQA, and I’m currently exploring multimodal LLMs.
  • ✈️ 🏊 🎶 I enjoy traveling, sports like swimming and yoga, and music in all its moods, which energizes me for work.
  • 💬 As an ENFJ-A, I thrive on meaningful collaboration and communication.
  • 📫 You can reach me at hzgn97@gmail.com—let’s connect!
  • 🏃‍♀️ I believe scientific research is a marathon. Let’s keep going!

Pinned Loading

  1. Awesome-MLLM-TextVQA Awesome-MLLM-TextVQA Public

    ✨✨Latest Research on Multimodal Large Language Models on Scene-Text VQA Tasks

    5

  2. ViTXT-GQA ViTXT-GQA Public

    ✨✨ Scene-Text Grounding for Text-Based Video Question Answering (arxiv)

    Python 12 1

  3. GPIN GPIN Public

    Graph Pooling Inference Network for Text-based VQA (ACM TOMM'2024)

    Python 3

  4. SSGN SSGN Public

    Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA (IEEE TIP'2023)

    Python 3