Skip to content
View wondervictor's full-sized avatar
🤡
coding
🤡
coding

Highlights

  • Pro

Organizations

@hustvl @msra-alumni @HRNet @TencentARC

Block or report wondervictor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wondervictor/README.md

Hi there 👋

I'm Tianheng Cheng, and now a researcher at ByteDance Seed Team and working on cutting-edge large multimodal models and world models. I have finished my Ph.D. career at the HUST Vision Lab of Huazhong University of Science and Technology.

My lifelong research goal is to enable machines/robots to comprehend world knowledge and interact with environments like human beings.

Previous works/publications are listed at Google Scholar 📚.

Currently, I'm devoted to research about large multimodal models, foundational visual-language modeling, and image generation. Before that, I mainly focused on fundamental tasks such as object detection and instance segmentation, as well as visual perception for autonomous driving.

Pinned Loading

  1. AILab-CVC/YOLO-World AILab-CVC/YOLO-World Public

    [CVPR 2024] Real-Time Open-Vocabulary Object Detection

    Python 5.7k 539

  2. hustvl/SparseInst hustvl/SparseInst Public

    [CVPR 2022] SparseInst: Sparse Instance Activation for Real-Time Instance Segmentation

    Python 606 74

  3. hustvl/GKT hustvl/GKT Public

    Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer

    Python 239 18

  4. hustvl/Symphonies hustvl/Symphonies Public

    [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

    Python 185 6

  5. hustvl/EVF-SAM hustvl/EVF-SAM Public

    Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

    Python 429 19

  6. hustvl/ControlAR hustvl/ControlAR Public

    [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models

    Python 277 8