Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles by
Rui Qian
Reasoning to Attend: Try to Understand How <SEG> Token Works
20 January 2025 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
9 January 2025 by
Yifei Li
and
others
Computer Vision and Pattern Recognition
,
Artificial Intelligence
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
6 January 2025 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
17 December 2024 by
Shuangrui Ding
and
others
Computer Vision and Pattern Recognition
Imagen 3
13 December 2024 by
Imagen-Team-Google
and
others
Computer Vision and Pattern Recognition
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
12 December 2024 by
Pan Zhang
and
others
Computer Vision and Pattern Recognition
,
Artificial Intelligence
SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images
6 December 2024 by
Jiahua Dong
and
others
Computer Vision and Pattern Recognition
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
9 July 2024 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
3 July 2024 by
Pan Zhang
and
others
Computer Vision and Pattern Recognition
,
Computation and Language
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
27 May 2024 by
Cristina Vasconcelos
and
others
at
Google
Computer Vision and Pattern Recognition
,
Machine Learning
Streaming Long Video Understanding with Large Language Models
25 May 2024 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation
27 February 2024 by
Shuangrui Ding
and
others
Sound
,
Artificial Intelligence
VideoPrism: A Foundational Visual Encoder for Video Understanding
20 February 2024 by
Long Zhao
and
others
Computer Vision and Pattern Recognition
,
Artificial Intelligence
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
29 November 2023 by
Shuangrui Ding
and
others
at
Chinese University of Hong Kong
Computer Vision and Pattern Recognition
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos
19 August 2023 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
8 August 2023 by
Shuangrui Ding
and
others
Computer Vision and Pattern Recognition
Spatial Localization Ability of Planarians Identified Through the Light Maze Paradigm
19 July 2023 by
Ruisen Qian
and
others
at
China Agricultural University
Animal Behavior and Cognition
Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
18 March 2023 by
Lingting Zhu
and
others
Computer Vision and Pattern Recognition
,
Sound
Motion-inductive Self-supervised Object Discovery in Videos
1 October 2022 by
Shuangrui Ding
and
others
Computer Vision and Pattern Recognition
Static and Dynamic Concepts for Self-supervised Video Representation Learning
26 July 2022 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
,
Machine Learning
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
21 July 2022 by
Grant Van Horn
and
others
Computer Vision and Pattern Recognition
,
Machine Learning
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
15 July 2022 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
,
Machine Learning
Dual Contrastive Learning for Spatio-temporal Representation
12 July 2022 by
Shuangrui Ding
and
others
Computer Vision and Pattern Recognition
3D Object Detection for Autonomous Driving: A Survey
24 May 2022 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
2 April 2022 by
Liangzhe Yuan
and
others
Computer Vision and Pattern Recognition
Controllable Augmentations for Video Representation Learning
1 April 2022 by
Rui Qian
and
others
Computer Vision and Pattern Recognition
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
24 March 2022 by
Xian Liu
and
others
Computer Vision and Pattern Recognition
Scintillator Tile Batch Test of CEPC AHCAL
21 March 2022 by
Yuying Duan
and
others
Instrumentation and Detectors
,
High Energy Physics
A survey of optimal strategy for signature-based drug repositioning and an application to liver cancer
22 February 2022 by
Chunlin Yang
and
others
Bioinformatics
Visual Sound Localization in the Wild by Cross-Modal Interference Erasing
13 February 2022 by
Xian Liu
and
others
Computer Vision and Pattern Recognition
,
Sound
Load more