[go: up one dir, main page]

关注
Manyuan ZHANG
Manyuan ZHANG
MMLab, CUHK&Meituan, HK
在 link.cuhk.edu.hk 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Motion-i2v: Consistent and controllable image-to-video generation with explicit motion modeling
X Shi, Z Huang, FY Wang, W Bian, D Li, Y Zhang, M Zhang, KC Cheung, ...
ACM SIGGRAPH 2024 Conference Papers, 1-11, 2024
1922024
Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation
X Shi, Z Huang, D Li, M Zhang, KC Cheung, S See, H Qin, J Dai, H Li
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
1872023
Videoflow: Exploiting temporal cues for multi-frame optical flow estimation
X Shi, Z Huang, W Bian, D Li, M Zhang, KC Cheung, S See, H Qin, J Dai, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1552023
Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step
Z Guo, R Zhang, C Tong, Z Zhao, R Huang, H Zhang, M Zhang, J Liu, ...
arXiv preprint arXiv:2501.13926, 2025
117*2025
Lumina-image 2.0: A unified and efficient image generative framework
Q Qin, L Zhuo, Y Xin, R Du, Z Li, B Fu, Y Lu, X Li, D Liu, X Zhu, W Beddow, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2025
65*2025
Deep reward supervisions for tuning text-to-image diffusion models
X Wu, Y Hao, M Zhang, K Sun, Z Huang, G Song, Y Liu, H Li
European Conference on Computer Vision, 108-124, 2024
402024
Decoupled detr: Spatially disentangling localization and classification for improved end-to-end object detection
M Zhang, G Song, Y Liu, H Li
Proceedings of the IEEE/CVF international conference on computer vision …, 2023
402023
Discriminability distillation in group representation learning
M Zhang, G Song, H Zhou, Y Liu
European Conference on Computer Vision, 1-19, 2020
262020
Longcat-flash technical report
MLC Team, B Li, B Lei, B Wang, B Rong, C Wang, C Zhang, C Gao, ...
arXiv preprint arXiv:2509.01322, 2025
232025
Towards flops-constrained face recognition
JY Yu Liu, Guanglu Song, Manyuan Zhang, Jihao Liu, Yucong Zhou
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
212019
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark
Z Guo, X Chen, R Zhang, R An, Y Qi, D Jiang, X Li, M Zhang, H Li, ...
arXiv preprint arXiv:2510.26802, 2025
14*2025
Think with 3d: Geometric imagination grounded spatial reasoning from limited views
Z Chen, M Zhang, X Yu, X Luo, M Sun, Z Pan, Y Feng, P Pei, X Cai, ...
arXiv preprint arXiv:2510.18632, 2025
102025
DI-drive: OpenDILab decision intelligence platform for autonomous driving simulation
D Drive Contributors
102021
Onethinker: All-in-one reasoning model for image and video
K Feng, M Zhang, H Li, K Fan, S Chen, Y Jiang, D Zheng, P Sun, Y Zhang, ...
arXiv preprint arXiv:2512.03043, 2025
72025
Ares: Multimodal adaptive reasoning via difficulty-aware token-level entropy shaping
S Chen, Y Guo, Y Ye, S Huang, W Hu, H Li, M Zhang, J Chen, S Guo, ...
arXiv preprint arXiv:2510.08457, 2025
72025
Switchable k-class hyperplanes for noise-robust representation learning
B Liu, G Song, M Zhang, H You, Y Liu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
72021
Deep learning for computational science and engineering
J Adie, J Yang, M Zhang, S See
GPU technology conference, 4-7, 2018
72018
Thinking-while-generating: Interleaving textual reasoning throughout visual generation
Z Guo, R Zhang, H Li, M Zhang, X Chen, S Wang, Y Feng, P Pei, PA Heng
arXiv preprint arXiv:2511.16671, 2025
62025
1st place solution for ava-kinetics crossover in acitivitynet challenge 2020
S Chen, J Pan, G Song, M Zhang, H Shao, Z Lin, J Shao, H Li, Y Liu
arXiv preprint arXiv:2006.09116, 2020
62020
Codeplot-cot: Mathematical visual reasoning by thinking with code-driven images
C Duan, K Sun, R Fang, M Zhang, Y Feng, Y Luo, Y Liu, K Wang, P Pei, ...
arXiv preprint arXiv:2510.11718, 2025
52025
系统目前无法执行此操作,请稍后再试。
文章 1–20