Talk & Coverage
Invited Talk
- M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training, Qingyuan Talk
- Microsoft Researchers Propose Open-Vocabulary Responsible Visual Synthesis (ORES) with the Two-Stage Intervention Framework, MarktechPost
- Microsoft Researchers Propose NUWA-XL: A Novel Diffusion Over Diffusion Architecture For Extremely Long Video Generation, MarktechPost
- An Undergraduate Student Who Studies NLP Submitted A Paper to A Top CV Conference and Got Accepted as the First Author? (Chinese), AI Tech Review
- Overcoming Language Barriers! Harbin Institute of Technology, in Collaboration with MSRA, Has Proposed a Unified Pre-training Model for Multi-task, Multi-modal, and Multi-language Applications, M3P (CVPR 2021) (Chinese), FightingCV
- 16 Descriptions Generate an 11-minute Animation! The New Member of the NUWA Series: the Ultra-long Video Generation Model NUWA-XL (Chinese), AI Era
- CVPR 2021 | 9 Selected Papers: An Overview of the Latest Advances in Visual Research at Microsoft Research Asia (Chinese), Microsoft Research Asia
- Microsoft Research Asia’s Multimodal Model NÜWA: Creating Visual Content with Natural Language (Chinese), Microsoft News Centre