Yang Gao (高扬)Technical Staff
Mosi Intelligence (MOSI) |
![]() |
I am currently a Member of Technical Staff at Mosi Intelligence (MOSI), focusing on performance optimization for large-scale training and inference systems. My work spans multimodal systems including audio/video understanding and generation.
Previously, I worked on large language model pre-training at InternLM (书生-浦语) in Shanghai AI Laboratory. I received Master (2022) and Bachelor (2019) degrees in School of Computer Science and Technology from Northwestern Polytechnical University (NWPU), advised by Prof. Peng Wang.
My research and engineering interests focus on:
We are hiring engineers and researchers interested in AI infrastructure, large-scale training/inference systems, and multimodal foundation models. Please feel free to contact me if you are interested in joining us.
M.S. at Northwestern Polytechnical University (Postgraduate recommendation)
B.S. at Northwestern Polytechnical University
Shanghai AI Laboratory, Research Engineer on Large Language Model Pre-training
SenseTime Research, Engineer Intern on Automated Machine Learning
Baidu Research, Research Intern on Light-weight Object DetectionMOSS-TTS Technical Report
OpenMOSS Team
[paper]
[code]
[huggingface]
MOVA: Towards Scalable and Synchronized Video-Audio Generation
OpenMOSS Team
[paper]
[code]
[huggingface]
Intern-S1: A Scientific Multimodal Foundation Model
InternLM Team
[paper]
[code]
[huggingface]
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang
[paper]
[code]
[huggingface]
Lins: Reducing Communication Overhead of ZeRO for Efficient LLM Training
Qiaoling Chen, Qinghao Hu, Guoteng Wang, Yingtong Xiong, Ting Huang, Xun Chen, Yang Gao, Hang Yan, Yonggang Wen, Tianwei Zhang, Peng Sun
IWQoS 2024.
[paper]
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang
NeurIPS 2024.
[paper]
[code]
[huggingface]
InternLM2 Technical Report
InternLM Team
[paper]
[code]
[huggingface]
Internlm: A multilingual language model with progressively enhanced capabilities
InternLM Team
[paper]
[code]
[huggingface]
NAS-FCOS: Fast neural architecture search for object detection
*Ning Wang, *Yang Gao, *Hao Chen, Peng Wang, Zhi Tian, Chunhua Shen, Yanning Zhang
CVPR 2020.
[paper]
[code]