Career Profile

  • In the field of large language models, my work spans several key areas: optimizing attention mechanism, optimizing cross-entropy like objective function, optimizing reasoning ability for complex downstream tasks like Math Solving, Code Generation.
  • In the field of efficient deep learning, my work spans several key areas: model compression
  • In the field of multimodal large language model, my work spans several key areas: multimodal alignment and fusion, hallucination mitigation
A detailed list of publications can be found here.

Publications

VQ-logits:Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits
Jintian Shao*, Hongyi Huang, Yiming Cheng, Jiayi Wu, Beiwen Zhang, ZhiYu Wu, You Shan, MingKai Zheng†
arxiv preprint
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
Jintian Shao*, Hongyi Huang, Yiming Cheng, Jiayi Wu, Beiwen Zhang, You Shan, MingKai Zheng†
arxiv preprint
Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective
Jintian Shao*, Yiming Cheng*
arxiv preprint
Power-Law Decay Loss for Large Language Model Finetuning: A Theory Perspective
Jintian Shao*, Yiming Cheng*
arxiv preprint

Experiences

Research Assistant

2025.02 - 2025.05
Southern University of Science and Technology of China, Shenzhen
  • Under the supervision of Professor Mingkai Zheng. My research primarily focuses on the Large Language Models, Efficient Deep Learning, Multimodal Large Language Model.

Quant Developer

2024.01 - 2024.12
Paretone Capital, Remote Hybrid
  • Developing a reliable, high performance, low latency c++ trading system

Algorithm Engineer

2023.04 - 2023.12
Amazon, Beijing
  • Developing a recalling module of search system which helps to enhance the indicators like AUC, Recall, etc

Algorithm Engineer

2020.10 - 2021.12
Tencent, ShenZhen
  • Developing a A/B Testing Platform which helps to refine products of PCG(eg: Tencent Vedio, Tencent News, Wesee, etc)
  • Lead the team to investigate and analyze performance issues with benchmarks;

Software Engineer Intern

2020.04 - 2020.07
DiDi, Beijing
  • Maintained Mobile Cloud Module: solve cross-originproblem.
  • Added observability facilities toDiDiFarm analyze bottlenecks and implemented a tool called performance monitor, which reduced the tail latency by 10.