Biagio Brattoli Resume

Career Profile

In the field of large language models, my work spans several key areas: optimizing attention mechanism, optimizing cross-entropy like objective function, optimizing reasoning ability for complex downstream tasks like Math Solving, Code Generation.
In the field of efficient deep learning, my work spans several key areas: model compression
In the field of multimodal large language model, my work spans several key areas: multimodal alignment and fusion, hallucination mitigation

A detailed list of publications can be found here.

Publications

VQ-logits:Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits

Jintian Shao*, Hongyi Huang, Yiming Cheng, Jiayi Wu, Beiwen Zhang, ZhiYu Wu, You Shan, MingKai Zheng†

arxiv preprint

ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention

Jintian Shao*, Hongyi Huang, Yiming Cheng, Jiayi Wu, Beiwen Zhang, You Shan, MingKai Zheng†

arxiv preprint

Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective

Jintian Shao*, Yiming Cheng*

arxiv preprint

Power-Law Decay Loss for Large Language Model Finetuning: A Theory Perspective

Jintian Shao*, Yiming Cheng*

arxiv preprint

Experiences

Research Assistant

2025.02 - 2025.05

Southern University of Science and Technology of China, Shenzhen

Under the supervision of Professor Mingkai Zheng. My research primarily focuses on the Large Language Models, Efficient Deep Learning, Multimodal Large Language Model.

Quant Developer

2024.01 - 2024.12

Paretone Capital, Remote Hybrid

Developing a reliable, high performance, low latency c++ trading system

Algorithm Engineer

2023.04 - 2023.12

Amazon, Beijing

Developing a recalling module of search system which helps to enhance the indicators like AUC, Recall, etc

Algorithm Engineer

2020.10 - 2021.12

Tencent, ShenZhen

Developing a A/B Testing Platform which helps to refine products of PCG(eg: Tencent Vedio, Tencent News, Wesee, etc)
Lead the team to investigate and analyze performance issues with benchmarks;

Software Engineer Intern

2020.04 - 2020.07

DiDi, Beijing

Maintained Mobile Cloud Module: solve cross-originproblem.
Added observability facilities toDiDiFarm analyze bottlenecks and implemented a tool called performance monitor, which reduced the tail latency by 10.

Jintian Shao

AI Researcher

Career

Bachelor

Hunan university

Career Profile

Publications

Experiences

Research Assistant

Quant Developer

Algorithm Engineer

Algorithm Engineer

Software Engineer Intern