Connecting World's top Talents with Premier Jobs and Networking.
Register
Connecting World's top Talents with Premier Jobs and Networking.

腾讯智能座舱-大模型量化部署工程师

Apply instagram Share link

Job Source

腾讯集团

Location

China, Beijing

Salary

Negotiable

Designation

Full Stack Developer

Job Type

Full Time

Language

Job Posted Date

29-09-2025

Job Description

1.负责座舱端侧大模型量化部署,如高通/MTK/Nvidia等座舱芯片平台;
2.探索不同芯片平台的算子能力与工程新特性,设计不同的量化策略与验证方法,优化量化前后精度损失;
3.负责端侧大模型部署过程中的性能优化,优化token生成速度与减少内存带宽的使用;
4.研究端侧大模型前沿的量化部署方法,提升端侧大模型整体性能与精度。

Job Requirements

1.熟练掌握 C/C++、Python语言,有良好计算机体系结构知识;
2.具备高通/MTK/nvidia等芯片平台的端侧量化部署经验,至少一个平台;
3.熟悉大模型常见的部署框架(如TensorRT-LLM/vLLM/QNN等)和量化算法;
4.熟悉端侧大模型推理机制如计算图的执行、算子融合、KV 缓存优化、投机采样策略等;
5.精通 Transformer 等大模型核心算子(Attention、FFN、LayerNorm)底层实现机制与性能优化方法;
6.具有多模态大模型量化部署与优化经验者优先;
7.具备大模型训练和推理过程中的调试与优化实操经验者优先。。加分项:1.在同等条件下,通过腾讯云认证或取得同等资格认证的候选人,我们会优先考虑。



腾讯集团




Just one more quick step more to complete your application!

 

Welcome to Linkedtour! Please complete your profile first and then enjoy your trip in Linkedtour!

 

Just one more quick step more to complete your application!

 

Please complete now your information at our partner site and click to apply. Good luck !