大模型异构引擎研发高级工程师(深圳/北京/上海/杭州)Apply |
|
Job Source |
腾讯集团 |
Location |
China, Shenzhen |
Salary |
Negotiable |
Designation |
Internet/AI |
Job Type |
Full Time |
Language |
|
Job Posted Date |
01-09-2025 |
Job Description |
|
1.研发及优化大模型推理引擎、PD分离推理调度系统;
2.支持主流GPU和异构AI芯片,优化大模型推理性能,打造极致性能成本优势。 |
|
Job Requirements |
|
1.熟练掌握C/C++、Python编程语言,具备良好的coding和调试能力;
2.熟悉GPU/AI芯片编程,如CUDA,OpenCL,Ascend C等; 3.熟悉常见的算子编译优化和算子调优手段,如torch.compile,triton等; 4.熟悉各类深度学习网络和算子底层实现细节,训练和推理模型调试、调优有实操经验优先; 5.熟悉主流大模型推理框架,如vllm,sglang,tensorrt-llm,FasterFransformer等优先; 6.熟悉并行策略,如模型并行、流水线并行等,了解NVLINK、GPU通信者优先; 7.具备GPU、AI芯片体系结构知识,熟悉芯片特性,具备系统性能分析和调优经验优先。。加分项:1.机器学习或者体系结构相关顶会论文; 2.参与vllm、sglang等开源项目贡献者; 3.熟悉推理服务框架,具备服务部署经验者优先,有超大模型分布式部署经验优先。 |
Welcome to Linkedtour! Please complete your profile first and then enjoy your trip in Linkedtour!
Please complete now your information at our partner site and click to apply. Good luck !