Yequan's Academic
Yequan's Academic
Home
Projects
Publications
Patents
Talks
Contact
Light
Dark
Automatic
English
English
中文 (简体)
Tele-FLM
52B to 1T: Lessons Learned via Tele-FLM Series
As scaling laws underscore the potential of increasing model sizes, the academic community has intensified its investigations into LLMs with capacities exceeding 50 billion parameters. This technical report builds on our prior work with Tele-FLM (also known as FLM-2), a publicly available 52-billion-parameter model.
Xiang Li
,
Yiqun Yao
,
Xin Jiang
,
Xuezhi Fang
,
China Telecom
,
Yequan Wang
,
Zhongjiang He
,
Zhongyuan Wang
,
Xuelong Li
,
Tiejun Huang
PDF
Cite
Project
Tele-FLM-1T
Tele-FLM
Cite
×