[2025-01-18] For better promotion of the events, the categories in this system will be adjusted. For details, please refer to the announcement of this system. The link is https://indico-tdli.sjtu.edu.cn/news/1-warm-reminder-on-adjusting-indico-tdli-categories-indico

Jul 15 – 18, 2023
Asia/Shanghai timezone

Emergence of Large Language Model and Application in Scientific Discoveries

Jul 17, 2023, 2:00 PM
50m
计算方法与技术 Invited Talks

Speaker

Hengkui Wu (SuperSymmetry Technologies)

Description

ChatGPT demonstrates extraordinary emergent abilities in complex reasoning and language generation. In this talk we introduce Big Bang Transformer[乾元], a pretrained LLM trained on blend of general text, scientific papers and code datasets. We discuss how LLMs acquire emergent abilities by instruct finetuning and RLHF. We propose to use statistical mechanics principles to study the fundamental mechanism of the emergence of intelligence in large parameter language models. Also, we discuss the potential application of large language model in scientific data analysis and discoveries, particularly in the field of experimental particle physics.

Primary author

Hengkui Wu (SuperSymmetry Technologies)

Presentation materials