AMGPT: A large language model for contextual querying in additive manufacturing

Achuth Chandrasekhar; Jonathan Chan; Francis Ogoke; Olabode Ajenifujah; Amir Barati Farimani

doi:10.1016/j.addlet.2024.100232

Additive Manufacturing Letters (Dec 2024)

AMGPT: A large language model for contextual querying in additive manufacturing

Achuth Chandrasekhar,
Jonathan Chan,
Francis Ogoke,
Olabode Ajenifujah,
Amir Barati Farimani

Affiliations

Achuth Chandrasekhar: Materials Science and Engineering, Carnegie Mellon University, Pittsburgh, 15213, PA, USA
Jonathan Chan: Mechanical Engineering, Carnegie Mellon University, Pittsburgh, 15213, PA, USA
Francis Ogoke: Mechanical Engineering, Carnegie Mellon University, Pittsburgh, 15213, PA, USA
Olabode Ajenifujah: Mechanical Engineering, Carnegie Mellon University, Pittsburgh, 15213, PA, USA
Amir Barati Farimani: Mechanical Engineering, Carnegie Mellon University, Pittsburgh, 15213, PA, USA; Biomedical Engineering, Carnegie Mellon University, Pittsburgh, 15213, PA, USA; Chemical Engineering, Carnegie Mellon University, Pittsburgh, 15213, PA, USA; Machine Learning Department, Carnegie Mellon University, Pittsburgh, 15213, PA, USA; Correspondence to: 5000, Forbes Avenue, USA.

DOI: https://doi.org/10.1016/j.addlet.2024.100232
Journal volume & issue: Vol. 11
p. 100232

Abstract

Read online

Generalized large language models (LLMs) such as GPT-4 may not provide specific answers to queries formulated by materials science researchers. These models may produce a high-level outline but lack the capacity to return detailed instructions on manufacturing and material properties of novel alloys. We introduce “AMGPT”, a specialized LLM text generator designed for metal AM queries. The goal of AMGPT is to assist researchers and users in navigating a curated corpus of literature. Instead of training from scratch, we employ a pre-trained Llama2-7B model from Hugging Face in a Retrieval-Augmented Generation (RAG) setup, utilizing it to dynamically incorporate information from ∼50 AM papers and textbooks in PDF format. Mathpix is used to convert these PDF documents into TeX format, facilitating their integration into the RAG pipeline managed by LlamaIndex. A query retrieval function has also been added, enabling the system to fetch relevant literature from Elsevier journals based on the context of the query. Expert evaluations of this project highlight that specific embeddings from the RAG setup accelerate response times and maintain coherence in the generated text.

Published in Additive Manufacturing Letters

ISSN: 2772-3690 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering
Website: https://www.journals.elsevier.com/additive-manufacturing-letters/

About the journal

Abstract

Keywords