A Brief Overview of Few-Shot Prompting in the Large Language Models

Vladlen Kulikov; Radoslav Neychev

doi:10.5281/zenodo.8005317

Proceedings of the XXth Conference of Open Innovations Association FRUCT (May 2023)

A Brief Overview of Few-Shot Prompting in the Large Language Models

Vladlen Kulikov,
Radoslav Neychev

Affiliations

Vladlen Kulikov: Consultant
Radoslav Neychev: Harbour.Space University

DOI: https://doi.org/10.5281/zenodo.8005317
Journal volume & issue: Vol. 33, no. 2
pp. 364 – 370

Abstract

Read online

Working on a larger, more general topic: «Large Language Models (LLMs). Learning and Reasoning at the Inference Stage», among other things, we investigated the following specific questions: 1. What is more important for the emergent abilities (few- shot prompting and augmented prompting) observed at the inference stage in LLMs – model’s size (number of model parameters) or actual training dataset size (number of training tokens)? 2. What is the composition of datasets on which LLMs demonstrating these abilities were trained and are there any correlations with the compositions and sizes of datasets? 3. What are the qualitative data requirements for observing emergent inference abilities, i.e., is there something in the language data that causes these abilities? To answer these questions, we present analysis of selected theoretical and experimental results focused on LLMs.

fine-tuning zero-shot one-shot few-shot language modeling in-context learning augmented prompting

Published in Proceedings of the XXth Conference of Open Innovations Association FRUCT

ISSN: 2305-7254 (Print); 2343-0737 (Online)
Publisher: FRUCT
Country of publisher: Finland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Telecommunication
Website: http://fruct.org/publication

About the journal

Abstract

Keywords