Home // GPTMB 2024, The First International Conference on Generative Pre-trained Transformer Models and Beyond // View article


Comparison of Large Language Models for Deployment Requirements

Authors:
Alper Yaman
Jannik Schwab
Christof Nitsche
Abhirup Sinha
Marco Huber

Keywords: generative AI; large language models; model comparison, HuggingFace

Abstract:
Large Language Models (LLMs), such as Generative Pre-trained Transformers (GPTs) are revolutionizing the generation of human-like text, producing contextually relevant and syntactically correct content. Despite challenges like biases and hallucinations, these Artificial Intelligence (AI) models excel in tasks, such as content creation, translation, and code generation. Fine-tuning and novel architectures, such as Mixture of Experts (MoE), address these issues. Over the past two years, numerous open-source foundational and fine-tuned models have been introduced, complicating the selection of the optimal LLM for researchers and companies regarding licensing and hardware requirements. To navigate the rapidly evolving LLM landscape and facilitate LLM selection, we present a comparative list of foundational and domain-specific models, focusing on features, such as release year, licensing, and hardware requirements. This list is published on GitLab and will be continuously updated.

Pages: 41 to 44

Copyright: Copyright (c) IARIA, 2024

Publication date: June 30, 2024

Published in: conference

ISBN: 978-1-68558-182-4

Location: Porto, Portugal

Dates: from June 30, 2024 to July 4, 2024