Reader small image

You're reading from  Mastering NLP from Foundations to LLMs

Product typeBook
Published inApr 2024
PublisherPackt
ISBN-139781804619186
Edition1st Edition
Right arrow
Authors (2):
Lior Gazit
Lior Gazit
author image
Lior Gazit

Lior Gazit is a highly skilled Machine Learning professional with a proven track record of success in building and leading teams drive business growth. He is an expert in Natural Language Processing and has successfully developed innovative Machine Learning pipelines and products. He holds a Master degree and has published in peer-reviewed journals and conferences. As a Senior Director of the Machine Learning group in the Financial sector, and a Principal Machine Learning Advisor at an emerging startup, Lior is a respected leader in the industry, with a wealth of knowledge and experience to share. With much passion and inspiration, Lior is dedicated to using Machine Learning to drive positive change and growth in his organizations.
Read more about Lior Gazit

Meysam Ghaffari
Meysam Ghaffari
author image
Meysam Ghaffari

Meysam Ghaffari is a Senior Data Scientist with a strong background in Natural Language Processing and Deep Learning. Currently working at MSKCC, where he specialize in developing and improving Machine Learning and NLP models for healthcare problems. He has over 9 years of experience in Machine Learning and over 4 years of experience in NLP and Deep Learning. He received his Ph.D. in Computer Science from Florida State University, His MS in Computer Science - Artificial Intelligence from Isfahan University of Technology and his B.S. in Computer Science at Iran University of Science and Technology. He also worked as a post doctoral research associate at University of Wisconsin-Madison before joining MSKCC.
Read more about Meysam Ghaffari

View More author details
Right arrow

Different types of LLMs

LLMs are generally neural network architectures that are trained on a large corpus of text data. The term “large” refers to the size of these models in terms of the number of parameters and the scale of training data. Here are some examples of LLMs.

Transformer models

Transformer models have been at the forefront of the recent wave of LLMs. They are based on the “Transformer” architecture, which uses self-attention mechanisms to weigh the relevance of different words in the input when making predictions. Transformers are a type of neural network architecture introduced in the paper Attention is All You Need by Vaswani et al. One of their significant advantages, particularly for training LLMs, is their suitability for parallel computing.

In traditional RNN models, such as LSTM and GRU, the sequence of tokens (words, subwords, or characters in the text) must be processed sequentially. That’s because each token’...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Mastering NLP from Foundations to LLMs
Published in: Apr 2024Publisher: PacktISBN-13: 9781804619186

Authors (2)

author image
Lior Gazit

Lior Gazit is a highly skilled Machine Learning professional with a proven track record of success in building and leading teams drive business growth. He is an expert in Natural Language Processing and has successfully developed innovative Machine Learning pipelines and products. He holds a Master degree and has published in peer-reviewed journals and conferences. As a Senior Director of the Machine Learning group in the Financial sector, and a Principal Machine Learning Advisor at an emerging startup, Lior is a respected leader in the industry, with a wealth of knowledge and experience to share. With much passion and inspiration, Lior is dedicated to using Machine Learning to drive positive change and growth in his organizations.
Read more about Lior Gazit

author image
Meysam Ghaffari

Meysam Ghaffari is a Senior Data Scientist with a strong background in Natural Language Processing and Deep Learning. Currently working at MSKCC, where he specialize in developing and improving Machine Learning and NLP models for healthcare problems. He has over 9 years of experience in Machine Learning and over 4 years of experience in NLP and Deep Learning. He received his Ph.D. in Computer Science from Florida State University, His MS in Computer Science - Artificial Intelligence from Isfahan University of Technology and his B.S. in Computer Science at Iran University of Science and Technology. He also worked as a post doctoral research associate at University of Wisconsin-Madison before joining MSKCC.
Read more about Meysam Ghaffari