Reader small image

You're reading from  Machine Learning Techniques for Text

Product typeBook
Published inOct 2022
PublisherPackt
ISBN-139781803242385
Edition1st Edition
Right arrow
Author (1)
Nikos Tsourakis
Nikos Tsourakis
author image
Nikos Tsourakis

Nikos Tsourakis is a professor of computer science and business analytics at the International Institute in Geneva, Switzerland, and a research associate at the University of Geneva. He has over 20 years of experience designing, building, and evaluating intelligent systems using speech and language technologies. He has also co-authored over 50 research publications in the area. In the past, he worked as a software engineer, developing products for major telecommunication vendors. He also served as an expert for the European Commission and is currently a certified educator at the Amazon Web Services Academy. He holds a degree in electronic and computer engineering, a master's in management, and a PhD in multilingual information processing.
Read more about Nikos Tsourakis

Right arrow

Understanding BERT

Looking at the transformer’s encoder/decoder architecture discussed in the Introducing transformers section of Chapter 7, Summarizing Wikipedia Articles, we can observe a clear separation of tasks. The encoder is responsible for extracting features from an input sentence, such as syntax, grammar, and context. At the same time, the decoder maps it to a target sequence – for example, translates it to another language. This separation makes the two components self-contained; therefore, they can be used independently.

This section introduces a state-of-the-art transformer-based technique to generate language representation models named Bidirectional Encoder Representation from Transformers (BERT). BERT incorporates a stack of transformer encoders to understand the language better.

Similarly to word embedding, the method belongs to the self-supervised learning family because it does not require human-annotated observation labels. Therefore, BERT can...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Machine Learning Techniques for Text
Published in: Oct 2022Publisher: PacktISBN-13: 9781803242385

Author (1)

author image
Nikos Tsourakis

Nikos Tsourakis is a professor of computer science and business analytics at the International Institute in Geneva, Switzerland, and a research associate at the University of Geneva. He has over 20 years of experience designing, building, and evaluating intelligent systems using speech and language technologies. He has also co-authored over 50 research publications in the area. In the past, he worked as a software engineer, developing products for major telecommunication vendors. He also served as an expert for the European Commission and is currently a certified educator at the Amazon Web Services Academy. He holds a degree in electronic and computer engineering, a master's in management, and a PhD in multilingual information processing.
Read more about Nikos Tsourakis