You're reading from Soar with Haskell

Product typeBook

Published inDec 2023

Reading LevelBeginner

PublisherPackt

ISBN-139781805128458

Edition1st Edition

Languages

Haskell

Concepts

Programming Language

Author (1)

Tom Schrijvers

Parser Combinators

Parsing is the act of turning a usually human-readable structured text into a data structure that can be easily processed in software. The best-known application of parsers is on compilers. A compiler frontend takes the text written by programmers, the source code, and turns it into an abstract syntax tree, which is a more convenient data structure to work with in the next stages of the compiler. Besides parsing source code, many other structured text formats are parsed, such as JSON, YMAL, and XML, which are used for all kinds of data entry, data exchange, and configuration.

To make parsing possible, the text cannot take on an arbitrary form. It needs to be structured in a particular way, and follow certain rules. Formally, the expected text structure is sometimes codified in a grammar, which serves as a non-executable specification for the parser. This formal background, together with the development of many sophisticated and intricate parsing approaches for...

Parsing

Before we dive into parser combinators, let’s briefly review what parsing is, and consider some alternative options for obtaining Haskell parsers.

What is parsing?

The basic idea of parsing is to take a string representation of some data and turn it into the corresponding value of a structured data type. For example, consider the "1 + 2" string. With a parser, we could turn this into a value, Plus (Lit 1) (Lit 2), of the Expr algebraic data type:

data Expr = Lit Int | Plus Expr Expr

Typically, the textual value is convenient for humans, but further programmatic processing is much easier on the structured value. For example, it is much easier to write an evaluation function that takes Expr than a string.

Because the string can easily be ill-formed – for example, "1 +" or "1 ++ 2" – parsing is an operation that may fail. We model this in Haskell with the result type, Maybe Expr. Thus, the basic parsing interface...

Parser combinators

Parser combinators are a compositional approach for defining parsers in the style of the embedded DSLs, which we covered in the previous chapter. In this section, we will cover basic combinators and the essence of their implementation.

A parser for sums

A parser is a kind of data structure that is assembled from combinators. The (abstract) type of the parser is Parser a; this type denotes a parser that processes a string and produces a result of the a type. For example, to parse sum expressions, we want a parser, exprP, of the Parser Expr type.

Once we have a parser, we can apply it to an input string using the following interpretation function:

parse :: Parser a -> String -> Maybe a

Because the input string may not be in the expected format, the parser can fail; this is modeled with the Maybe a result type.

We expect the following behavior:

*Main> parse exprP "1+2"
Just (Plus (Lit 1) (Lit 2))
*Main> parse exprP "1+...

The Parsec library

In practice, you will want to use an off-the-shelf parser combinator library. In this chapter, we’ll study Parsec, which is one of the older and more established libraries. On Hackage, you can find various new libraries with more bells and whistles, but Parsec will do nicely as a starting point.

Different types of parsers

To provide additional flexibility and expressive power, Parsec’s parser type features three additional type parameters beyond the a parameter for the result type:

ParsecT s u m a

Let’s discuss the three additional parameters from right to left:

The monad parameter, m, signals that ParsecT s u is a monad transformer; it can be layered on top of a monad, m. For basic uses, we don’t need any underlying monad and can default to using the trivial Identity monad. For that, Parsec provides a type synonym:
```
type Parsec s u = ParsecT s u Identity
```
The u parameter is for a user-defined state. Indeed, some parsers...

Parsing challenges – expressions revisited

In this section, we’ll revisit the parser for simple arithmetic expressions, now using Parsec. We explore several variations and extensions, consolidate earlier lessons, and handle new challenges.

As our starting point, consider again the simple type for arithmetic expressions:

data Expr = Lit Int | Plus Expr Expr

We could write a very native parser for it that follows the structure of the data type definition:

exprP = literalP <|> sumP
literalP = Lit <$> number
sumP = do x <- exprP
          plus
          y <- exprP
          pure (Plus x y)

When used naively, the parser does not consume all the input:

*Main> parse exprP "1+2"
Right (Lit 1)

To amend this, we have to check for the end of the file:

*Main> parse (exprP <...

Summary

In this chapter, we learned how parser combinators make it easy to turn strings into structured data. We covered how to write parsers and also looked under the hood of a minimal parser combinator implementation to get a basic understanding of how the approach works. Then, we moved on to the industrial-strength Parsec library for parser combinators. We studied its character consumption behavior and its support for error messages. Finally, we explored how to satisfy several requirements and avoid common pitfalls when writing parsers.

Chapter 15, Lenses, presents an elegant, purely functional approach to a mundane but ubiquitous programming task: data access in nested data types. First, we’ll identify the disadvantages of Haskell’s built-in support for record access and then present the concept of lenses as a much more convenient alternative for both reading and updating fields. We’ll not only show that lenses compose trivially to reach deep into data structures...

Questions

Answer the following questions to test your knowledge of this chapter:

What is parsing?
What are parser combinators?
What are the advantages and disadvantages of parser combinators?
What are the particular features of Parsec?

Answers

Here are the answers to this chapter’s questions:

Parsing is the act of turning a, usually human-readable, structured text into a data structure that can be easily processed in software.
Parser combinators are an embedded DSL for parsing. They define parsers in a compositional way, assembling primitive parsers into larger ones.
Parser combinators have several advantages:
- They are more convenient to write than hand-rolling your own parser
- As an embedded DSL, they are easily integrated into a code base and do not hamper the development process
- They do not require learning a new language and have a low threshold for entry
- Monad parsers are very flexible and expressive; the parsing behavior can be determined dynamically
Parser combinators also have several disadvantages compared to parser generators:
- Their performance is usually not as good as that of parser generators and can be pathologically bad if we aren’t careful (for example, in the case of left...

The rest of the chapter is locked

You have been reading a chapter from

Soar with Haskell

Published in: Dec 2023Publisher: PacktISBN-13: 9781805128458

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €14.99/month. Cancel anytime

Author (1)

Tom Schrijvers

Tom Schrijvers is a professor of computer science at KU Leuven in Belgium since 2014, and previously from 2011 until 2014 at Ghent University in Belgium. He has over 20 years of research experience in programming languages and has co-authored more than 100 scientific papers. Much of his research focuses on functional programming and on the Haskell programming language in particular: he has made many contributions to the language, its ecosystem and applications, and chaired academic events like the Haskell Symposium. At the same time, he has more than a decade of teaching experience (including functional programming with Haskell) and received several teaching awards.
Read more about Tom Schrijvers

Personalised recommendations for you

Based on your interests and search pattern

C++ Programming for Linux Systems

This book covers the essential system programming tools and helps you explore the features of C++20. It emphasizes important details to maintain code quality and tackle everyday challenges of developing software for high performance, optimization, and more.

BookSep 2023288 pages

Expert C++

Discover advanced programming techniques, the latest features of C++17 and C++20, and best practices for memory management, debugging, testing, and large-scale application design with Expert C++. Ideal for experienced developers advancing to proficient programmers and building professional-grade C++ applications.

BookAug 2023604 pages

iOS 17 Programming for Beginners

iOS 17 Programming for Beginners, Eighth Edition is your comprehensive guide to learning the art of iOS app development. Whether you dream of creating the next chart-topping app or simply want to enhance your programming skills, this book is your trusted companion on this exciting journey.

BookOct 2023604 pages4

Developer Career Masterplan

Written by industry experts that have spent the last 20+ years helping developers grow their career path towards senior developer positions and beyond. This book provides a comprehensive guide, sharing examples and stories from their global careers. By the end, you’ll have the knowledge to create a clear career progression plan as a technical professional.

BookSep 2023310 pages

Refactoring with C#

In Refactoring with C#, you’ll explore the process of safely refactoring modern .NET code using Visual Studio features, advanced unit tests, AI assistance, and custom Roslyn analyzers.

BookNov 2023434 pages

Python Real-World Projects

Amplify your developer journey by curating a dynamic project portfolio that outshines traditional resumes. Delve into the Python realm through immersive projects, mastering core concepts while constructing comprehensive modules and applications. From data acquisition prowess to impactful data visualization, Python Real-World Projects arms you with essential skills to beat the competition.

BookSep 2023478 pages5

The MVVM Pattern in .NET MAUI

The MVVM Pattern in .NET MAUI enables developers to master MVVM principles and effectively apply them to .NET MAUI. This book uses real-life examples and covers complex problems to help you successfully apply MVVM with .NET MAUI to confidently develop robust and high-performing cross-platform apps.

BookNov 2023386 pages

Extending Microsoft Business Central with Power Platform

Extending Business Central with the Power Platform is a step-by-step guide for Business Central professionals to create solutions that automate business processes, explain complex workflow approvals, and integrate with hundreds of other systems, without traditional development. It’ll guide you in customizing Business Central with Power Platform.

BookAug 2023458 pages5

Extending Microsoft Business Central with Power Platform

Extending Business Central with the Power Platform is a step-by-step guide for Business Central professionals to create solutions that automate business processes, explain complex workflow approvals, and integrate with hundreds of other systems, without traditional development. It’ll guide you in customizing Business Central with Power Platform.

BookAug 2023458 pages5

Quantum Computing Algorithms

The book emphasizes intuitive ideas behind quantum algorithms in ways that other books don’t cover, striking a careful balance between no math and too much math. To get the most from this book, you should be comfortable with basic algebra and writing simple computer code. No prior understanding of quantum physics is needed to get started.

BookSep 2023342 pages

Python – Complete Python, Django, Data Science and ML Guide

Unlock Python's full potential with this 50+ hour course! From programming to web and game development, data manipulation, and machine learning, gain the skills required to succeed in various Python-related careers. With practical tasks, hands-on experience, and a strong foundation in Python, you'll be ready to tackle real-world challenges and take advantage of the many opportunities this versatile language offers.

VideoNov 202350 hours 30 minutes5

Python – Complete Python, Django, Data Science and ML Guide

Unlock Python's full potential with this 50+ hour course! From programming to web and game development, data manipulation, and machine learning, gain the skills required to succeed in various Python-related careers. With practical tasks, hands-on experience, and a strong foundation in Python, you'll be ready to tackle real-world challenges and take advantage of the many opportunities this versatile language offers.

VideoNov 202350 hours 30 minutes5

You're reading from Soar with Haskell

Parser Combinators

Parsing

What is parsing?

Parser combinators

A parser for sums

The Parsec library

Different types of parsers

Parsing challenges – expressions revisited

Summary

Questions

Further reading

Answers

Unlock this book and the full library FREE for 7 days

Author (1)

C++ Programming for Linux Systems

This book covers the essential system programming tools and helps you explore the features of C++20. It emphasizes important details to maintain code quality and tackle everyday challenges of developing software for high performance, optimization, and more.

Expert C++

iOS 17 Programming for Beginners

iOS 17 Programming for Beginners, Eighth Edition is your comprehensive guide to learning the art of iOS app development. Whether you dream of creating the next chart-topping app or simply want to enhance your programming skills, this book is your trusted companion on this exciting journey.

Developer Career Masterplan

Refactoring with C#

In Refactoring with C#, you’ll explore the process of safely refactoring modern .NET code using Visual Studio features, advanced unit tests, AI assistance, and custom Roslyn analyzers.

Python Real-World Projects

The MVVM Pattern in .NET MAUI

The MVVM Pattern in .NET MAUI enables developers to master MVVM principles and effectively apply them to .NET MAUI. This book uses real-life examples and covers complex problems to help you successfully apply MVVM with .NET MAUI to confidently develop robust and high-performing cross-platform apps.

Extending Microsoft Business Central with Power Platform

Extending Microsoft Business Central with Power Platform

Quantum Computing Algorithms

Python – Complete Python, Django, Data Science and ML Guide

Python – Complete Python, Django, Data Science and ML Guide