Reader small image

You're reading from  Principles of Strategic Data Science

Product typeBook
Published inJun 2019
Reading LevelExpert
Publisher
ISBN-139781838985295
Edition1st Edition
Languages
Concepts
Right arrow
Author (1)
Peter Prevos
Peter Prevos
author image
Peter Prevos

Dr Peter Prevos is a civil engineer and social scientist who also dabbles in theatrical magic. Peter has almost three decades of experience as a water engineer and manager, working in Europe, Africa, Asia, and Australia. He has worked on marine engineering, drinking water, and sewage treatment projects. Throughout his career, analysing data has been a central theme. He also has a PhD in marketing and is the author of Customer Experience Management for Water Utilities. In his work, he aims to combine the social sciences with engineering to create value for customers. Peter occasionally lectures marketing for MBA students. He is currently responsible for developing and implementing the data science strategy for a water utility in regional Australia. The objective of this strategy is to create value from data through useful, sound, and aesthetic data science. His mission is to breed unicorn data scientists by motivating other water professionals to ditch their spreadsheets and learn how to write code.
Read more about Peter Prevos

Right arrow

Process

Chapter 2, Good Data Science, mentioned the requirement for governance in data science to ensure the outcomes of projects are sound. The process of creating value from data follows an iterative workflow that works from raw data to a finished project. (Wickham, H., & Grolemund, G. (2016). R for data science: Import, Tidy, Transform, Visualize, and Model Data Sebastopol, CA: O'Reilly. Available athttps://r4ds.had.co.nz/). The workflow starts with defining a problem that needs solving as shown in Figure 4.2. The next step involves loading and transforming the data into a format that is suitable for the required analysis. The data science workflow contains a loop that consists of exploration, modelling, and reflection, which is repeated until the problem is solved or is shown to be unsolvable.

Figure 4.2: Data science workflow
Figure 4.2: Data science workflow

The workflow for a data project is independent of the aspect of the data science continuum under consideration. The same principles...

Process


Chapter 2, Good Data Science, mentioned the requirement for governance in data science to ensure the outcomes of projects are sound. The process of creating value from data follows an iterative workflow that works from raw data to a finished project. (Wickham, H., & Grolemund, G. (2016). R for data science: Import, Tidy, Transform, Visualize, and Model Data Sebastopol, CA: O'Reilly. Available at—https://r4ds.had.co.nz/). The workflow starts with defining a problem that needs solving as shown in Figure 4.2. The next step involves loading and transforming the data into a format that is suitable for the required analysis. The data science workflow contains a loop that consists of exploration, modelling, and reflection, which is repeated until the problem is solved or is shown to be unsolvable.

Figure 4.2: Data science workflow

The workflow for a data project is independent of the aspect of the data science continuum under consideration. The same principles apply to every type of analysis...

The Limitations of Data Science


The problems that data science can solve seem to be limitless when reading some of the articles on the web and in literature. There are, however, theoretical and practical limitations to what a computational analysis can achieve. Besides limits to what we can do, there are also ethical limits to what we should be doing. This final section reflects on some of the boundaries of data science.

The Limits of Computation

The limitations of computation relate to some complex and deeply philosophical issues concerning the limits of human knowledge. This short introduction to this problem looks at the restrictions of measuring physical and social processes, and the limitations of algorithms.

  • Limitations of Measurement: The first limitation relates to the fact that our collection of data will always be an incomplete description of reality. In a physical system, choose which points to measure, at which frequency, by which method, and so on. We need to make a lot of choices...

Process

Chapter 2, Good Data Science, mentioned the requirement for governance in data science to ensure the outcomes of projects are sound. The process of creating value from data follows an iterative workflow that works from raw data to a finished project. (Wickham, H., & Grolemund, G. (2016). R for data science: Import, Tidy, Transform, Visualize, and Model Data Sebastopol, CA: O'Reilly. Available athttps://r4ds.had.co.nz/). The workflow starts with defining a problem that needs solving as shown in Figure 4.2. The next step involves loading and transforming the data into a format that is suitable for the required analysis. The data science workflow contains a loop that consists of exploration, modelling, and reflection, which is repeated until the problem is solved or is shown to be unsolvable.

Figure 4.2: Data science workflow
Figure 4.2: Data science workflow

The workflow for a data project is independent of the aspect of the data science continuum under consideration...

The Limitations of Data Science

The problems that data science can solve seem to be limitless when reading some of the articles on the web and in literature. There are, however, theoretical and practical limitations to what a computational analysis can achieve. Besides limits to what we can do, there are also ethical limits to what we should be doing. This final section reflects on some of the boundaries of data science.

The Limits of Computation

The limitations of computation relate to some complex and deeply philosophical issues concerning the limits of human knowledge. This short introduction to this problem looks at the restrictions of measuring physical and social processes, and the limitations of algorithms.

  • Limitations of Measurement: The first limitation relates to the fact that our collection of data will always be an incomplete description of reality. In a physical system, choose which points to measure, at which frequency, by which method, and so on. We need...
lock icon
The rest of the chapter is locked
You have been reading a chapter from
Principles of Strategic Data Science
Published in: Jun 2019Publisher: ISBN-13: 9781838985295
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Author (1)

author image
Peter Prevos

Dr Peter Prevos is a civil engineer and social scientist who also dabbles in theatrical magic. Peter has almost three decades of experience as a water engineer and manager, working in Europe, Africa, Asia, and Australia. He has worked on marine engineering, drinking water, and sewage treatment projects. Throughout his career, analysing data has been a central theme. He also has a PhD in marketing and is the author of Customer Experience Management for Water Utilities. In his work, he aims to combine the social sciences with engineering to create value for customers. Peter occasionally lectures marketing for MBA students. He is currently responsible for developing and implementing the data science strategy for a water utility in regional Australia. The objective of this strategy is to create value from data through useful, sound, and aesthetic data science. His mission is to breed unicorn data scientists by motivating other water professionals to ditch their spreadsheets and learn how to write code.
Read more about Peter Prevos