Reader small image

You're reading from  Talend Open Studio Cookbook

Product typeBook
Published inOct 2013
Reading LevelIntermediate
PublisherPackt
ISBN-139781782167266
Edition1st Edition
Languages
Tools
Right arrow
Author (1)
Rick Barton
Rick Barton
author image
Rick Barton

Rick Barton is a freelance consultant who has specialized in data integration and ETL for the last 13 years as part of an IT career spanning over 25 years. After gaining a degree in Computer Systems from Cardiff University, he began his career as a firmware programmer before moving into Mainframe data processing and then into ETL tools in 1999. He has provided technical consultancy to some of the UKs largest companies, including banks and telecommunications companies, and was a founding partner of a Big Data integration consultancy. Four years ago he moved back into freelance development and has been working almost exclusively with Talend Open Studio and Talend Integration Suite, on multiple projects, of various sizes, in UK. It is on these projects that he has learned many of the lessons that can be found in this, his first book.
Read more about Rick Barton

Right arrow

Capturing file information


Another useful Talend feature is the ability to capture information about a file for use within downstream processing, most probably to perform validation prior to processing.

Getting ready

Open the jo_cook_ch08_0110_fileInformation job.

How to do it...

The steps for capturing file information are as follows:

  1. Drag a tFileProperties component from the right-hand panel. Open tFileProperties, and set the file name to context.cookbookData+"/chapter8/chapter08_jo_0110_customerData.txt".

  2. Drag tFlowToIterate to the canvas, and link the row from tFileProperties to it. Name the flow properties.

  3. Drag tFileRowCount to the canvas and set the filename to match the tFileProperties component.

  4. Add onSubjobOk from tFileProperties to tFileRowCount, and then to tFixedFlowInput, so that your job looks like the one shown as follows:

  5. Open tFixedFlowInput.

  6. Add ((Long)globalMap.get("properties.size")) to the field fileSize.

  7. Add ((Integer)globalMap.get("tFileRowCount_1_COUNT")) to the field numberOfRows...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Talend Open Studio Cookbook
Published in: Oct 2013Publisher: PacktISBN-13: 9781782167266

Author (1)

author image
Rick Barton

Rick Barton is a freelance consultant who has specialized in data integration and ETL for the last 13 years as part of an IT career spanning over 25 years. After gaining a degree in Computer Systems from Cardiff University, he began his career as a firmware programmer before moving into Mainframe data processing and then into ETL tools in 1999. He has provided technical consultancy to some of the UKs largest companies, including banks and telecommunications companies, and was a founding partner of a Big Data integration consultancy. Four years ago he moved back into freelance development and has been working almost exclusively with Talend Open Studio and Talend Integration Suite, on multiple projects, of various sizes, in UK. It is on these projects that he has learned many of the lessons that can be found in this, his first book.
Read more about Rick Barton