Tapping into Unstructured Data and over 2 million other books are available for Amazon Kindle . Learn more

Buy New

or
Sign in to turn on 1-Click ordering.
Buy Used
Used - Good See details
Price: £4.99

or
 
   
More Buying Choices
Have one to sell? Sell yours here
Sorry, this item is not available in
Image not available for
Colour:
Image not available

 
Start reading Tapping into Unstructured Data on your Kindle in under a minute.

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.

Tapping into Unstructured Data: Integrating Unstructured Data and Textual Analytics into Business Intelligence [Paperback]

William H. Inmon , Anthony Nesavich
2.0 out of 5 stars  See all reviews (1 customer review)
RRP: £31.99
Price: £27.53 & FREE Delivery in the UK. Details
You Save: £4.46 (14%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
Only 1 left in stock (more on the way).
Dispatched from and sold by Amazon. Gift-wrap available.
Want it Thursday, 25 Sep.? Choose Express delivery at checkout. Details

Formats

Amazon Price New from Used from
Kindle Edition £24.82  
Paperback £27.53  

Book Description

11 Dec 2007 0132360292 978-0132360296 1

“The authors, the best minds on the topic, are breaking new ground. They show how every organization can realize the benefits of a system that can search and present complex ideas or data from what has been a mostly untapped source of raw data.”

--Randy Chalfant, CTO, Sun Microsystems

 

The Definitive Guide to Unstructured Data Management and Analysis--From the World’s Leading Information Management Expert

A wealth of invaluable information exists in unstructured textual form, but organizations have found it difficult or impossible to access and utilize it. This is changing rapidly: new approaches finally make it possible to glean useful knowledge from virtually any collection of unstructured data.

 

William H. Inmon--the father of data warehousing--and Anthony Nesavich introduce the next data revolution: unstructured data management. Inmon and Nesavich cover all you need to know to make unstructured data work for your organization. You’ll learn how to bring it into your existing structured data environment, leverage existing analytical infrastructure, and implement textual analytic processing technologies to solve new problems and uncover new opportunities. Inmon and Nesavich introduce breakthrough techniques covered in no other book--including the powerful role of textual integration, new ways to integrate textual data into data warehouses, and new SQL techniques for reading and analyzing text. They also present five chapter-length, real-world case studies--demonstrating unstructured data at work in medical research, insurance, chemical manufacturing, contracting, and beyond.

 

This book will be indispensable to every business and technical professional trying to make sense of a large body of unstructured text: managers, database designers, data modelers, DBAs, researchers, and end users alike.

 

Coverage includes

  • What unstructured data is, and how it differs from structured data
  • First generation technology for handling unstructured data, from search engines to ECM--and its limitations
  • Integrating text so it can be analyzed with a common, colloquial vocabulary: integration engines, ontologies, glossaries, and taxonomies
  • Processing semistructured data: uncovering patterns, words, identifiers, and conflicts
  • Novel processing opportunities that arise when text is freed from context
  • Architecture and unstructured data: Data Warehousing 2.0
  • Building unstructured relational databases and linking them to structured data
  • Visualizations and Self-Organizing Maps (SOMs), including Compudigm and Raptor solutions
  • Capturing knowledge from spreadsheet data and email
  • Implementing and managing metadata: data models, data quality, and more

William H. Inmon is founder, president, and CTO of Inmon Data Systems. He is the father of the data warehouse concept, the corporate information factory, and the government information factory. Inmon has written 47 books on data warehouse, database, and information technology management; as well as more than 750 articles for trade journals such as Data Management Review, Byte, Datamation, and ComputerWorld. His b-eye-network.com newsletter currently reaches 55,000 people.

Anthony Nesavich worked at Inmon Data Systems, where he developed multiple reports that successfully query unstructured data.

 

Preface xvii

1          Unstructured Textual Data in the Organization 1

2          The Environments of Structured Data and Unstructured Data 15

3          First Generation Textual Analytics 33

4          Integrating Unstructured Text into the Structured Environment 47

5          Semistructured Data 73

6          Architecture and Textual Analytics 83

7          The Unstructured Database 95

8          Analyzing a Combination of Unstructured Data and Structured Data 113

9          Analyzing Text Through Visualization 127

10        Spreadsheets and Email 135

11        Metadata in Unstructured Data 147

12        A Methodology for Textual Analytics 163

13        Merging Unstructured Databases into the Data Warehouse 175

14        Using SQL to Analyze Text 185

15        Case Study--Textual Analytics in Medical Research 195

16        Case Study--A Database for Harmful Chemicals 203

17        Case Study--Managing Contracts Through an Unstructured Database 209

18        Case Study--Creating a Corporate Taxonomy (Glossary) 215

19        Case Study--Insurance Claims 219

Glossary 227

Index 233

 


Product details

  • Paperback: 264 pages
  • Publisher: Prentice Hall; 1 edition (11 Dec 2007)
  • Language: English
  • ISBN-10: 0132360292
  • ISBN-13: 978-0132360296
  • Product Dimensions: 23.1 x 18.1 x 1.4 cm
  • Average Customer Review: 2.0 out of 5 stars  See all reviews (1 customer review)
  • Amazon Bestsellers Rank: 1,752,964 in Books (See Top 100 in Books)
  • See Complete Table of Contents

More About the Author

Discover books, learn about writers, and more.

Product Description

From the Back Cover

“The authors, the best minds on the topic, are breaking new ground. They show how every organization can realize the benefits of a system that can search and present complex ideas or data from what has been a mostly untapped source of raw data.”

--Randy Chalfant, CTO, Sun Microsystems

 

The Definitive Guide to Unstructured Data Management and Analysis--From the World’s Leading Information Management Expert

A wealth of invaluable information exists in unstructured textual form, but organizations have found it difficult or impossible to access and utilize it. This is changing rapidly: new approaches finally make it possible to glean useful knowledge from virtually any collection of unstructured data.

 

William H. Inmon--the father of data warehousing--and Anthony Nesavich introduce the next data revolution: unstructured data management. Inmon and Nesavich cover all you need to know to make unstructured data work for your organization. You’ll learn how to bring it into your existing structured data environment, leverage existing analytical infrastructure, and implement textual analytic processing technologies to solve new problems and uncover new opportunities. Inmon and Nesavich introduce breakthrough techniques covered in no other book--including the powerful role of textual integration, new ways to integrate textual data into data warehouses, and new SQL techniques for reading and analyzing text. They also present five chapter-length, real-world case studies--demonstrating unstructured data at work in medical research, insurance, chemical manufacturing, contracting, and beyond.

 

This book will be indispensable to every business and technical professional trying to make sense of a large body of unstructured text: managers, database designers, data modelers, DBAs, researchers, and end users alike.

 

Coverage includes

  • What unstructured data is, and how it differs from structured data
  • First generation technology for handling unstructured data, from search engines to ECM--and its limitations
  • Integrating text so it can be analyzed with a common, colloquial vocabulary: integration engines, ontologies, glossaries, and taxonomies
  • Processing semistructured data: uncovering patterns, words, identifiers, and conflicts
  • Novel processing opportunities that arise when text is freed from context
  • Architecture and unstructured data: Data Warehousing 2.0
  • Building unstructured relational databases and linking them to structured data
  • Visualizations and Self-Organizing Maps (SOMs), including Compudigm and Raptor solutions
  • Capturing knowledge from spreadsheet data and email
  • Implementing and managing metadata: data models, data quality, and more

William H. Inmon is founder, president, and CTO of Inmon Data Systems. He is the father of the data warehouse concept, the corporate information factory, and the government information factory. Inmon has written 47 books on data warehouse, database, and information technology management; as well as more than 750 articles for trade journals such as Data Management Review, Byte, Datamation, and ComputerWorld. His b-eye-network.com newsletter currently reaches 55,000 people.

Anthony Nesavich worked at Inmon Data Systems, where he developed multiple reports that successfully query unstructured data.

 

Preface xvii

1          Unstructured Textual Data in the Organization 1

2          The Environments of Structured Data and Unstructured Data 15

3          First Generation Textual Analytics 33

4          Integrating Unstructured Text into the Structured Environment 47

5          Semistructured Data 73

6          Architecture and Textual Analytics 83

7          The Unstructured Database 95

8          Analyzing a Combination of Unstructured Data and Structured Data 113

9          Analyzing Text Through Visualization 127

10        Spreadsheets and Email 135

11        Metadata in Unstructured Data 147

12        A Methodology for Textual Analytics 163

13        Merging Unstructured Databases into the Data Warehouse 175

14        Using SQL to Analyze Text 185

15        Case Study--Textual Analytics in Medical Research 195

16        Case Study--A Database for Harmful Chemicals 203

17        Case Study--Managing Contracts Through an Unstructured Database 209

18        Case Study--Creating a Corporate Taxonomy (Glossary) 215

19        Case Study--Insurance Claims 219

Glossary 227

Index 233

 

About the Author

Bill Inmon--the "father of data warehousing"--has written 50 books and published in nine languages on subjects such as data warehousing, database design, and architecture.

For current events, seminars, conference speaking schedules, and a lot of other information related to data warehousing, unstructured data, and textual ETL, take a look at Bill Inmon’s Web site at www.inmoncif.com.

Anthony aka “Tony” Nesavich received his master's degree in computer information technology from Regis University in Denver, Colorado. He worked with Bill Inmon at Inmon Data Systems (IDS) where he was instrumental in the development of the IDS Foundation software. Much of Tony’s contributions to IDS are discussed in this book. Tony lives in Denver, Colorado, with his wife Melissa and his faithful dog, Lola.

 


Inside This Book (Learn More)
Browse Sample Pages
Front Cover | Copyright | Table of Contents | Excerpt | Index | Back Cover
Search inside this book:

Customer Reviews

5 star
0
4 star
0
3 star
0
1 star
0
2.0 out of 5 stars
2.0 out of 5 stars
Most Helpful Customer Reviews
2 of 2 people found the following review helpful
2.0 out of 5 stars High-Level Description of Text Analysis 10 Jun 2011
By John M. Ford TOP 500 REVIEWER
Format:Kindle Edition
William Inman and Anthony Nesavich introduce the concepts of text analysis. They build on readers' familiarity with analysis of structured data from spreadsheets and databases. They describe how to transform text data into numbers and categories that can be analyzed with these traditional methods.

The book's chapters are of two types. The first fourteen chapters review the nature of business intelligence, discuss the challenges of analyzing structured and unstructured data, and lay out a general process for organizing and categorizing text data. Some sections are particularly good. Chapter 1, for example, suggests where useful unstructured data can be found in a typical organization. This is helpful guidance for an new analyst. Chapter 12 develops the framework of "A Methodology For Textual Analytics" that covers some of the key issues. This chapter contains most of the book's new information about text analysis.

The last five chapters present business intelligence case studies which used text analysis. The settings include conducting medical research, monitoring toxic chemicals, managing contract documents, creating a common corporate vocabulary, and imposing consistency on insurance claims.

The book has two weaknesses. First, it comes too slowly to its core material about text analysis. Early chapters review well-established business and data management practices to excess. There is far too much agonizing over whether to integrate unstructured data with structured data or analyze it separately. The second and more serious weakness is the abstract and almost cursory description of text analysis techniques. This is certainly not a technical tutorial. It is also not an adequate high-level description of the challenges and variations in such projects.
Read more ›
Was this review helpful to you?
Most Helpful Customer Reviews on Amazon.com (beta)
Amazon.com: 2.7 out of 5 stars  3 reviews
11 of 11 people found the following review helpful
2.0 out of 5 stars High-Level Description of Text Analysis 1 Dec 2010
By John M. Ford - Published on Amazon.com
Format:Paperback|Verified Purchase
William Inman and Anthony Nesavich introduce the concepts of text analysis. They build on readers' familiarity with analysis of structured data from spreadsheets and databases. They describe how to transform text data into numbers and categories that can be analyzed with these traditional methods.

The book's chapters are of two types. The first fourteen chapters review the nature of business intelligence, discuss the challenges of analyzing structured and unstructured data, and lay out a general process for organizing and categorizing text data. Some sections are particularly good. Chapter 1, for example, suggests where useful unstructured data can be found in a typical organization. This is helpful guidance for an new analyst. Chapter 12 develops the framework of "A Methodology For Textual Analytics" that covers some of the key issues. This chapter contains most of the book's new information about text analysis.

The last five chapters present business intelligence case studies which used text analysis. The settings include conducting medical research, monitoring toxic chemicals, managing contract documents, creating a common corporate vocabulary, and imposing consistency on insurance claims.

The book has two weaknesses. First, it comes too slowly to its core material about text analysis. Early chapters review well-established business and data management practices to excess. There is far too much agonizing over whether to integrate unstructured data with structured data or analyze it separately. The second and more serious weakness is the abstract and almost cursory description of text analysis techniques. This is certainly not a technical tutorial. It is also not an adequate high-level description of the challenges and variations in such projects. It doesn't tell you how to fix the potholes or drive around them.

Read chapters 1 and 12 of this book for a quick overview of text analysis. Then try something with more depth, such as Svenja Adolphs' Introducing Electronic Text Analysis. For more detailed technical guidance, try Weis, Indurkhya and Zhang's Fundamentals of Predictive Text Mining.
2 of 2 people found the following review helpful
1.0 out of 5 stars He might have got it wrong 30 Sep 2012
By aussiejim - Published on Amazon.com
Format:Kindle Edition|Verified Purchase
Like many of his earlier books Bill tends to generalise a little and although it all seems to make sense some of the thoughts are becoming outdated.
In this book Bill suggest to bring unstructured data INTO the warehouse but the current trend is to dump unstructured data into nosql databases with little or no modelling and apply statistical analysis to this BIG data. Then someone might wish to then test what they are infering by taking structured data out of the warehouse to compare or what if etc

Thus it seems to be going the other way, traditional data warehouses are taking the back seat to emerging big data technologies like hadoop, mapreduce, base, no sql, yada, yada, yada etc
People wouldnt dream of dumping all that data into a traditional warehouse IMHO
1 of 15 people found the following review helpful
5.0 out of 5 stars Good Book 26 Jan 2009
By J. Wood - Published on Amazon.com
Format:Paperback|Verified Purchase
I need this book for my Information Quality course. It seems to be a good book so far. Easy to read.
Were these reviews helpful?   Let us know
Search Customer Reviews
Only search this product's reviews

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 

Search Customer Discussions
Search all Amazon discussions
   


Look for similar items by category


Feedback