• The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data See large image

    The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data (Mixed media product) By (author) Ralph Kimball, By (author) Joe Caserta

    $33.51 - Save $16.53 33% off - RRP $50.04 Free delivery worldwide Available
    Dispatched in 3 business days
    When will my order arrive?
    Add to basket | Add to wishlist |

    DescriptionCowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copies Delivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) process Delineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouse Offers proven time-saving ETL techniques, comprehensive guidance on building dimensional structures, and crucial advice on ensuring data quality

Other books

Other people who viewed this bought | Other books in this category
Showing items 1 to 10 of 10


Reviews | Bibliographic data
  • Full bibliographic data for The Data Warehouse ETL Toolkit

    The Data Warehouse ETL Toolkit
    Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
    Authors and contributors
    By (author) Ralph Kimball, By (author) Joe Caserta
    Physical properties
    Format: Mixed media product
    Number of pages: 528
    Width: 188 mm
    Height: 234 mm
    Thickness: 28 mm
    Weight: 771 g
    ISBN 13: 9780764567575
    ISBN 10: 0764567578

    BIC E4L: COM
    Nielsen BookScan Product Class 3: S10.2
    B&T Book Type: NF
    B&T Merchandise Category: TXT
    B&T Modifier: Region of Publication: 01
    B&T Modifier: Subject Development: 20
    BIC subject category V2: UF
    B&T Modifier: Academic Level: 03
    B&T Modifier: Text Format: 01
    LC subject heading:
    Warengruppen-Systematik des deutschen Buchhandels: 16350
    LC subject heading:
    DC22: 005.74
    LC subject heading:
    Ingram Subject Code: XD
    BISAC V2.8: COM021000
    B&T General Subject: 228
    LC subject heading: ,
    LC classification: QA76.9.D37 K53 2004
    Thema V1.0: UN
    John Wiley & Sons Inc
    Imprint name
    John Wiley & Sons Inc
    Publication date
    01 October 2004
    Publication City/Country
    New York
    Author Information
    RALPH KIMBALL, PhD, founder of the Kimball Group, has been a leading visionary in the data warehousing industry since 1982 and is one of today's best-known speakers and educators. He is the author of several bestselling titles published on data warehousing, including The Data Warehouse Toolkit (Wiley). JOE CASERTA is the founder of Caserta Concepts, LLC, a data warehousing consulting firm. He writes frequently for print and online magazines, and is an active contributor to DWList, the major online community for data warehousing professionals.
    Back cover copy
    The single most authoritative guide on the most difficult phase of building a data warehouseThe extract, transform, and load (ETL) phase of the data warehouse development life cycle is far and away the most difficult, time-consuming, and labor-intensive phase of building a data warehouse. Done right, companies can maximize their use of data storage; if not, they can end up wasting millions of dollars storing obsolete and rarely used data. Bestselling author Ralph Kimball, along with Joe Caserta, shows you how a properly designed ETL system extracts the data from the source systems, enforces data quality and consistency standards, conforms the data so that separate sources can be used together, and finally delivers the data in a presentation-ready format.Serving as a road map for planning, designing, building, and running the back-room of a data warehouse, this book provides complete coverage of proven, timesaving ETL techniques. Beginning with a quick overview of ETL fundamentals, it then looks at ETL data structures, both relational and dimensional. The authors show how to build useful dimensional structures, providing practical examples of techniques. Along the way you'll learn how to: Plan and design your ETL systemChoose the appropriate architecture from the many possible optionsBuild the development/test/production suite of ETL processesBuild a comprehensive data cleaning subsystemTune the overall ETL process for optimum performance
    Table of contents
    Acknowledgments. About the Authors. Introduction. Part I: Requirements, Realities, and Architecture. Chapter 1: Surrounding the Requirements. Chapter 2: ETL Data Structures. Part II: Data Flow. Chapter 3: Extracting. Chapter 4: Cleaning and Conforming. Chapter 5: Delivering Dimension Tables. Chapter 6: Delivering Fact Tables. Part III: Implementation and operations. Chapter 7: Development. Chapter 8: Operations. Chapter 9: Metadata. Chapter 10: Responsibilities. Part IV: Real Time Streaming ETL Systems. Chapter 11: Real-Time ETL Systems. Chapter 12: Conclusions. Index.