What is Data Warehouse? Explain it with Key Feature
4 years ago
Data Mining and Data Warehousing
What is Data Warehouse?
- Data warehousing provides architectures and tools for business executives to systematically organize, understand, and use their data to make strategic
- A data warehouse refers to a database that is maintained separately from an organization’s operational
- Data warehouse systems allow for the integration of a variety of application
- They support information processing by providing a solid platform of consolidated historical data for
- According to William H. Inmon, a leading architect in the construction of data warehouse systems, “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of data in support of management’s decision making process”
- The four keywords, subject-oriented, integrated, time-variant, and nonvolatile, distinguish data warehouses from other data repository systems, such as relational database systems, transaction processing systems, and file systems.
Subject-oriented
- A data warehouse is organized around major subjects, such as customer, supplier, product, and
- Rather than concentrating on the day-to-day operations and transaction processing of an organization, a data warehouse focuses on the modeling and analysis of data for decision
- Data warehouses typically provide a simple and concise view around particular subject issues by excluding data that are not useful in the decision support process.
Integrated
- A data warehouse is usually constructed by integrating multiple heterogeneous sources, such as relational databases, flat files, and on-line transaction records.
- Data cleaning and data integration techniques are applied to ensure consistency in naming conventions, encoding structures, attribute measures, and so on.
Time-variant
- Data are stored to provide information from a historical perspective (e.g., the past 5–10 years).
- Every key structure in the data warehouse contains, either implicitly or explicitly, an element of
Nonvolatile
- A data warehouse is always a physically separate store of data transformed from the application data found in the operational
- Due to this separation, a data warehouse does not require transaction processing, recovery, and concurrency control
- It usually requires only two operations in data accessing: initial loading of data and access of
Raju Singhaniya
Oct 14, 2021