登陆注册
3765300000029

第29章 Database System(8)

Data Warehousing

Data warehouses contain consolidated data from many sources?? spanning long time periods?? and augmented with summary information. Warehouses are much larger than other kinds of databases; sizes ranging from several gigabytes to terabytes are common. Typical workloads involve ad hoc?? fairly complex queries?? and fast response time is important. These characteristics differentiate warehouse applications from OLTP applications?? and different DBMS design and implementation techniques must be used to achieve satisfactory results. Adistributed DBMS with good scalability and high availability ??achieved by storing tables redundantly at more than one site?? is required for very large warehouses.

An organization's daily operations access and modify operational databases. Data from these operational databases and other external sources ??e. g.?? customer profiles supplied by external consultants?? are extracted by using gateways?? or standard external interfaces supported by the underlying DBMS. Standards such as Open Database Connectivity ??ODBC?? from Microsoft are emerging for gateways;ODBC is an application program interface that allows client programs to generate SQL statements to be executed at a sewer.

There are many challenges in creating and maintaining a large data warehouse. A goad database schema must be designed to hold an integrated collection of data copied from diverse sources. For example?? a company warehouse might include the Inventory and Personnel departments' databases?? together with Sales databases maintained by offices in different countries. Since the source databases are often created and maintained by different groups?? there are a number of semantic mismatches across these databases?? such as different currency units?? different names for the same attribute?? and differences in how tables are normalized or structured;these differences must be reconciled when data is brought into the warehouse. After the warehouse schema is designed?? the warehouse must be populated?? and over time?? it must be kept consistent with the primary data sources.

Data extracted from operational databases and external sources is first cleaned to minimize errors and fin in missing information when possible?? and transformed to reconcile semantic mismatches. Transforming data is typically accomplished by defining a relational view over the tables in the data sources ??the operational databases and other external sources??. Loading data consists of materializing such views and storing them in the warehouse. Unlike a standard view in a relational DBMS?? therefore?? the view is stored in a database ??the warehouse?? that is different from the database ??s?? containing the tables it is defined over.

The cleaned and transformed data is finally loaded into the warehouse?? Additional preprocessing such as sorting and generation of summary information is carried out at this stage. Data is partitioned and indexes are built for efficiency. The large volume of data to be loaded means that loading is a slow process; loading a terabyte of data sequentially can take weeks. Parallelism is therefore important for loading warehouses.

同类推荐
  • 那些给我勇气的句子(每天读一点英文)

    那些给我勇气的句子(每天读一点英文)

    这是一套与美国人同步阅读的中英双语丛书,该丛书由美国英语教师协会推荐,特点有三:内文篇目取自世界上最经典、最有影响的寓言故事,适于诵读,“实战提升”部分,包括单词注解、实用句型和智慧点津。
  • 计算机英语

    计算机英语

    本书共九章,包括:计算机的发展及总体介绍,计算机硬件,计算机操作系统,数据库系统,软件工程,计算机网络和因特网,办公自动化系统,多媒体技术以及计算机安全。每个章节都配有正文的参考译文,帮助读者更加方便地学习和理解。每章的后面配有练习题并附参考答案,以利于对本单元内容进行巩固。课后的附录包含了单词表,词组表,计算机英语语法及科技英语写作要点,全方位地给读者提供丰富的相关知识。
  • 娱乐休闲英语口语即学即用

    娱乐休闲英语口语即学即用

    在当今紧张的工作之余,人们总是争取大量的空闲时间来休闲娱乐。在各种娱乐场合,人与人之间容易增进感情,加深关系,促进交往。书中每个单元都设有与内容相关的简单句型结构和短语,并配有多个例句和汉语翻译,便于读者套用和练习。每章开篇都为读者提供了该主题所蕴含的文化背景,方便读者对语言文化的学习。
  • 从零开始学德语,“袋”着走

    从零开始学德语,“袋”着走

    《从零开始学德语,“袋”着走》恰恰满足了初学者的诉求。不仅封面大方美观,内容更是丰富多彩。从基础字母入门,到日常生活、青春校园、职场风云、特色文化等,几乎涵盖了所有你能想到的,以及你若是有机会去德国旅游、生活或是工作能够用到的各个方面。
  • 终极英语日常用语1980句

    终极英语日常用语1980句

    本书内容包括:用餐宴请;居家交流;职场办公;校园求学;旅游出行;逛街购物等基本交际口语。
热门推荐
  • 追妻无门:女boss不好惹

    追妻无门:女boss不好惹

    青涩蜕变,如今她是能独当一面的女boss,爱了冷泽聿七年,也同样花了七年时间去忘记他。以为是陌路,他突然向他表白,扬言要娶她,她只当他是脑子抽风,他的殷勤她也全都无视。他帮她查她父母的死因,赶走身边情敌,解释当初拒绝她的告别,和故意对她冷漠都是无奈之举。突然爆出她父母的死居然和冷家有丝毫联系,还莫名跳出个公爵未婚夫,扬言要与她履行婚约。峰回路转,破镜还能重圆吗? PS:我又开新文了,每逢假期必书荒,新文《有你的世界遇到爱》,喜欢我的文的朋友可以来看看,这是重生类现言,对这个题材感兴趣的一定要收藏起来。
  • 魅力背后的心理秘密

    魅力背后的心理秘密

    《魅力背后的心理秘密》导读——为什么相貌普通的男性常比美男子在女性眼中更有魅力?为什么女性的魅力在于吸引力,而男性的魅力在于影响力?为什么微笑是增添女性魅力的超级“化妆品”?你的魅力是可以测试出来的,你知道怎样测试吗?提高魅力需要“身、心、灵”三位一体修炼,这意味着什么呢?让你遇见“魅力四射”的自己,开始阅读吧!
  • 有一间当铺

    有一间当铺

    一首诗,一颗珠子,一间当铺,由此引出一段数百年来最大的迷案,和璧隋珠、双鱼玉佩、十全老人,当这些错综复杂交接在一起的时候,命运的齿轮才刚刚开始转动。
  • 悔龙罪

    悔龙罪

    当你重生穿越时,如果你的重生带来的是无尽的责悔和悲剧,那你是孤独面对还是停滞不前,这是一个悲情的故事也是一个蜕变的故事。
  • 九阳大帝

    九阳大帝

    这是一个强者为尊的世界。他没有妖孽般的天赋,没有强大的背景,有的只是一颗不甘于平庸的心。他走的每一步都是汗与血的铺垫,凭借着过人的毅力和勇气,他惊艳了时光。传说里,他的左手掌控毁灭,右手代表重生。他就是慕炎,一个应运而生,纯阳之体的修士,一个用实力踏上主宰巅峰的男人。
  • 万世之光

    万世之光

    看,这有个小哥哥/小姐姐在读我的小说喔~
  • 重生小厨娘:将军,有礼了

    重生小厨娘:将军,有礼了

    前世遭渣男哄骗蒙蔽,她在新婚之日逃走,却被卖入青楼。好在上天垂怜,令娇俏小厨娘一朝重生。做大厨、开酒楼、斗极品、虐渣男,走上致富不归路。
  • 月光下的门

    月光下的门

    《微阅读1+1工程:月光下的门》作者用朴实无华的笔触,从一个个温暖感人的小故事中,讲述了人间的真、善、美。情节生动,笔调幽默,立意新颖、情节严谨、结局新奇。读者可以从一个点、一个画面、一个对比、一声赞叹、一瞬间之中,捕捉住了小说的一种智慧、一种美、一个耐人寻味的场景,一种新鲜的思想。
  • 末世胶囊系统

    末世胶囊系统

    林城得到了一个奇怪的系统,这个系统所兑换出来的物品竟然可以变成轻便的胶囊随身携带,只要他想,完全可以把沉重的房屋汽车装在口袋里来一场说走就走的旅行。可得到如此神器的林城却高兴不起来,因为他马上要面临的,是一场波及全球的巨变!
  • 托起将星的人们

    托起将星的人们

    本书以新颖独特的视角、真挚质朴的笔调,多层次、多方位地展示了后勤指挥学院党史专家邵维正教授、后勤理论专家杨少俊研究员、军事仓储专家王宗喜教授、后勤管理专家李祝文教授等11位知名专家教授的教书育人之路,热情讴歌了军校教员在三尺讲台自觉实践“三个代表”重要思想,为培养高素质军事人才而创新、拼搏、牺牲、奉献的崇高精神。