{"id":4639,"date":"2022-06-24T23:29:41","date_gmt":"2022-06-25T07:29:41","guid":{"rendered":"https:\/\/www.gudusoft.com\/?p=4639"},"modified":"2022-06-27T18:36:48","modified_gmt":"2022-06-28T02:36:48","slug":"metadata-management-for-data-warehouses","status":"publish","type":"post","link":"https:\/\/www.gudusoft.com\/fr\/metadata-management-for-data-warehouses\/","title":{"rendered":"Gestion des m\u00e9tadonn\u00e9es pour les entrep\u00f4ts de donn\u00e9es | Gudu SQLFlow"},"content":{"rendered":"<div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-1 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"background-color: rgba(255,255,255,0);background-position: center center;background-repeat: no-repeat;border-width: 0px 0px 0px 0px;border-color:#e8eaf0;border-style:solid;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start\" style=\"max-width:1310.4px;margin-left: calc(-4% \/ 2 );margin-right: calc(-4% \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-0 fusion_builder_column_1_1 1_1 fusion-flex-column\"><div class=\"fusion-column-wrapper fusion-flex-justify-content-flex-start fusion-content-layout-column\" style=\"background-position:left top;background-repeat:no-repeat;-webkit-background-size:cover;-moz-background-size:cover;-o-background-size:cover;background-size:cover;padding: 0px 0px 0px 0px;\"><div class=\"fusion-text fusion-text-1\" style=\"line-height:26px;\"><h2>Gestion des m\u00e9tadonn\u00e9es pour les entrep\u00f4ts de donn\u00e9es<\/h2>\n<p><strong><a href=\"https:\/\/www.gudusoft.com\/fr\/quest-ce-que-la-gestion-des-metadonnees\/\">Gestion des m\u00e9tadonn\u00e9es<\/a><\/strong> is the foundation of enterprise <strong><a href=\"https:\/\/www.gudusoft.com\/fr\/category\/data-governance-101\/\">gouvernance des donn\u00e9es<\/a><\/strong> and the improvement of <strong><a href=\"https:\/\/www.gudusoft.com\/fr\/what-is-a-data-warehouse\/\">entrep\u00f4t de donn\u00e9es<\/a><\/strong>. As someone who often works with data, the first task is to understand metadata management. This article will sort out the concept of metadata and introduce metadata management for data warehouses.<\/p>\n<h3>What is metadata management for data warehouses?<\/h3>\n<p>Before going any further, let&#8217;s figure out <strong>qu&#039;est-ce que les m\u00e9tadonn\u00e9es<\/strong> et <strong>what is a data warehouse<\/strong>.<\/p>\n<p><strong>M\u00e9tadonn\u00e9es<\/strong>, also known as intermediary data and relay data, is data about data. Most of the time, metadata can be divided into business metadata and technical metadata according to the different meanings they represent.<\/p>\n<p>UN <strong>entrep\u00f4t de donn\u00e9es<\/strong> is a central repository of information that can be analyzed to make better decisions. Typically, data regularly flows into a data warehouse from transactional systems, relational databases, and other sources. Business analysts, <strong><a href=\"https:\/\/www.gudusoft.com\/fr\/data-engineers\/\">data engineers<\/a><\/strong>, <a href=\"https:\/\/www.gudusoft.com\/fr\/scientifiques-des-donnees\/\"><strong>scientifiques des donn\u00e9es<\/strong><\/a>, and decision makers access data through business intelligence (BI) tools, SQL clients, and other analytical applications.<\/p>\n<div id=\"attachment_4664\" style=\"width: 955px\" class=\"wp-caption aligncenter\"><img aria-describedby=\"caption-attachment-4664\" decoding=\"async\" class=\"size-full wp-image-4664\" src=\"https:\/\/www.gudusoft.com\/wp-content\/uploads\/2022\/06\/Metadata_Management_for_Data_Warehouses-2.png\" alt=\"Gestion des m\u00e9tadonn\u00e9es pour les entrep\u00f4ts de donn\u00e9es\" width=\"945\" height=\"532\" srcset=\"https:\/\/www.gudusoft.com\/wp-content\/uploads\/2022\/06\/Metadata_Management_for_Data_Warehouses-2-200x113.png 200w, https:\/\/www.gudusoft.com\/wp-content\/uploads\/2022\/06\/Metadata_Management_for_Data_Warehouses-2-300x169.png 300w, https:\/\/www.gudusoft.com\/wp-content\/uploads\/2022\/06\/Metadata_Management_for_Data_Warehouses-2-400x225.png 400w, https:\/\/www.gudusoft.com\/wp-content\/uploads\/2022\/06\/Metadata_Management_for_Data_Warehouses-2-600x338.png 600w, https:\/\/www.gudusoft.com\/wp-content\/uploads\/2022\/06\/Metadata_Management_for_Data_Warehouses-2-768x432.png 768w, https:\/\/www.gudusoft.com\/wp-content\/uploads\/2022\/06\/Metadata_Management_for_Data_Warehouses-2-800x450.png 800w, https:\/\/www.gudusoft.com\/wp-content\/uploads\/2022\/06\/Metadata_Management_for_Data_Warehouses-2.png 945w\" sizes=\"(max-width: 945px) 100vw, 945px\" \/><p id=\"caption-attachment-4664\" class=\"wp-caption-text\">Gestion des m\u00e9tadonn\u00e9es pour les entrep\u00f4ts de donn\u00e9es<\/p><\/div>\n<p><strong>Then what is metadata management for data warehouses?<\/strong><\/p>\n<p>The metadata in the data warehouse mainly records the definition of each theme, the mapping relationship between different levels, the data status of the monitoring data warehouse and the task running status of the <a href=\"https:\/\/www.gudusoft.com\/fr\/meilleurs-outils-etl\/\"><strong>ETL<\/strong><\/a>. Generally, metadata is stored and managed uniformly through the metadata repository, and its main purpose is to achieve coordination and consistency in the design, deployment, operation and management of the data warehouse.<\/p>\n<p>Metadata is an important part of data warehouse management system. Metadata management is a key component in enterprise data warehouse. It runs through the whole process of data warehouse construction and directly affects the construction, use and maintenance of data warehouse.<\/p>\n<h3>Pourquoi les entrep\u00f4ts de donn\u00e9es ont-ils besoin d\u2019une gestion des m\u00e9tadonn\u00e9es\u00a0?<\/h3>\n<ol>\n<li><strong>A must for building data warehouse<\/strong>: The data warehouse is obtained from external data, business data and documents through some ETL tools. Without a clear and clear rule, it is impossible to realize this process.<\/li>\n<li><strong>Help to quickly understand the data warehouse system<\/strong>: On the one hand, the data warehouse is essentially an important project of a department or even a company, and the development time is lengthy. There will inevitably be a flow of people in the middle. If there is no clear metadata, it will have a major impact on the entire system and the entire project. On the other hand, the data warehouse serves as the analytical data export for the entire department and company, not only for data personnel. The DM layer is unavoidable for business people and DIM for other developers. If there is clear metadata to describe the data warehouse system, it will save a lot of communication time between the two parties.<\/li>\n<li><strong>Efficient and accurate communication<\/strong>: On the one hand, the management metadata in the metadata records the data permissions of different users, roles, and departments. If there is data that needs to be notified, you can quickly query the system to communicate by mass emails, etc., so as to avoid the situation of lack of people and too many people in the communication link. On the other hand, when communicating with products or communicating with R&amp;D, you can confirm the meaning of indicators and dimensions of mutual communication based on business metadata, so as to avoid the ambiguity of communication at the root, and thus improve the efficiency of communication.<\/li>\n<li><strong>Guaranteed<\/strong> <a href=\"https:\/\/www.gudusoft.com\/fr\/comment-ameliorer-la-qualite-des-donnees\/\"><strong>qualit\u00e9 des donn\u00e9es<\/strong><\/a>: The ideal metadata describes the structure of the data warehouse, the schema of the warehouse, the dimensions, measures, hierarchies, definitions of the databases everywhere, and the location and content of the <strong><a href=\"https:\/\/www.gudusoft.com\/fr\/quest-ce-quun-entrepot-de-donnees\/\">datamarts<\/a><\/strong>. Therefore, we can judge with certainty which data is definitely accurate, which data may be faulty, and which data is definitely faulty. Simply put, each field should have its value range, business definition and other information. Once metadata is defined, it can be applied to data quality detection, evaluation, etc., so as to truly improve the data quality of enterprises through the data quality management process.<\/li>\n<li><strong>Reduce data system construction costs<\/strong>: If the metadata is well established, the information will be obtained more accurately and quickly, so that the data system construction will not be reworked or less reworked, the workload of analysis will be reduced, the unified understanding and communication efficiency of all parties will be strengthened, and the development cost will be minimized.<\/li>\n<li><strong>Quickly analyze change impact<\/strong>: Because metadata is centrally maintained and managed with reference relationships, when changes occur, the metadata management system can be used to analyze in real time the affected business functions, application systems, personnel involved, and whether supervision is involved.<\/li>\n<li><strong>Prepare for the future<\/strong>: Strategic-level application systems of enterprises such as big data, artificial intelligence, <a href=\"https:\/\/www.gudusoft.com\/fr\/quest-ce-quun-lac-de-donnees\/\"><strong>lac de donn\u00e9es<\/strong><\/a>, data center, and business intelligence can rely on good metadata management to exert their due effects.<\/li>\n<\/ol>\n<h3>Sc\u00e9narios d&#039;application des m\u00e9tadonn\u00e9es<\/h3>\n<ol>\n<li><strong>Impact analysis<\/strong>: During development, we often encounter the following problems: If I want to change a table or ETL, what will be the impact? If there is no metadata, then we may need to traverse all scripts and data to get the desired answer; however, if there is mature metadata management, then we can get the answer directly and save a lot of time.<\/li>\n<li><a href=\"https:\/\/www.gudusoft.com\/fr\/quest-ce-que-la-lignee-des-donnees-pourquoi-est-elle-importante\/\"><strong>Data lineage analysis<\/strong><\/a>: Data lineage analysis is a technical means used to comprehensively track the data processing process, so as to find all related metadata objects starting from a data object and the relationship between these metadata objects. The relationship between metadata objects specifically refers to the data flow input and output relationship representing these metadata objects. After the metadata management system is formed, we can analyze the data health, data distribution, concentration, and data heat in the data warehouse through lineage relationship analysis.<\/li>\n<li><strong>ETL automation management<\/strong>: In the warehouse, a large part of ETL are boring and repetitive steps. For example, at the source system-ODS layer: table input &#8211; table output. Another example is ODS-DW: SQL input &#8211; data cleaning &#8211; data processing &#8211; table output. The above rules are actually part of the metadata. That can be achieved in theory, write a fixed script, and then select it through the front-end &#8211; or api interface. In this way, the repeated ETL can be automatically managed to reduce the time cost of ETL development.<\/li>\n<li><strong>Data quality management<\/strong>: The logic of data cleaning can be simply divided into different data types and designated special processing columns. We only need to specify default cleaning rules for different data types and special processing logic for some special columns to achieve intelligent and fast data cleaning. Data quality management belongs to the intersection of data governance and metadata management, and is more inclined to data governance.<\/li>\n<li><strong>Data security management<\/strong>: In the data center that Ali advocates, all data interface indicators will be exported from the data warehouse. Therefore, in theory, you only need to configure the metadata management permission in this metadata to achieve data security management for the whole company.<\/li>\n<\/ol>\n<h3>Conclusion<\/h3>\n<p>Merci d&#039;avoir lu notre article et nous esp\u00e9rons qu&#039;il pourra vous aider \u00e0 mieux comprendre <strong>metadata management for data warehouses<\/strong>. If you want to learn more about metadata management for data warehouses, we would like to advise you to visit <strong><a href=\"https:\/\/www.gudusoft.com\/fr\/\">Gudu SQLFlow<\/a><\/strong> pour plus d&#039;informations.<\/p>\n<p>En tant que l&#039;un des\u00a0<strong><a href=\"https:\/\/www.dpriver.com\/blog\/2022\/05\/11\/best-data-lineage-tools\/\" target=\"_blank\" rel=\"noopener noreferrer\">meilleurs outils de lignage de donn\u00e9es<\/a><\/strong> Disponible sur le march\u00e9 aujourd&#039;hui, Gudu SQLFlow peut non seulement analyser les fichiers de script SQL, obtenir la lign\u00e9e des donn\u00e9es et effectuer un affichage visuel, mais \u00e9galement permettre aux utilisateurs de fournir la lign\u00e9e des donn\u00e9es au format CSV et d&#039;effectuer un affichage visuel.\u00a0<strong>(Published by Ryan on Jun 25, 2022)<\/strong><\/p>\n<\/div><\/div><\/div><style type=\"text\/css\">.fusion-body .fusion-builder-column-0{width:100% !important;margin-top : 0px;margin-bottom : 0px;}.fusion-builder-column-0 > .fusion-column-wrapper {padding-top : 0px !important;padding-right : 0px !important;margin-right : 1.92%;padding-bottom : 0px !important;padding-left : 0px !important;margin-left : 1.92%;}@media only screen and (max-width:1024px) {.fusion-body .fusion-builder-column-0{width:100% !important;}.fusion-builder-column-0 > .fusion-column-wrapper {margin-right : 1.92%;margin-left : 1.92%;}}@media only screen and (max-width:640px) {.fusion-body .fusion-builder-column-0{width:100% !important;}.fusion-builder-column-0 > .fusion-column-wrapper {margin-right : 1.92%;margin-left : 1.92%;}}<\/style><\/div><style type=\"text\/css\">.fusion-body .fusion-flex-container.fusion-builder-row-1{ padding-top : 0px;margin-top : 0px;padding-right : 0px;padding-bottom : 0px;margin-bottom : 0px;padding-left : 0px;}<\/style><\/div>","protected":false},"excerpt":{"rendered":"","protected":false},"author":27,"featured_media":4663,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[178],"tags":[210,152,213,151,211,212],"_links":{"self":[{"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/posts\/4639"}],"collection":[{"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/users\/27"}],"replies":[{"embeddable":true,"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/comments?post=4639"}],"version-history":[{"count":28,"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/posts\/4639\/revisions"}],"predecessor-version":[{"id":4669,"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/posts\/4639\/revisions\/4669"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/media\/4663"}],"wp:attachment":[{"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/media?parent=4639"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/categories?post=4639"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.gudusoft.com\/fr\/wp-json\/wp\/v2\/tags?post=4639"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}