{"id":41308,"date":"2021-06-18T10:06:47","date_gmt":"2021-06-18T10:06:47","guid":{"rendered":"https:\/\/polarising.com\/?p=41308"},"modified":"2022-08-29T15:50:55","modified_gmt":"2022-08-29T15:50:55","slug":"smart-monitoring-and-hdd-reporting","status":"publish","type":"post","link":"https:\/\/polarising.com\/techinside\/smart-monitoring-and-hdd-reporting\/","title":{"rendered":"Be S.M.A.R.T. and check your data! HDD Monitoring and Reporting."},"content":{"rendered":"\n<p>Cloud adoption is on the <a aria-label=\"rise (opens in a new tab)\" class=\"rank-math-link\" href=\"https:\/\/ec.europa.eu\/eurostat\/statistics-explained\/index.php?title=Cloud_computing_-_statistics_on_the_use_by_enterprises\" target=\"_blank\" rel=\"noreferrer noopener\">rise<\/a> and for good reasons. The simplified application hosting and delegated resource management provides companies and individuals the security of a monitored infrastructure, combined with the ease of access to computing resources. Still, these features come at a cost, and <strong>cloud computing<\/strong> is not a cheap endeavor.<\/p>\n\n\n\n<p>In order to reduce the cloud computing budget, many organizations decide to run their production environments on the cloud while maintaining quality environments in premises. It would be easy to overlook the necessity of proper resource monitoring, since the cloud has accustomed us to its resilience and fault tolerance. However, doing so could potentially lead to data loss and hours spent on system recovery. In this article we focus on <strong>monitoring the health of our infrastructure building blocks<\/strong>, the hard disk drives [HDD]s.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"self-monitoring-analysis-and-reporting-technology-system-the-smart-approach\">Self-Monitoring, Analysis and Reporting Technology System: the SMART approach. <\/h2>\n\n\n\n<p>The truth is all drives will eventually fail. Our objective is, therefore, to collect and interpret the signs and symptoms an HDD shows before failing, so that we can be ready for that event. To do so, <strong>Self-Monitoring, Analysis and Reporting Technology System (SMART)<\/strong> built into the most modern disks, can be of assistance. This system reports internal information about the drive, including failure events. Among these, an increase in the following metrics indicates impending HDD failure:<\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-28f84493 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:100%\">\n<ul class=\"wp-block-list\"><li>SMART 5: Reallocated_Sector_Count.<\/li><li>SMART 197: Current_Pending_Sector_Count.<\/li><li>SMART 198: Offline_Uncorrectable.<\/li><\/ul>\n\n\n\n<p>For monitoring these metrics and conducting HDD tests, SMART tools like <a aria-label=\"SmartMonTools (opens in a new tab)\" class=\"rank-math-link\" href=\"https:\/\/www.smartmontools.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">SmartMonTools<\/a> can be used. These programs allow the definition of disk analysis on pre-defined schedules, interpret metric results and notify the system administrator in case of detected failures.<\/p>\n\n\n\n<p>A typical report of a healthy drive consists of two parts. Firstly, the <strong>HDD information section<\/strong>, where it is important to make sure that the SMART support is available and enabled.<\/p>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"895\" height=\"594\" src=\"https:\/\/polarising.com\/site21\/wp-content\/uploads\/2021\/05\/smartmonitoring_image_smartactivationcheck.png\" alt=\"\" class=\"wp-image-41363\" srcset=\"https:\/\/polarising.com\/techinside\/wp-content\/uploads\/2021\/05\/smartmonitoring_image_smartactivationcheck.png 895w, https:\/\/polarising.com\/techinside\/wp-content\/uploads\/2021\/05\/smartmonitoring_image_smartactivationcheck-300x199.png 300w, https:\/\/polarising.com\/techinside\/wp-content\/uploads\/2021\/05\/smartmonitoring_image_smartactivationcheck-768x510.png 768w\" sizes=\"auto, (max-width: 895px) 100vw, 895px\" \/><figcaption>Image: SMART Activation Check<\/figcaption><\/figure><\/div>\n\n\n\n<p>And then, the <strong>metrics report<\/strong>, where an increase on critical metrics should be closely monitored by the system administrator, and, if needed, replace the affected disk.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"895\" height=\"594\" src=\"https:\/\/polarising.com\/site21\/wp-content\/uploads\/2021\/05\/smartmonitoring_image_smarttestresults.png\" alt=\"\" class=\"wp-image-41364\" srcset=\"https:\/\/polarising.com\/techinside\/wp-content\/uploads\/2021\/05\/smartmonitoring_image_smarttestresults.png 895w, https:\/\/polarising.com\/techinside\/wp-content\/uploads\/2021\/05\/smartmonitoring_image_smarttestresults-300x199.png 300w, https:\/\/polarising.com\/techinside\/wp-content\/uploads\/2021\/05\/smartmonitoring_image_smarttestresults-768x510.png 768w\" sizes=\"auto, (max-width: 895px) 100vw, 895px\" \/><figcaption>Image: SMART Test Results<\/figcaption><\/figure><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"what-about-failure\">What about failure?<\/h2>\n\n\n\n<p>It is important to notice that a single error does not mean the drive is about to fail. A small number of failure events occurs during the HDD lifetime, however, a sudden spike or fast increase over a short time span of the before mentioned critical metrics, is a strong indicator of impending disk failure.<\/p>\n\n\n\n<p>Although these tools are a very important part of a successful infrastructure, always remember that they are here to prevent outages on your system and backups are still essential.<\/p>\n\n\n\n<p><strong>Tiago Diogo<\/strong><br>Software Engineer<\/p>\n\n\n\n<p>Support Links: <a href=\"https:\/\/ec.europa.eu\/eurostat\/statistics-explained\/index.php?title=Cloud_computing_-_statistics_on_the_use_by_enterprises_https:\/\/www.smartmontools.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">Cloud computing &#8211; statistics on the use by enterprises<\/a><br><\/p>\n\n\n\n<p class=\"has-text-align-center\">Contact us to know more about IT Services.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-horizontal is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-03627597 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link\" href=\"https:\/\/polarising.com\/services\/\" target=\"_blank\" rel=\"noreferrer noopener\">Polarising<\/a><\/div>\n<\/div>\n\n\n\n<div style=\"height:50px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p class=\"has-text-align-center\"><em> <\/em> <\/p>\n\n\n\n<hr class=\"wp-block-separator\"\/>\n","protected":false},"excerpt":{"rendered":"<p>In this article we focus on monitoring the health of our infrastructure building blocks, the hard disk drives [HDD]s.<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"_uag_custom_page_level_css":"","site-sidebar-layout":"default","site-content-layout":"default","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"default","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"footnotes":""},"categories":[4],"tags":[],"class_list":["post-41308","post","type-post","status-publish","format-standard","hentry","category-technology"],"uagb_featured_image_src":{"full":false,"thumbnail":false,"medium":false,"medium_large":false,"large":false,"1536x1536":false,"2048x2048":false,"authorship-box-avatar":false,"authorship-box-related":false},"uagb_author_info":{"display_name":"Tiago Diogo","author_link":"https:\/\/polarising.com\/techinside\/author\/tiago-diogo\/"},"uagb_comment_info":0,"uagb_excerpt":"In this article we focus on monitoring the health of our infrastructure building blocks, the hard disk drives [HDD]s.","_links":{"self":[{"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/posts\/41308","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/comments?post=41308"}],"version-history":[{"count":1,"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/posts\/41308\/revisions"}],"predecessor-version":[{"id":42963,"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/posts\/41308\/revisions\/42963"}],"wp:attachment":[{"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/media?parent=41308"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/categories?post=41308"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/polarising.com\/techinside\/wp-json\/wp\/v2\/tags?post=41308"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}