{"id":58,"date":"2017-09-26T10:24:46","date_gmt":"2017-09-26T10:24:46","guid":{"rendered":"https:\/\/techvidvan.com\/tutorials\/?p=58"},"modified":"2017-09-26T10:24:46","modified_gmt":"2017-09-26T10:24:46","slug":"top-features-of-big-data-hadoop","status":"publish","type":"post","link":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/","title":{"rendered":"Top 10 Features of Big Data Hadoop"},"content":{"rendered":"<p>In this Hadoop Tutorial, we will discuss 10 best features of Hadoop. If you are not familiar with Apache Hadoop, so you can refer our <strong>Hadoop Introduction<\/strong> blog to get detailed knowledge of Apache Hadoop framework.<\/p>\n<p>In this blog, we are going to over most important features of Big data Hadoop such as Hadoop Fault Tolerance, Distributed Processing in Hadoop, Scalability<b>,\u00a0<\/b>Reliability<b>,\u00a0<\/b>High Availability, Economic,\u00a0 Flexibility,\u00a0\u00a0Data locality in Hadoop.<\/p>\n<h3>Hadoop Introduction<\/h3>\n<p>Hadoop is an open source software framework that supports distributed storage and processing of huge amount of data set. It is most powerful big data tool in the market because of its features. Features like Fault tolerance, Reliability, High Availability etc.<\/p>\n<p>Hadoop provides-<\/p>\n<ul>\n<li><strong>HDFS\u00a0<\/strong>&#8211; World most reliable storage layer<\/li>\n<li><strong>MapReduce\u00a0<\/strong>&#8211; Distributed processing layer<\/li>\n<li><strong>YARN\u00a0<\/strong>&#8211; Resource management layer<\/li>\n<\/ul>\n<h3>Important Features of Big data Hadoop<\/h3>\n<p>There are so many features that Apache Hadoop provides. Let&#8217;s discuss these features of Hadoop in detail.<\/p>\n<h4>a. Open source<\/h4>\n<p>It is an open source Java-based programming framework. Open source means it is freely available and even we can change its source code as per your requirements.<\/p>\n<h4>b. Fault Tolerance<\/h4>\n<p>Hadoop control faults by the process of replica creation. When client stores a file in HDFS, Hadoop framework divide the file into blocks. Then client distributes data blocks across different machines present in HDFS cluster.<\/p>\n<p>And, then create the replica of each block is on other machines present in the cluster. HDFS, by default, creates 3 copies of a block on other machines present in the cluster.<\/p>\n<p>If any machine in the cluster goes down or fails due to unfavorable conditions. Then also, the user can easily access that data from other machines.<\/p>\n<h4>c. Distributed Processing<\/h4>\n<p>Hadoop stores huge amount of data in a distributed manner in HDFS. Process the data in parallel on a cluster of nodes.<\/p>\n<h4>d. Scalability<strong> \u00a0<\/strong><\/h4>\n<p>Hadoop is an open-source platform. This makes it extremely scalable platform. So, new nodes can be easily added without any downtime. Hadoop provides horizontal scalability so new node added on the fly model to the system. In Apache hadoop, applications run on more than thousands of node.<\/p>\n<h4>e. Reliability<\/h4>\n<p>Data is reliably stored on the cluster of machines despite machine failure due to replication of data. So, if any of the nodes fails, then also we can store data reliably.<\/p>\n<h4>f. High Availability<\/h4>\n<p>Due to multiple copies of data, data is highly available and accessible despite hardware failure. So, any machine goes down data can be retrieved from the other path. Learn Hadoop High Availability feature in detail.<\/p>\n<h4>g. Economic<\/h4>\n<p>Hadoop is not very expensive as it runs on the cluster of commodity hardware. As we are using low-cost commodity hardware, we don\u2019t need to spend a huge amount of money for scaling out your Hadoop cluster.<\/p>\n<h4>i. Flexibility<\/h4>\n<p>Hadoop is very flexible in terms of ability to deal with all kinds of data. It deals with structured, semi-structured or unstructured.<\/p>\n<h4>j. Easy to use<\/h4>\n<p>No need of client to deal with distributed computing, the framework takes care of all the things. So it is easy to use.<\/p>\n<h4>k. Data locality<\/h4>\n<p>It refers to the ability to move the computation close to where actual data resides on the node. Instead of moving data to computation. This minimizes network congestion and increases the over throughput of the system. Learn more about<strong> Data Locality.<\/strong><\/p>\n<h3>Conclusion<\/h3>\n<p>In conclusion, we can say, Hadoop is highly fault-tolerant. It reliably stores huge amount of data despite hardware failure. It provides High scalability and high availability.<\/p>\n<p>Hadoop is cost efficient as it runs on a cluster of commodity hardware. Hadoop work on Data locality as moving computation is cheaper than moving data. All these features of Big data Hadoop make it powerful for the Big data processing.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In this Hadoop Tutorial, we will discuss 10 best features of Hadoop. If you are not familiar with Apache Hadoop, so you can refer our Hadoop Introduction blog to get detailed knowledge of Apache&#46;&#46;&#46;<\/p>\n","protected":false},"author":1,"featured_media":73141,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[544],"tags":[538,457,539,540,541,542,543],"class_list":["post-58","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-hadoop","tag-apache-hadoop","tag-big-data","tag-big-data-hadoop","tag-features-of-hadoop","tag-hadoop","tag-hadoop-features","tag-hadoop-tutorial"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.7 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Top 10 Features of Big Data Hadoop - TechVidvan<\/title>\n<meta name=\"description\" content=\"Top 10 features of Big data Hadoop-Fault Tolerance,Distributed Processing, Scalability,Reliability,High Availability,Economic,Flexibility,Data locality etc.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Top 10 Features of Big Data Hadoop - TechVidvan\" \/>\n<meta property=\"og:description\" content=\"Top 10 features of Big data Hadoop-Fault Tolerance,Distributed Processing, Scalability,Reliability,High Availability,Economic,Flexibility,Data locality etc.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/\" \/>\n<meta property=\"og:site_name\" content=\"TechVidvan\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/TechVidvan\/\" \/>\n<meta property=\"article:published_time\" content=\"2017-09-26T10:24:46+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/techvidvan.com\/tutorials\/wp-content\/uploads\/2019\/11\/Hadoop-Features-01.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"628\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"TechVidvan Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@vidvantech\" \/>\n<meta name=\"twitter:site\" content=\"@vidvantech\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"TechVidvan Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Top 10 Features of Big Data Hadoop - TechVidvan","description":"Top 10 features of Big data Hadoop-Fault Tolerance,Distributed Processing, Scalability,Reliability,High Availability,Economic,Flexibility,Data locality etc.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/","og_locale":"en_US","og_type":"article","og_title":"Top 10 Features of Big Data Hadoop - TechVidvan","og_description":"Top 10 features of Big data Hadoop-Fault Tolerance,Distributed Processing, Scalability,Reliability,High Availability,Economic,Flexibility,Data locality etc.","og_url":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/","og_site_name":"TechVidvan","article_publisher":"https:\/\/www.facebook.com\/TechVidvan\/","article_published_time":"2017-09-26T10:24:46+00:00","og_image":[{"width":1200,"height":628,"url":"https:\/\/techvidvan.com\/tutorials\/wp-content\/uploads\/2019\/11\/Hadoop-Features-01.jpg","type":"image\/jpeg"}],"author":"TechVidvan Team","twitter_card":"summary_large_image","twitter_creator":"@vidvantech","twitter_site":"@vidvantech","twitter_misc":{"Written by":"TechVidvan Team","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/#article","isPartOf":{"@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/"},"author":{"name":"TechVidvan Team","@id":"https:\/\/techvidvan.com\/tutorials\/#\/schema\/person\/e9c26e74dd3d87421f7ada9433b8cd22"},"headline":"Top 10 Features of Big Data Hadoop","datePublished":"2017-09-26T10:24:46+00:00","mainEntityOfPage":{"@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/"},"wordCount":634,"commentCount":0,"publisher":{"@id":"https:\/\/techvidvan.com\/tutorials\/#organization"},"image":{"@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/#primaryimage"},"thumbnailUrl":"https:\/\/techvidvan.com\/tutorials\/wp-content\/uploads\/2019\/11\/Hadoop-Features-01.jpg","keywords":["apache hadoop","big data","big data hadoop","Features of hadoop","hadoop","hadoop features","hadoop tutorial"],"articleSection":["Hadoop Tutorials"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/","url":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/","name":"Top 10 Features of Big Data Hadoop - TechVidvan","isPartOf":{"@id":"https:\/\/techvidvan.com\/tutorials\/#website"},"primaryImageOfPage":{"@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/#primaryimage"},"image":{"@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/#primaryimage"},"thumbnailUrl":"https:\/\/techvidvan.com\/tutorials\/wp-content\/uploads\/2019\/11\/Hadoop-Features-01.jpg","datePublished":"2017-09-26T10:24:46+00:00","description":"Top 10 features of Big data Hadoop-Fault Tolerance,Distributed Processing, Scalability,Reliability,High Availability,Economic,Flexibility,Data locality etc.","breadcrumb":{"@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/#primaryimage","url":"https:\/\/techvidvan.com\/tutorials\/wp-content\/uploads\/2019\/11\/Hadoop-Features-01.jpg","contentUrl":"https:\/\/techvidvan.com\/tutorials\/wp-content\/uploads\/2019\/11\/Hadoop-Features-01.jpg","width":1200,"height":628,"caption":"Hadoop Features"},{"@type":"BreadcrumbList","@id":"https:\/\/techvidvan.com\/tutorials\/top-features-of-big-data-hadoop\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/techvidvan.com\/tutorials\/"},{"@type":"ListItem","position":2,"name":"Top 10 Features of Big Data Hadoop"}]},{"@type":"WebSite","@id":"https:\/\/techvidvan.com\/tutorials\/#website","url":"https:\/\/techvidvan.com\/tutorials\/","name":"TechVidvan Blogs","description":"","publisher":{"@id":"https:\/\/techvidvan.com\/tutorials\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/techvidvan.com\/tutorials\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/techvidvan.com\/tutorials\/#organization","name":"TechVidvan","url":"https:\/\/techvidvan.com\/tutorials\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/techvidvan.com\/tutorials\/#\/schema\/logo\/image\/","url":"https:\/\/techvidvan.com\/tutorials\/wp-content\/uploads\/2024\/03\/techvidvan-logo-200x50-1.webp","contentUrl":"https:\/\/techvidvan.com\/tutorials\/wp-content\/uploads\/2024\/03\/techvidvan-logo-200x50-1.webp","width":200,"height":50,"caption":"TechVidvan"},"image":{"@id":"https:\/\/techvidvan.com\/tutorials\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/TechVidvan\/","https:\/\/x.com\/vidvantech"]},{"@type":"Person","@id":"https:\/\/techvidvan.com\/tutorials\/#\/schema\/person\/e9c26e74dd3d87421f7ada9433b8cd22","name":"TechVidvan Team","description":"The TechVidvan Team delivers practical, beginner-friendly tutorials on programming, Java, Python, C++, DSA, AI, ML, data Science, Android, Flutter, MERN, Web Development, and technology. Our experts are here to help you upskill and excel in today\u2019s tech industry."}]}},"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/posts\/58","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/comments?post=58"}],"version-history":[{"count":0,"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/posts\/58\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/media\/73141"}],"wp:attachment":[{"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/media?parent=58"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/categories?post=58"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/techvidvan.com\/tutorials\/wp-json\/wp\/v2\/tags?post=58"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}