{"id":1842,"date":"2026-02-23T09:41:06","date_gmt":"2026-02-23T02:41:06","guid":{"rendered":"https:\/\/base.binus.ac.id\/business-engineering\/?p=1842"},"modified":"2026-02-23T09:41:11","modified_gmt":"2026-02-23T02:41:11","slug":"classification-in-data-mining","status":"publish","type":"post","link":"https:\/\/base.binus.ac.id\/business-engineering\/2026\/02\/23\/classification-in-data-mining\/","title":{"rendered":"Classification in Data Mining"},"content":{"rendered":"<h1>Classification in Data Mining<\/h1>\n<p>Businesses collect enormous amounts of data every day, such as transactions, customer behavior, sensor readings, website activity, etc.<\/p>\n<p>However, raw data alone doesn't create value. Hence, data mining methods are used to give values to data by answering questions that involve prediction and\/or labeling, one of which is through a method called classification.<\/p>\n<p>&nbsp;<\/p>\n<h2>What is Classification?<\/h2>\n<p>Classification is a data processing technique that filters and groups data according to the similar criteria possessed by each piece of data [1], [2]. Groupings from this method are not random guesses. Instead, this method lets the model learn from past labeled examples to help it predict labels for new data.<\/p>\n<p>For example:<\/p>\n<table style=\"border-collapse: collapse;width: 100%\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 50%;text-align: center\"><strong>Problem<\/strong><\/td>\n<td style=\"width: 50%;text-align: center\"><strong>Classification Output<\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 50%\">Email filtering<\/td>\n<td style=\"width: 50%\">Spam \/ Not Spam<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 50%\">Loan approval<\/td>\n<td style=\"width: 50%\">Approve \/ Reject<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 50%\">Customer churn<\/td>\n<td style=\"width: 50%\">Stay \/ Leave<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 50%\">Product quality<\/td>\n<td style=\"width: 50%\">Defective \/ Non-Defective<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 50%\">Medical diagnosis<\/td>\n<td style=\"width: 50%\">[Disease] \/ Healthy<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Usually, classification outputs are in discrete classes and not numbers. This is what makes classification different from prediction methods like regression.<\/p>\n<p>&nbsp;<\/p>\n<h2>How Classification Works<\/h2>\n<p>A classification model follows three main stages:<\/p>\n<h3>1. Training<\/h3>\n<p>At this stage, the model studies historical labeled data.<\/p>\n<p>Example dataset:<\/p>\n<table style=\"border-collapse: collapse;width: 100%\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 33.3333%;text-align: center\"><strong>Customer Age<\/strong><\/td>\n<td style=\"width: 33.3333%;text-align: center\"><strong>Monthly Usage<\/strong><\/td>\n<td style=\"width: 33.3333%;text-align: center\"><strong>Churn<\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 33.3333%;text-align: center\">21<\/td>\n<td style=\"width: 33.3333%;text-align: center\">Low<\/td>\n<td style=\"width: 33.3333%;text-align: center\">Yes<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 33.3333%;text-align: center\">45<\/td>\n<td style=\"width: 33.3333%;text-align: center\">High<\/td>\n<td style=\"width: 33.3333%;text-align: center\">No<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 33.3333%;text-align: center\">30<\/td>\n<td style=\"width: 33.3333%;text-align: center\">Medium<\/td>\n<td style=\"width: 33.3333%;text-align: center\">No<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The algorithm will then learn patterns that connect features to the outcomes.<\/p>\n<h3>2. Learning the Pattern<\/h3>\n<p>The algorithm builds decision rules based on the learned dataset. Based on the example dataset, it may catch on that customers of young age, with low usage, are more likely to churn. This step is also known as the \"intelligence building\" phase.<\/p>\n<h3>3. Prediction<\/h3>\n<p>After understanding the algorithm and identifying the patterns, the model will now be given new data to predict outcomes.<\/p>\n<p>New data example:<\/p>\n<table style=\"border-collapse: collapse;width: 100%;height: 48px\" border=\"1\">\n<tbody>\n<tr style=\"height: 24px\">\n<td style=\"width: 50%;text-align: center;height: 24px\"><strong>Customer Age<\/strong><\/td>\n<td style=\"width: 50%;text-align: center;height: 24px\"><strong>Monthly Usage<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 24px\">\n<td style=\"width: 50%;text-align: center;height: 24px\">24<\/td>\n<td style=\"width: 50%;text-align: center;height: 24px\">Low<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The model will most likely predict that it will churn (churn = yes), because this data shows a customer of young age with low monthly usage. Based on historical data, this type of dataset shows a positive result of churning.<\/p>\n<p>&nbsp;<\/p>\n<h2>Common Classification Algorithms<\/h2>\n<ul>\n<li><strong>Decision Tree:<\/strong> It creates human-readable rules using branching logic, which are easy to interpret. An example rule can be, \"IF usage is low AND complaints are high, then it will churn.\"<\/li>\n<li><strong>k-Nearest Neighbor (kNN):\u00a0<\/strong>It classifies data based on similarity to nearby observations.<\/li>\n<li><strong>Logistic Regression<\/strong>: It provides probability to show how likely something will happen. For example, the analysis shows an 82% chance of customer churn.<\/li>\n<li><strong>Na\u00efve Bayes:<\/strong> It is a probability-based classifier using statistical independence assumptions. This algorithm is fast and effective for text classification, such as spam detection or sentiment analysis.<\/li>\n<li><strong>Support Vector Machine (SVM)<\/strong><strong>:<\/strong> This algorithm separates classes using optimal boundaries. It works well for complex patterns and high-dimensional data.<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h2>References<\/h2>\n<p>[1] D. Vaya and T. Hadpawat, \u201cClassification in Data Mining: A Survey,\u201d <i>International Journal of Advanced Science and Technology,<\/i> vol. 29, no. 3, pp. 13061\u201313071, Jan. 2020.<\/p>\n<div>\n<p>[2] N. a\u2019yuni Ramadhani and H. A. Rosyid, \u201cReview: Algoritma Data Mining untuk Klasifikasi Data,\u201d <i>Jurnal Inovasi Teknologi Dan Edukasi Teknik<\/i>, vol. 2, no. 12, pp. 550\u2013556, Dec. 2022, doi: <a href=\"http:\/\/10.17977\/um068v2i122022p550-556\"><span class=\"url\">10.17977\/um068v2i122022p550-556<\/span><\/a>.<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Classification in Data Mining Businesses collect enormous amounts of data every day, such as transactions, customer behavior, sensor readings, website activity, etc. However, raw data alone doesn't create value. Hence, data mining methods are used to give values to data by answering questions that involve prediction and\/or labeling, one of which is through a method [&hellip;]<\/p>\n","protected":false},"author":15,"featured_media":1846,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[],"class_list":["post-1842","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-articles"],"featured_image":{"phone":"https:\/\/base.binus.ac.id\/business-engineering\/wp-content\/uploads\/sites\/4\/2026\/02\/classification-480x268.jpg","tablet":"https:\/\/base.binus.ac.id\/business-engineering\/wp-content\/uploads\/sites\/4\/2026\/02\/classification.jpg"},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v14.4.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Classification in Data Mining - Business Engineering<\/title>\n<meta name=\"description\" content=\"This article gives a short, simple description of classification model in data mining. It gives basic insights of how this model works and how it is used.\" \/>\n<meta name=\"robots\" content=\"index, follow\" \/>\n<meta name=\"googlebot\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta name=\"bingbot\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/base.binus.ac.id\/business-engineering\/2026\/02\/23\/classification-in-data-mining\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Classification in Data Mining - Business Engineering\" \/>\n<meta property=\"og:description\" content=\"This article gives a short, simple description of classification model in data mining. It gives basic insights of how this model works and how it is used.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/base.binus.ac.id\/business-engineering\/2026\/02\/23\/classification-in-data-mining\/\" \/>\n<meta property=\"og:site_name\" content=\"Business Engineering\" \/>\n<meta property=\"article:published_time\" content=\"2026-02-23T02:41:06+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-02-23T02:41:11+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/base.binus.ac.id\/business-engineering\/wp-content\/uploads\/sites\/4\/2026\/02\/classification.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"523\" \/>\n\t<meta property=\"og:image:height\" content=\"292\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebSite\",\"@id\":\"https:\/\/base.binus.ac.id\/business-engineering\/#website\",\"url\":\"https:\/\/base.binus.ac.id\/business-engineering\/\",\"name\":\"Business Engineering\",\"description\":\"BINUS UNIVERSITY\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":\"https:\/\/base.binus.ac.id\/business-engineering\/?s={search_term_string}\",\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/base.binus.ac.id\/business-engineering\/2026\/02\/23\/classification-in-data-mining\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/base.binus.ac.id\/business-engineering\/wp-content\/uploads\/sites\/4\/2026\/02\/classification.jpg\",\"width\":523,\"height\":292},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/base.binus.ac.id\/business-engineering\/2026\/02\/23\/classification-in-data-mining\/#webpage\",\"url\":\"https:\/\/base.binus.ac.id\/business-engineering\/2026\/02\/23\/classification-in-data-mining\/\",\"name\":\"Classification in Data Mining - Business Engineering\",\"isPartOf\":{\"@id\":\"https:\/\/base.binus.ac.id\/business-engineering\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/base.binus.ac.id\/business-engineering\/2026\/02\/23\/classification-in-data-mining\/#primaryimage\"},\"datePublished\":\"2026-02-23T02:41:06+00:00\",\"dateModified\":\"2026-02-23T02:41:11+00:00\",\"author\":{\"@id\":\"https:\/\/base.binus.ac.id\/business-engineering\/#\/schema\/person\/f9b77807075fb6cd0764cc54d29813b5\"},\"description\":\"This article gives a short, simple description of classification model in data mining. It gives basic insights of how this model works and how it is used.\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/base.binus.ac.id\/business-engineering\/2026\/02\/23\/classification-in-data-mining\/\"]}]},{\"@type\":[\"Person\"],\"@id\":\"https:\/\/base.binus.ac.id\/business-engineering\/#\/schema\/person\/f9b77807075fb6cd0764cc54d29813b5\",\"name\":\"authorbasebuseng\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/base.binus.ac.id\/business-engineering\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/7008c0cb08f8d2b192b47fb862de8a6f?s=96&d=mm&r=g\",\"caption\":\"authorbasebuseng\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","_links":{"self":[{"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/posts\/1842"}],"collection":[{"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/users\/15"}],"replies":[{"embeddable":true,"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/comments?post=1842"}],"version-history":[{"count":3,"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/posts\/1842\/revisions"}],"predecessor-version":[{"id":1845,"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/posts\/1842\/revisions\/1845"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/media\/1846"}],"wp:attachment":[{"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/media?parent=1842"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/categories?post=1842"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/base.binus.ac.id\/business-engineering\/wp-json\/wp\/v2\/tags?post=1842"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}