{"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"provenance":[],"collapsed_sections":["51Rqt_IBsSSk","2wwVqbUcs-R5","FNcS2pkK8XYu"],"authorship_tag":"ABX9TyNc3Rq8UXGeqNPKG/G+ChOn"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"}},"cells":[{"cell_type":"markdown","source":["# **Tugas 2 | Diskritisasi data**"],"metadata":{"id":"lr9BFevqpF0I"}},{"cell_type":"markdown","source":["Diskretisasi adalah proses mengganti kontinum dengan set poin yang terbatas. Dalam konteks komputasi digital, diskritisasi terjadi ketika sinyal waktu kontinu, seperti audio atau video, direduksi menjadi sinyal diskrit. Proses diskritisasi merupakan bagian integral dari konversi analog ke digital. Diskretisasi terkait dengan istilah kuantisasi.\n","\n","- Carilah data yang bertipe numerik ( data klassifikasi)\n","\n","- Lakukan proses diskritisasi dengan equal width dan equal frequency\n","\n","- Lakukan proses diskritisasi dengan basis entropy\n","\n","- Kumpulkan tugas dengan link github ( web statis dari jupyter book)"],"metadata":{"id":"-anjEgSW6-WD"}},{"cell_type":"code","execution_count":1,"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":424},"id":"VWbmzb9PnHY4","executionInfo":{"status":"ok","timestamp":1670808125869,"user_tz":-420,"elapsed":1258,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"c4496cc7-599c-4f29-f2d2-6882eba5d16a"},"outputs":[{"output_type":"execute_result","data":{"text/plain":[" sepal.length sepal.width petal.length petal.width variety\n","0 5.1 3.5 1.4 0.2 Setosa\n","1 4.9 3.0 1.4 0.2 Setosa\n","2 4.7 3.2 1.3 0.2 Setosa\n","3 4.6 3.1 1.5 0.2 Setosa\n","4 5.0 3.6 1.4 0.2 Setosa\n",".. ... ... ... ... ...\n","145 6.7 3.0 5.2 2.3 Virginica\n","146 6.3 2.5 5.0 1.9 Virginica\n","147 6.5 3.0 5.2 2.0 Virginica\n","148 6.2 3.4 5.4 2.3 Virginica\n","149 5.9 3.0 5.1 1.8 Virginica\n","\n","[150 rows x 5 columns]"],"text/html":["\n","
\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
sepal.lengthsepal.widthpetal.lengthpetal.widthvariety
05.13.51.40.2Setosa
14.93.01.40.2Setosa
24.73.21.30.2Setosa
34.63.11.50.2Setosa
45.03.61.40.2Setosa
..................
1456.73.05.22.3Virginica
1466.32.55.01.9Virginica
1476.53.05.22.0Virginica
1486.23.45.42.3Virginica
1495.93.05.11.8Virginica
\n","

150 rows Γ— 5 columns

\n","
\n"," \n"," \n"," \n","\n"," \n","
\n","
\n"," "]},"metadata":{},"execution_count":1}],"source":["import pandas as pd \n","import matplotlib.pyplot as plt\n","import numpy as np\n","import math\n","data = pd.read_csv(\"https://gist.githubusercontent.com/netj/8836201/raw/6f9306ad21398ea43cba4f7d537619d0e07d5ae3/iris.csv\")\n","data"]},{"cell_type":"markdown","source":["## Setelah di Transformasi menjadi kategori"],"metadata":{"id":"ie_lseWhVwIJ"}},{"cell_type":"markdown","source":["## Equal Width"],"metadata":{"id":"dEuoqaN9pLyf"}},{"cell_type":"markdown","source":["Equal width adalah algoritma untuk membagi rentang interval dengan ukuran lebar yang sama\n","\n","**Rumus Equal Width :**\n","> π“Œ = [$\\frac{max - min}{𝓍}$]\n","\n","```\n"," ket:\n"," π“Œ = Width\n"," max = nilai maximum pada data\n"," min = nilai minimum pada data\n"," x = jumlah kategori / bin \n","```"],"metadata":{"id":"A1sxv2DUpPTS"}},{"cell_type":"markdown","source":["Pada libary pandas python, dapat menggunakan `.cut` untuk mencari equal width"],"metadata":{"id":"cOR6ZPA650AF"}},{"cell_type":"code","source":["label = ['A','B','C']\n","data['category_sepal_length'] = pd.cut(data['sepal.length'], 3, labels=label)\n","data['category_sepal_width'] = pd.cut(data['sepal.width'], 3, labels=label)\n","data['category_petal_length'] = pd.cut(data['petal.length'], 3, labels=label)\n","data['category_petal_width'] = pd.cut(data['petal.width'], 3, labels=label)\n","data[['category_sepal_length','category_sepal_width','category_petal_length','category_petal_width','variety']]"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":424},"id":"DN5dvi-CpYIg","executionInfo":{"status":"ok","timestamp":1670808125869,"user_tz":-420,"elapsed":29,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"ad3283cf-d79b-462f-f4a9-293aec6664a9"},"execution_count":2,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" category_sepal_length category_sepal_width category_petal_length \\\n","0 A B A \n","1 A B A \n","2 A B A \n","3 A B A \n","4 A B A \n",".. ... ... ... \n","145 B B C \n","146 B A C \n","147 B B C \n","148 B B C \n","149 B B C \n","\n"," category_petal_width variety \n","0 A Setosa \n","1 A Setosa \n","2 A Setosa \n","3 A Setosa \n","4 A Setosa \n",".. ... ... \n","145 C Virginica \n","146 C Virginica \n","147 C Virginica \n","148 C Virginica \n","149 C Virginica \n","\n","[150 rows x 5 columns]"],"text/html":["\n","
\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
category_sepal_lengthcategory_sepal_widthcategory_petal_lengthcategory_petal_widthvariety
0ABAASetosa
1ABAASetosa
2ABAASetosa
3ABAASetosa
4ABAASetosa
..................
145BBCCVirginica
146BACCVirginica
147BBCCVirginica
148BBCCVirginica
149BBCCVirginica
\n","

150 rows Γ— 5 columns

\n","
\n"," \n"," \n"," \n","\n"," \n","
\n","
\n"," "]},"metadata":{},"execution_count":2}]},{"cell_type":"markdown","source":["## Equal Frequency"],"metadata":{"id":"B9qwpT-Mq1eL"}},{"cell_type":"markdown","source":["Equal frekuensi adalah algoritma untuk membagi rentang menjadi rentang-rentang yang berisi nilai data yang kurang lebih nilainya sama.\n","\n","**Rumus Equal Frequency :**\n","> 𝓕req = $\\frac{n}{𝓍}$\n","\n","```\n"," ket:\n"," 𝓕req = frekuensi\n"," 𝓍 = jumlah kategori / bin\n"," n = jumlah data pada list / count\n","```"],"metadata":{"id":"SyZ8Le-iq-iS"}},{"cell_type":"markdown","source":["Pada libary pandas python, dapat menggunakan `.qcut` untuk mencari equal width"],"metadata":{"id":"vQvJo7EG6D6o"}},{"cell_type":"code","source":["label = ['A','B','C']\n","data['category_sepal_length'] = pd.qcut(data['sepal.length'], 3, labels=label)\n","data['category_sepal_width'] = pd.qcut(data['sepal.width'], 3, labels=label)\n","data['category_petal_length'] = pd.qcut(data['petal.length'], 3, labels=label)\n","data['category_petal_width'] = pd.qcut(data['petal.width'], 3, labels=label)\n","data[['category_sepal_length','category_sepal_width','category_petal_length','category_petal_width','variety']]"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":424},"id":"_Athrev0rDnd","executionInfo":{"status":"ok","timestamp":1670808125870,"user_tz":-420,"elapsed":29,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"9e1e8d39-c70c-40dc-f016-1bfded226056"},"execution_count":3,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" category_sepal_length category_sepal_width category_petal_length \\\n","0 A C A \n","1 A B A \n","2 A B A \n","3 A B A \n","4 A C A \n",".. ... ... ... \n","145 C B C \n","146 B A C \n","147 C B C \n","148 B C C \n","149 B B C \n","\n"," category_petal_width variety \n","0 A Setosa \n","1 A Setosa \n","2 A Setosa \n","3 A Setosa \n","4 A Setosa \n",".. ... ... \n","145 C Virginica \n","146 C Virginica \n","147 C Virginica \n","148 C Virginica \n","149 C Virginica \n","\n","[150 rows x 5 columns]"],"text/html":["\n","
\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
category_sepal_lengthcategory_sepal_widthcategory_petal_lengthcategory_petal_widthvariety
0ACAASetosa
1ABAASetosa
2ABAASetosa
3ABAASetosa
4ACAASetosa
..................
145CBCCVirginica
146BACCVirginica
147CBCCVirginica
148BCCCVirginica
149BBCCVirginica
\n","

150 rows Γ— 5 columns

\n","
\n"," \n"," \n"," \n","\n"," \n","
\n","
\n"," "]},"metadata":{},"execution_count":3}]},{"cell_type":"markdown","source":["## **Entropy**"],"metadata":{"id":"kvW3QchLrOJY"}},{"cell_type":"markdown","source":["Melakukan proses diskritasi dengan data petal.length."],"metadata":{"id":"tSS55NoErSHy"}},{"cell_type":"code","source":["panjang_petal = data[['petal.length','category_petal_length']]\n","panjang_petal"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":424},"id":"53QmG8uorU2Q","executionInfo":{"status":"ok","timestamp":1670808125870,"user_tz":-420,"elapsed":28,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"370be75f-7846-4545-a71a-fc554ea29730"},"execution_count":4,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" petal.length category_petal_length\n","0 1.4 A\n","1 1.4 A\n","2 1.3 A\n","3 1.5 A\n","4 1.4 A\n",".. ... ...\n","145 5.2 C\n","146 5.0 C\n","147 5.2 C\n","148 5.4 C\n","149 5.1 C\n","\n","[150 rows x 2 columns]"],"text/html":["\n","
\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
petal.lengthcategory_petal_length
01.4A
11.4A
21.3A
31.5A
41.4A
.........
1455.2C
1465.0C
1475.2C
1485.4C
1495.1C
\n","

150 rows Γ— 2 columns

\n","
\n"," \n"," \n"," \n","\n"," \n","
\n","
\n"," "]},"metadata":{},"execution_count":4}]},{"cell_type":"markdown","source":["### Fungsi - Fungsi Untuk Proses Entropy"],"metadata":{"id":"Jv-72qtsrz7l"}},{"cell_type":"markdown","source":["**Menghitung Data kategori**"],"metadata":{"id":"8TPNgyQW64Bx"}},{"cell_type":"code","source":["def keseluruhan(data, labels, kolom, kategori):\n"," data_keseluruhan = []\n"," group = data.groupby(kategori).count()\n"," for i in labels:\n"," data_keseluruhan.append(group.loc[i,kolom])\n"," return data_keseluruhan"],"metadata":{"id":"CtualzAOr46o","executionInfo":{"status":"ok","timestamp":1670808125871,"user_tz":-420,"elapsed":28,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}}},"execution_count":5,"outputs":[]},{"cell_type":"markdown","source":["**Menghitung Split terhadap data diatas**"],"metadata":{"id":"HaAR_S_aACfU"}},{"cell_type":"code","source":["def split(titik, data, labels, kolom, kategori):\n"," grupa = data[data[kolom] < titik]\n"," grupb = data[data[kolom] >= titik]\n"," panjanggrupa = keseluruhan(grupa, labels, kolom, kategori)\n"," panjanggrupb = keseluruhan(grupb, labels, kolom, kategori)\n"," return (panjanggrupa, panjanggrupb)"],"metadata":{"id":"rn3YOmtnsLXI","executionInfo":{"status":"ok","timestamp":1670808125871,"user_tz":-420,"elapsed":28,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}}},"execution_count":6,"outputs":[]},{"cell_type":"markdown","source":["**Menghitung Data $(D_1 /D_2)$**"],"metadata":{"id":"jEZcml456umA"}},{"cell_type":"markdown","source":["\n","**Rumus Entropy :**\n","> *Entropy*$(D_1) = - βˆ‘_{i=1}^{m}p_i log_2 p_i$\n","\n","```\n"," ket :\n"," D = himpunan kasu\n"," m = jumlah partisi D\n"," pi = proporsi dari Di terhadap D\n","``` "],"metadata":{"id":"kMaydRhd-nKG"}},{"cell_type":"code","source":["def entropy(data):\n"," data_entropy = []\n"," for i in data:\n"," if i/sum(data) != 0:\n"," data_entropy.append(-(i/sum(data))*math.log2(i/sum(data)))\n"," else:\n"," data_entropy.append(0)\n"," return (sum(data_entropy))"],"metadata":{"id":"K_QCOUbMsOj_","executionInfo":{"status":"ok","timestamp":1670808125872,"user_tz":-420,"elapsed":28,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}}},"execution_count":7,"outputs":[]},{"cell_type":"markdown","source":["## **$info_a$**"],"metadata":{"id":"51Rqt_IBsSSk"}},{"cell_type":"markdown","source":["### **fungsi untuk menghitung $info_a$**"],"metadata":{"id":"vNQdnjQOsWPR"}},{"cell_type":"markdown","source":["**Rumus :**\n","> $info_a = \\frac{|D_1|}{|D|} Entropy(D_1) + \\frac{|D_2|}{|D|} Entropy(D_2)$\n","```\n"," ket:\n"," D1/D2 = jumlah data\n"," D = jumlah keseluruhan data\n"," Entropy(D) = hasil entropy data \n","```"],"metadata":{"id":"DjHts1I170fp"}},{"cell_type":"code","source":["def info(hasil_split, data):\n"," infoa = []\n"," for i in hasil_split:\n"," angka = (sum(i)/data.shape[0])*entropy(i)\n"," infoa.append(angka)\n"," return sum(infoa)"],"metadata":{"id":"YUPw7owhsakY","executionInfo":{"status":"ok","timestamp":1670808125872,"user_tz":-420,"elapsed":28,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}}},"execution_count":8,"outputs":[]},{"cell_type":"markdown","source":["## **Gain $(D_{new)}$**"],"metadata":{"id":"2wwVqbUcs-R5"}},{"cell_type":"markdown","source":["### **Fungsi menghitung Gain $(D_{new)}$**"],"metadata":{"id":"sIrJhJEttBja"}},{"cell_type":"markdown","source":["**Rumus Gain$(D_1)$ :**\n","> $Gain(E_{new}) = E_{initial} - E_{new}$\n","\n","```\n"," ket:\n"," Gain = gain\n"," Einitial = jumlah_data_width\n"," Enew = infoa\n","```"],"metadata":{"id":"_TeSC0uy7-aE"}},{"cell_type":"code","source":["def gain(Einisial,Enew):\n"," return Einisial - Enew"],"metadata":{"id":"RFxBNx3MtE2-","executionInfo":{"status":"ok","timestamp":1670808125873,"user_tz":-420,"elapsed":28,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}}},"execution_count":9,"outputs":[]},{"cell_type":"markdown","source":["## Penerapan Fungsi - Fungsi terhadap data iris"],"metadata":{"id":"M7rlW5K7vKi9"}},{"cell_type":"code","source":["keseluruhan_data = keseluruhan(panjang_petal,label,'petal.length','category_petal_length')\n","keseluruhan_data"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"su_740yivSCM","executionInfo":{"status":"ok","timestamp":1670808125873,"user_tz":-420,"elapsed":28,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"8a212a24-b20a-4ed6-97af-822e32c1bcad"},"execution_count":10,"outputs":[{"output_type":"execute_result","data":{"text/plain":["[50, 54, 46]"]},"metadata":{},"execution_count":10}]},{"cell_type":"markdown","source":["menghitung entropy keseluruhan"],"metadata":{"id":"nIDbg90Mymzc"}},{"cell_type":"code","source":["entropy_1 = entropy(keseluruhan_data)\n","entropy_1"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"1WN-ZDhpyqzY","executionInfo":{"status":"ok","timestamp":1670808125874,"user_tz":-420,"elapsed":27,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"6ea4407e-f312-47fc-bc30-acdefc7d4d98"},"execution_count":11,"outputs":[{"output_type":"execute_result","data":{"text/plain":["1.58188145993464"]},"metadata":{},"execution_count":11}]},{"cell_type":"markdown","source":["### Mencari Split 1 :4.4"],"metadata":{"id":"wffJAzOyzBzC"}},{"cell_type":"code","source":["split_1 = split(4.4,panjang_petal,label,'petal.length','category_petal_length')\n","split_1"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"IpG_2XOlzGJV","executionInfo":{"status":"ok","timestamp":1670808125874,"user_tz":-420,"elapsed":25,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"3d27ac15-8e15-4f00-8d1c-209d37ce42a6"},"execution_count":12,"outputs":[{"output_type":"execute_result","data":{"text/plain":["([50, 25, 0], [0, 29, 46])"]},"metadata":{},"execution_count":12}]},{"cell_type":"markdown","source":["menghitung infoa untuk split 1 :4.4"],"metadata":{"id":"yxoYxeGSz2Y7"}},{"cell_type":"code","source":["infoa_1 = info(split_1,panjang_petal)\n","infoa_1"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"i1FQMavWz6tn","executionInfo":{"status":"ok","timestamp":1670808125875,"user_tz":-420,"elapsed":25,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"cfb78f11-63e0-49b9-821a-a52535f036e8"},"execution_count":13,"outputs":[{"output_type":"execute_result","data":{"text/plain":["0.9404552700263706"]},"metadata":{},"execution_count":13}]},{"cell_type":"markdown","source":["menghitung gain untuk split 1 :4.4"],"metadata":{"id":"2pfSH5Ts4Zkc"}},{"cell_type":"code","source":["gain_1 = gain(entropy_1,infoa_1)\n","gain_1"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"M9ONQdPP3gxV","executionInfo":{"status":"ok","timestamp":1670808125875,"user_tz":-420,"elapsed":24,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"a1bd2754-0281-4353-bb12-28ac2a0c7b30"},"execution_count":14,"outputs":[{"output_type":"execute_result","data":{"text/plain":["0.6414261899082694"]},"metadata":{},"execution_count":14}]},{"cell_type":"markdown","source":["### Mencari Split 2 : 5.5"],"metadata":{"id":"QNYdQBgx3zDv"}},{"cell_type":"markdown","source":["menghitung split 2 : 5.5"],"metadata":{"id":"BpwD9ZIhAhV6"}},{"cell_type":"code","source":["split_2 = split(5.5,panjang_petal,label,'petal.length','category_petal_length')\n","split_2"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"dc8yBqqj39tb","executionInfo":{"status":"ok","timestamp":1670808125876,"user_tz":-420,"elapsed":24,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"a5e04fb1-52f1-41c7-8516-f6e941655902"},"execution_count":15,"outputs":[{"output_type":"execute_result","data":{"text/plain":["([50, 54, 18], [0, 0, 28])"]},"metadata":{},"execution_count":15}]},{"cell_type":"markdown","source":["menghitung info a pada split 2 : 5.5"],"metadata":{"id":"lD-neky8AlvR"}},{"cell_type":"code","source":["infoa_2 = info(split_2,panjang_petal)\n","infoa_2"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"fxJUnZef4G3Y","executionInfo":{"status":"ok","timestamp":1670808125876,"user_tz":-420,"elapsed":23,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"82dd4cce-69cc-4fb1-8d9e-8eef785e13b9"},"execution_count":16,"outputs":[{"output_type":"execute_result","data":{"text/plain":["1.183563803674313"]},"metadata":{},"execution_count":16}]},{"cell_type":"markdown","source":["menghitung gain pada split 2 : 5.5"],"metadata":{"id":"p4zwnRybAqfz"}},{"cell_type":"code","source":["gain_2 = gain(entropy_1,infoa_2)\n","gain_2"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"HGYj0KIo4Pa-","executionInfo":{"status":"ok","timestamp":1670808125877,"user_tz":-420,"elapsed":23,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"823f201a-2b9d-4f3d-8d26-d4460f6bdd9b"},"execution_count":17,"outputs":[{"output_type":"execute_result","data":{"text/plain":["0.398317656260327"]},"metadata":{},"execution_count":17}]},{"cell_type":"markdown","source":["### Mencari Split 3 : 6.6\n"],"metadata":{"id":"ejPkBp2K4e3W"}},{"cell_type":"markdown","source":["menghitung split 3 : 6.6"],"metadata":{"id":"spQ5ptizAv-x"}},{"cell_type":"code","source":["split_3 = split(6.6,panjang_petal,label,'petal.length','category_petal_length')\n","split_3"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"Sq-dl0-24k0J","executionInfo":{"status":"ok","timestamp":1670808125877,"user_tz":-420,"elapsed":22,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"9215fe84-fc5f-4b3f-a647-a772d82468db"},"execution_count":18,"outputs":[{"output_type":"execute_result","data":{"text/plain":["([50, 54, 42], [0, 0, 4])"]},"metadata":{},"execution_count":18}]},{"cell_type":"markdown","source":["menghitung info a pada split 3 : 6.6"],"metadata":{"id":"S-g8GooHA0cD"}},{"cell_type":"code","source":["infoa_3 = info(split_3,panjang_petal)\n","infoa_3"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"fxqv0Y854pzh","executionInfo":{"status":"ok","timestamp":1670808125878,"user_tz":-420,"elapsed":22,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"8ad90719-6eb6-4aae-ca63-a50629315391"},"execution_count":19,"outputs":[{"output_type":"execute_result","data":{"text/plain":["1.535202128228074"]},"metadata":{},"execution_count":19}]},{"cell_type":"markdown","source":["menghitung gain pada split 3 : 6.6"],"metadata":{"id":"jCLT6fCQA3PW"}},{"cell_type":"code","source":["gain_3 = gain(entropy_1,infoa_3)\n","gain_3"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"d4G-6AYt4zCG","executionInfo":{"status":"ok","timestamp":1670808125878,"user_tz":-420,"elapsed":21,"user":{"displayName":"Caca Erha","userId":"13359221303846732984"}},"outputId":"977a49f3-4ea0-46f0-b779-03e977ab3f41"},"execution_count":20,"outputs":[{"output_type":"execute_result","data":{"text/plain":["0.04667933170656613"]},"metadata":{},"execution_count":20}]},{"cell_type":"markdown","source":["## **Kesimpulan**"],"metadata":{"id":"FNcS2pkK8XYu"}},{"cell_type":"markdown","source":["Semakin kecil nilai Gain terhadap nilai **Einisial** dan **Enew**, semakin baik akurasinya, maka split yang terbaik adalah `6.6` karena yang paling rendah"],"metadata":{"id":"Qswqq2v38Zri"}}]}