Title: Evaluation of Chabot Text Classification Using Machine Learning
Abstract: Chapter 13 Evaluation of Chabot Text Classification Using Machine Learning P. Kumaraguru Diderot, P. Kumaraguru Diderot Department of ECE, Hindustan Institute of Technology and Science, Chennai, IndiaSearch for more papers by this authorK. Sakthidasan Sankaran, K. Sakthidasan Sankaran Department of ECE, Hindustan Institute of Technology and Science, Chennai, IndiaSearch for more papers by this authorMalik Jawarneh, Malik Jawarneh Faculty of Computing Sciences, Gulf College, Al-Khuwair, OmanSearch for more papers by this authorHriakumar Pallathadka, Hriakumar Pallathadka Manipur International University, Manipur, IndiaSearch for more papers by this authorJosé Luis Arias-Gonzáles, José Luis Arias-Gonzáles University of British Columbia, Lima, PeruSearch for more papers by this authorDomenic T. Sanchez, Domenic T. Sanchez Cebu Technological University-NEC, Cebu City, PhilippinesSearch for more papers by this author P. Kumaraguru Diderot, P. Kumaraguru Diderot Department of ECE, Hindustan Institute of Technology and Science, Chennai, IndiaSearch for more papers by this authorK. Sakthidasan Sankaran, K. Sakthidasan Sankaran Department of ECE, Hindustan Institute of Technology and Science, Chennai, IndiaSearch for more papers by this authorMalik Jawarneh, Malik Jawarneh Faculty of Computing Sciences, Gulf College, Al-Khuwair, OmanSearch for more papers by this authorHriakumar Pallathadka, Hriakumar Pallathadka Manipur International University, Manipur, IndiaSearch for more papers by this authorJosé Luis Arias-Gonzáles, José Luis Arias-Gonzáles University of British Columbia, Lima, PeruSearch for more papers by this authorDomenic T. Sanchez, Domenic T. Sanchez Cebu Technological University-NEC, Cebu City, PhilippinesSearch for more papers by this author Book Editor(s):Romil Rawat, Romil RawatSearch for more papers by this authorRajesh Kumar Chakrawarti, Rajesh Kumar ChakrawartiSearch for more papers by this authorSanjaya Kumar Sarangi, Sanjaya Kumar SarangiSearch for more papers by this authorPiyush Vyas, Piyush VyasSearch for more papers by this authorMary Sowjanya Alamanda, Mary Sowjanya AlamandaSearch for more papers by this authorKotagiri Srividya, Kotagiri SrividyaSearch for more papers by this authorKrishnan Sakthidasan Sankaran, Krishnan Sakthidasan SankaranSearch for more papers by this author First published: 27 January 2024 https://doi.org/10.1002/9781394200801.ch13 AboutPDFPDF ToolsRequest permissionExport citationAdd to favoritesTrack citation ShareShareShare a linkShare onEmailFacebookTwitterLinkedInRedditWechat Summary A chatbot is a type of software with artificial intelligence (AI) that is meant to talk like a person, usually over the internet. More and more people use chatbots. Every chatbot is built from the ground up with the main idea that it can have an intelligent conversation with a human user (often through text messages) and respond to their questions in the right way. This chapter gives a detailed literature review of the different ways that chatbot text features can be optimized. In the literature review section, there is also a detailed look at how machine learning techniques can be used to classify texts and group them together. A framework for classifying chatbot text based on machine learning is also shown. The normalization method is used to remove noise during preprocessing. Particle Swarm Optimization is used to improve the features, and KNN, SVM, and Naïve Bayes techniques are used to classify the text. References Ali , A.X. , Morris , M.R. , Wobbrock , J.O. , Crowdsourcing similarity judgments for agreement analysis in end-user elicitation studies , in: The 31st Annual ACM Symposium on User Interface Software and Technology , pp. 177 – 188 , ACM , 2018 . Google Scholar Raghuvanshi , A. , Singh , U. , Sajja , G. , Pallathadka , H. , Asenso , E. , Kamal , M. et al ., Intrusion detection using machine learning for risk mitigation in IoT-enabled smart irrigation in smart farming . J. Of Food Qual. , 2022 , 1 – 8 , 2022 , doi: 10.1155/2022/3955514 . 10.1155/2022/3955514 Web of Science®Google Scholar Hemamalini , V. , Rajarajeswari , S. , Nachiyappan , S. , Sambath , M. , Devi , T. , Singh , B. , Food quality inspection and grading using efficient image segmentation and machine learning-based system . J. Of Food Qual. , 2022 , 1 – 6 , 2022 , doi: 10.1155/2022/5262294 . 10.1155/2022/5262294 Web of Science®Google Scholar Prasad , J.V.D. , Zamani , A.S., K, A. , Naved , M. , Pallathadka , H. , Sammy , F. , Kaliyaperumal , K. , Computational technique based on machine learning and image processing for medical image analysis of breast cancer diagnosis . Secur. Commun. Networks , 2022 , Article ID 1918379, 7 , 2022 , https://doi.org/10.1155/2022/1918379 . 10.1155/2022/1918379 Web of Science®Google Scholar Chaudhury , S. , Krishna , A.N. , Gupta , S. , Sankaran , K.S. , Khan , S. , Sau , K. , Sammy , F. , Effective image processing and segmentation-based machine learning techniques for diagnosis of breast cancer . Comput. Math. Methods Med. , 2022 , Article ID 6841334, 6 , 2022 , https://doi.org/10.1155/2022/6841334 . 10.1155/2022/6841334 Web of Science®Google Scholar Zamani , A.S. , Anand , L. , Rane , K.P. , Prabhu , P. , Buttar , A.M. , Pallathadka , H. , Dugbakie , B.N. , Performance of machine learning and image processing in plant leaf disease detection . J. Food Qual. , 2022 , Article ID 1598796, 7 , 2022 . https://doi.org/10.1155/2022/1598796 . 10.1155/2022/1598796 Web of Science®Google Scholar Abualigah , L.M. , Khader , A.T. , Al-Betar , M.A. , Alomari , O.A. , Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering . Expert Syst. Appl. , 84 , 24 – 36 , 2017 . 10.1016/j.eswa.2017.05.002 Web of Science®Google Scholar Aghdam , M.H. , Ghasem-Aghaee , N. , Basiri , M.E. , Text feature selection using ant colony optimization . Expert Syst. Appl. , 36 , 3 , 6843 – 6853 , 2009 . 10.1016/j.eswa.2008.08.022 Web of Science®Google Scholar Alghamdi , H.S. , Tang , H.L. , Alshomrani , S. , Hybrid ACO and TOFA feature selection approach for text classification , in: 2012 IEEE Congress on Evolutionary Computation , IEEE , pp. 1 – 6 , 2012 , June. 10.1109/CEC.2012.6252960 Google Scholar Subanya , B. and Rajalaxmi , R.R. , Feature selection using artificial Bee Colony for cardiovascular disease classification . 2014 International Conference on Electronics and Communication Systems (ICECS) , Coimbatore , pp. 1 – 6 , 2014 , doi: 10.1109/ECS.2014.6892729 . Google Scholar Younus , Z.S. , Mohamad , D. , Saba , T. , Alkawaz , M.H. , Rehman , A. , Al-Rodhaan , M. , Al-Dhelaan , A. , Content-based image retrieval using PSO and k-means clustering algorithm . Arabian J. Geosci. , 8 , 8 , 6211 – 6224 , 2015 . 10.1007/s12517-014-1584-7 Web of Science®Google Scholar Ahmad , S.R. , Yusop , N.M.M. , Bakar , A.A. , Yaakub , M.R. , Statistical analysis for validating ACO-KNN algorithm as feature selection in sentiment analysis , in: AIP Conference Proceedings , AIP Publishing , vol. 1891 , no. 1, p. 020018 , 2017 , October. Google Scholar Zhang , N. , Xiong , J. , Zhong , J. , Thompson , L. , Feature selection method using BPSO-EA with ENN classifier , in: 2018 Eighth International Conference on Information Science and Technology (ICIST) , IEEE , pp. 364 – 369 , 2018 , June. 10.1109/ICIST.2018.8426154 Google Scholar Hamdani , T.M. , Won , J.M. , Alimi , A.M. , Karray , F. , Hierarchical genetic algorithm with new evaluation function and bi-coded representation for the selection of features considering their confidence rate . Appl. Soft Comput. , 11 , 2 , 2501 – 2509 , 2011 . 10.1016/j.asoc.2010.08.020 Web of Science®Google Scholar Suguna , N. and Thanushkodi , K.G. , An independent rough set approach hybrid with artificial bee colony algorithm for dimensionality reduction . Am. J. Appl. Sci. , 8 , 3 , 261 , 2011 . 10.3844/ajassp.2011.261.266 Google Scholar Basiri , M.E. and Nemati , S. , A novel hybrid ACO-GA algorithm for text feature selection , in: 2009 IEEE Congress on Evolutionary Computation , IEEE , pp. 2561 – 2568 , 2009 . Google Scholar Karabulut , M. , Fuzzy unordered rule induction algorithm in text categorization on top of geometric particle swarm optimization term selection . Knowl.-Based Syst. , 54 , 288 – 297 , 2013 . 10.1016/j.knosys.2013.09.020 Web of Science®Google Scholar Iyyer , M. , Manjunatha , V. , Boyd-Graber , J. , Daumé III , H. , Deep unordered composition rivals syntactic methods for text classification , in: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) , vol. 1 , pp. 1681 – 1691 , 2015 . 10.3115/v1/P15-1162 Google Scholar Acharya , A. , Goel , R. , Metallinou , A. , Dhillon , I. , Online embedding compression for text classification using low rank matrix factorization , in: Proceedings of the AAAI Conference on Artificial Intelligence , vol. 33 , pp. 6196 – 6203 , 2019 . 10.1609/aaai.v33i01.33016196 Google Scholar Vilar , E. , Word embedding, neural networks and text classification: What is the state-of-the-art? Junior Manag. Sci. , 4 , 1 , 35 – 62 , 2019 . Google Scholar Gulin , V.V. and Frolov , A.B. , On the classification of text documents taking into account their structural features . J. Comput. Syst. Sci. Int. , 55 , 3 , 394 – 403 , 2016 . 10.1134/S1064230716030102 Web of Science®Google Scholar Kanika , and Sangeeta , Applying machine learning algorithms for news articles categorization: Using SVM and kNN with TF-IDF approach , in: Smart Computational Strategies: Theoretical and Practical Aspects , A. Luhach , K. Hawari , I. Mihai , P.A. Hsiung , R. Mishra (eds.), Springer , Singapore , 2019 . 10.1007/978-981-13-6295-8_9 Google Scholar Kowsari , K. , Brown , D.E. , Heidarysafa , M. , Meimandi , K.J. , Gerber , M.S. , Barnes , L.E. , Hdltex: Hierarchical deep learning for text classification , in: 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA) , IEEE , pp. 364 – 371 , 2017 . 10.1109/ICMLA.2017.0-134 Google Scholar Zhang , X. , Zhao , J. , LeCun , Y. , Character-level convolutional networks for text classification , in: Advances in Neural Information Processing Systems , pp. 649 – 657 , 2015 . Google Scholar Lai , S. , Xu , L. , Liu , K. , Zhao , J. , Recurrent convolutional neural networks for text classification , in: Twenty-ninth AAAI conference on artificial intelligence , 2015 . 10.1609/aaai.v29i1.9513 Google Scholar Miao , Y.Q. and Kamel , M. , Pairwise optimized Rocchio algorithm for text categorization . Pattern Recognit. Lett. , 32 , 2 , 375 – 382 , 2011 . 10.1016/j.patrec.2010.09.018 Web of Science®Google Scholar Abasi , A.K. , Khader , A.T. , Al-Betar , M.A. , Naim , S. , Makhadmeh , S.N. , Alyasseri , Z.A.A. , A text feature selection technique based on binary multi-verse optimizer for text clustering , in: 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT) , IEEE , pp. 1 – 6 , 2019 . 10.1109/JEEIT.2019.8717491 Google Scholar Garg , S. , Perot , V. , Limtiaco , N. , Taly , A. , Chi , E.H. , Beutel , A. , Counterfactual fairness in text classification through robustness , in: Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society , ACM , pp. 219 – 226 , 2019 . 10.1145/3306618.3317950 Google Scholar Pietramala , A. , Policicchio , V.L. , Rullo , P. , Sidhu , I. , A genetic algorithm for text classification rule induction , in: Joint European Conference on Machine Learning and Knowledge Discovery in Databases , pp. 188 – 203 , Berlin, Heidelberg , Springer , 2008 , September. 10.1007/978-3-540-87481-2_13 Google Scholar Beil , F. , Ester , M. , Xu , X. , Frequent term-based text clustering , in: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , ACM , pp. 436 – 442 , 2002 . 10.1145/775047.775110 Google Scholar Aggarwal , C.C. and Zhai , C. , A survey of text classification algorithms , in: Mining Text Data , pp. 163 – 222 , Springer , Boston, MA , 2012 . 10.1007/978-1-4614-3223-4_6 Google Scholar Liu , T. , Liu , S. , Chen , Z. , Ma , W.-Y. , An evaluation on feature selection for text clustering , in: Proceedings of the 20th International Conference on Machine Learning (ICML-03) , pp. 488 – 495 , 2003 . Google Scholar Li , Y. , Luo , C. , Chung , S.M. , Text clustering with feature selection by using statistical data . IEEE Trans. knowl. Data Eng. , 20 , 5 , 641 – 652 , 2008 . 10.1109/TKDE.2007.190740 Web of Science®Google Scholar Zhang , W. , Yoshida , T. , Tang , X. , Wang , Q. , Text clustering using frequent itemsets . Knowl.-Based Syst. , 23 , 5 , 379 – 388 , 2010 . 10.1016/j.knosys.2010.01.011 CASPubMedWeb of Science®Google Scholar Jo , T. , Text Clustering: Approaches , in: Text Mining , pp. 203 – 224 , Springer , Cham , 2019 . 10.1007/978-3-319-91815-0_10 Google Scholar Bharti , K.K. and Singh , P.K. , Chaotic gradient artificial bee colony for text clustering . Soft Comput. , 20 , 3 , 1113 – 1126 , 2016 . 10.1007/s00500-014-1571-7 Web of Science®Google Scholar Mijangos , V. , Sierra , G. , Montes , A. , Sentence level matrix representation for document spectral clustering . Pattern Recognit. Lett. , 85 , 29 – 34 , 2017 . 10.1016/j.patrec.2016.11.008 Web of Science®Google Scholar Xu , Z. and Ke , Y. , Effective and efficient spectral clustering on text and link data , in: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management , ACM , pp. 357 – 366 , 2016 . 10.1145/2983323.2983708 Google Scholar Li , A.-D. , Xue , B. , Zhang , M. , A forward search inspired particle swarm optimization algorithm for feature selection in classification . 2021 IEEE Congress on Evolutionary Computation (CEC) , pp. 786 – 793 , 2021 , doi: 10.1109/CEC45853.2021.9504949 . Google Scholar Javid , J. , Mughal , M.A. , Karim , M. , Using kNN algorithm for classification of distribution transformers health index . 2021 International Conference on Innovative Computing (ICIC) , pp. 1 – 6 , 2021 , doi: 10.1109/ICIC53490.2021.9693013 . Google Scholar Al-Jumaili , S. , Duru , A.D. , Uçan , O.N. , Covid-19 ultrasound image classification using SVM based on kernels deduced from convolutional neural network . 2021 5th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT) , pp. 429 – 433 , 2021 , doi: 10.1109/ISMSIT52890.2021.9604551 . Google Scholar Li , C. , Chen , L. , Wu , S. , Mo , Y. , Chen , L. , Application research of naive bayes algorithm based on DIKW in weather website . 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys) , pp. 2253 – 2257 , 2021 , doi: 10.1109/HPCC-DSS-SmartCity-DependSys53884.2021.00337 . Google Scholar http://www.cs.cmu.edu/~ark/QA-data/ . Google Scholar Mahor , V. , Bijrothiya , S. , Rawat , R. , Kumar , A. , Garg , B. , Pachlasiya , K. , IoT and artificial intelligence techniques for public safety and security , in: Smart Urban Computing Applications , p. 111 , 2023 . Google Scholar Mahor , V. , Pachlasiya , K. , Garg , B. , Chouhan , M. , Telang , S. , Rawat , R. , Mobile operating system (Android) vulnerability analysis using machine learning , in: Proceedings of International Conference on Network Security and Blockchain Technology: ICNSBT 2021 , pp. 159 – 169 , Springer Nature Singapore , Singapore , 2022 , June. 10.1007/978-981-19-3182-6_13 Google Scholar Rawat , R. , Garg , B. , Pachlasiya , K. , Mahor , V. , Telang , S. , Chouhan , M. , Mishra , R. , SCNTA: Monitoring of network availability and activity for identification of anomalies using machine learning approaches . Int. J. Inf. Technol. Web Eng. (IJITWE) , 17 , 1 , 1 – 19 , 2022 . 10.4018/IJITWE.304051 Web of Science®Google Scholar Rawat , R. , Rimal , Y.N. , William , P. , Dahima , S. , Gupta , S. , Sankaran , K.S. , Malware threat affecting financial organization analysis using machine learning approach . Int. J. Inf. Technol. Web Eng. (IJITWE) , 17 , 1 , 1 – 20 , 2022 . 10.4018/IJITWE.304051 Web of Science®Google Scholar Rawat , R. , Mahor , V. , Chouhan , M. , Pachlasiya , K. , Telang , S. , Garg , B. , Systematic literature review (SLR) on social media and the digital transformation of drug trafficking on darkweb , in: International Conference on Network Security and Blockchain Technology , pp. 181 – 205 , Springer , Singapore , 2022 . 10.1007/978-981-19-3182-6_15 Google Scholar Rawat , R. , Ayodele Oki , O. , Sankaran , S. , Florez , H. , Ajagbe , S.A. , Techniques for predicting dark web events focused on the delivery of illicit products and ordered crime . Int. J. Electr. Comput. Eng. (IJECE) , 13 , 5 , 5354 – 5365 , Oct. 2023 , doi: 10.11591/ijece.v13i5.pp5354-5365 . 10.11591/ijece.v13i5.pp5354-5365 Google Scholar Rawat , R. , Garg , B. , Mahor , V. , Telang , S. , Pachlasiya , K. , Chouhan , M. , Organ trafficking on the dark web—The data security and privacy concern in healthcare systems , in: Internet of Healthcare Things: Machine Learning for Security and Privacy , pp. 189 – 216 , 2022 . 10.1002/9781119792468.ch9 Google Scholar Vyas , P. , Vyas , G. , Chauhan , A. , Rawat , R. , Telang , S. , Gottumukkala , M. , Anonymous trading on the dark online marketplace: An exploratory study , in: Using Computational Intelligence for the Dark Web and Illicit Behavior Detection , pp. 272 – 289 , IGI Global , 2022 . 10.4018/978-1-6684-6444-1.ch015 Google Scholar Rawat , R. , Oki , O.A. , Sankaran , K.S. , Olasupo , O. , Ebong , G.N. , Ajagbe , S.A. , A new solution for cyber security in big data using machine learning approach , in: Mobile Computing and Sustainable Informatics: Proceedings of ICMCSI 2023 , pp. 495 – 505 , Springer Nature Singapore , Singapore , 2023 . 10.1007/978-981-99-0835-6_35 Google Scholar Rawat , R. , Chakrawarti , R.K. , Raj , A. , Mani , G. , Chidambarathanu , K. , Bhardwaj , R. , Association rule learning for threat analysis using traffic analysis and packet filtering approach . Int. J. Inf. Technol. , 1 – 11 , 2023 . Google Scholar Rawat , R. , Logical concept mapping and social media analytics relating to cyber criminal activities for ontology creation . Int. J. Inf. Technol. , 15 , 2 , 893 – 903 , 2023 . Google Scholar Rawat , R. , Mahor , V. , Álvarez , J.D. , Ch , F. , Cognitive systems for dark web cyber delinquent association malignant data crawling: A review , in: Handbook of Research on War Policies, Strategies, and Cyber Wars , pp. 45 – 63 , 2023 . 10.4018/978-1-6684-6741-1.ch003 Google Scholar Rawat , R. , Chakrawarti , R.K. , Vyas , P. , Gonzáles , J.L.A. , Sikarwar , R. , Bhardwaj , R. , Intelligent fog computing surveillance system for crime and vulnerability identification and tracing . Int. J. Inf. Secur. Priv. (IJISP) , 17 , 1 , 1 – 25 , 2023 . 10.4018/IJISP.317371 Web of Science®Google Scholar R. Rawat , A.M. Sowjanya , S.I. Patel , V. Jaiswal , I. Khan , A. Balaram (Eds.), Using Machine Intelligence: Autonomous Vehicles Volume 1 , John Wiley & Sons , 2022 . Google Scholar Rawat , R. , Mahor , V. , Díaz-Álvarez , J. , Chávez , F. , Rooted learning model at fog computing analysis for crime incident surveillance , in: 2022 International Conference on Smart Generation Computing, Communication and Networking (SMART GENCON) , pp. 1 – 9 , IEEE , 2022 , December. 10.1109/SMARTGENCON56628.2022.10084316 Google Scholar Rawat , R. and Shrivastav , S.K. , SQL injection attack detection using SVM . Int. J. Comput. Appl. , 42 , 13 , 1 – 4 , 2012 . Google Scholar Rawat , R. , Bhardwaj , P. , Kaur , U. , Telang , S. , Chouhan , M. , Sankaran , K.S. , Smart vehicles for communication, volume 2 , John Wiley & Sons , 2023 . Google Scholar Mahor , V. , Garg , B. , Telang , S. , Pachlasiya , K. , Chouhan , M. , Rawat , R. , Cyber threat phylogeny assessment and vulnerabilities representation at thermal power station , in: Proceedings of International Conference on Network Security and Blockchain Technology: ICNSBT 2021 , pp. 28 – 39 , Springer Nature Singapore , Singapore , 2022 , June. 10.1007/978-981-19-3182-6_3 Google Scholar Rawat , R. , Gupta , S. , Sivaranjani , S. , Cu , O.K. , Kuliha , M. , Sankaran , K.S. , Malevolent information crawling mechanism for forming structured illegal organisations in hidden networks . Int. J. Cyber Warf. Terror. (IJCWT) , 12 , 1 , 1 – 14 , 2022 . 10.4018/IJCWT.311422 Web of Science®Google Scholar Conversational Artificial Intelligence ReferencesRelatedInformation
Publication Year: 2024
Publication Date: 2024-01-27
Language: en
Type: other
Indexed In: ['crossref']
Access and Citation
Cited By Count: 1
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot