International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 185 - Issue 35 |
Published: Sep 2023 |
Authors: Ahmad Farhan Alshammari |
![]() |
Ahmad Farhan Alshammari . Implementation of Keyword Extraction using Term Frequency-Inverse Document Frequency (TF-IDF) in Python. International Journal of Computer Applications. 185, 35 (Sep 2023), 9-14. DOI=10.5120/ijca2023923137
@article{ 10.5120/ijca2023923137, author = { Ahmad Farhan Alshammari }, title = { Implementation of Keyword Extraction using Term Frequency-Inverse Document Frequency (TF-IDF) in Python }, journal = { International Journal of Computer Applications }, year = { 2023 }, volume = { 185 }, number = { 35 }, pages = { 9-14 }, doi = { 10.5120/ijca2023923137 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2023 %A Ahmad Farhan Alshammari %T Implementation of Keyword Extraction using Term Frequency-Inverse Document Frequency (TF-IDF) in Python%T %J International Journal of Computer Applications %V 185 %N 35 %P 9-14 %R 10.5120/ijca2023923137 %I Foundation of Computer Science (FCS), NY, USA
The goal of this research is to develop a keyword extraction program using Term Frequency-Inverse Document Frequency (TF-IDF) in Python. The purpose of keyword extraction is to identify the set of words (keywords) that describe the content of the text. The TF-IDF method is used to measure the importance of words in the text. The basic steps of keyword extraction are explained: preprocessing text, creating list of words, creating bag of words, creating word frequency (TF), creating inverse document frequency (IDF), creating word frequency-inverse document frequency (TF-IDF), creating keywords, and sorting keywords. The developed program was tested on an experimental text from Wikipedia. The program successfully performed the basic steps of keyword extraction and provided the required results.