Tags

Type your tag names separated by a space and hit enter

Graph-based extractive text summarization method for Hausa text.
PLoS One. 2023; 18(5):e0285376.Plos

Abstract

Automatic text summarization is one of the most promising solutions to the ever-growing challenges of textual data as it produces a shorter version of the original document with fewer bytes, but the same information as the original document. Despite the advancements in automatic text summarization research, research involving the development of automatic text summarization methods for documents written in Hausa, a Chadic language widely spoken in West Africa by approximately 150,000,000 people as either their first or second language, is still in early stages of development. This study proposes a novel graph-based extractive single-document summarization method for Hausa text by modifying the existing PageRank algorithm using the normalized common bigrams count between adjacent sentences as the initial vertex score. The proposed method is evaluated using a primarily collected Hausa summarization evaluation dataset comprising of 113 Hausa news articles on ROUGE evaluation toolkits. The proposed approach outperformed the standard methods using the same datasets. It outperformed the TextRank method by 2.1%, LexRank by 12.3%, centroid-based method by 19.5%, and BM25 method by 17.4%.

Authors+Show Affiliations

School of Computing, Universiti Teknologi Malaysia, Johor, Malaysia.School of Computing, Universiti Teknologi Malaysia, Johor, Malaysia.School of Computing, Universiti Teknologi Malaysia, Johor, Malaysia.School of Computing, Universiti Teknologi Malaysia, Johor, Malaysia.Department of Computer Science, Yusuf Maitama Sule University, Kano, Nigeria.

Pub Type(s)

Journal Article
Research Support, Non-U.S. Gov't

Language

eng

PubMed ID

37159449

Citation

Bichi, Abdulkadir Abubakar, et al. "Graph-based Extractive Text Summarization Method for Hausa Text." PloS One, vol. 18, no. 5, 2023, pp. e0285376.
Bichi AA, Samsudin R, Hassan R, et al. Graph-based extractive text summarization method for Hausa text. PLoS One. 2023;18(5):e0285376.
Bichi, A. A., Samsudin, R., Hassan, R., Hasan, L. R. A., & Ado Rogo, A. (2023). Graph-based extractive text summarization method for Hausa text. PloS One, 18(5), e0285376. https://doi.org/10.1371/journal.pone.0285376
Bichi AA, et al. Graph-based Extractive Text Summarization Method for Hausa Text. PLoS One. 2023;18(5):e0285376. PubMed PMID: 37159449.
* Article titles in AMA citation format should be in sentence-case
TY - JOUR T1 - Graph-based extractive text summarization method for Hausa text. AU - Bichi,Abdulkadir Abubakar, AU - Samsudin,Ruhaidah, AU - Hassan,Rohayanti, AU - Hasan,Layla Rasheed Abdallah, AU - Ado Rogo,Abubakar, Y1 - 2023/05/09/ PY - 2021/11/27/received PY - 2023/04/23/accepted PY - 2023/5/11/medline PY - 2023/5/9/pubmed PY - 2023/5/9/entrez SP - e0285376 EP - e0285376 JF - PloS one JO - PLoS One VL - 18 IS - 5 N2 - Automatic text summarization is one of the most promising solutions to the ever-growing challenges of textual data as it produces a shorter version of the original document with fewer bytes, but the same information as the original document. Despite the advancements in automatic text summarization research, research involving the development of automatic text summarization methods for documents written in Hausa, a Chadic language widely spoken in West Africa by approximately 150,000,000 people as either their first or second language, is still in early stages of development. This study proposes a novel graph-based extractive single-document summarization method for Hausa text by modifying the existing PageRank algorithm using the normalized common bigrams count between adjacent sentences as the initial vertex score. The proposed method is evaluated using a primarily collected Hausa summarization evaluation dataset comprising of 113 Hausa news articles on ROUGE evaluation toolkits. The proposed approach outperformed the standard methods using the same datasets. It outperformed the TextRank method by 2.1%, LexRank by 12.3%, centroid-based method by 19.5%, and BM25 method by 17.4%. SN - 1932-6203 UR - https://www.unboundmedicine.com/medline/citation/37159449/Graph_based_extractive_text_summarization_method_for_Hausa_text_ DB - PRIME DP - Unbound Medicine ER -