Publication:
Automatic Text Summarization for Hindi Using Real Coded Genetic Algorithm

dc.contributor.authorJain, Arti
dc.contributor.authorArora, Anuja
dc.contributor.authorMorato Lara, Jorge Luis
dc.contributor.authorYadav, Divakar
dc.contributor.authorKumar, Kumar Vimal
dc.date.accessioned2023-10-24T07:10:29Z
dc.date.available2023-10-24T07:10:29Z
dc.date.issued2022-06-01
dc.description.abstractIn the present scenario, Automatic Text Summarization (ATS) is in great demand to address the ever-growing volume of text data available online to discover relevant information faster. In this research, the ATS methodology is proposed for the Hindi language using Real Coded Genetic Algorithm (RCGA) over the health corpus, available in the Kaggle dataset. The methodology comprises five phases: preprocessing, feature extraction, processing, sentence ranking, and summary generation. Rigorous experimentation on varied feature sets is performed where distinguishing features, namely- sentence similarity and named entity features are combined with others for computing the evaluation metrics. The top 14 feature combinations are evaluated through Recall-Oriented Understudy for Gisting Evaluation (ROUGE) measure. RCGA computes appropriate feature weights through strings of features, chromosomes selection, and reproduction operators: Simulating Binary Crossover and Polynomial Mutation. To extract the highest scored sentences as the corpus summary, different compression rates are tested. In comparison with existing summarization tools, the ATS extractive method gives a summary reduction of 65%.es
dc.identifier.bibliographicCitationJain, A.; Arora, A.; Morato, J.; Yadav, D.; Kumar, K.V. Automatic Text Summarization for Hindi Using Real Coded Genetic Algorithm. Appl. Sci. 2022, 12, 6584. https://doi.org/10.3390/app12136584en
dc.identifier.doittps://doi.org/10.3390/app12136584
dc.identifier.issn2076-3417
dc.identifier.publicationfirstpage1
dc.identifier.publicationissue13
dc.identifier.publicationlastpage23
dc.identifier.publicationtitleApplied Sciencesen
dc.identifier.publicationvolume12
dc.identifier.urihttps://hdl.handle.net/10016/38657
dc.identifier.uxxiAR/0000031148
dc.language.isoengen
dc.publisherMDPIen
dc.rights© 2022 by the authors.en
dc.rightsAtribución 3.0 España*
dc.rights.accessRightsopen accesses
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/
dc.subject.ecienciaInformáticaes
dc.subject.otherautomatic text summarizationen
dc.subject.otherextractive summaryen
dc.subject.otherfeature seten
dc.subject.otherhindi languageen
dc.subject.otherHindien
dc.subject.otherhealtd dataen
dc.subject.othernamed entityen
dc.subject.otherreal coded genetic algorithmen
dc.subject.otherrouge metricen
dc.subject.othersummarization toolen
dc.titleAutomatic Text Summarization for Hindi Using Real Coded Genetic Algorithmen
dc.typeresearch article*
dc.type.hasVersionVoR*
dspace.entity.typePublication
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
automatic_AS_2022.pdf
Size:
2.2 MB
Format:
Adobe Portable Document Format
Description: