Performance Evaluation of Recurrent Neural Network on Large-Scale Translated Dataset for Question Generation in NLP for Educational Purposes

dc.contributor.authorMamani Maquera, Fidel
dc.contributor.authorPaz Valderrama, Alfredo
dc.contributor.authorCastro Gutierrez, Eveling
dc.date.accessioned2019-08-17T03:07:59Z
dc.date.accessioned2022-02-22T12:02:42Z
dc.date.available2019-08-17T03:07:59Z
dc.date.available2022-02-22T12:02:42Z
dc.date.issued2019-07
dc.description.abstractIn recent years, neural networks have been used widely to solve many NLP tasks that involve large-scale datasets. Recently, Question Generation (QG) has called great attention since it is a subtask of Question Answering (QA) that has many applications in the real world, mainly for educational purposes. The importance of it could be seen on many recently released large-scale datasets prepared exclusively for this task, most the data used in NLP are available in the English language, but it is not the case for the rest of the languages, like Spanish, which is the third most used language in the world. This research is focused on analyzing the performance of current state-of-the-art neural network models used in QG using translated Spanish large-scale dataset from English. To know the accuracy of the translated Spanish data from English, it has been used state-of-the-art OpenNMT machine translator and Google Translation API, then the results have been analyzed with the corresponding automatic metrics - BLEU, METEOR, ROUGE - and human evaluations such as fluency and adequacy, later, it has been trained a state-of-the-art question generation (QG) neural network model using Spanish translated data to generate automatic questions in Spanish language. Surprisingly, the results outperform the original English results in average 37% on all automatic evaluation metrics. To the best of our knowledge, this work is the first one using large-scale Spanish translated data for QG task using recurrent neural networks for educational purposes.en_US
dc.description.countryPeruen
dc.description.institutionUniversidad Nacional de San Agustín de Arequipaen
dc.description.trackInformation Technology, Technology Management, Ethics, Technology and Societyen
dc.identifier.isbn978-958-52071-4-1
dc.identifier.issn2414-6390
dc.identifier.otherhttp://laccei.org/LACCEI2019-MontegoBay/meta/FP178.html
dc.identifier.urihttp://dx.doi.org/10.18687/LACCEI2019.1.1.178
dc.identifier.urihttps://axces.info/handle/10.18687/20190101_178
dc.journal.referatopeerReview
dc.language.isoEnglishen_US
dc.publisherLACCEI, Inc.en_US
dc.rightsLACCEI License
dc.rights.urihttps://laccei.org/blog/copyright-laccei-papers/
dc.subjectgoogle translationen_US
dc.subjectrecurrent neural network (RNN)en_US
dc.subjectnatural language processingen_US
dc.subjecttranslated dataen_US
dc.subjectsquad dataseten_US
dc.titlePerformance Evaluation of Recurrent Neural Network on Large-Scale Translated Dataset for Question Generation in NLP for Educational Purposes
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
FP178.pdf
Size:
390.95 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
64 B
Format:
Plain Text
Description: