Näytä suppeat kuvailutiedot

Named Entity Recognition : Deep Learning with Automated Pipeline for Lead Processing

Nguyen, Phan Khanh (2020)

dc.contributor.authorNguyen, Phan Khanh
dc.date.accessioned2020-05-11T10:25:21Z
dc.date.available2020-05-11T10:25:21Z
dc.date.issued2020-
dc.identifier.urihttp://www.theseus.fi/handle/10024/337561
dc.description.abstractOver the past few years, various breakthroughs have been made in many artificial intelligence tasks due to the increasing popularity of artificial neural networks. Named entity recognition is a subtask of natural language processing, in which the aim is to detect and extract potential named entities from unstructured text. The goal of this thesis is to develop a functional Named Entity Recognition system using an artificial neural network for the company Vainu. The end model was constructed by using different architectures of artificial neural networks, such as Recurrent Neural Network and Convolutional Neural Network. Some methods of transfer learning such as word embeddings were also applied. The trained model was then deployed as a microservice using Python and Docker. A training pipeline for the Named Entity Recognition model consisting of a continuous integration system with automated building and testing processes was also implemented. Through many experiments and testing, the objective of this thesis was accomplished. The final model was able to perform the entity extracting task with high accuracy. With the new Named Entity Recognition application, Vainu gets a new AI that can be freely adapted to suit its requirements, increases the matching performance of the company and reduces the operation expense compared to using third-party software. The training pipeline was also implemented in a highly scalable way to ensure that new models for new languages can be added to the system with ease if necessary.-
dc.language.isoeng-
dc.rightsCC BY-NC 4.0-
dc.titleNamed Entity Recognition : Deep Learning with Automated Pipeline for Lead Processing-
dc.type.ontasotfi=AMK-opinnäytetyö|sv=YH-examensarbete|en=Bachelor's thesis|-
dc.identifier.urnURN:NBN:fi:amk-202005118219-
dc.subject.specializationSoftware Engineering-
dc.subject.degreeprogramfi=Tieto- ja viestintätekniikka|sv=Informations- och kommunikationsteknik|en=Information and Communications Technology|-
dc.subject.ysoNLP-
dc.subject.ysonamed-entity recognition-
dc.subject.disciplineDegree Programme in Information Technology-


Tiedostot

Thumbnail

Viite kuuluu kokoelmiin:

Näytä suppeat kuvailutiedot