Master's Thesis : Development of server side document processing and OCR services

Master's Thesis : Development of server side document processing and OCR services

Maréchal, Grégory

Date de soutenance : 7-sep-2020/9-sep-2020 • URL permanente : `http://hdl.handle.net/2268.2/10882`

Détails

Titre :	Master's Thesis : Development of server side document processing and OCR services
Titre traduit :	[fr] Développement de services de traitement de documents et de services de reconnaissance optique des caractères côté serveur
Auteur :	Maréchal, Grégory
Date de soutenance :	7-sep-2020/9-sep-2020
Promoteur(s) :	Leduc, Guy
Membre(s) du jury :	Boigelot, Bernard Donnet, Benoît Hannay, Sébastien
Langue :	Anglais
Nombre de pages :	55 (65 avec annexes)
Mots-clés :	[en] Android mobile [en] Spring java server [en] deep learning [en] classification [en] online training [en] image processing
Discipline(s) :	Ingénierie, informatique & technologie > Ingénierie civile
Intitulé du projet de recherche :	Self training classification of medical documents for a distributed mobile application
Public cible :	Professionnels du domaine Etudiants
URL complémentaire :	https://www.andaman7.com/fr
Institution(s) :	Université de Liège, Liège, Belgique
Diplôme :	Master : ingénieur civil en informatique, à finalité spécialisée en "management"
Faculté :	Mémoires de la Faculté des Sciences appliquées

Résumé

[en] Andaman7 is the name of a company and of a mobile app whose goal is to empower patients (medical term) by giving them easier access and more control on their medical data. However, the processes currently in place to import this data into the application are long and/or tedious. In this project, we will start an exploration of the possibility to use machine learning algorithms in order to automate as much as possible the process of importing data.

To do so, we will implement what will be called the dataflow, which is a complete data processing scheme, including front-end and back-end services, allowing the user to send data for automated metadata extraction, but also to review samples for which the machine learning algorithm would not be confident. This last element will allow Andaman7 to rely on online training to compensate for the lack of data.

The dataflow will then be completed with an actual machine learning algorithm which will be used to classify the sent samples. Finally, the conclusion will include a short discussion about what could be done to extract more metadata from the samples than just the class.

Fichier(s)

Document(s)

Master_Thesis.pdf
Description: Main document
Taille: 3.53 MB
Format: Adobe PDF

Demander un tiré à part

Summary.pdf
Description: Summary
Taille: 57.9 kB
Format: Adobe PDF

Demander un tiré à part

Annexe(s)

picture_1.png
Description: picture 1
Taille: 217.72 kB
Format: image/png

Demander un tiré à part

picture_2.jpg
Description: picture 2
Taille: 134.63 kB
Format: JPEG

Demander un tiré à part

picture_3.jpg
Description: picture 3
Taille: 120.37 kB
Format: JPEG

Demander un tiré à part

picture_4.png
Description: picture 4
Taille: 77.34 kB
Format: image/png

Demander un tiré à part

picture_5.jpg
Description: picture 5
Taille: 272.84 kB
Format: JPEG

Demander un tiré à part

picture_6.jpg
Description: picture 6
Taille: 70.81 kB
Format: JPEG

Demander un tiré à part

Citer ce mémoire

Tous les documents disponibles sur MatheO sont protégés par le droit d'auteur et soumis aux règles habituelles de bon usage.
L'Université de Liège ne garantit pas la qualité scientifique de ces travaux d'étudiants ni l'exactitude de l'ensemble des informations qu'ils contiennent.

Mémoire

Master's Thesis : Development of server side document processing and OCR services

Maréchal, Grégory

Promoteur(s) : Leduc, Guy

Date de soutenance : 7-sep-2020/9-sep-2020 • URL permanente : http://hdl.handle.net/2268.2/10882

Détails

Résumé

Fichier(s)

Document(s)

Annexe(s)

Auteur

Promoteur(s)

Membre(s) du jury

Citer ce mémoire

Date de soutenance : 7-sep-2020/9-sep-2020 • URL permanente : `http://hdl.handle.net/2268.2/10882`