Business Card Data Extraction
• Primary Goal
: To develop a system that can extract structured data from business cards.
•
Text from images were extracted using
Pytesseract
and
data wrangling
was carried out.
•
NER
data was labelled manually using BIO tagging.
•
Prepared training data for
Spacy
and data converted into spacy format.
•
Configured and trained
NER model
.
•
Created
data pipeline
for parsing text.
•
Built a
document scanner web app
using
Flask
.