18 Application example photo OCR
18-1 Problem description and pipeline
The photo OCR problem
1.Text detection
2.Character segmentation
3.Character classification (recognition)
4.*Spelling correction
Photo OCR pipeline
18-2 Sliding windows
Text detection | Pedestrian detection
Supervised learning for pedestrian detection
Sliding window detection
Text detection
1D Sliding window for character segmentation
18-3 Getting lots of data: Artificial data synthesis
Character recognition
Artificial data synthesis for photo OCR
Synthesizing data by introducing distortions
Discussion on getting more data
-
Make sure you have a low bias classifier before expending the effort(Plot learning curves). E.g. keep increasing the number of features/number of hidden units in neural network until you have a low bias classifier -
How much work would it be to get 10x as much data as we currently have?
-
Artificial data synthesis -
Collect/label it yourself -
Crowd source"(E.g. Amazon Mechanical Turk)
18-4 Ceiling analysis What part of the pipeline to work on next
Estimating the errors due to each component(ceiling analysis
What part of the pipeline should you spend the most time trying to improve?
Another ceiling analysis example Face recognition from images (Artificial example
19 Conclusion-Summary and Thank you
Summary: Main topics Supervised Learning -Linear regression, logistic regression, neural networks, SVMS Unsupervised Learning -K-means, PCA, Anomaly detection Special applications/special topics -Recommender systems, large scale machine learning Advice on building a machine learning system -Bias/variance, regularization; deciding what to work on next: evaluation of learning algorithms, learning curves, error analysis, ceiling analysis
|