Photo OCR (Optical Character Recognition)

September 7, 2019 less than 1 minute read

Tags: coursera-machine-learning, getting-more-data

Photo OCR: Problem Description and Pipeline

OCR: Optical Character Recognition

Imgur

Imgur

Imgur

OCR example: Sliding Windows

Imgur

Imgur

Imgur

Imgur

Imgur

Imgur

Imgur

Artificial data synthesis

Imgur

Imgur

Imgur

Imgur

Dissussion on getting more data

Make sure you have a low bias classifier before expending the effort. (Plot learnign curves) E.g. keep increasing the number of features/number or hidden units neural network until you have a low bias classifier.

“How much work would it be to get 10x as much data as we currently have?”

Artificial data synthesis

Collect / label it yourself

 # 有時候就真的靜下好好 label 一番，
 # 仔細算也不過一兩天(幾小時)的事情，
 # 卻可以讓模型變成好棒棒的兒~ 
 #
 # ex: M = 1,000 筆數
 #     人工 label 一筆 10 秒
 #    總共花 1,000 * 10 秒

“Croed source” (E.g. Amazon Mechanical Turk)

Ceiling analysis: What part of the pipeline to work on next

Imgur

Imgur

Imgur

Face Recognition Example

Imgur

Imgur

Imgur

Conclusion: Summary and Thank you

我好棒棒阿!!!!!!!!!!!!

Supervise Learning

Linear regression, logistic regression, neural networks, SVMs

Unsupervised Learning

K-means, PCA, Anomaly detection

Special applications/special topics

Recommender systems, large scale machine learning

Advice on building a machine learning system

Bias/variance, regularization; deciding what to work on next: evaluation of learning algorithms, learning curves, error analysis, ceiling analysis

Twitter Facebook LinkedIn

You May Also Enjoy

再不說些話我都快被ＡＩ淹沒拉！！！！

June 13, 2025 less than 1 minute read

我這裡有一批很純的ＡＩ你有什麼興趣嗎！用上面這句話總結過去半年沒發文的空白ＸＤ連啟動 local 要寫 blog 的語法都忘記了好險有 google 現階段最懼怕的ＧＰＴ好夥伴

daily Programming: 寶哥出場品質保證又見 GitHub Copilot!

October 25, 2024 1 minute read

又見面拉～～～

daily Programming: Azure AI Search

September 27, 2024 less than 1 minute read

Github:azure-search-openai-demo

當你覺得對方很Ｇ8討厭的時候，放面鏡子照照自己剛剛的行為先吧！

December 14, 2023 less than 1 minute read

控制你的情緒＆語氣