Google Drive Blog
The latest news and updates from the Google Drive team.
Optical character recognition (OCR) in Google Docs
Tuesday, June 22, 2010
A couple of months ago, my co-worker, Mike, showed up at my desk with a pile of paper, each of the yellowed sheets densely covered with an ancient-looking typewriter font. His wife had recently discovered parts of her family chronicles in the attic, typed up by her grandmother many years ago! Now he was wondering if there was a way for her to continue writing the chronicles in Google Docs.
The papers sat on my desk for a while, but recently, I returned them to Mike with a smile, cheerfully telling him that what started as my
20% project
is now ready for everyone to use -- Google Docs now officially supports importing scanned documents. What we launched as an
experimental feature for the Documents List Data API
last year is now available on the upload page: check the “Convert text from PDF or image files to Google Docs documents”, upload your scanned images (JPEG, GIF, PNG) or PDFs, and Google Docs will extract text and formatting from the scans for you to edit away.
For the technically curious: we’re using
Optical Character Recognition
(OCR) that our friends from
Google Books
helped us set up. OCR works best with high-resolution images, and not all formatting may be preserved. The original images will be included in the new document to make it easier for you to correct mistakes. Supported languages include English, French, Italian, German and Spanish, with more languages and character sets on their way. We’re looking forward to get feedback from you while we keep improving the feature over the next months.
And Mike’s scanned family chronicles have even been extended by an additional chapter in Google Docs: his wife recently had a baby boy named James!
Posted by: Jaron Schaeffer, Software Engineer, Google Docs
Labels
#SafeOnline
accessibility
add-ons
Android
app scripts
apps
attachments
avery
back to school
blind
braille
charts
chat
Chrome
Chrome extensions
chrome web apps
Cloud Connect
collaboration
comments
community
discussions
docs
docs editors
document list
documents
documents list
drawings
Drive
drivebacktoschool
easybib
education
enterprise
Faces of Docs
folders
forms
gmail
gone google
Google Apps Blog
Google Apps Script
Google Cloud Connect
google docs
Google Docs Viewer
google documents
google drive
Google Drive Blog
Google Pack
Google Sites
Google+
googlenew
Guest Post
hangout on air
help
holiday
images
iOS
Keep
letterfeed
low-vision
mailchimp
mobile
nanowrimo
OCR
office compatibility mode
offline
paperless
pdfs
photo
photos
presentations
product ideas
profiles
quickoffice
Reddit
research
save to drive
screen reader
scripts
security
sharing
sheet
sheets
shortcut
slides
spell check
spreadsheets
stock photos
storage
students
suggested edits
tables
teachers
team
templates
videos
Viewer
work
Archive
2016
Sep
May
Apr
Feb
Jan
2015
Dec
Nov
Oct
Jul
Mar
Feb
Jan
2014
Dec
Nov
Oct
Sep
Aug
Jun
Apr
Mar
Jan
2013
Dec
Nov
Oct
Sep
Aug
Jun
May
Apr
Mar
Feb
Jan
2012
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
2011
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2010
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2009
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2008
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2007
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2006
Dec
Nov
Oct
Feed
Visit our site
Google Drive
Google Docs, Sheets, Slides
Google
on
Follow @googledrive
Give us feedback in our
Product Forum
.