Python实现OCR识别之pytesseract案例详解

2022-06-02 22:52

短信预约 信息系统项目管理师 报名、考试、查分时间动态提醒

Python实现OCR识别：pytesseract

Python常用pytesseract进行图片上的文字识别，即OCR识别，完整的代码比较简单，只要下面一行即可，但是实际使用时环境配置上容易出错。


from PIL import Image
import pytesseract
 
text = pytesseract.image_to_string(Image.open('/Users/alice/Documents/Develop/PythonCode/textinphoto.PNG'))
print(text)

因此使用前，需要先安装pillow和pytesseract依赖包。

然而运行时仍然报错，raise TesseractNotFoundError()
pytesseract.pytesseract.TesseractNotFoundError: tesseract is not installed or it's not in your path

原因是因为未安装tesseract，然后使用pip3 install tesseract之后仍然提示错误，如图：


alicedembp:~ alice$ pip3 install tesseract
Requirement already satisfied: tesseract in /Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/site-packages (0.1.3)
alicedembp:~ alice$ tesseract
-bash: tesseract: command not found

无法使用，往上找了很多教程，说是要使用brew安装，于是得以解决，步骤为：

先安装brew


alicedembp:~ alice$ ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)"

再使用brew安装leptonica


alicedembp:~ alice$ brew install leptonica

使用brew安装tesseract


alicedembp:~ alice$ brew install tesseract

安装成功，通过命令行tesseract -v的方式查看是否成功，出现版本号则为安装成功


alicedembp:~ alice$ tesseract
Usage:
  tesseract --help | --help-extra | --version
  tesseract --list-langs
  tesseract imagename outputbase [options...] [configfile...]
 
OCR options:
  -l LANG[+LANG]        Specify language(s) used for OCR.
NOTE: These options must occur before any configfile.
 
Single options:
  --help                Show this help message.
  --help-extra          Show extra help for advanced users.
  --version             Show version information.
  --list-langs          List available languages for tesseract engine.
 
alicedembp:~ alice$ tesseract -v
tesseract 4.0.0
 leptonica-1.78.0
  libgif 5.1.4 : libjpeg 9c : libpng 1.6.36 : libtiff 4.0.10 : zlib 1.2.11 : libwebp 1.0.2 : libopenjp2 2.3.1
 Found AVX2
 Found AVX
 Found SSE

接下来就可以直接使用了，使用如下代码：


alicedembp:~ alice$ tesseract /Users/alice/Documents/Develop/PythonCode/textinphoto.png /Users/alice/Documents/Develop/PythonCode/output.txt

打开textinphoto.PNG的图片，将文字输出到output.txt，图片如下

运行成功，产生output.txt文档，里面的文本为图片中识别出的文字。

到此这篇关于Python实现OCR识别之pytesseract案例详解的文章就介绍到这了,更多相关python OCR识别之pytesseract内容请搜索编程网以前的文章或继续浏览下面的相关文章希望大家以后多多支持编程网！

免责声明：

① 本站未注明“稿件来源”的信息均来自网络整理。其文字、图片和音视频稿件的所属权归原作者所有。本站收集整理出于非商业性的教育和科研之目的，并不意味着本站赞同其观点或证实其内容的真实性。仅作为临时的测试数据，供内部测试之用。本站并未授权任何人以任何方式主动获取本站任何信息。

② 本站未注明“稿件来源”的临时测试数据将在测试完成后最终做删除处理。有问题或投稿请发送至: 邮箱/279061341@qq.com QQ/279061341

python实现OCR识别 python pytesseract

阅读原文内容投诉

Python实现OCR识别之pytesseract案例详解

下载Word文档到电脑，方便收藏和打印～

下载Word文档

Python实现OCR识别之pytesseract案例详解

Python实现OCR识别：pytesseract

Python实现OCR识别之pytesseract案例详解

相关文章

猜你喜欢

Python实现OCR识别之pytesseract案例详解

小白学Python之实现OCR识别

Python免费验证码识别之ddddocr识别OCR自动库实现

OpenCV实战案例之车道线识别详解

详解Python OpenCV数字识别案例

python之tensorflow手把手实例讲解猫狗识别实现

python验证码识别的实例详解

Unity实现植物识别示例详解

python之tensorflow手把手实例讲解斑马线识别实现

Python实战之MNIST手写数字识别详解

基于Opencv图像识别实现答题卡识别示例详解

Python实现堆排序案例详解

Python基于ImageAI实现图像识别详解

Python基于keras训练实现微笑识别的示例详解

Java之单例模式实现方案详解

python selenium参数详解和实现案例

Python 实现静态链表案例详解

python Opencv实现停车位识别思路详解

Python实现识别XSS漏洞的方法详解

Android 轻松实现语音识别详解及实例代码

热门标签

编程热搜

编程资源站

目录

感谢您的提交，我们服务专员将在30分钟内给您回复