摘 要: 针对目前互联网上关于页面自动登录环节出现的难点,由于部分登录界面有验证码的存在,自动登录的 时长被增加,并且有的验证码难以识别,这就提出了基于Python和卷积神经网络(CNN)相结合的验证码识别。首先本 文对三千多张验证码的样本集进行图片预处理,分别有灰度化处理、二值化处理和去噪点处理三步操作。然后利用三个 池化层和一个全连接层的结构设计卷积神经网络,随后训练样本集,并对随机的十个样本进行预测。 |
关键词: 验证码;Python;二值化;卷积神经网络 |
中图分类号: TP315
文献标识码: A
|
|
Verification Code Recognition Based on Python and CNN |
JIN Dapeng,ZHANG Tianxin,LIU Tao
|
( University of Shanghai for Science and Technology, Shanghai 200093, China)
|
Abstract: The paper focuses on the current difficulties in the automatic login of pages on the Internet.Due to the existence of verifications code in some login interfaces,the duration of automatic login is increased,and some verification codes are difficult to identify.Accordingly,based on Python and Convolutional Neural Network (CNN),a combined identification of verification codes is proposed in this study.Firstly,this paper preprocesses the sample set of more than 3,000 verification codes,including the three processing steps of graying,binarization and denoising.The Convolutional Neural Networ is then designed using three pooled layers and a fully connected layer structure,followed by training the sample set and predicting ten random samples. |
Keywords: verification code;Python;binarization;Convolutional Neural Network |