Breaking CAPTCHA in Python ~ Robert Gawron

Breaking CAPTCHA in Python

in image processing, programming, Python / with 28 comments /

Usually CAPTCHAs are analyzed by using neural network, it's a good approach, but it may be overcomplicated in simple cases. Presented below, much shorter algorithm can produce sufficient results for uncomplicated CAPTCHAs.

In this algorithm an image with unknown letter is compared with samples of known letters, the letter in the most similar sample is probably also the letter in analyzed image. It was implemented as a Python script, usage presented below:

captcha breaker in python, sample of usage

bash-3.2$ python cracker.py test1.png 
e

other sample of usage of the script for breaking CAPTCHAs in Python

bash-3.2$ python cracker.py test2.png 
p

It can't be directly used on a raw CAPTCHA, firstly small artifacts have to be removed from the CAPTCHA, secondly each letter should be stored in a separate image.

The script below requires samples directory with samples of letters. A sample set and this CAPTCHA breaker can be downloaded from my GitHub (CaptachaCracker directory).

import sys, os
import math
import string
import Image
import PIL.ImageChops

if __name__=="__main__":
    input = sys.argv[1]
    base = Image.open(input).convert('L')

    class Fit:
        letter = None
        difference = 0 

    best = Fit()

    for letter in string.lowercase:
        current = Fit()
        current.letter = letter

        sample_path = os.path.join('samples', letter + '.png')
        sample = Image.open(sample_path).convert('L').resize(base.size)
        difference = PIL.ImageChops.difference(base, sample)
        
        for x in range(difference.size[0]):
            for y in range(difference.size[1]):
                current.difference += difference.getpixel((x, y))

        if not best.letter or best.difference > current.difference:
            best = current

    print best.letter

I was surprised that this task can be done in less than 50 lines! Of course it's not good enough to break complicated CAPTCHAs, but they also aren't easy task for more complicated algorithms.

28 comments:

UnknownAugust 8, 2013 at 1:47 PM
wow!!!
the great blog.the post is very informative and very useful.
keep blogging.

image decoding
ReplyDelete
Replies
AnonymousSeptember 18, 2013 at 10:53 AM
OCR software can be used as well. The problem with recognition of the letters is that there isn't a good way to recognise cambered letters (popular "fish eye" effects in Google CAPTCHA or Open Captcha).
ReplyDelete
Replies
UnknownJuly 14, 2016 at 6:05 AM
Hi, where is the sample directory at GitHub?
ReplyDelete
Replies
AcidRainOctober 25, 2016 at 3:26 PM
You can use this captcha solver service for better captcha type support https://www.captchasolutions.com/
ReplyDelete
Replies
AnonymousJune 27, 2022 at 3:26 PM
en son çıkan perde modelleri
uc satın al
en son çıkan perde modelleri
nft nasıl alınır
minecraft premium
özel ambulans
yurtdışı kargo
lisans satın al
ReplyDelete
Replies
website kurmaOctober 25, 2022 at 3:47 AM
Congratulations on your article, it was very helpful and successful. 66a5b1b78247971704bd770b86651825
numara onay
sms onay
website kurma
ReplyDelete
Replies
define dedektörüOctober 29, 2022 at 6:59 AM
Thank you for your explanation, very good content. 5ccf0f6b8bf4cc6ceb44a169bbabcab3
altın dedektörü
ReplyDelete
Replies
evde iş imkanıNovember 16, 2022 at 6:48 PM
Thanks for your article. 3480323888524bdde9ee3f6affaf9f0f
evden iş imkanı
ReplyDelete
Replies
mrbahisDecember 17, 2022 at 1:10 AM
Good content. You write beautiful things.
mrbahis
sportsbet
vbet
hacklink
sportsbet
hacklink
vbet
taksi
korsan taksi
ReplyDelete
Replies
azraJuly 31, 2023 at 10:07 AM
maraş
bursa
tokat
uşak
samsun

8ON6
ReplyDelete
Replies
AyşeAugust 13, 2023 at 12:02 PM
salt likit
salt likit
dr mood likit
big boss likit
dl likit
dark likit
Z1S3
ReplyDelete
Replies
alpSeptember 6, 2023 at 1:41 AM
https://saglamproxy.com
metin2 proxy
proxy satın al
knight online proxy
mobil proxy satın al
PWWJA
ReplyDelete
Replies
Şengül2September 27, 2023 at 11:00 AM
https://bayanlarsitesi.com/
Altınşehir
Karaköy
Alemdağ
Gürpınar
LFFN
ReplyDelete
Replies
CelestialCipherXOctober 18, 2023 at 4:17 PM
ankara parça eşya taşıma
takipçi satın al
antalya rent a car
antalya rent a car
ankara parça eşya taşıma
Z5B726
ReplyDelete
Replies
95DA5Linda53F21November 9, 2023 at 12:19 AM
578AE
Kalıcı Makyaj
Karaman Lojistik
Bitci Güvenilir mi
Şırnak Şehirler Arası Nakliyat
Batman Parça Eşya Taşıma
Ünye Oto Lastik
Bitexen Güvenilir mi
Çerkezköy Çamaşır Makinesi Tamircisi
Antalya Lojistik
ReplyDelete
Replies
AE8B3Brittany80A00November 9, 2023 at 12:59 AM
05680
Kastamonu Parça Eşya Taşıma
Ünye Kurtarıcı
Silivri Boya Ustası
İzmir Şehir İçi Nakliyat
İstanbul Lojistik
Bursa Şehir İçi Nakliyat
Sincan Fayans Ustası
Kırıkkale Şehir İçi Nakliyat
Sinop Evden Eve Nakliyat
ReplyDelete
Replies
AF16EConradC1831December 23, 2023 at 4:52 PM
688F3
istanbul parasız sohbet
hatay sohbet muhabbet
düzce canlı sohbet ücretsiz
bartın görüntülü sohbet sitesi
Bilecik Telefonda Rastgele Sohbet
görüntülü sohbet canlı
diyarbakır canlı sohbet et
Eskişehir Yabancı Görüntülü Sohbet Uygulamaları
görüntülü sohbet
ReplyDelete
Replies
C7E6EBlaze23447January 4, 2024 at 10:13 PM
71A71
kocaeli rastgele sohbet siteleri
balıkesir sesli görüntülü sohbet
ordu mobil sohbet
kocaeli mobil sohbet et
adana görüntülü canlı sohbet
adana parasız görüntülü sohbet
karaman parasız görüntülü sohbet uygulamaları
kastamonu rastgele canlı sohbet
Mersin Bedava Görüntülü Sohbet
ReplyDelete
Replies
3C587LeelaD4B5EJanuary 6, 2024 at 4:29 PM
A66FF
Twitch Takipçi Satın Al
Parasız Görüntülü Sohbet
Kripto Para Üretme
Bitcoin Üretme
Tiktok İzlenme Satın Al
Coin Nasıl Alınır
Binance Referans Kodu
Spotify Takipçi Hilesi
Youtube Abone Hilesi
ReplyDelete
Replies
349A0AndrewDEFFEJanuary 7, 2024 at 9:53 PM
29B25
Nonolive Takipçi Satın Al
Mexc Borsası Güvenilir mi
Threads Yeniden Paylaş Hilesi
MEME Coin Hangi Borsada
Telegram Abone Satın Al
Coin Nasıl Kazılır
Kripto Para Kazanma
Instagram Takipçi Hilesi
Mith Coin Hangi Borsada
ReplyDelete
Replies
B5846Israel36F4AJanuary 21, 2024 at 7:48 AM
3F2CB
ledger desktop
avax wallet
trust wallet web
trezor suite
web bitbox wallet
ledger web
ledger web
web arculus
arculus wallet web
ReplyDelete
Replies

Add comment