StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PORemoving background noisy lines from Captcha Image using PYTHON PIL
primarykey
Id
15319528
data
AcceptedAnswerId
0
AnswerCount
3
ClosedDate
CommentCount
1
CommunityOwnedDate
CreationDate
2013-03-10T06:17:21.293
FavoriteCount
6
LastActivityDate
2014-09-19T14:28:27.857
LastEditDate
2013-04-15T05:15:04.753
LastEditorUserId
1618788
OwnerUserId
1618788
ParentId
0
PostTypeId
1
Score
3
ViewCount
5583
LastEditorDisplayName
text
Body
I have a processed captcha image(Enlarged) look like : <img src="https://i.stack.imgur.com/oeDUH.gif" alt="captcha"> As you can see, the font-size of the "TEXT" is bit larger than the width of the Noisy Lines. So I need an algorithm or code to remove the noisy lines from this image. With the help of Python PIL Library and the chopping algorithm mentioned below I din't get the output image which could be easily read by OCRs. Here's Python code that I tried : <pre><code>import PIL.Image import sys # python chop.py [chop-factor] [in-file] [out-file] chop = int(sys.argv[1]) image = PIL.Image.open(sys.argv[2]).convert('1') width, height = image.size data = image.load() # Iterate through the rows. for y in range(height): for x in range(width): # Make sure we're on a dark pixel. if data[x, y] > 128: continue # Keep a total of non-white contiguous pixels. total = 0 # Check a sequence ranging from x to image.width. for c in range(x, width): # If the pixel is dark, add it to the total. if data[c, y] < 128: total += 1 # If the pixel is light, stop the sequence. else: break # If the total is less than the chop, replace everything with white. if total <= chop: for c in range(total): data[x + c, y] = 255 # Skip this sequence we just altered. x += total # Iterate through the columns. for x in range(width): for y in range(height): # Make sure we're on a dark pixel. if data[x, y] > 128: continue # Keep a total of non-white contiguous pixels. total = 0 # Check a sequence ranging from y to image.height. for c in range(y, height): # If the pixel is dark, add it to the total. if data[x, c] < 128: total += 1 # If the pixel is light, stop the sequence. else: break # If the total is less than the chop, replace everything with white. if total <= chop: for c in range(total): data[x, y + c] = 255 # Skip this sequence we just altered. y += total image.save(sys.argv[3]) </code></pre> So, basically I would like to know a better algorithm/code to get rid of the noise and thus able to make the image readable by the OCR (Tesseract or pytesser).
Tags
<python><algorithm><image-processing><python-imaging-library><captcha>
Title
Removing background noisy lines from Captcha Image using PYTHON PIL
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USdjadmin
UserOwnerUserId
1. USdjadmin
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PORemoving background noisy lines from Captcha Image using PYTHON PIL
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PORemoving background noisy lines from Captcha Image using PYTHON PIL
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PORemoving background noisy lines from Captcha Image using PYTHON PIL
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTApproveEditSuggestion
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.