iOS - How to recognise texts as a block using tesseract OCR
I am using google's tesseract OCR (https://github.com/gali8/Tesseract-OCR-iOS) to perform image to text conversion in my iOS app.
I'm able to scan and get the string using the following code.
let tesseract:G8Tesseract = G8Tesseract(language:"eng")
tesseract.delegate = self
tesseract.image = imageTaken // image taken from camera
tesseract.engineMode = .tesseractCubeCombined
tesseract.recognize()
print(tesseract.recognizedText)
It scans and retrieves the text in line by line approach. (Mixes the lines from other paragraphs). Like this.
Image 1

Now, How can I get texts as a block and read lines from each blocks separately. Like this.
Image 2

Things I tried.
- print(tesseract.recognizedBlocks(by: .block))
- print(tesseract.recognizedBlocks(by: .paragraph))
Still it mixes the lines from different paragraphs and considers the texts as a single line as shown in the image 1.
Any help will be appreciated. Thanks in advance.
ios swift ocr tesseract text-recognition
add a comment |
I am using google's tesseract OCR (https://github.com/gali8/Tesseract-OCR-iOS) to perform image to text conversion in my iOS app.
I'm able to scan and get the string using the following code.
let tesseract:G8Tesseract = G8Tesseract(language:"eng")
tesseract.delegate = self
tesseract.image = imageTaken // image taken from camera
tesseract.engineMode = .tesseractCubeCombined
tesseract.recognize()
print(tesseract.recognizedText)
It scans and retrieves the text in line by line approach. (Mixes the lines from other paragraphs). Like this.
Image 1

Now, How can I get texts as a block and read lines from each blocks separately. Like this.
Image 2

Things I tried.
- print(tesseract.recognizedBlocks(by: .block))
- print(tesseract.recognizedBlocks(by: .paragraph))
Still it mixes the lines from different paragraphs and considers the texts as a single line as shown in the image 1.
Any help will be appreciated. Thanks in advance.
ios swift ocr tesseract text-recognition
add a comment |
I am using google's tesseract OCR (https://github.com/gali8/Tesseract-OCR-iOS) to perform image to text conversion in my iOS app.
I'm able to scan and get the string using the following code.
let tesseract:G8Tesseract = G8Tesseract(language:"eng")
tesseract.delegate = self
tesseract.image = imageTaken // image taken from camera
tesseract.engineMode = .tesseractCubeCombined
tesseract.recognize()
print(tesseract.recognizedText)
It scans and retrieves the text in line by line approach. (Mixes the lines from other paragraphs). Like this.
Image 1

Now, How can I get texts as a block and read lines from each blocks separately. Like this.
Image 2

Things I tried.
- print(tesseract.recognizedBlocks(by: .block))
- print(tesseract.recognizedBlocks(by: .paragraph))
Still it mixes the lines from different paragraphs and considers the texts as a single line as shown in the image 1.
Any help will be appreciated. Thanks in advance.
ios swift ocr tesseract text-recognition
I am using google's tesseract OCR (https://github.com/gali8/Tesseract-OCR-iOS) to perform image to text conversion in my iOS app.
I'm able to scan and get the string using the following code.
let tesseract:G8Tesseract = G8Tesseract(language:"eng")
tesseract.delegate = self
tesseract.image = imageTaken // image taken from camera
tesseract.engineMode = .tesseractCubeCombined
tesseract.recognize()
print(tesseract.recognizedText)
It scans and retrieves the text in line by line approach. (Mixes the lines from other paragraphs). Like this.
Image 1

Now, How can I get texts as a block and read lines from each blocks separately. Like this.
Image 2

Things I tried.
- print(tesseract.recognizedBlocks(by: .block))
- print(tesseract.recognizedBlocks(by: .paragraph))
Still it mixes the lines from different paragraphs and considers the texts as a single line as shown in the image 1.
Any help will be appreciated. Thanks in advance.
ios swift ocr tesseract text-recognition
ios swift ocr tesseract text-recognition
asked Nov 12 '18 at 12:07
TesterTester
1661114
1661114
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53261872%2fios-how-to-recognise-texts-as-a-block-using-tesseract-ocr%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53261872%2fios-how-to-recognise-texts-as-a-block-using-tesseract-ocr%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown