From b85909ddd3ad7d34f2e511ef482f6fc121ecb96b Mon Sep 17 00:00:00 2001 From: BobLd <38405645+BobLd@users.noreply.github.com> Date: Fri, 31 Jan 2020 17:09:10 +0000 Subject: [PATCH] Updated Document Layout Analysis (markdown) --- Document-Layout-Analysis.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/Document-Layout-Analysis.md b/Document-Layout-Analysis.md index 9ce7e1e..6234167 100644 --- a/Document-Layout-Analysis.md +++ b/Document-Layout-Analysis.md @@ -53,7 +53,8 @@ using (var document = PdfDocument.Open(@"document.pdf")) The algorithm was used on this [map](https://upload.wikimedia.org/wikipedia/commons/6/64/APISmap1.pdf) that has a complex layout, with glyphs/words having very diverse text directions. The algorithm is able to rebuild words independently of their direction. Words are indicated by red rectangles: -TO DO: Image + +![nearest neighbour word example](https://github.com/UglyToad/PdfPig/blob/master/documentation/Document%20Layout%20Analysis/nearest%20neighbour%20word%20example.PNG) # Page segmenters Page segmenters deal with the task of finding block of text in a page. 3 different methods are currently available: