Updated Document Layout Analysis (markdown)

BobLd
2020-01-31 17:09:10 +00:00
parent 6338aab9cb
commit b85909ddd3

@@ -53,7 +53,8 @@ using (var document = PdfDocument.Open(@"document.pdf"))
The algorithm was used on this [map](https://upload.wikimedia.org/wikipedia/commons/6/64/APISmap1.pdf) that has a complex layout, with glyphs/words having very diverse text directions. The algorithm is able to rebuild words independently of their direction.
Words are indicated by red rectangles:
TO DO: Image
![nearest neighbour word example](https://github.com/UglyToad/PdfPig/blob/master/documentation/Document%20Layout%20Analysis/nearest%20neighbour%20word%20example.PNG)
# Page segmenters
Page segmenters deal with the task of finding block of text in a page. 3 different methods are currently available: