mirror of
https://github.com/UglyToad/PdfPig.git
synced 2026-03-10 00:23:29 +08:00
Add Page segmenters use case table
@@ -73,6 +73,16 @@ Page segmenters deal with the task of finding block of text in a page. 3 differe
|
||||
- [__Recursive XY Cut__](https://github.com/UglyToad/PdfPig/wiki/Document-Layout-Analysis#recursive-xy-cut-method) – a top-down method
|
||||
- [__Docstrum for bounding boxes__](https://github.com/UglyToad/PdfPig/wiki/Document-Layout-Analysis#docstrum-for-bounding-boxes-method) – a bottom-up method
|
||||
|
||||
## Use cases
|
||||
|Text | Default method | Recursive XY Cut | Docstrum |
|
||||
|-------------------------|:----------------:|:------------------:|:----------:|
|
||||
|Single Column | ✔️ | ✔️ | ✔️ |
|
||||
|Multi Columns | ✖️ | ✔️ | ✔️ |
|
||||
|L-shaped text | ❌ | ❌ | ✔️ |
|
||||
|Rotated lines/paragraphs | ❌ | ✖️ | ✔️ |
|
||||
|
||||
Legend: ✔️: supported, ✖️: partial support, ❌: not supported
|
||||
|
||||
## [Default method](https://github.com/UglyToad/PdfPig/blob/master/src/UglyToad.PdfPig.DocumentLayoutAnalysis/DefaultPageSegmenter.cs)
|
||||
### Description
|
||||
This method returns one single block, containing all words in the page.
|
||||
|
||||
Reference in New Issue
Block a user