Commit Graph

990 Commits

Author SHA1 Message Date
BobLd
6d31ef80a7 add RecursiveXYCutTests 2020-05-30 13:03:59 +01:00
BobLd
aa0e75d768 update DocstrumBoundingBoxesTests 2020-05-30 13:03:59 +01:00
BobLd
208e1dd8f2 add DocstrumBoundingBoxesTests 2020-05-30 13:03:59 +01:00
BobLd
75e9046c16 add DlaHelper for tests and correct minor typos 2020-05-30 13:03:59 +01:00
BobLd
05d96cd9c4 add documents for tests 2020-05-30 13:03:59 +01:00
BobLd
465cf3f072 update word rotated bbox with previous PdfRectangle constructor order 2020-05-30 13:03:59 +01:00
BobLd
dacf816a86 add summary doc to Clipper 2020-05-30 13:03:59 +01:00
BobLd
f883b56e72 completely rework DocstrumBoundingBoxes, now handle rotated text 2020-05-30 13:03:59 +01:00
BobLd
a16f377d5a update DefaultPageSegmenter to use DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
1438fec741 update RecursiveXYCut to use DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
5362a335f5 update XYLeaf with word separator 2020-05-30 13:03:59 +01:00
BobLd
79b78f486a add ReadingOrderHelper 2020-05-30 13:03:59 +01:00
BobLd
ec613d337f correct Word bounding box 2020-05-30 13:03:59 +01:00
BobLd
8f1ab2022f update NearestNeighbourWordExtractor to use DlaOptions, stop ordering words 2020-05-30 13:03:59 +01:00
BobLd
43a68693ba allow oriented bounding box for TextBlock 2020-05-30 13:03:59 +01:00
BobLd
5b0b0a6db3 allow oriented bounding box for TextLine 2020-05-30 13:03:59 +01:00
BobLd
bb94348127 add text Separator in TextBlock and TextLine 2020-05-30 13:03:59 +01:00
BobLd
5f75205e41 rename TextDirection into TextOrientation 2020-05-30 13:03:59 +01:00
BobLd
33ee66af42 add PageSegmenterOptions abstract class 2020-05-30 13:03:59 +01:00
BobLd
dd546dcfc8 update IPageSegmenter with DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
3cf7c45994 add DlaOptions abstract class 2020-05-30 13:03:59 +01:00
BobLd
a00660cd6e update nn word extractor for new clustering type 2020-05-30 13:03:59 +01:00
BobLd
78da925263 add AlmostEqualsToZero() and AlmostEquals(). 2020-05-30 13:03:59 +01:00
BobLd
08300f6a3a use IReadOnlyList<PdfPoint> i/o PdfPoint[] in KdTree 2020-05-30 13:03:59 +01:00
BobLd
ca4111ec1b better parameters for FindIndexNearest 2020-05-30 13:03:59 +01:00
BobLd
d2c2a2f592 add angle bounding functions 2020-05-30 13:03:59 +01:00
BobLd
404d6621de return grouped elements i/o grouped indexes in Clustering.NearestNeighbours 2020-05-30 13:03:59 +01:00
BobLd
eccfb35397 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-05-23 18:26:19 +01:00
BobLd
78d57ad5f9 Add italic-bold test 2020-05-23 16:26:39 +01:00
BobLd
40afe977a3 check for ItalicAngle != 0 2020-05-23 16:26:39 +01:00
BobLd
496a9bc6c7 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-05-22 13:46:31 +01:00
Eliot Jones
a7a2ef0630 remove old text from the readme 2020-05-10 16:40:05 +01:00
Eliot Jones
256c2833ab 0.1.2-alpha002 01.2-alpha002 2020-05-10 16:36:14 +01:00
BobLd
bb33741552 Fix KdTree.FindNearestNeighbours(k) returning the pivot itself 2020-05-10 15:49:00 +01:00
Eliot Jones
0512bb1e4f handle indirect references appearing in cid font widths array #174 2020-05-10 15:46:38 +01:00
BobLd
1b51eba496 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-04-28 22:11:56 +01:00
BobLd
a53d87a6f5 Fix KdTree.FindNearestNeighbours(k) returning the pivot itself 2020-04-28 22:01:50 +01:00
BobLd
f91acefcfa Set ClipPaths to false if no ParsingOptions given (consistent behaviour) 2020-04-27 17:21:52 +01:00
BobLd
1a3edabd1f Set ClipPaths to false if no ParsingOptions given (consistent behaviour) 2020-04-27 12:51:02 +01:00
Eliot Jones
09b951f667 expose font details on individual letters
also fixes a regression for image extraction
2020-04-25 17:15:26 +01:00
Eliot Jones
98dd736f94 0.1.2-alpha001 0.1.2-alpha001 2020-04-25 15:20:07 +01:00
Eliot Jones
ae62197178 merge pull request #167 from bobld/master
Improve NearestNeighbourWordExtractor
2020-04-25 15:07:18 +01:00
Eliot Jones
e264583c21 add merging to the readme 2020-04-25 12:06:11 +01:00
Eliot Jones
19047f62ae fix name output for merged documents 2020-04-25 11:23:37 +01:00
Eliot Jones
391b650e3c add more examples to the examples solution 2020-04-25 10:14:05 +01:00
BobLd
ae82c30a31 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-04-25 10:10:08 +01:00
Eliot Jones
27e251f921 make filter provider and filter public and use tryget for image bytes 2020-04-25 09:42:24 +01:00
Eliot Jones
635c4b4c5e formatting tidy-up 2020-04-25 09:11:16 +01:00
BobLd
c2de52423e Make NearestNeighbours public 2020-04-25 08:40:43 +01:00
BobLd
d4210cd5d1 Make clustering algos public and use shorter names 2020-04-25 08:40:43 +01:00