Commit Graph

963 Commits

Author SHA1 Message Date
Eliot Jones
0f65397f48 Merge pull request #184 from BobLd/docstrum-v2.3
Fix DocstrumBoundingBoxes when dXj=0
2020-06-20 15:27:05 +01:00
BobLd
4b88f4adbe correct typo in section numers 2020-06-20 14:27:13 +01:00
BobLd
4f78e58195 remove unnecessary 'inheritdoc' 2020-06-20 14:24:29 +01:00
BobLd
924c0138e0 fix Docstrum's GetTranslatedPoint() to handle dXj=0 2020-06-20 14:19:23 +01:00
BobLd
091c17bdf8 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-06-20 13:30:30 +01:00
Eliot Jones
5fb04582a7 0.1.2-alpha003 0.1.2-alpha003 2020-06-20 12:54:31 +01:00
Eliot Jones
982c331935 re-use truetype parser for opentype cid fonts 2020-06-20 12:46:41 +01:00
BobLd
4758820db5 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-06-20 12:29:39 +01:00
Eliot Jones
79dea8d314 Merge pull request #179 from BobLd/master
Improve ContentStreamProcessor
2020-06-20 12:13:40 +01:00
BobLd
8ef70d9a9d Merge branch 'master' of https://github.com/BobLd/PdfPig 2020-06-18 18:31:29 +01:00
BobLd
7a393383de Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-06-17 22:56:41 +01:00
BobLd
33f92cd11c handle page rotation by updating initial TransformationMatrix 2020-06-02 16:12:24 +01:00
BobLd
6e773446df simplify double cast 2020-06-01 14:55:45 +01:00
BobLd
2d9a4e5adb fix CurrentTransformationMatrix multiplication order in ProcessFormXObject 2020-06-01 14:00:17 +01:00
BobLd
958beada48 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-06-01 13:54:01 +01:00
Eliot Jones
bf45602ac5 fix #176, allow startxref to appear earlier in the document 2020-05-31 17:01:38 +01:00
BobLd
4312aa470e minor optimisations 2020-05-30 13:03:59 +01:00
BobLd
755e199fed Add new dla images 2020-05-30 13:03:59 +01:00
BobLd
b3665f10c9 Delete recursive xy cut example.png 2020-05-30 13:03:59 +01:00
BobLd
0ec6e2389f Delete nearest neighbour word example.PNG 2020-05-30 13:03:59 +01:00
BobLd
1d6b819579 Delete docstrum bounding boxes example.png 2020-05-30 13:03:59 +01:00
BobLd
3ac26bb1bc fix bbox for TextLine and TextBlock 2020-05-30 13:03:59 +01:00
BobLd
14454184ad update RecursiveXYCutTests 2020-05-30 13:03:59 +01:00
BobLd
6d31ef80a7 add RecursiveXYCutTests 2020-05-30 13:03:59 +01:00
BobLd
aa0e75d768 update DocstrumBoundingBoxesTests 2020-05-30 13:03:59 +01:00
BobLd
208e1dd8f2 add DocstrumBoundingBoxesTests 2020-05-30 13:03:59 +01:00
BobLd
75e9046c16 add DlaHelper for tests and correct minor typos 2020-05-30 13:03:59 +01:00
BobLd
05d96cd9c4 add documents for tests 2020-05-30 13:03:59 +01:00
BobLd
465cf3f072 update word rotated bbox with previous PdfRectangle constructor order 2020-05-30 13:03:59 +01:00
BobLd
dacf816a86 add summary doc to Clipper 2020-05-30 13:03:59 +01:00
BobLd
f883b56e72 completely rework DocstrumBoundingBoxes, now handle rotated text 2020-05-30 13:03:59 +01:00
BobLd
a16f377d5a update DefaultPageSegmenter to use DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
1438fec741 update RecursiveXYCut to use DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
5362a335f5 update XYLeaf with word separator 2020-05-30 13:03:59 +01:00
BobLd
79b78f486a add ReadingOrderHelper 2020-05-30 13:03:59 +01:00
BobLd
ec613d337f correct Word bounding box 2020-05-30 13:03:59 +01:00
BobLd
8f1ab2022f update NearestNeighbourWordExtractor to use DlaOptions, stop ordering words 2020-05-30 13:03:59 +01:00
BobLd
43a68693ba allow oriented bounding box for TextBlock 2020-05-30 13:03:59 +01:00
BobLd
5b0b0a6db3 allow oriented bounding box for TextLine 2020-05-30 13:03:59 +01:00
BobLd
bb94348127 add text Separator in TextBlock and TextLine 2020-05-30 13:03:59 +01:00
BobLd
5f75205e41 rename TextDirection into TextOrientation 2020-05-30 13:03:59 +01:00
BobLd
33ee66af42 add PageSegmenterOptions abstract class 2020-05-30 13:03:59 +01:00
BobLd
dd546dcfc8 update IPageSegmenter with DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
3cf7c45994 add DlaOptions abstract class 2020-05-30 13:03:59 +01:00
BobLd
a00660cd6e update nn word extractor for new clustering type 2020-05-30 13:03:59 +01:00
BobLd
78da925263 add AlmostEqualsToZero() and AlmostEquals(). 2020-05-30 13:03:59 +01:00
BobLd
08300f6a3a use IReadOnlyList<PdfPoint> i/o PdfPoint[] in KdTree 2020-05-30 13:03:59 +01:00
BobLd
ca4111ec1b better parameters for FindIndexNearest 2020-05-30 13:03:59 +01:00
BobLd
d2c2a2f592 add angle bounding functions 2020-05-30 13:03:59 +01:00
BobLd
404d6621de return grouped elements i/o grouped indexes in Clustering.NearestNeighbours 2020-05-30 13:03:59 +01:00