Commit Graph

83 Commits

Author SHA1 Message Date
Eliot Jones
ad0fb4ec5b version 0.1.3 2020-11-15 12:08:37 -04:00
BobLd
d3b619b6ee Fix Error CS0136 and tidy up code 2020-09-29 13:50:05 +01:00
BobLd
90dacb1fcf Improve UnsupervisedReadingOrderDetector customisation: allows changing the spatial reasoning rule in use and using or not rendering order. 2020-09-26 12:29:24 +01:00
Eliot Jones
58ecfbf963 0.1.3-alpha001 2020-09-04 13:19:03 +01:00
Eliot Jones
98af575ee3 version 0.1.2 2020-07-04 16:55:14 +01:00
Eliot Jones
7d166131ad Merge pull request #186 from BobLd/dupOverlap
Implement DuplicateOverlappingTextProcessor
2020-07-01 16:09:48 +01:00
BobLd
4cda06c2fa make Letter public and flag letter as bold 2020-06-26 10:29:44 +01:00
BobLd
fe8bc0e5be Implement DuplicateOverlappingTextProcessor 2020-06-25 15:00:20 +01:00
BobLd
4b88f4adbe correct typo in section numers 2020-06-20 14:27:13 +01:00
BobLd
4f78e58195 remove unnecessary 'inheritdoc' 2020-06-20 14:24:29 +01:00
BobLd
924c0138e0 fix Docstrum's GetTranslatedPoint() to handle dXj=0 2020-06-20 14:19:23 +01:00
Eliot Jones
5fb04582a7 0.1.2-alpha003 2020-06-20 12:54:31 +01:00
BobLd
4312aa470e minor optimisations 2020-05-30 13:03:59 +01:00
BobLd
3ac26bb1bc fix bbox for TextLine and TextBlock 2020-05-30 13:03:59 +01:00
BobLd
f883b56e72 completely rework DocstrumBoundingBoxes, now handle rotated text 2020-05-30 13:03:59 +01:00
BobLd
a16f377d5a update DefaultPageSegmenter to use DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
1438fec741 update RecursiveXYCut to use DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
5362a335f5 update XYLeaf with word separator 2020-05-30 13:03:59 +01:00
BobLd
79b78f486a add ReadingOrderHelper 2020-05-30 13:03:59 +01:00
BobLd
8f1ab2022f update NearestNeighbourWordExtractor to use DlaOptions, stop ordering words 2020-05-30 13:03:59 +01:00
BobLd
43a68693ba allow oriented bounding box for TextBlock 2020-05-30 13:03:59 +01:00
BobLd
5b0b0a6db3 allow oriented bounding box for TextLine 2020-05-30 13:03:59 +01:00
BobLd
bb94348127 add text Separator in TextBlock and TextLine 2020-05-30 13:03:59 +01:00
BobLd
5f75205e41 rename TextDirection into TextOrientation 2020-05-30 13:03:59 +01:00
BobLd
33ee66af42 add PageSegmenterOptions abstract class 2020-05-30 13:03:59 +01:00
BobLd
dd546dcfc8 update IPageSegmenter with DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
3cf7c45994 add DlaOptions abstract class 2020-05-30 13:03:59 +01:00
BobLd
a00660cd6e update nn word extractor for new clustering type 2020-05-30 13:03:59 +01:00
BobLd
78da925263 add AlmostEqualsToZero() and AlmostEquals(). 2020-05-30 13:03:59 +01:00
BobLd
08300f6a3a use IReadOnlyList<PdfPoint> i/o PdfPoint[] in KdTree 2020-05-30 13:03:59 +01:00
BobLd
ca4111ec1b better parameters for FindIndexNearest 2020-05-30 13:03:59 +01:00
BobLd
d2c2a2f592 add angle bounding functions 2020-05-30 13:03:59 +01:00
BobLd
404d6621de return grouped elements i/o grouped indexes in Clustering.NearestNeighbours 2020-05-30 13:03:59 +01:00
Eliot Jones
256c2833ab 0.1.2-alpha002 2020-05-10 16:36:14 +01:00
BobLd
bb33741552 Fix KdTree.FindNearestNeighbours(k) returning the pivot itself 2020-05-10 15:49:00 +01:00
Eliot Jones
98dd736f94 0.1.2-alpha001 2020-04-25 15:20:07 +01:00
BobLd
ae82c30a31 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-04-25 10:10:08 +01:00
BobLd
c2de52423e Make NearestNeighbours public 2020-04-25 08:40:43 +01:00
BobLd
d4210cd5d1 Make clustering algos public and use shorter names 2020-04-25 08:40:43 +01:00
BobLd
0a6ec3946b NearestNeighbourWordExtractor:
- Improve results by using PointSize
- Make 'filterFunction' public for ad hoc GetWords()
- Allow text in different direction

Make Letter.PointSize public and add warning (needed for NNWordExtractor)
Remove Page.GetPointSize(Letter letter)
2020-04-20 13:09:35 +01:00
Eliot Jones
407ee5ca51 add content order text extractor and example of use 2020-04-19 17:06:34 +01:00
BobLd
ab94746252 Improve default RecursiveXYCut dominant font width and height functions 2020-04-13 10:25:34 +01:00
Eliot Jones
75c04eb81c fix namespace using order 2020-04-05 17:58:57 +01:00
Eliot Jones
2a0a3fae69 add test for svg exporter and escape xml characters 2020-04-05 17:58:57 +01:00
Eliot Jones
2998b1b2eb tidy up clipper related code 2020-04-05 17:58:57 +01:00
BobLd
c14646fcc0 Fix Mode() 2020-04-05 17:58:57 +01:00
BobLd
a7fe39fc32 Add SvgTextExporter
Modify WriteSvg to account for height
2020-04-05 17:58:57 +01:00
BobLd
43b40da5d5 Change Subpath to path where necessary 2020-04-05 17:58:57 +01:00
BobLd
ab6a0f11fc Change name from PdfPath to PdfSubpath 2020-04-05 17:58:57 +01:00
Eliot Jones
9abe9f4b2f #158 add strong naming to the solution 2020-04-04 16:59:51 +01:00