Commit Graph

96 Commits

Author SHA1 Message Date
Eliot Jones
03692cf42f set version to alpha of 0.1.7 for future nightly builds 2022-04-25 10:06:46 -04:00
Eliot Jones
5597a8f38c version 0.1.6 2022-04-25 09:22:47 -04:00
Eliot Jones
a12fc0ce34 #441 option to replace tabs and whitespace with space character in generated text 2022-04-22 16:31:53 -04:00
Eliot Jones
10680299cb Merge pull request #432 from BobLd/master
Refactor DLA
2022-04-14 19:20:59 -04:00
Eliot Jones
d7898d851c add net 6 as a target framework and dual target tests 2022-04-03 14:48:50 -04:00
BobLD
05aba1cfe7 Make DlaOptions an interface, add IWordExtractorOptions, remove GetBlocks(words, options), GetWords(letters, options) and put options in constructors - Fix #424. Tidy up code 2022-03-12 13:41:28 +00:00
Eliot Jones
a538aaf0de bump version so that nightly builds for pre-release have correct version 2022-01-10 12:40:47 +00:00
davmarksman
b3ace45e96 PageXmlTextExporter: ensure no coords on the page boundaries #399 2021-12-19 15:21:15 +00:00
BobLD
7ba28d2b56 Clean DocstrumBoundingBoxes and make relevant static methods public. Add tests. #376 2021-10-24 11:44:24 +01:00
Eliot Jones
4c36f84a0d version 0.1.5 2021-09-17 11:16:43 -04:00
Eliot Jones
16d26effc5 0.1.5-alpha002 2021-05-09 13:05:54 -04:00
Eliot Jones
b930924b9b 0.1.5-alpha001 2021-02-28 13:55:25 -04:00
Eliot Jones
237fd96f9e version 0.1.4 2020-11-29 14:02:42 -04:00
Eliot Jones
ad0fb4ec5b version 0.1.3 2020-11-15 12:08:37 -04:00
BobLd
d3b619b6ee Fix Error CS0136 and tidy up code 2020-09-29 13:50:05 +01:00
BobLd
90dacb1fcf Improve UnsupervisedReadingOrderDetector customisation: allows changing the spatial reasoning rule in use and using or not rendering order. 2020-09-26 12:29:24 +01:00
Eliot Jones
58ecfbf963 0.1.3-alpha001 2020-09-04 13:19:03 +01:00
Eliot Jones
98af575ee3 version 0.1.2 2020-07-04 16:55:14 +01:00
Eliot Jones
7d166131ad Merge pull request #186 from BobLd/dupOverlap
Implement DuplicateOverlappingTextProcessor
2020-07-01 16:09:48 +01:00
BobLd
4cda06c2fa make Letter public and flag letter as bold 2020-06-26 10:29:44 +01:00
BobLd
fe8bc0e5be Implement DuplicateOverlappingTextProcessor 2020-06-25 15:00:20 +01:00
BobLd
4b88f4adbe correct typo in section numers 2020-06-20 14:27:13 +01:00
BobLd
4f78e58195 remove unnecessary 'inheritdoc' 2020-06-20 14:24:29 +01:00
BobLd
924c0138e0 fix Docstrum's GetTranslatedPoint() to handle dXj=0 2020-06-20 14:19:23 +01:00
Eliot Jones
5fb04582a7 0.1.2-alpha003 2020-06-20 12:54:31 +01:00
BobLd
4312aa470e minor optimisations 2020-05-30 13:03:59 +01:00
BobLd
3ac26bb1bc fix bbox for TextLine and TextBlock 2020-05-30 13:03:59 +01:00
BobLd
f883b56e72 completely rework DocstrumBoundingBoxes, now handle rotated text 2020-05-30 13:03:59 +01:00
BobLd
a16f377d5a update DefaultPageSegmenter to use DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
1438fec741 update RecursiveXYCut to use DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
5362a335f5 update XYLeaf with word separator 2020-05-30 13:03:59 +01:00
BobLd
79b78f486a add ReadingOrderHelper 2020-05-30 13:03:59 +01:00
BobLd
8f1ab2022f update NearestNeighbourWordExtractor to use DlaOptions, stop ordering words 2020-05-30 13:03:59 +01:00
BobLd
43a68693ba allow oriented bounding box for TextBlock 2020-05-30 13:03:59 +01:00
BobLd
5b0b0a6db3 allow oriented bounding box for TextLine 2020-05-30 13:03:59 +01:00
BobLd
bb94348127 add text Separator in TextBlock and TextLine 2020-05-30 13:03:59 +01:00
BobLd
5f75205e41 rename TextDirection into TextOrientation 2020-05-30 13:03:59 +01:00
BobLd
33ee66af42 add PageSegmenterOptions abstract class 2020-05-30 13:03:59 +01:00
BobLd
dd546dcfc8 update IPageSegmenter with DlaOptions 2020-05-30 13:03:59 +01:00
BobLd
3cf7c45994 add DlaOptions abstract class 2020-05-30 13:03:59 +01:00
BobLd
a00660cd6e update nn word extractor for new clustering type 2020-05-30 13:03:59 +01:00
BobLd
78da925263 add AlmostEqualsToZero() and AlmostEquals(). 2020-05-30 13:03:59 +01:00
BobLd
08300f6a3a use IReadOnlyList<PdfPoint> i/o PdfPoint[] in KdTree 2020-05-30 13:03:59 +01:00
BobLd
ca4111ec1b better parameters for FindIndexNearest 2020-05-30 13:03:59 +01:00
BobLd
d2c2a2f592 add angle bounding functions 2020-05-30 13:03:59 +01:00
BobLd
404d6621de return grouped elements i/o grouped indexes in Clustering.NearestNeighbours 2020-05-30 13:03:59 +01:00
Eliot Jones
256c2833ab 0.1.2-alpha002 2020-05-10 16:36:14 +01:00
BobLd
bb33741552 Fix KdTree.FindNearestNeighbours(k) returning the pivot itself 2020-05-10 15:49:00 +01:00
Eliot Jones
98dd736f94 0.1.2-alpha001 2020-04-25 15:20:07 +01:00
BobLd
ae82c30a31 Merge branch 'master' of https://github.com/UglyToad/PdfPig 2020-04-25 10:10:08 +01:00