Eliot Jones
ad0fb4ec5b
version 0.1.3
2020-11-15 12:08:37 -04:00
BobLd
d3b619b6ee
Fix Error CS0136 and tidy up code
2020-09-29 13:50:05 +01:00
BobLd
90dacb1fcf
Improve UnsupervisedReadingOrderDetector customisation: allows changing the spatial reasoning rule in use and using or not rendering order.
2020-09-26 12:29:24 +01:00
Eliot Jones
58ecfbf963
0.1.3-alpha001
2020-09-04 13:19:03 +01:00
Eliot Jones
98af575ee3
version 0.1.2
2020-07-04 16:55:14 +01:00
Eliot Jones
7d166131ad
Merge pull request #186 from BobLd/dupOverlap
...
Implement DuplicateOverlappingTextProcessor
2020-07-01 16:09:48 +01:00
BobLd
4cda06c2fa
make Letter public and flag letter as bold
2020-06-26 10:29:44 +01:00
BobLd
fe8bc0e5be
Implement DuplicateOverlappingTextProcessor
2020-06-25 15:00:20 +01:00
BobLd
4b88f4adbe
correct typo in section numers
2020-06-20 14:27:13 +01:00
BobLd
4f78e58195
remove unnecessary 'inheritdoc'
2020-06-20 14:24:29 +01:00
BobLd
924c0138e0
fix Docstrum's GetTranslatedPoint() to handle dXj=0
2020-06-20 14:19:23 +01:00
Eliot Jones
5fb04582a7
0.1.2-alpha003
2020-06-20 12:54:31 +01:00
BobLd
4312aa470e
minor optimisations
2020-05-30 13:03:59 +01:00
BobLd
3ac26bb1bc
fix bbox for TextLine and TextBlock
2020-05-30 13:03:59 +01:00
BobLd
f883b56e72
completely rework DocstrumBoundingBoxes, now handle rotated text
2020-05-30 13:03:59 +01:00
BobLd
a16f377d5a
update DefaultPageSegmenter to use DlaOptions
2020-05-30 13:03:59 +01:00
BobLd
1438fec741
update RecursiveXYCut to use DlaOptions
2020-05-30 13:03:59 +01:00
BobLd
5362a335f5
update XYLeaf with word separator
2020-05-30 13:03:59 +01:00
BobLd
79b78f486a
add ReadingOrderHelper
2020-05-30 13:03:59 +01:00
BobLd
8f1ab2022f
update NearestNeighbourWordExtractor to use DlaOptions, stop ordering words
2020-05-30 13:03:59 +01:00
BobLd
43a68693ba
allow oriented bounding box for TextBlock
2020-05-30 13:03:59 +01:00
BobLd
5b0b0a6db3
allow oriented bounding box for TextLine
2020-05-30 13:03:59 +01:00
BobLd
bb94348127
add text Separator in TextBlock and TextLine
2020-05-30 13:03:59 +01:00
BobLd
5f75205e41
rename TextDirection into TextOrientation
2020-05-30 13:03:59 +01:00
BobLd
33ee66af42
add PageSegmenterOptions abstract class
2020-05-30 13:03:59 +01:00
BobLd
dd546dcfc8
update IPageSegmenter with DlaOptions
2020-05-30 13:03:59 +01:00
BobLd
3cf7c45994
add DlaOptions abstract class
2020-05-30 13:03:59 +01:00
BobLd
a00660cd6e
update nn word extractor for new clustering type
2020-05-30 13:03:59 +01:00
BobLd
78da925263
add AlmostEqualsToZero() and AlmostEquals().
2020-05-30 13:03:59 +01:00
BobLd
08300f6a3a
use IReadOnlyList<PdfPoint> i/o PdfPoint[] in KdTree
2020-05-30 13:03:59 +01:00
BobLd
ca4111ec1b
better parameters for FindIndexNearest
2020-05-30 13:03:59 +01:00
BobLd
d2c2a2f592
add angle bounding functions
2020-05-30 13:03:59 +01:00
BobLd
404d6621de
return grouped elements i/o grouped indexes in Clustering.NearestNeighbours
2020-05-30 13:03:59 +01:00
Eliot Jones
256c2833ab
0.1.2-alpha002
2020-05-10 16:36:14 +01:00
BobLd
bb33741552
Fix KdTree.FindNearestNeighbours(k) returning the pivot itself
2020-05-10 15:49:00 +01:00
Eliot Jones
98dd736f94
0.1.2-alpha001
2020-04-25 15:20:07 +01:00
BobLd
ae82c30a31
Merge branch 'master' of https://github.com/UglyToad/PdfPig
2020-04-25 10:10:08 +01:00
BobLd
c2de52423e
Make NearestNeighbours public
2020-04-25 08:40:43 +01:00
BobLd
d4210cd5d1
Make clustering algos public and use shorter names
2020-04-25 08:40:43 +01:00
BobLd
0a6ec3946b
NearestNeighbourWordExtractor:
...
- Improve results by using PointSize
- Make 'filterFunction' public for ad hoc GetWords()
- Allow text in different direction
Make Letter.PointSize public and add warning (needed for NNWordExtractor)
Remove Page.GetPointSize(Letter letter)
2020-04-20 13:09:35 +01:00
Eliot Jones
407ee5ca51
add content order text extractor and example of use
2020-04-19 17:06:34 +01:00
BobLd
ab94746252
Improve default RecursiveXYCut dominant font width and height functions
2020-04-13 10:25:34 +01:00
Eliot Jones
75c04eb81c
fix namespace using order
2020-04-05 17:58:57 +01:00
Eliot Jones
2a0a3fae69
add test for svg exporter and escape xml characters
2020-04-05 17:58:57 +01:00
Eliot Jones
2998b1b2eb
tidy up clipper related code
2020-04-05 17:58:57 +01:00
BobLd
c14646fcc0
Fix Mode()
2020-04-05 17:58:57 +01:00
BobLd
a7fe39fc32
Add SvgTextExporter
...
Modify WriteSvg to account for height
2020-04-05 17:58:57 +01:00
BobLd
43b40da5d5
Change Subpath to path where necessary
2020-04-05 17:58:57 +01:00
BobLd
ab6a0f11fc
Change name from PdfPath to PdfSubpath
2020-04-05 17:58:57 +01:00
Eliot Jones
9abe9f4b2f
#158 add strong naming to the solution
2020-04-04 16:59:51 +01:00