read last line of ignore file (#1155)

* read last line of ignore file

- do not cancel other matrix jobs if one test fails
- read all lines of the ignore list even if it doesn't end with a newline
- add ignore list for 0008 and 0009

* support missing object numbers when brute-forcing

the file 10404 (ironically) contains not found references with number 43 0
for its info dictionary. changes brute-force code so that objects can be
entirely missing

* fix test since document is now opened successfully but mediabox is broken
This commit is contained in:
Eliot Jones
2025-09-13 16:57:35 +02:00
committed by GitHub
parent c96880ac61
commit 07df6fd740
4 changed files with 27 additions and 9 deletions

View File

@@ -10,6 +10,7 @@ jobs:
build:
runs-on: ubuntu-latest
strategy:
fail-fast: false
matrix:
pair: ["0000-0001", "0002-0003", "0004-0005", "0006-0007"]
@@ -45,7 +46,7 @@ jobs:
- name: Remove unwanted test files
run: |
while read f; do
while read f || [ -n "$f" ]; do
full="corpus/extracted/$f"
if [ -f "$full" ]; then
echo "Removing $full"