The library should allow to make queries to PDF. E.g. select table rows from tables where first cell text is "Number". The library should execute the passed sequence of operations, like stages in MongoDB.
Sometimes extracting data from PDF require write a lot of boilerplate C# code.
Open a PDF
var pdfBytes = File.ReadAllBytes("my.pdf");
var pdfContainer = new PdfDocument(pdfBytes);
Define the operations sequence
var pdfql = @"
select(tables)
->filter((item) => item.GetCell(4).Text() == 'Name')
->selectMany(tableRows)
->map((item) => item.GetCell(1))";
Run the tree execution
var executor = new PdfqlExecutor();
var result = executor.ExecutePsql(pdfql, pdfContainer);