General Category => General Discussion => Topic started by: knarf on December 12, 2018, 08:21:39 AM

Title: emtpy cells
Post by: knarf on December 12, 2018, 08:21:39 AM
I have a table in a pdf file. One cell is empty
Does Ivy skip empty cells ?
Can that be prevented ?
When I go right from the first to the second cell, it goes to the third cell and skips the second.
I tested the Template editor and in c#
Title: Re: emtpy cells
Post by: admin on December 12, 2018, 09:07:35 AM

Unfortunately PDFs don't have a concept of a "table". Every piece of information (line, word, sometimes every character) is a separate object, unrelated to others. In Ivy we call them "tokens" and Left() / Right() functions are simply moving to tokens on the left or on the right of the current one. Therefore, this behavior is correct.

Ivy has a few algorithms that can convert tokens into real tables. It's based on statistical analysis so results may vary, depending on complexity and "cleanliness" of the table. Please test it by selecting one of header tokens and clicking "Table" button in the toolbox. There are various options you can try to improve extraction, like using the border lines, amount of whitespace after the table, and so on. Please refer to "Table extraction" section in the Ivy documentation for more information, examples and tutorials.