Data Extraction Powered by AI

The Data Extraction feature in Folgo lets you extract specific information from PDFs and other file formats. For example, you can select a folder containing invoices and ask Folgo to retrieve the Date, Amount, Invoice Number and any other information you would like to extract from your PDFs.

In this article 

How to use it

To use that feature, select the folder containing the files with data to extract, then click on the "Data extraction" menu in Folgo.

It's important to remember that you don't have to use the exact keywords that are present in your PDFs. For instance, if your PDF contains a date but doesn't explicitly mention the word "Date," Folgo will still be able to extract the relevant content, such as the date that is visible within the document. This means that you have flexibility in your search queries and can focus on the information you need, regardless of the specific terms used in the PDF.


In the same way as other processes within Folgo, a Google Sheet will be generated containing a comprehensive list of all the files that are located in your designated folder. Once this is done, Folgo will proceed to systematically open each of these files, extract the specific data that is necessary, and seamlessly incorporate it into your pre-existing spreadsheet.


How does it work?

The content of each file in your selected folder will be extracted and sent to a Generative AI API along with the list of keywords to extract. Folgo currently uses GPT-4, which offers the best results in most cases.

OpenAI API Terms clearly state: "We will not use Customer Content to develop or improve the Services."

As your content is stored in Google Drive and already indexed by Google, you might prefer your content to be processed via a Generative AI API made available by Google, like Gemini. Contact our support if you prefer to use that service instead of GPT-4 inside Folgo Data Extraction feature.

Did this answer your question? Thanks for the feedback There was a problem submitting your feedback. Please try again later.

Still need help? Contact Us Contact Us