Data Extraction Powered by AI

The Data Extraction feature in Folgo lets you extract specific information from PDFs and other file formats. For example, you can select a folder containing invoices and ask Folgo to retrieve the Date, Amount, Invoice Number and any other information you would like to extract from your PDFs.

In this article 

How to use it

To use that feature, select the folder containing the files with data to extract, then click on the "Data extraction" menu in Folgo.

It's important to remember that you don't have to use the exact keywords that are present in your PDFs. For instance, if your PDF contains a date but doesn't explicitly mention the word "Date," Folgo will still be able to extract the relevant content, such as the date that is visible within the document. This means that you have flexibility in your search queries and can focus on the information you need, regardless of the specific terms used in the PDF.


In the same way as other processes within Folgo, a Google Sheet will be generated containing a comprehensive list of all the files that are located in your designated folder. Once this is done, Folgo will proceed to systematically open each of these files, extract the specific data that is necessary, and seamlessly incorporate it into your pre-existing spreadsheet.


How does it work?

The content of each file in your selected folder is extracted and sent to a Generative AI API along with the list of keywords to extract. Folgo now uses Google’s Gemini model by default, ensuring your data is processed within Google’s ecosystem and in line with Workspace security standards.

You can review Google Gemini’s Terms of Service for more details on how your content is handled.

Since your content is stored in Google Drive and already indexed by Google, Gemini is often the preferred choice for organizations concerned with keeping data processing within a single cloud provider.

Did this answer your question? Thanks for the feedback There was a problem submitting your feedback. Please try again later.