Quickstart
Work with your unstructured data at scale in under 5 minutes
Upload Your Unstructured Data
Navigate to the Roe Dataset page
Upload a file to the default dataset
For the purpose of this quickstart, we will use the same invoice PDF file as invoice.pdf
file in the invoice-extraction-example
dataset. Go to that dataset and download the file first. Then upload to the default
dataset.
View the default dataset
Upload your files
View your uploaded files
Once uploaded, your files will be listed here
View Your Data
Click on the Workspace tab to go to VolansDB.
A default table called dataset_default
has been created for you. Any new
files uploaded to the default
dataset will automatically be added to this table as well. You can check the table by clicking dataset_default
table in the table list.
Or you can run the following SQL query to view the data in a new worksheet tab:
Process Your Data
Now that you have uploaded your data, you can start processing it using the built-in functions. For example, you can extract information from the invoice PDF with the following SQL query:
The result of the extract_from
function is a reference to the async job. Once
the job is completed, you can view the extracted data in the worksheet by clicking on the Job ID.
Next Steps
Learn More About VolansDB SQL
We are based on Clickhouse, so you can use the full power of Clickhouse SQL to process your data. You can learn more about Clickhouse SQL in their official documentation.
Also check out our examples to see how you can use SQL to process your data.
Learn More About VolansDB Functions
extract_from
is just one of the many functions you can use to process your
data. We have a variety of functions available to help you process your data
with VolansDB Agents. You can learn more about the functions available in the
Functions section.