> ## Documentation Index
> Fetch the complete documentation index at: https://docs.roe-ai.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Quickstart

> Work with your unstructured data at scale in under 5 minutes

## Upload Your Unstructured Data

<Steps>
  <Step title="Navigate to the Roe Dataset page">
    <Frame>
      <img src="https://mintcdn.com/roeai/wyVOyeWPONjXHsrt/images/database-quickstart-roe-dataset.png?fit=max&auto=format&n=wyVOyeWPONjXHsrt&q=85&s=4ccc25162b10902a2ee5d08e2f0433df" width="1926" height="858" data-path="images/database-quickstart-roe-dataset.png" />
    </Frame>
  </Step>

  <Step title="Upload a file to the default dataset">
    For the purpose of this quickstart, we will use the same invoice PDF file as `invoice.pdf` file in the `invoice-extraction-example` dataset. Go to that dataset and download the file first. Then upload to the `default` dataset.

    <Frame caption="View the default dataset">
      <img src="https://mintcdn.com/roeai/wyVOyeWPONjXHsrt/images/database-quickstart-upload-file-1.png?fit=max&auto=format&n=wyVOyeWPONjXHsrt&q=85&s=a91a0e4573f47661b5927b1ea7f5747d" width="1990" height="162" data-path="images/database-quickstart-upload-file-1.png" />
    </Frame>

    <br />

    <Frame caption="Upload your files">
      <img src="https://mintcdn.com/roeai/wyVOyeWPONjXHsrt/images/database-quickstart-upload-file-2.png?fit=max&auto=format&n=wyVOyeWPONjXHsrt&q=85&s=59d2612e60e2dfc9078d8bcde4cf1ff9" width="1982" height="136" data-path="images/database-quickstart-upload-file-2.png" />
    </Frame>
  </Step>

  <Step title="View your uploaded files">
    <Frame caption="Once uploaded, your files will be listed here">
      <img src="https://mintcdn.com/roeai/wyVOyeWPONjXHsrt/images/database-quickstart-uploaded-file.png?fit=max&auto=format&n=wyVOyeWPONjXHsrt&q=85&s=9faf17833fd58f832389305de3c85075" width="1988" height="382" data-path="images/database-quickstart-uploaded-file.png" />
    </Frame>
  </Step>
</Steps>

## View Your Data

Click on the Workspace tab to go to ROE SQL workspace.
A default table called `dataset_default` has been created for you. Any new
files uploaded to the `default` dataset will automatically be added to this table as well. You can check the table by clicking `dataset_default` table in the table list.

<Frame>
  <img src="https://mintcdn.com/roeai/wyVOyeWPONjXHsrt/images/database-quickstart-default-table.png?fit=max&auto=format&n=wyVOyeWPONjXHsrt&q=85&s=70e6ceee0372a7df46c004e0705edc58" width="2454" height="836" data-path="images/database-quickstart-default-table.png" />
</Frame>

Or you can run the following SQL query to view the data in a new worksheet tab:

```sql theme={null}
SELECT * FROM dataset_default;
```

<Frame>
  <img src="https://mintcdn.com/roeai/wyVOyeWPONjXHsrt/images/database-quickstart-select-default-table.png?fit=max&auto=format&n=wyVOyeWPONjXHsrt&q=85&s=f52256bc9c1637710013b1b427185c77" width="2458" height="1148" data-path="images/database-quickstart-select-default-table.png" />
</Frame>

## Process Your Data

Now that you have uploaded your data, you can start processing it using the
built-in functions. For example, you can extract information from the invoice
PDF with the following SQL query:

```sql theme={null}
-- You're shown three example worksheets for three different use cases. Take a moment to read through each one!
-- This worksheet extracts information from the invoice PDF as specified by the return format requested. Try running it!

SELECT name, file, extract_from(
  'returns following structure: {
    from: <from company name>,
    recipient: <recipient company name>,
    line_items: [
      {
        name: <item name>
        quantity: <item quantity>
        cost: <item total cost>
      },
      <more line items>
    ],
    subtotal: <amount before tax>
    tax: <tax amount>
    total: <total amount due>
  }', -- extract_from() uses the Default Extraction agent, so you don't need to specify an agent
  file
) FROM dataset_examples WHERE dataset='invoice-extraction-example'; -- You can run on all datasets in the table or a specific one (what we do here)
```

The result of the `extract_from` function is a reference to the async job. Once
the job is completed, you can view the extracted data in the worksheet by clicking on the Job ID.

<Frame>
  <img src="https://mintcdn.com/roeai/wyVOyeWPONjXHsrt/images/database-quickstart-result.png?fit=max&auto=format&n=wyVOyeWPONjXHsrt&q=85&s=a0c091864094aaf1d89a507bb5e07f30" width="2004" height="636" data-path="images/database-quickstart-result.png" />
</Frame>

## Next Steps

### Learn More About Roe SQL

We are based on clickhouse dialect, learn more about clickhouse SQL dialect in their official
[documentation](https://clickhouse.com/docs).

Also check out our [examples](/database/examples/introduction) to see how you
can use SQL to process your data.

### Learn More About Roe SQL Functions

`extract_from` is just one of the many functions you can use to process your
data. We have a variety of functions available to help you process your data
with ROE AI Agents. You can learn more about the functions available in the
[Functions](/database/functions/introduction) section.
