Spice.ai Catalog Connector
Query datasets hosted in the Spice.ai Cloud Platform. Discover available public datasets on Spicerack.
Configuration​
Create a Spice.ai Cloud Platform account and login with the CLI using spice login.
Example:
catalogs:
  - from: spice.ai:demo-org/tpch # Load tables from the `demo-org` organization's `tpch` app
    name: marketplace # Tables will be available in the "marketplace" catalog
    params:
      spiceai_api_key: ${secrets:SPICEAI_API_KEY} # Spice.ai API key to login to the organization and app
    include:
      - "tpch.part*" # include only the tables from the "tpch" schema and that start with "part"
      - "tpch.supplier" # also include the "supplier" table
This configuration would load the following tables:
sql> show tables;
+---------------+--------------+----------------+------------+
| table_catalog | table_schema | table_name     | table_type |
+---------------+--------------+----------------+------------+
| marketplace   | tpch         | part           | BASE TABLE |
| marketplace   | tpch         | partsupp       | BASE TABLE |
| marketplace   | tpch         | supplier       | BASE TABLE |
+---------------+--------------+----------------+------------+
from​
The from field specifies which organization and application to load tables from. The format is:
spice.ai/[organization]/[application][/catalog_name]
- organization: The Spice.ai organization that owns the application
- application: The specific application to load tables from
- catalog_name: (optional): A specific catalog within the application. If not specified, the application's default catalog is used
For example:
- Load the default catalog from the "tpch" application in the "demo-org" organization
catalogs:
  - from: spice.ai/demo-org/tpch
    name: marketplace
This will create tables like:
SHOW TABLES;
+---------------+--------------+----------------+------------+
| table_catalog | table_schema | table_name     | table_type |
+---------------+--------------+----------------+------------+
| marketplace   | tpch         | region         | BASE TABLE |
| marketplace   | tpch         | orders         | BASE TABLE |
| marketplace   | tpch         | part           | BASE TABLE |
| marketplace   | tpch         | supplier       | BASE TABLE |
| marketplace   | tpch         | customer       | BASE TABLE |
| marketplace   | tpch         | partsupp       | BASE TABLE |
| marketplace   | tpch         | lineitem       | BASE TABLE |
| marketplace   | tpch         | nation         | BASE TABLE |
+---------------+--------------+----------------+------------+
- Load a specific catalog named "custom_catalog" from an application
catalogs:
  - from: spice.ai/demo-org/tpch/custom_catalog
    name: marketplace
If the remote application has tables in custom_catalog like:
SHOW TABLES;
+------------------+--------------+----------------+------------+
| table_catalog    | table_schema | table_name     | table_type |
+------------------+--------------+----------------+------------+
| custom_catalog   | ice_schema1  | table1         | BASE TABLE |
| custom_catalog   | ice_schema1  | table2         | BASE TABLE |
| custom_catalog   | ice_schema2  | table1         | BASE TABLE |
+------------------+--------------+----------------+------------+
They will be available locally as:
SHOW TABLES;
+------------------+--------------+----------------+------------+
| table_catalog    | table_schema | table_name     | table_type |
+------------------+--------------+----------------+------------+
| marketplace      | ice_schema1  | table1         | BASE TABLE |
| marketplace      | ice_schema1  | table2         | BASE TABLE |
| marketplace      | ice_schema2  | table1         | BASE TABLE |
+------------------+--------------+----------------+------------+
name​
The name field defines what catalog name the tables will be available under in the local Spice instance. For example, with the following configuration:
from: spice.ai/demo-org/tpch
name: marketplace
Then tables that exist in the remote application as:
spice # Default catalog
  |- schema1
     |- table1
     |- table2
Will be available locally as:
marketplace
  |- schema1
     |- table1
     |- table2
Queries are run against the marketplace catalog, like SELECT * FROM marketplace.schema1.table1.
include​
Use the include field to specify which tables to include from the catalog. The include field supports glob patterns to match multiple tables:
- schema_name.*- Include all tables from a specific schema
- *.table_name- Include all tables with a specific name from any schema
- schema_name.table_name- Include a specific table from a schema
- schema_name.table_prefix*- Include all tables in a schema that start with a prefix
Multiple include patterns can be specified and are OR'ed together. For example:
include:
  - "tpch.part*" # Include all tables from the "tpch" schema that start with "part"
  - "tpch.supplier" # Include the "supplier" table from the "tpch" schema
params​
The following parameters are supported for configuring the connection to the Spice Cloud catalog/tables:
| Parameter Name | Definition | 
|---|---|
| spiceai_api_key | Authorization API key from the Spice.ai Cloud Platform, used to login to the specified organization and app. | 
Cookbook​
- A cookbook recipe to configure Spice Cloud as a catalog connector in Spice. Spice Cloud Catalog Connector
