Class GeoParquetDialect


  • public class GeoParquetDialect
    extends DuckDBDialect
    SQL Dialect for GeoParquet format.

    This dialect extends the base DuckDB dialect with GeoParquet-specific functionality:

    • Parsing and utilizing GeoParquet metadata from the "geo" field
    • Setting up appropriate SQL views for GeoParquet files
    • Optimizing spatial operations and bounds computations
    • Handling both local and remote (HTTP, S3) GeoParquet data access

    The dialect extracts and uses the GeoParquet specification metadata to provide improved performance for operations like bounds computation and feature access. It supports both standard GeoParquet format (1.1.0) and development versions (1.2.0-dev).

    • Constructor Detail

      • GeoParquetDialect

        public GeoParquetDialect​(JDBCDataStore dataStore,
                                 URI uri)
    • Method Detail

      • initializeConnection

        public void initializeConnection​(Connection cx)
                                  throws SQLException
        Description copied from class: SQLDialect
        Initializes a newly created database connection.

        Subclasses should override this method if there is some additional action that needs to be taken when a new connection to the database is created. The default implementation does nothing.

        Overrides:
        initializeConnection in class SQLDialect
        Parameters:
        cx - The new database connection.
        Throws:
        SQLException
      • getOptimizedBounds

        public List<ReferencedEnvelope> getOptimizedBounds​(String schema,
                                                           SimpleFeatureType featureType,
                                                           Connection cx)
                                                    throws SQLException,
                                                           IOException
        If possible, return the full dataset bounds from the geo geoparquet metadata field.

        Otherwise check if there's a bbox column even if the geoparquet metadata is not present (e.g. OvertureMaps release 2025-02-19.0), and compute the xmin,xmax,ymin,ymax bounds using aggregate functions, which will perform better since it uses the column stats.

        Finally, fall back to DuckDBDialect.getOptimizedBounds(String, SimpleFeatureType, Connection) for a more general (and slow) ST_Extent_Agg(geometry) call.

        Overrides:
        getOptimizedBounds in class DuckDBDialect
        Parameters:
        schema - The database schema, if any, or null
        featureType - The feature type containing the geometry columns whose bounds need to computed. Mind, it may be retyped and thus contain less geometry columns than the table
        Returns:
        a list of referenced envelopes (some of which may be null or empty)
        Throws:
        SQLException
        IOException
      • getGeometrySRID

        public Integer getGeometrySRID​(String schemaName,
                                       String tableName,
                                       String columnName,
                                       Connection cx)
                                throws SQLException
        Description copied from class: SQLDialect
        Returns the spatial reference system identifier (srid) for a particular geometry column.

        This method is given a direct connection to the database. The connection must not be closed. However any statements or result sets instantiated from the connection must be closed.

        In the event that the srid cannot be determined, this method should return null .

        Overrides:
        getGeometrySRID in class DuckDBDialect
        Parameters:
        schemaName - The database schema, could be null.
        tableName - The table, never null.
        columnName - The column name, never null
        cx - The database connection.
        Throws:
        SQLException