Skip to content

Conversation

@uros-db
Copy link
Contributor

@uros-db uros-db commented Oct 28, 2025

What changes were proposed in this pull request?

Introduce the catalyst-internal geospatial in-memory layout for Geometry and Geography: SRID (integer value) header + WKB (byte array) payload, and add core conversion API to retrieve these values from the server-side execution wrappers.

Why are the changes needed?

Establish a consistent representation/encoding for geospatial types, with a clear contract towards Geometry and Geography classes. This also provides minimal safe SRID & WKB extraction in execution and lays the groundwork for standard format support, conversions, etc.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Added the appropriate unit tests to validate SRID and WKB extraction from the in-memory data formats:

  • GeographyExecutionSuite
  • GeometryExecutionSuite

Was this patch authored or co-authored using generative AI tooling?

No.

@github-actions github-actions bot added the SQL label Oct 28, 2025
Copy link
Contributor Author

@uros-db uros-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mkaravel @cloud-fan Please review.

@cloud-fan
Copy link
Contributor

thanks, merging to master!

@cloud-fan cloud-fan closed this in 1940868 Oct 29, 2025
@dongjoon-hyun
Copy link
Member

@uros-db and @cloud-fan . The PR seems to have a wrong JIRA ID.

  • SPARK-51658 SPIP: Add geospatial types in Spark
  • SPARK-54060 Introduce Geometry and Geography in-memory wrapper formats

@cloud-fan cloud-fan changed the title [SPARK-51658][Geo][SQL] Introduce Geometry and Geography in-memory wrapper formats [SPARK-54060][Geo][SQL] Introduce Geometry and Geography in-memory wrapper formats Oct 29, 2025
@uros-db
Copy link
Contributor Author

uros-db commented Oct 29, 2025

Thank you @dongjoon-hyun @cloud-fan.

@cloud-fan
Copy link
Contributor

sorry I was too careless... reopened the SPIP JIRA ticket and closed the actual ticket.

@dongjoon-hyun
Copy link
Member

Thank you for correcting. :)

Yicong-Huang pushed a commit to Yicong-Huang/spark that referenced this pull request Oct 30, 2025
…apper formats

### What changes were proposed in this pull request?
Introduce the catalyst-internal geospatial in-memory layout for Geometry and Geography: `SRID` (integer value) header + `WKB` (byte array) payload, and add core conversion API to retrieve these values from the server-side execution wrappers.

### Why are the changes needed?
Establish a consistent representation/encoding for geospatial types, with a clear contract towards `Geometry` and `Geography` classes. This also provides minimal safe SRID & WKB extraction in execution and lays the groundwork for standard format support, conversions, etc.

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
Added the appropriate unit tests to validate SRID and WKB extraction from the in-memory data formats:
- `GeographyExecutionSuite`
- `GeometryExecutionSuite`

### Was this patch authored or co-authored using generative AI tooling?
No.

Closes apache#52761 from uros-db/geo-memory-repr.

Authored-by: Uros Bojanic <uros.bojanic@databricks.com>
Signed-off-by: Wenchen Fan <wenchen@databricks.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants