[Python] Add API to map Arrow types (including extension types) to pandas ExtensionArray instances for to_pandas conversions

With the next release of Pandas, it will be possible to define custom column types that back a `pandas.Series`. Thus we will not be able to cover all possible column types in the `to_pandas` conversion by default as we won't be aware of all extension arrays.

To enable users to create `ExtensionArray` instances from Arrow columns in the `to_pandas` conversion, we should provide a hook in the `to_pandas` call where they can overload the default conversion routines with the ones that produce their `ExtensionArray` instances.

This should avoid additional copies in the case where we would nowadays first convert the Arrow column into a default Pandas column (probably of object type) and the user would afterwards convert it to a more efficient `ExtensionArray`. This hook here will be especially useful when you build `ExtensionArrays` where the storage is backed by Arrow.

The meta-issue that tracks the implementation inside of Pandas is: https://github.com/pandas-dev/pandas/issues/19696

**Reporter**: [Uwe Korn](https://issues.apache.org/jira/browse/ARROW-2428) / @xhochy
**Assignee**: [Joris Van den Bossche](https://issues.apache.org/jira/browse/ARROW-2428) / @jorisvandenbossche
#### Related issues:
- [[Python] support pandas' nullable Integer type in from_pandas](https://github.com/apache/arrow/issues/21838) (blocks)
- [[Python] Add API to map Arrow types to pandas ExtensionDtypes for to_pandas conversions](https://github.com/apache/arrow/issues/23827) (relates to)
- [[Python] Interface for converting pandas ExtensionArray / other custom array objects to pyarrow Array](https://github.com/apache/arrow/issues/21741) (relates to)
- [[Python] Provide Python API for creating user-defined data types that can survive Arrow IPC](https://github.com/apache/arrow/issues/15390) (depends upon)
#### PRs and other links:
- [GitHub Pull Request #5512](https://github.com/apache/arrow/pull/5512)

<sub>**Note**: *This issue was originally created as [ARROW-2428](https://issues.apache.org/jira/browse/ARROW-2428). Please see the [migration documentation](https://github.com/apache/arrow/issues/14542) for further details.*</sub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Python] Add API to map Arrow types (including extension types) to pandas ExtensionArray instances for to_pandas conversions #18536

Related issues:

PRs and other links:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Python] Add API to map Arrow types (including extension types) to pandas ExtensionArray instances for to_pandas conversions #18536

Description

Related issues:

PRs and other links:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions