SNOW-2111386: Feature Store - Duplicate Feature Names in Feature Views Cause Issues with Generated Datasets

Hi,

I am experiencing an issue when using multiple feature views with features that share the same name, such as avg_balance, which leads to confusion in the final dataset. Here’s the scenario:

I have a spine dataframe with the following structure:

| user_id | state_id | postal_code | timestamp |

Additionally, I have three feature views corresponding to three entities: user, state, and postal_code. Each feature view contains a feature named avg_balance. When I create a dataset from these feature views, the resulting dataframe includes multiple columns with the same name (avg_balance), which makes it difficult to work with the data.

Current behavior:
The generated dataset has duplicated feature names (e.g., avg_balance), causing issues when trying to manipulate or analyze the data.

Expected behavior:
It would be helpful to add an option to automatically append a prefix or suffix to the feature names in the generated dataset to avoid these naming conflicts.

For example:

user_avg_balance

state_avg_balance

postal_code_avg_balance

This would help clarify the origin of each feature and make the dataframe easier to work with.

Suggestion:
Could an option be implemented to automatically add a prefix or suffix to features with the same name when merging datasets from multiple feature views?

Thank you for your support!



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SNOW-2111386: Feature Store - Duplicate Feature Names in Feature Views Cause Issues with Generated Datasets #156

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

SNOW-2111386: Feature Store - Duplicate Feature Names in Feature Views Cause Issues with Generated Datasets #156

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions