Skip to content

SNOW-2111386: Feature Store - Duplicate Feature Names in Feature Views Cause Issues with Generated Datasets #156

Open
@ArtemioPadilla

Description

@ArtemioPadilla

Hi,

I am experiencing an issue when using multiple feature views with features that share the same name, such as avg_balance, which leads to confusion in the final dataset. Here’s the scenario:

I have a spine dataframe with the following structure:

| user_id | state_id | postal_code | timestamp |

Additionally, I have three feature views corresponding to three entities: user, state, and postal_code. Each feature view contains a feature named avg_balance. When I create a dataset from these feature views, the resulting dataframe includes multiple columns with the same name (avg_balance), which makes it difficult to work with the data.

Current behavior:
The generated dataset has duplicated feature names (e.g., avg_balance), causing issues when trying to manipulate or analyze the data.

Expected behavior:
It would be helpful to add an option to automatically append a prefix or suffix to the feature names in the generated dataset to avoid these naming conflicts.

For example:

user_avg_balance

state_avg_balance

postal_code_avg_balance

This would help clarify the origin of each feature and make the dataframe easier to work with.

Suggestion:
Could an option be implemented to automatically add a prefix or suffix to features with the same name when merging datasets from multiple feature views?

Thank you for your support!

Metadata

Metadata

Labels

deferredThe feature request has been defferred

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions