feat: add dataframe duolicated issue - #667 #669

RahulDas-dev · 2025-05-03T09:13:56Z

This merge request adds a new [duplicated()] method to the DataFrame class that identifies duplicate rows within a DataFrame. This functionality is essential for data cleaning and exploration workflows.

Resolve the issue - #667

Features

Identifies duplicate rows in a DataFrame based on specified columns
Returns a Series of boolean values marking duplicate entries
Supports flexible options for handling duplicates:
- keep: 'first' - Mark duplicates except for the first occurrence (default)
- keep: 'last'- Mark duplicates except for the last occurrence
- keep: false - Mark all duplicates
  Allows focusing on specific columns with the subset option

Implementation Details

Optimized to handle large datasets efficiently with a hash-based approach
Comprehensive input validation for better error handling
Well-documented with JSDoc comments and examples

// Create a DataFrame with duplicate rows const df = new DataFrame({ 'A': [1, 2, 2, 3, 3], 'B': ['a', 'b', 'b', 'c', 'c'] }); // Find duplicates keeping first occurrence (default) const dups = df.duplicated(); // Returns: [false, false, true, false, true] // Find duplicates keeping last occurrence const dupsLast = df.duplicated({ keep: 'last' }); // Returns: [false, true, false, true, false] // Find duplicates based on specific columns const dupsSubset = df.duplicated({ subset: ['B'] }); // Returns: [false, false, true, false, true]

Signed-off-by: rahuldas-dev <r.das699@gmail.com>

feat: add dataframe duolicated

5b19f23

Signed-off-by: rahuldas-dev <r.das699@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: add dataframe duolicated issue - #667 #669

feat: add dataframe duolicated issue - #667 #669

Uh oh!

RahulDas-dev commented May 3, 2025

Labels

1 participant

Uh oh!

feat: add dataframe duolicated issue - #667 #669

Are you sure you want to change the base?

feat: add dataframe duolicated issue - #667 #669

Uh oh!

Conversation

RahulDas-dev commented May 3, 2025

Labels

1 participant