Skip to content

Commit

Permalink
update README and add summary stats func
Browse files Browse the repository at this point in the history
  • Loading branch information
jessiezhang24 committed Jan 9, 2025
1 parent 6da1d59 commit fe96d94
Show file tree
Hide file tree
Showing 2 changed files with 22 additions and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,7 @@ Catherine Meng, Jessie Zhang, Zheng He
- **`missing_value_summary`**\
This function is to provide a summary of missing values in the dataset.
- **`get_summary_statistics`**
Generate summary statistics for specified columns or all columns if none are provided.

## Installation

Expand Down
21 changes: 21 additions & 0 deletions src/pyeda/data_summary.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
def get_summary_statistics(df, col = None):
"""
Generate summary statistics for specified columns or all columns if none are provided.
This function will return the important statistics (e.g. mean, min, std) for numeric columns, as well as
some key metrics (e.g. count, unique) for non-numeric columns.
Parameters
----------
df : pd.DataFrame
The dataframe containing the data for analysis.
col : list or None
A list of column names for which to get statistics.
Default value is None, the function will apply for all columns.
Returns
----------
pd.DataFrame
A DataFrame with summary statistics for the specified columns.
"""
pass

0 comments on commit fe96d94

Please sign in to comment.