data_analysis_plotting_tools.AnalysisTool#
Tool to facilitate data set analysis.
Module Contents#
Classes#
Tool to facilitate data set analysis. |
- class data_analysis_plotting_tools.AnalysisTool.AnalysisTool(city_name: str, pandas_data_frame: pandas.DataFrame)#
Tool to facilitate data set analysis.
- preprocess_data_set(columns_to_drop: list[str], columns_to_check: list[str], create_file: bool = False, disable_feedback: bool = False) None#
Specify the columns to be dropped based on the statistical summary.
- Parameters:
columns_to_drop (list[str]) – Choose which numeric variables to remove.
columns_to_check (list[str]) – Choose which numeric variables to analyze.
create_file (bool) – Whether to create a csv file of the preprocessed data set or not.
disable_feedback (bool) – Whether to print feedbacks, like data set previews, into the console.
- Return type:
None.
- _clean_and_preprocess() None#
Private Method. Cleans the data set.
- _remove_outliers() None#
Private Method. Removes outliers in the data set.
- _fix_inconsistencies() None#
Private Method. Fixes inconsistent values in the data set.
- get_statistical_summary() pandas.DataFrame#
Returns a statistical summary of the preprocessed data set.
- Parameters:
None. –
- Return type:
pandas DataFrame.
- get_data_frame() pandas.DataFrame#
Returns the preprocessed data set as pandas DataFrame.
- Parameters:
None. –
- Return type:
pandas DataFrame.