data_analysis_plotting_tools.AnalysisTool#

Tool to facilitate data set analysis.

Module Contents#

Classes#

AnalysisTool

Tool to facilitate data set analysis.

class data_analysis_plotting_tools.AnalysisTool.AnalysisTool(city_name: str, pandas_data_frame: pandas.DataFrame)#

Tool to facilitate data set analysis.

preprocess_data_set(columns_to_drop: list[str], columns_to_check: list[str], create_file: bool = False, disable_feedback: bool = False) None#

Specify the columns to be dropped based on the statistical summary.

Parameters:
  • columns_to_drop (list[str]) – Choose which numeric variables to remove.

  • columns_to_check (list[str]) – Choose which numeric variables to analyze.

  • create_file (bool) – Whether to create a csv file of the preprocessed data set or not.

  • disable_feedback (bool) – Whether to print feedbacks, like data set previews, into the console.

Return type:

None.

_clean_and_preprocess() None#

Private Method. Cleans the data set.

_remove_outliers() None#

Private Method. Removes outliers in the data set.

_fix_inconsistencies() None#

Private Method. Fixes inconsistent values in the data set.

get_statistical_summary() pandas.DataFrame#

Returns a statistical summary of the preprocessed data set.

Parameters:

None.

Return type:

pandas DataFrame.

get_data_frame() pandas.DataFrame#

Returns the preprocessed data set as pandas DataFrame.

Parameters:

None.

Return type:

pandas DataFrame.