test_setups

`analyse_scale_robustness(all_histories, multipliers)`

Function to generate an analysis of a set of simulation tests with different multipliers applied in the environment. It returns a pandas dataframe summarizing the results for each multiplier pairs. The results analyzed are the following:

convergence
steps taken
discounted rewards
extra steps taken (compared to a minimum path)
t min over t (a ratio of how optimal the path taken was)

For each result, the mean, standard deviation along with the mean and standard deviation of the successful trajectories are recorded.

Parameters:

Name	Type	Description	Default
`all_histories`	`list[SimulationHistory]`	A list of all the simulation histories to summarize	required
`multipliers`	`ndarray`	An array of the multiplier pairs used (for the y multiplier then the x multiplier)	required

Returns:

Name	Type	Description
`df`	`DataFrame`	The analysis dataframe.

Source code in olfactory_navigation/test_setups.py

def analyse_scale_robustness(all_histories: list[SimulationHistory],
                             multipliers: np.ndarray
                             ) -> pd.DataFrame:
    '''
    Function to generate an analysis of a set of simulation tests with different multipliers applied in the environment.
    It returns a pandas dataframe summarizing the results for each multiplier pairs.
    The results analyzed are the following:

    - convergence
    - steps taken
    - discounted rewards
    - extra steps taken (compared to a minimum path)
    - t min over t (a ratio of how optimal the path taken was)

    For each result, the mean, standard deviation along with the mean and standard deviation of the successful trajectories are recorded.

    Parameters
    ----------
    all_histories : list[SimulationHistory]
        A list of all the simulation histories to summarize
    multipliers : np.ndarray
        An array of the multiplier pairs used (for the y multiplier then the x multiplier)

    Returns
    -------
    df : pd.DataFrame
        The analysis dataframe.
    '''
    rows = []
    # For each simulation history and multiplier, the analysis dataframe is extracted
    for hist, multiplier in zip(all_histories, multipliers):
        df = hist.general_analysis_df

        # Then the summarized metrics are collapsed on a single row
        col_metric_dict = {'multiplier': int(multiplier)}
        for col in ['converged', 'reached_horizon', 'steps_taken', 'discounted_rewards', 'extra_steps', 't_min_over_t']:
            for metric in ['mean', 'standard_deviation', 'success_mean', 'success_standard_deviation']:
                col_metric_dict[f'{col}_{metric}'] = df.loc[metric, col]

        rows.append(col_metric_dict)

    # Creating the dataframe from all the rows
    df = pd.DataFrame(rows)

    # Removal of 4 unnecessary columns
    df = df.drop(columns=['converged_success_mean',
                          'converged_success_standard_deviation',
                          'reached_horizon_success_mean',
                          'reached_horizon_success_standard_deviation'])

    return df

`analyse_shape_robustness(all_histories, multipliers)`

Function to generate an analysis of a set of simulation tests with different multipliers applied in the environment. It returns a pandas dataframe summarizing the results for each multiplier pairs. The results analyzed are the following:

convergence
steps taken
discounted rewards
extra steps taken (compared to a minimum path)
t min over t (a ratio of how optimal the path taken was)

For each result, the mean, standard deviation along with the mean and standard deviation of the successful trajectories are recorded.

Parameters:

Name	Type	Description	Default
`all_histories`	`list[SimulationHistory]`	A list of all the simulation histories to summarize	required
`multipliers`	`ndarray`	An array of the multiplier pairs used (for the y multiplier then the x multiplier)	required

Returns:

Name	Type	Description
`df`	`DataFrame`	The analysis dataframe.

Source code in olfactory_navigation/test_setups.py

def analyse_shape_robustness(all_histories: list[SimulationHistory],
                             multipliers: np.ndarray
                             ) -> pd.DataFrame:
    '''
    Function to generate an analysis of a set of simulation tests with different multipliers applied in the environment.
    It returns a pandas dataframe summarizing the results for each multiplier pairs.
    The results analyzed are the following:

    - convergence
    - steps taken
    - discounted rewards
    - extra steps taken (compared to a minimum path)
    - t min over t (a ratio of how optimal the path taken was)

    For each result, the mean, standard deviation along with the mean and standard deviation of the successful trajectories are recorded.

    Parameters
    ----------
    all_histories : list[SimulationHistory]
        A list of all the simulation histories to summarize
    multipliers : np.ndarray
        An array of the multiplier pairs used (for the y multiplier then the x multiplier)

    Returns
    -------
    df : pd.DataFrame
        The analysis dataframe.
    '''
    rows = []
    # For each simulation history and multiplier, the analysis dataframe is extracted
    for hist, multiplier_pair in zip(all_histories, multipliers):
        df = hist.general_analysis_df

        # Then the summarized metrics are collapsed on a single row
        col_metric_dict = {'y_multiplier': multiplier_pair[0].astype(int), 'x_multiplier': multiplier_pair[1].astype(int)}
        for col in ['converged', 'reached_horizon', 'steps_taken', 'discounted_rewards', 'extra_steps', 't_min_over_t']:
            for metric in ['mean', 'standard_deviation', 'success_mean', 'success_standard_deviation']:
                col_metric_dict[f'{col}_{metric}'] = df.loc[metric, col]

        rows.append(col_metric_dict)

    # Creating the dataframe from all the rows
    df = pd.DataFrame(rows)

    # Removal of 4 unnecessary columns
    df = df.drop(columns=['converged_success_mean',
                          'converged_success_standard_deviation',
                          'reached_horizon_success_mean',
                          'reached_horizon_success_standard_deviation'])

    return df

`run_all_starts_test(agent, environment=None, time_shift=0, time_loop=True, horizon=1000, initialization_values={}, reward_discount=0.99, print_progress=True, print_stats=True, use_gpu=False)`

Function to run a test with all the available starting positions based on the environment provided (or the environmnent of the agent).

Parameters:

Name	Type	Description	Default
`agent`	`Agent`	The agent to be tested	required
`environment`	`Environment`	The environment to run the simulations in. By default, the environment linked to the agent will used. This parameter is intended if the environment needs to be modified compared to environment the agent was trained on.	`None`
`time_shift`	`int or ndarray`	The time at which to start the olfactory simulation array. It can be either a single value, or n values.	`0`
`time_loop`	`bool`	Whether to loop the time if reaching the end. (starts back at 0)	`True`
`horizon`	`int`	The amount of steps to run the simulation for before killing the remaining simulations.	`1000`
`initialization_values`	`dict`	In the case the agent is to be initialized with custom values, the paramaters to be passed on the initialize_state function can be set here.	`{}`
`reward_discount`	`float`	How much a given reward is discounted based on how long it took to get it. It is purely used to compute the Average Discount Reward (ADR) after the simulation.	`0.99`
`print_progress`	`bool`	Wheter to show a progress bar of what step the simulations are at.	`True`
`print_stats`	`bool`	Wheter to print the stats at the end of the run.	`True`
`use_gpu`	`bool`	Whether to run the simulations on the GPU or not.	`False`

Returns:

Name	Type	Description
`hist`	`SimulationHistory`	A SimulationHistory object that tracked all the positions, actions and observations.

Source code in olfactory_navigation/test_setups.py

def run_all_starts_test(agent: Agent,
                        environment: Environment = None,
                        time_shift: int | np.ndarray = 0,
                        time_loop: bool = True,
                        horizon: int = 1000,
                        initialization_values: dict = {},
                        reward_discount: float = 0.99,
                        print_progress: bool = True,
                        print_stats: bool = True,
                        use_gpu: bool = False
                        ) -> SimulationHistory:
    '''
    Function to run a test with all the available starting positions based on the environment provided (or the environmnent of the agent).

    Parameters
    ----------
    agent : Agent
        The agent to be tested
    environment : Environment, optional
        The environment to run the simulations in.
        By default, the environment linked to the agent will used.
        This parameter is intended if the environment needs to be modified compared to environment the agent was trained on.
    time_shift : int or np.ndarray, default=0
        The time at which to start the olfactory simulation array.
        It can be either a single value, or n values.
    time_loop : bool, default=True
        Whether to loop the time if reaching the end. (starts back at 0)
    horizon : int, default=1000
        The amount of steps to run the simulation for before killing the remaining simulations.
    initialization_values : dict, default={}
        In the case the agent is to be initialized with custom values,
        the paramaters to be passed on the initialize_state function can be set here.
    reward_discount : float, default=0.99
        How much a given reward is discounted based on how long it took to get it.
        It is purely used to compute the Average Discount Reward (ADR) after the simulation.
    print_progress : bool, default=True
        Wheter to show a progress bar of what step the simulations are at.
    print_stats : bool, default=True
        Wheter to print the stats at the end of the run.
    use_gpu : bool, default=False
        Whether to run the simulations on the GPU or not.

    Returns
    -------
    hist : SimulationHistory
        A SimulationHistory object that tracked all the positions, actions and observations.
    '''
    # Handle the case an specific environment is given
    environment_provided = environment is not None
    if environment_provided:
        assert environment.shape == agent.environment.shape, "The provided environment's shape doesn't match the environment has been trained on..."
    else:
        environment = agent.environment

    # Gathering starting points
    start_points = np.argwhere(environment.start_probabilities > 0)
    n = len(start_points)

    return run_test(
        agent = agent,
        n = n,
        start_points = start_points,
        environment = environment if environment_provided else None,
        time_shift = time_shift,
        time_loop = time_loop,
        horizon = horizon,
        initialization_values = initialization_values,
        reward_discount = reward_discount,
        print_progress = print_progress,
        print_stats = print_stats,
        use_gpu = use_gpu
    )

`run_n_by_cell_test(agent, cell_width=10, n_by_cell=10, environment=None, time_shift=0, time_loop=True, horizon=1000, initialization_values={}, reward_discount=0.99, print_progress=True, print_stats=True, use_gpu=False)`

Function to run a test with simulations starting in different cells across the available starting zones. A number n_by_cell determines how many simulations should start within each cell (the same position can be chosen multiple times).

Parameters:

Name	Type	Description	Default
`agent`	`Agent`	The agent to be tested	required
`cell_width`	`int`	The size of the sides of each cells to be considered.	`10`
`n_by_cell`	`int`	How many simulations should start within each cell.	`10`
`environment`	`Environment`	The environment to run the simulations in. By default, the environment linked to the agent will used. This parameter is intended if the environment needs to be modified compared to environment the agent was trained on.	`None`
`time_shift`	`int or ndarray`	The time at which to start the olfactory simulation array. It can be either a single value, or n values.	`0`
`time_loop`	`bool`	Whether to loop the time if reaching the end. (starts back at 0)	`True`
`horizon`	`int`	The amount of steps to run the simulation for before killing the remaining simulations.	`1000`
`initialization_values`	`dict`	In the case the agent is to be initialized with custom values, the paramaters to be passed on the initialize_state function can be set here.	`{}`
`reward_discount`	`float`	How much a given reward is discounted based on how long it took to get it. It is purely used to compute the Average Discount Reward (ADR) after the simulation.	`0.99`
`print_progress`	`bool`	Wheter to show a progress bar of what step the simulations are at.	`True`
`print_stats`	`bool`	Wheter to print the stats at the end of the run.	`True`
`use_gpu`	`bool`	Whether to run the simulations on the GPU or not.	`False`

Returns:

Name	Type	Description
`hist`	`SimulationHistory`	A SimulationHistory object that tracked all the positions, actions and observations.

Source code in olfactory_navigation/test_setups.py

def run_n_by_cell_test(agent: Agent,
                       cell_width: int = 10,
                       n_by_cell: int = 10,
                       environment: Environment = None,
                       time_shift: int | np.ndarray = 0,
                       time_loop: bool = True,
                       horizon: int = 1000,
                       initialization_values: dict = {},
                       reward_discount: float = 0.99,
                       print_progress: bool = True,
                       print_stats: bool = True,
                       use_gpu: bool = False
                       ) -> SimulationHistory:
    '''
    Function to run a test with simulations starting in different cells across the available starting zones.
    A number n_by_cell determines how many simulations should start within each cell (the same position can be chosen multiple times).

    Parameters
    ----------
    agent : Agent
        The agent to be tested
    cell_width : int, default=10
        The size of the sides of each cells to be considered.
    n_by_cell : int, default=10
        How many simulations should start within each cell.
    environment : Environment, optional
        The environment to run the simulations in.
        By default, the environment linked to the agent will used.
        This parameter is intended if the environment needs to be modified compared to environment the agent was trained on.
    time_shift : int or np.ndarray, default=0
        The time at which to start the olfactory simulation array.
        It can be either a single value, or n values.
    time_loop : bool, default=True
        Whether to loop the time if reaching the end. (starts back at 0)
    horizon : int, default=1000
        The amount of steps to run the simulation for before killing the remaining simulations.
    initialization_values : dict, default={}
        In the case the agent is to be initialized with custom values,
        the paramaters to be passed on the initialize_state function can be set here.
    reward_discount : float, default=0.99
        How much a given reward is discounted based on how long it took to get it.
        It is purely used to compute the Average Discount Reward (ADR) after the simulation.
    print_progress : bool, default=True
        Wheter to show a progress bar of what step the simulations are at.
    print_stats : bool, default=True
        Wheter to print the stats at the end of the run.
    use_gpu : bool, default=False
        Whether to run the simulations on the GPU or not.

    Returns
    -------
    hist : SimulationHistory
        A SimulationHistory object that tracked all the positions, actions and observations.
    '''
    # Handle the case an specific environment is given
    environment_provided = environment is not None
    if environment_provided:
        assert environment.shape == agent.environment.shape, "The provided environment's shape doesn't match the environment has been trained on..."
    else:
        environment = agent.environment

    # Gathering starting points
    cells_x = int(environment.shape[0] / cell_width)
    cells_y = int(environment.shape[1] / cell_width)

    indices = np.arange(np.prod(environment.shape), dtype=int)
    indices_grid = indices.reshape(environment.shape)
    all_chosen_indices = []

    for i in range(cells_x):
        for j in range(cells_y):
            cell_probs = environment.start_probabilities[(i*cell_width):(i*cell_width)+cell_width, (j*cell_width):(j*cell_width)+cell_width]
            if np.any(cell_probs > 0):
                cell_indices = indices_grid[(i*cell_width):(i*cell_width)+cell_width, (j*cell_width):(j*cell_width)+cell_width]
                cell_probs /= np.sum(cell_probs)

                chosen_indices = np.random.choice(cell_indices.ravel(), size=n_by_cell, replace=True, p=cell_probs.ravel()).tolist()
                all_chosen_indices += chosen_indices

    n = len(all_chosen_indices)
    start_points = np.array(np.unravel_index(all_chosen_indices, environment.shape)).T

    return run_test(
        agent = agent,
        n = n,
        start_points = start_points,
        environment = environment if environment_provided else None,
        time_shift = time_shift,
        time_loop = time_loop,
        horizon = horizon,
        initialization_values = initialization_values,
        reward_discount = reward_discount,
        print_progress = print_progress,
        print_stats = print_stats,
        use_gpu = use_gpu
    )

`test_agents(*agents, environments, simulation_time_shift=0, simulation_time_loop=True, simulation_horizon=1000, reward_discount=0.99, print_progress=False, print_stats=True, use_gpu=False, save_histories_path=None, save_result_table=True)`

A function to test multiple (trained) agents in multiple given environments.

A summary table will be generated to compare the performance of the various agents within each environment.

Parameters:

Name	Type	Description	Default
`agents`	`Agent`	The agents to test. They must be already trained.	`()`
`environments`	`list[Environment]`	The environment to test the agents in.	required
`simulation_time_shift`	`int \| ndarray`	By how many steps to shift the t0 of the environment. It can be fixed or for each starting point of the simulation (in such case the amount of starting points must be same in each environments).	`= 0`
`simulation_time_loop`	`bool`	Whether the simulation t should loop back to 0 when reaching the max t of the given environment.	`= True`
`simulation_horizon`	`int`	For how many steps the simulation should run for.	`= 1000`
`reward_discount`	`float`	The reward discount that is used to compare the cummulative discounted reward.	`= 0.99`
`print_progress`	`bool`	Whether to show a progress bar for the simulations.	`= False`
`print_stats`	`bool`	Whether to print the stats (results) after each simulation.	`= True`
`use_gpu`	`bool`	Whether to use the gpu to speedup testing.	`= False`
`save_histories_path`	`str`	If the details of the simulation histories are to be saved, a path can be provided here.	`None`
`save_result_table`	`bool`	Whether the returned table should also be saved, it will be saved at the save_histories_path if it is set.	`= True`

Returns:

Name	Type	Description
`simulations_comparison_df`	`DataFrame`	A table with as row indices (agent, environment) pairs and columns the same columns as the output of SimulationHistory.compare_all.

Source code in olfactory_navigation/test_setups.py

def test_agents(*agents: Agent,
                environments: list[Environment],
                simulation_time_shift: int | np.ndarray = 0,
                simulation_time_loop: bool = True,
                simulation_horizon: int = 1000,
                reward_discount: float = 0.99,
                print_progress: bool = False,
                print_stats: bool = True,
                use_gpu: bool = False,
                save_histories_path: str = None,
                save_result_table: bool = True
                ) -> pd.DataFrame:
    '''
    A function to test multiple (trained) agents in multiple given environments.

    A summary table will be generated to compare the performance of the various agents within each environment.

    Parameters
    ----------
    agents : Agent
        The agents to test. They must be already trained.
    environments : list[Environment]
        The environment to test the agents in.
    simulation_time_shift : int | np.ndarray, default = 0
        By how many steps to shift the t0 of the environment.
        It can be fixed or for each starting point of the simulation (in such case the amount of starting points must be same in each environments).
    simulation_time_loop : bool, default = True
        Whether the simulation t should loop back to 0 when reaching the max t of the given environment.
    simulation_horizon : int, default = 1000
        For how many steps the simulation should run for.
    reward_discount : float, default = 0.99
        The reward discount that is used to compare the cummulative discounted reward.
    print_progress : bool, default = False
        Whether to show a progress bar for the simulations.
    print_stats : bool, default = True
        Whether to print the stats (results) after each simulation.
    use_gpu : bool, default = False
        Whether to use the gpu to speedup testing.
    save_histories_path : str, optional
        If the details of the simulation histories are to be saved, a path can be provided here.
    save_result_table : bool, default = True
        Whether the returned table should also be saved, it will be saved at the save_histories_path if it is set.

    Returns
    -------
    simulations_comparison_df : pd.DataFrame
        A table with as row indices (agent, environment) pairs and columns the same columns as the output of SimulationHistory.compare_all.
    '''
    simulation_histories = []
    for i_agent, agent in enumerate(agents):
        print(f'Testing Agent {i_agent}:')

        if not agent.trained:
            print(f'[Warning] Skipping agent {i_agent} due to it not being marked as trained...')
            continue

        agent_histories = []
        for i_environment, environment in enumerate(environments):

            print(f'- Environment {i_environment}')
            hist = run_all_starts_test(agent=agent,
                                       environment=environment,
                                       time_shift=simulation_time_shift,
                                       time_loop=simulation_time_loop,
                                       horizon=simulation_horizon,
                                       reward_discount=reward_discount,
                                       print_progress=print_progress,
                                       print_stats=print_stats,
                                       use_gpu=use_gpu)

            agent_histories.append(hist)
            print('')

            # Save simulation history if requested
            if save_histories_path is not None:
                hist.save(file=f'Simualtions-agent_{i_agent}-environment_{i_environment}', folder=save_histories_path, save_analysis=False)

        simulation_histories.append(agent_histories)
        print('--------------------------------------')

    # Generating comparison result table
    all_agent_comparison_dfs = []
    for agent_simulation_histories in simulation_histories:
        agent_comparison_df = SimulationHistory.compare_all(agent_simulation_histories)
        agent_comparison_df['environment'] = [f'environment_{i}' for i in range(len(environments))]
        agent_comparison_df.set_index('environment')

        all_agent_comparison_dfs.append(agent_comparison_df)

    simulations_comparison_df: pd.DataFrame = pd.concat(all_agent_comparison_dfs, keys=[f'agent_{i}' for i in range(len(agents))], names='agent')

    # Save comparison table if needed
    if save_result_table:
        folder = './' if save_histories_path is None else save_histories_path
        file = 'Simulation_comparison-' + datetime.now().strftime('%Y%m%d_%H%M%S%f')
        simulations_comparison_df.to_csv(folder+file)

    return simulations_comparison_df

`test_scale_robustness(agent, environment=None, time_shift=0, time_loop=True, horizon=1000, initialization_values={}, reward_discount=0.99, step_percentage=20, min_percentage=20, max_percentage=200, multipliers=None, use_gpu=False, print_progress=True, print_stats=True, save=True, save_folder=None, save_analysis=True)`

Function to test the robustness of an agent in a environment where the scale of the environment's shape is altered by some percentage.

A list of multipliers will be constructed from the min_percentage to 100% and up to max_percentage values with between each percentage step_percentage values. These percentage multipliers will be applied both in the x and y direction but cropped to the largest allowed multiplier along each axis.

This complete test consists in running from all possible start positions of the original environment.

Parameters:

Name	Type	Description	Default
`agent`	`Agent`	The agent to run the shape robustness test on.	required
`environment`	`Environment`	The environment to run the test in. By default, the environment linked to the agent will used. This parameter is intended if the environment needs to be modified compared to environment the agent was trained on.	`None`
`time_shift`	`int or ndarray`	The time at which to start the olfactory simulation array. It can be either a single value, or n values.	`0`
`time_loop`	`bool`	Whether to loop the time if reaching the end. (starts back at 0)	`True`
`horizon`	`int`	The amount of steps to run the simulation for before killing the remaining simulations.	`1000`
`initialization_values`	`dict`	In the case the agent is to be initialized with custom values, the paramaters to be passed on the initialize_state function can be set here.	`{}`
`reward_discount`	`float`	How much a given reward is discounted based on how long it took to get it. It is purely used to compute the Average Discount Reward (ADR) after the simulation.	`0.99`
`step_percentage`	`int`	Starting at 100%, how much of a percentage step to do to reach the min and max percentages.	`20`
`min_percentage`	`int`	The minimum percentage of deformation to apply on the environment's odor plume.	`20`
`max_percentage`	`int`	The maximum percentage of deformation to apply on the environment's odor plume. If this value is larger than the maximum shape allowed by the margins, the largest allowed percentage will be used.	`200`
`multipliers`	`list[int]`	If provided, the step_percentage, min_percentage and max_percentage parameters will be ignored. A list of percentages of deformations to use to deforme the environment's odor plume.	`None`
`use_gpu`	`bool`	Whether to use the GPU to speed up the tests.	`False`
`print_progress`	`bool`	Whether to display a progress bar of how many test have been performed so far.	`True`
`print_stats`	`bool`	Whether to display statistics at the end of each test.	`True`
`save`	`bool`	Whether to save the results of each test to a save_folder. Each test's result will be under the name 'test_env_mult-.csv'	`True`
`save_folder`	`str`	The path to which the test results are saved. If not provided, it will automatically create a new folder './results/scale_robustness_test/'	`None`
`save_analysis`	`bool`	Whether to save the analysis of the histories. It will be saved under a file named '_analysis.csv' in the save_folder.	`True`

Returns:

Name	Type	Description
`all_histories`	`list[SimulationHistory]`	A list of SimulationHistory instances.

Source code in olfactory_navigation/test_setups.py

def test_scale_robustness(agent: Agent,
                          environment: Environment = None,
                          time_shift: int | np.ndarray = 0,
                          time_loop: bool = True,
                          horizon: int = 1000,
                          initialization_values: dict = {},
                          reward_discount: float = 0.99,
                          step_percentage: int = 20,
                          min_percentage:int = 20,
                          max_percentage:int = 200,
                          multipliers: list[int] = None,
                          use_gpu: bool = False,
                          print_progress: bool = True,
                          print_stats: bool = True,
                          save: bool = True,
                          save_folder: str = None,
                          save_analysis: bool = True
                          ) -> list[SimulationHistory]:
    '''
    Function to test the robustness of an agent in a environment where the scale of the environment's shape is altered by some percentage.

    A list of multipliers will be constructed from the min_percentage to 100% and up to max_percentage values with between each percentage step_percentage values.
    These percentage multipliers will be applied both in the x and y direction but cropped to the largest allowed multiplier along each axis.

    This complete test consists in running from all possible start positions of the original environment.

    Parameters
    ----------
    agent : Agent
        The agent to run the shape robustness test on.
    environment : Environment, optional
        The environment to run the test in.
        By default, the environment linked to the agent will used.
        This parameter is intended if the environment needs to be modified compared to environment the agent was trained on.
    time_shift : int or np.ndarray, default=0
        The time at which to start the olfactory simulation array.
        It can be either a single value, or n values.
    time_loop : bool, default=True
        Whether to loop the time if reaching the end. (starts back at 0)
    horizon : int, default=1000
        The amount of steps to run the simulation for before killing the remaining simulations.
    initialization_values : dict, default={}
        In the case the agent is to be initialized with custom values,
        the paramaters to be passed on the initialize_state function can be set here.
    reward_discount : float, default=0.99
        How much a given reward is discounted based on how long it took to get it.
        It is purely used to compute the Average Discount Reward (ADR) after the simulation.
    step_percentage : int, default=20
        Starting at 100%, how much of a percentage step to do to reach the min and max percentages.
    min_percentage : int, default=20
        The minimum percentage of deformation to apply on the environment's odor plume.
    max_percentage : int, default=200
        The maximum percentage of deformation to apply on the environment's odor plume.
        If this value is larger than the maximum shape allowed by the margins, the largest allowed percentage will be used.
    multipliers : list[int], optional
        If provided, the step_percentage, min_percentage and max_percentage parameters will be ignored.
        A list of percentages of deformations to use to deforme the environment's odor plume.
    use_gpu : bool, default=False
        Whether to use the GPU to speed up the tests.
    print_progress : bool, default=True
        Whether to display a progress bar of how many test have been performed so far.
    print_stats : bool, default=True
        Whether to display statistics at the end of each test.
    save : bool, default=True
        Whether to save the results of each test to a save_folder.
        Each test's result will be under the name 'test_env_mult-<multiplier>.csv'
    save_folder : str, optional
        The path to which the test results are saved.
        If not provided, it will automatically create a new folder './results/<timestamp>_scale_robustness_test_<environment_name>/'
    save_analysis : bool, default=True
        Whether to save the analysis of the histories.
        It will be saved under a file named '_analysis.csv' in the save_folder.

    Returns
    -------
    all_histories : list[SimulationHistory]
        A list of SimulationHistory instances.
    '''
    # Handle the case an specific environment is given
    environment_provided = environment is not None
    if environment_provided:
        assert environment.shape == agent.environment.shape, "The provided environment's shape doesn't match the environment has been trained on..."
    else:
        environment = agent.environment

    # Gathering starting points
    start_points = np.argwhere(environment.start_probabilities > 0)
    n = len(start_points)

    # Generating multipliers
    if multipliers is None:
        with np.errstate(divide='ignore'):
            low_max_mult = ((environment.margins[:,0] / environment.data_source_position) + 1)
            high_max_mult = (1 + (environment.margins[:,1] / (environment.data_shape - environment.data_source_position)))
            max_mult = np.min(np.vstack([low_max_mult, high_max_mult]), axis=0)

        multipliers = [(100 - perc_mult) for perc_mult in range(0, (100-min_percentage)+step_percentage, step_percentage)[1:]] + [perc_mult for perc_mult in range(100, min(max_percentage, int(max(max_mult)*100)), step_percentage)]
    multipliers.sort()

    # Save Folder name and creation
    if save or save_analysis:
        if save_folder is None:
            save_folder = f'./results/{datetime.now().strftime("%Y%m%d_%H%M%S")}_scale_robustness_test_' + environment.name

        if not os.path.exists(save_folder):
            os.mkdir(save_folder)

        print(f'The results will be saved to: {save_folder}\n')

    all_histories = []
    for mult in (tqdm(multipliers) if print_progress else multipliers):
        print(f'Testing on environment with scale modifier {mult}%')

        # Modifying environment
        modified_environment = environment.modify_scale(scale_factor=mult/100)

        # Running test
        hist = run_test(
            agent = agent,
            n = n,
            start_points = start_points,
            environment = modified_environment,
            time_shift = time_shift,
            time_loop = time_loop,
            horizon = horizon,
            initialization_values = initialization_values,
            reward_discount = reward_discount,
            print_progress = False,
            print_stats = print_stats,
            use_gpu = use_gpu
        )

        all_histories.append(hist)

        # Saving history
        if save:
            file_name = f'test_env_mult-{mult}'
            hist.save(file=file_name,
                      folder=save_folder,
                      save_analysis=False)

        print()

    # Analysis saving
    if save and save_analysis:
        analysis_df = analyse_scale_robustness(all_histories=all_histories, multipliers=multipliers)
        analysis_file_name = '_analysis.csv'
        analysis_df.to_csv(save_folder + '/' + analysis_file_name, index=False)
        print(f'Scale robustness analysis saved to: {save_folder}/{analysis_file_name}')

    return all_histories

`test_shape_robustness(agent, environment=None, time_shift=0, time_loop=True, horizon=1000, initialization_values={}, reward_discount=0.99, step_percentage=20, min_percentage=20, max_percentage=200, multipliers=None, use_gpu=False, print_progress=True, print_stats=True, save=True, save_folder=None, save_analysis=True)`

Function to test the robustness of an agent in a environment where the odor plume's shape is altered by some percentage.

A list of multipliers will be constructed from the min_percentage to 100% and up to max_percentage values with between each percentage step_percentage values. These percentage multipliers will be applied both in the x and y direction but cropped to the largest allowed multiplier along each axis.

For each multiplier pair, a completed test will be run. This complete test consists in running from all possible start positions of the original environment.

Parameters:

Name	Type	Description	Default
`agent`	`Agent`	The agent to run the shape robustness test on.	required
`environment`	`Environment`	The environment to run the test in. By default, the environment linked to the agent will used. This parameter is intended if the environment needs to be modified compared to environment the agent was trained on.	`None`
`time_shift`	`int or ndarray`	The time at which to start the olfactory simulation array. It can be either a single value, or n values.	`0`
`time_loop`	`bool`	Whether to loop the time if reaching the end. (starts back at 0)	`True`
`horizon`	`int`	The amount of steps to run the simulation for before killing the remaining simulations.	`1000`
`initialization_values`	`dict`	In the case the agent is to be initialized with custom values, the paramaters to be passed on the initialize_state function can be set here.	`{}`
`reward_discount`	`float`	How much a given reward is discounted based on how long it took to get it. It is purely used to compute the Average Discount Reward (ADR) after the simulation.	`0.99`
`step_percentage`	`int`	Starting at 100%, how much of a percentage step to do to reach the min and max percentages.	`20`
`min_percentage`	`int`	The minimum percentage of deformation to apply on the environment's odor plume.	`20`
`max_percentage`	`int`	The maximum percentage of deformation to apply on the environment's odor plume. If this value is larger than the maximum shape allowed by the margins, the largest allowed percentage will be used.	`200`
`multipliers`	`list[int]`	If provided, the step_percentage, min_percentage and max_percentage parameters will be ignored. A list of percentages of deformations to use to deforme the environment's odor plume.	`None`
`use_gpu`	`bool`	Whether to use the GPU to speed up the tests.	`False`
`print_progress`	`bool`	Whether to display a progress bar of how many test have been performed so far.	`True`
`print_stats`	`bool`	Whether to display statistics at the end of each test.	`True`
`save`	`bool`	Whether to save the results of each test to a save_folder. Each test's result will be under the name 'test_env_y-_x-.csv'	`True`
`save_folder`	`str`	The path to which the test results are saved. If not provided, it will automatically create a new folder './results/shape_robustness_test/'	`None`
`save_analysis`	`bool`	Whether to save the analysis of the histories. It will be saved under a file named '_analysis.csv' in the save_folder.	`True`

Returns:

Name	Type	Description
`all_histories`	`list[SimulationHistory]`	A list of SimulationHistory instances.

Source code in olfactory_navigation/test_setups.py

def test_shape_robustness(agent: Agent,
                          environment: Environment = None,
                          time_shift: int | np.ndarray = 0,
                          time_loop: bool = True,
                          horizon: int = 1000,
                          initialization_values: dict = {},
                          reward_discount: float = 0.99,
                          step_percentage: int = 20,
                          min_percentage:int = 20,
                          max_percentage:int = 200,
                          multipliers: list[int] = None,
                          use_gpu: bool = False,
                          print_progress: bool = True,
                          print_stats: bool = True,
                          save: bool = True,
                          save_folder: str = None,
                          save_analysis: bool = True
                          ) -> list[SimulationHistory]:
    '''
    Function to test the robustness of an agent in a environment where the odor plume's shape is altered by some percentage.

    A list of multipliers will be constructed from the min_percentage to 100% and up to max_percentage values with between each percentage step_percentage values.
    These percentage multipliers will be applied both in the x and y direction but cropped to the largest allowed multiplier along each axis.

    For each multiplier pair, a completed test will be run. This complete test consists in running from all possible start positions of the original environment.

    Parameters
    ----------
    agent : Agent
        The agent to run the shape robustness test on.
    environment : Environment, optional
        The environment to run the test in.
        By default, the environment linked to the agent will used.
        This parameter is intended if the environment needs to be modified compared to environment the agent was trained on.
    time_shift : int or np.ndarray, default=0
        The time at which to start the olfactory simulation array.
        It can be either a single value, or n values.
    time_loop : bool, default=True
        Whether to loop the time if reaching the end. (starts back at 0)
    horizon : int, default=1000
        The amount of steps to run the simulation for before killing the remaining simulations.
    initialization_values : dict, default={}
        In the case the agent is to be initialized with custom values,
        the paramaters to be passed on the initialize_state function can be set here.
    reward_discount : float, default=0.99
        How much a given reward is discounted based on how long it took to get it.
        It is purely used to compute the Average Discount Reward (ADR) after the simulation.
    step_percentage : int, default=20
        Starting at 100%, how much of a percentage step to do to reach the min and max percentages.
    min_percentage : int, default=20
        The minimum percentage of deformation to apply on the environment's odor plume.
    max_percentage : int, default=200
        The maximum percentage of deformation to apply on the environment's odor plume.
        If this value is larger than the maximum shape allowed by the margins, the largest allowed percentage will be used.
    multipliers : list[int], optional
        If provided, the step_percentage, min_percentage and max_percentage parameters will be ignored.
        A list of percentages of deformations to use to deforme the environment's odor plume.
    use_gpu : bool, default=False
        Whether to use the GPU to speed up the tests.
    print_progress : bool, default=True
        Whether to display a progress bar of how many test have been performed so far.
    print_stats : bool, default=True
        Whether to display statistics at the end of each test.
    save : bool, default=True
        Whether to save the results of each test to a save_folder.
        Each test's result will be under the name 'test_env_y-<y_multiplier>_x-<x_multiplier>.csv'
    save_folder : str, optional
        The path to which the test results are saved.
        If not provided, it will automatically create a new folder './results/<timestamp>_shape_robustness_test_<environment_name>/'
    save_analysis : bool, default=True
        Whether to save the analysis of the histories.
        It will be saved under a file named '_analysis.csv' in the save_folder.

    Returns
    -------
    all_histories : list[SimulationHistory]
        A list of SimulationHistory instances.
    '''
    # Handle the case an specific environment is given
    environment_provided = environment is not None
    if environment_provided:
        assert environment.shape == agent.environment.shape, "The provided environment's shape doesn't match the environment has been trained on..."
    else:
        environment = agent.environment

    # Gathering starting points
    start_points = np.argwhere(environment.start_probabilities > 0)
    n = len(start_points)

    # Generating multipliers
    if multipliers is None:
        with np.errstate(divide='ignore'):
            low_max_mult = ((environment.margins[:,0] / environment.data_source_position) + 1)
            high_max_mult = (1 + (environment.margins[:,1] / (environment.data_shape - environment.data_source_position)))
            max_mult = np.min(np.vstack([low_max_mult, high_max_mult]), axis=0)

        multipliers = [(100 - perc_mult) for perc_mult in range(0, (100-min_percentage)+step_percentage, step_percentage)[1:]] + [perc_mult for perc_mult in range(100, min(max_percentage, int(max(max_mult)*100)), step_percentage)]
    multipliers.sort()

    # Generating all combinations of multipliers
    mult_combinations = np.array(np.meshgrid(multipliers, multipliers, indexing='xy')).T.reshape(-1,2).astype(float)
    mult_combinations /= 100
    mult_combinations = mult_combinations[np.all(mult_combinations < max_mult, axis=1), :]

    # Save Folder name and creation
    if save or save_analysis:
        if save_folder is None:
            save_folder = f'./results/{datetime.now().strftime("%Y%m%d_%H%M%S")}_shape_robustness_test_' + environment.name

        if not os.path.exists(save_folder):
            os.mkdir(save_folder)

        print(f'The results will be saved to: {save_folder}\n')

    all_histories = []
    for mults in (tqdm(mult_combinations) if print_progress else mult_combinations):
        print(f'Testing on environment with height {int(mults[0]*100)}% and width {int(mults[1] * 100)}%')

        # Modifying environment
        modified_environment = environment.modify(multiplier=mults)

        # Running test
        hist = run_test(
            agent = agent,
            n = n,
            start_points = start_points,
            environment = modified_environment,
            time_shift = time_shift,
            time_loop = time_loop,
            horizon = horizon,
            initialization_values = initialization_values,
            reward_discount = reward_discount,
            print_progress = False,
            print_stats = print_stats,
            use_gpu = use_gpu
        )

        all_histories.append(hist)

        # Saving history
        if save:
            file_name = f'test_env_y-{int(mults[0]*100)}_x-{int(mults[1]*100)}'
            hist.save(file=file_name,
                      folder=save_folder,
                      save_analysis=False)

        print()

    # Analysis saving
    if save and save_analysis:
        analysis_df = analyse_shape_robustness(all_histories=all_histories, multipliers=(mult_combinations*100))
        analysis_file_name = '_analysis.csv'
        analysis_df.to_csv(save_folder + '/' + analysis_file_name, index=False)
        print(f'Shape robustness analysis saved to: {save_folder}/{analysis_file_name}')

    return all_histories

`train_and_test_agents(*agent_classes, environments, agent_thresholds=3e-06, agent_space_aware=False, agent_spacial_subdivisions=None, agent_actions=None, agent_additional_parameters=None, training_environment=None, training_parameters=None, simulation_time_shift=0, simulation_time_loop=True, simulation_horizon=1000, reward_discount=0.99, print_progress=False, print_stats=True, use_gpu=False, save_histories_path=None, save_result_table=True)`

A function to train (with a given training_environment) and test multiple agents in multiple given environments.

A summary table will be generated to compare the performance of the various agents within each environment.

Parameters:

Name	Type	Description	Default
`agent_classes`	`type[Agent]`	The classes of the agents to create, train and test.	`()`
`environments`	`list[Environment]`	The environment to test the agents in.	required
`agent_thresholds`	`float \| list[float]`	The olfactory thresholds. If an odor cue above this threshold is detected, the agent detects it, else it does not. If a list of thresholds is provided, the agent should be able to detect \|thresholds\|+1 levels of odor.	`= 3e-6`
`agent_space_aware`	`bool`	Whether the agent is aware of it's own position in space.	`= False`
`agent_spacial_subdivisions`	`ndarray`	How many spacial compartments the agent has to internally represent the space it lives in. By default, it will be as many as there are grid points in the environment.	`None`
`agent_actions`	`dict[str, ndarray] \| ndarray`	The set of action available to the agent. It should match the type of environment (ie: if the environment has layers, it should contain a layer component to the action vector, and similarly for a third dimension). Else, a dict of strings and action vectors where the strings represent the action labels. If none is provided, by default, all unit steps in all cardinal directions are included and such for all layers (if the environment has layers.)	`None`
`agent_additional_parameters`	`list[dict]`	Any additional parameters to pass over to the agent constructor. The list needs to be as long as the amount of agents provided.	`None`
`training_parameters`	`list[dict]`	Any additional parameters to pass over to the agent's training process. The list needs to be as long as the amount of agents provided.	`None`
`simulation_time_shift`	`int \| ndarray`	By how many steps to shift the t0 of the environment. It can be fixed or for each starting point of the simulation (in such case the amount of starting points must be same in each environments).	`= 0`
`simulation_time_loop`	`bool`	Whether the simulation t should loop back to 0 when reaching the max t of the given environment.	`= True`
`simulation_horizon`	`int`	For how many steps the simulation should run for.	`= 1000`
`reward_discount`	`float`	The reward discount that is used to compare the cummulative discounted reward.	`= 0.99`
`print_progress`	`bool`	Whether to show a progress bar for the simulations.	`= False`
`print_stats`	`bool`	Whether to print the stats (results) after each simulation.	`= True`
`use_gpu`	`bool`	Whether to use the gpu to speedup testing.	`= False`
`save_histories_path`	`str`	If the details of the simulation histories are to be saved, a path can be provided here.	`None`
`save_result_table`	`bool`	Whether the returned table should also be saved, it will be saved at the save_histories_path if it is set.	`= True`

Returns:

Name	Type	Description
`simulations_comparison_df`	`DataFrame`	A table with as row indices (agent, environment) pairs and columns the same columns as the output of SimulationHistory.compare_all.

Source code in olfactory_navigation/test_setups.py

def train_and_test_agents(*agent_classes: type[Agent],
                          environments: list[Environment],
                          agent_thresholds: float | list[float] = 3e-6,
                          agent_space_aware: bool = False,
                          agent_spacial_subdivisions: np.ndarray = None,
                          agent_actions: dict[str, np.ndarray] | np.ndarray = None,
                          agent_additional_parameters: list[dict] = None, # Has to be as long the agents
                          training_environment: Environment = None,
                          training_parameters: list[dict] = None, # Has to be as long the agents
                          simulation_time_shift: int | np.ndarray = 0,
                          simulation_time_loop: bool = True,
                          simulation_horizon: int = 1000,
                          reward_discount: float = 0.99,
                          print_progress: bool = False,
                          print_stats: bool = True,
                          use_gpu: bool = False,
                          save_histories_path: str = None,
                          save_result_table: bool = True
                          ) -> pd.DataFrame:
    '''
    A function to train (with a given training_environment) and test multiple agents in multiple given environments.

    A summary table will be generated to compare the performance of the various agents within each environment.

    Parameters
    ----------
    agent_classes: type[Agent]
        The classes of the agents to create, train and test.
    environments : list[Environment]
        The environment to test the agents in.
    agent_thresholds : float | list[float], default = 3e-6
        The olfactory thresholds. If an odor cue above this threshold is detected, the agent detects it, else it does not.
        If a list of thresholds is provided, the agent should be able to detect |thresholds|+1 levels of odor.
    agent_space_aware : bool, default = False
        Whether the agent is aware of it's own position in space.
    agent_spacial_subdivisions : np.ndarray, optional
        How many spacial compartments the agent has to internally represent the space it lives in.
        By default, it will be as many as there are grid points in the environment.
    agent_actions : dict[str, np.ndarray] | np.ndarray, optional
        The set of action available to the agent. It should match the type of environment (ie: if the environment has layers, it should contain a layer component to the action vector, and similarly for a third dimension).
        Else, a dict of strings and action vectors where the strings represent the action labels.
        If none is provided, by default, all unit steps in all cardinal directions are included and such for all layers (if the environment has layers.)
    agent_additional_parameters : list[dict], optional
        Any additional parameters to pass over to the agent constructor.
        The list needs to be as long as the amount of agents provided.
    training_parameters : list[dict], optional
        Any additional parameters to pass over to the agent's training process.
        The list needs to be as long as the amount of agents provided.
    simulation_time_shift : int | np.ndarray, default = 0
        By how many steps to shift the t0 of the environment.
        It can be fixed or for each starting point of the simulation (in such case the amount of starting points must be same in each environments).
    simulation_time_loop : bool, default = True
        Whether the simulation t should loop back to 0 when reaching the max t of the given environment.
    simulation_horizon : int, default = 1000
        For how many steps the simulation should run for.
    reward_discount : float, default = 0.99
        The reward discount that is used to compare the cummulative discounted reward.
    print_progress : bool, default = False
        Whether to show a progress bar for the simulations.
    print_stats : bool, default = True
        Whether to print the stats (results) after each simulation.
    use_gpu : bool, default = False
        Whether to use the gpu to speedup testing.
    save_histories_path : str, optional
        If the details of the simulation histories are to be saved, a path can be provided here.
    save_result_table : bool, default = True
        Whether the returned table should also be saved, it will be saved at the save_histories_path if it is set.

    Returns
    -------
    simulations_comparison_df : pd.DataFrame
        A table with as row indices (agent, environment) pairs and columns the same columns as the output of SimulationHistory.compare_all.
    '''
    training_stats = []
    simulation_histories = []

    # Print warning in case no training environment is set because the agent will be retrained for each environment
    if training_environment is None:
        print('[Warning] The training environment has not been provided so the agent will be re-trained for each environment provided...')

    # Loop through all agents
    for i_agent, agent_class, agent_additional_params, training_params in enumerate(zip(agent_classes, agent_additional_parameters, training_parameters)):
        print(f'Agent {i_agent} ({agent_class.__name__}):')

        if training_environment is not None:
            print(f'- Training...')

            agent: Agent = agent_class(
                environment=training_environment,
                thresholds=agent_thresholds,
                space_aware=agent_space_aware,
                spatial_subdivisions=agent_spacial_subdivisions,
                actions=agent_actions,
                **agent_additional_params)

            # Tracking
            process = psutil.Process(os.getpid())
            memory_before = process.memory_info().rss
            start_time = time.perf_counter()

            if not agent.trained:
                agent.train(**training_params)

            # End tracking
            end_time = time.perf_counter()
            memory_after = process.memory_info().rss

            # Saving tracking
            training_stats.append({'memory_used': memory_after - memory_before,
                                   'time_taken': end_time - start_time})

            # Testing trained agent on all environments
            agent_simulation_histories = []
            for i_environment, environment in enumerate(environments):
                print(f'- Testing environment {i_environment}')

                hist = run_all_starts_test(agent=agent,
                                           environment=environment,
                                           time_shift=simulation_time_shift,
                                           time_loop=simulation_time_loop,
                                           horizon=simulation_horizon,
                                           reward_discount=reward_discount,
                                           print_progress=print_progress,
                                           print_stats=print_stats,
                                           use_gpu=use_gpu)

                agent_simulation_histories.append(hist)
                print('')

                # Save simulation history if requested
                if save_histories_path is not None:
                    hist.save(file=f'Simualtions-agent_{i_agent}-environment_{i_environment}', folder=save_histories_path, save_analysis=False)

            simulation_histories.append(agent_simulation_histories)
            print('--------------------------------------')

        # No training environment
        else:
            agent_histories = []
            agent_training_stats = []

            # Loop through the environments
            for i_environment, environment in enumerate(environments):
                print(f'- Environment {i_environment}')

                agent: Agent = agent_class(
                    environment=environment,
                    thresholds=agent_thresholds,
                    space_aware=agent_space_aware,
                    spatial_subdivisions=agent_spacial_subdivisions,
                    actions=agent_actions,
                    **agent_additional_params
                    )

                # Tracking
                process = psutil.Process(os.getpid())
                memory_before = process.memory_info().rss
                start_time = time.perf_counter()

                agent.train(**training_params)

                # End tracking
                end_time = time.perf_counter()
                memory_after = process.memory_info().rss

                # Saving tracking
                agent_training_stats.append({'memory_used': memory_after - memory_before,
                                             'time_taken': end_time - start_time})

                hist = run_all_starts_test(agent=agent,
                                           time_shift=simulation_time_shift,
                                           time_loop=simulation_time_loop,
                                           horizon=simulation_horizon,
                                           reward_discount=reward_discount,
                                           print_progress=print_progress,
                                           print_stats=print_stats,
                                           use_gpu=use_gpu)

                agent_histories.append(hist)
                print('')

                # Save simulation history if requested
                if save_histories_path is not None:
                    hist.save(file=f'Simualtions-agent_{i_agent}-environment_{i_environment}', folder=save_histories_path, save_analysis=False)

            training_stats.append(agent_training_stats)
            simulation_histories.append(agent_histories)

        print('--------------------------------------')

    # Generating comparison result table
    all_agent_comparison_dfs = []
    n_environment = len(environments)
    for agent_simulation_histories, agent_training_stats in zip(simulation_histories, training_stats):
        agent_comparison_df = SimulationHistory.compare_all(agent_simulation_histories)
        agent_comparison_df['environment'] = [f'environment_{i}' for i in range(len(environments))]
        agent_comparison_df['training_memory_usage'] = [agent_training_stats['memory_used']] * n_environment if isinstance(agent_training_stats, dict) else [env_training_stats['memory_used'] for env_training_stats in agent_training_stats]
        agent_comparison_df['training_time_taken'] = [agent_training_stats['time_taken']] * n_environment if isinstance(agent_training_stats, dict) else [env_training_stats['time_taken'] for env_training_stats in agent_training_stats]
        agent_comparison_df.set_index('environment')

        all_agent_comparison_dfs.append(agent_comparison_df)

    simulations_comparison_df: pd.DataFrame = pd.concat(all_agent_comparison_dfs, keys=[f'agent_{i}' for i in range(len(agent_classes))], names='agent')

    # Save comparison table if needed
    if save_result_table:
        folder = './' if save_histories_path is None else save_histories_path
        file = 'Simulation_comparison-' + datetime.now().strftime('%Y%m%d_%H%M%S%f')
        simulations_comparison_df.to_csv(folder+file)

    return simulations_comparison_df