Context Navigation

Changes between Version 3 and Version 4 of WikiStart

Timestamp:: Mar 8, 2021, 9:31:37 AM (4 years ago)
Author:: Wouter Pasman
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

WikiStart

-              v3
+              v4
+This is the public wiki of the BW4T-Matrx project.
+[[PageOutline]]
+= Welcome to BW4T-Matrx
+This is our repo for agents for BW4T running on matrx
+You can copy the code using
 {{{
 git clone https://tracinsy.ewi.tudelft.nl/BW4T-Matrx-CollaborativeAI/
 }}}
+see also https://tracinsy.ewi.tudelft.nl/trac/Matrx for our clone of matrx
+More info on matrx core http://docs.matrx-software.com/en/master/autoapi/matrx/world_builder/index.html#matrx.world_builder.WorldBuilder.add_agent
+= Usage
+== Installation
+* git clone the project as above
+* (optional) set up your virtual environment
+* pip install -r requirements.txt
+== Setup
+You can change the runsession.py or runtournament.py script that contains the setup settings for the run.
+=== runsession settings
+* Edit {{{bw4t/runsession.py}}} program with your favourite text editor
+* If you added a new TeamXXAgent, import it
+* locate the section containing the agents list, like this
+{{{
+agents = [
+    {'name':'agent1', 'botclass':RandomAgent, 'settings':{'slowdown':3, 'colorblind':True}},
+    {'name':'agent2', 'botclass':RandomAgent, 'settings':{'slowdown':2, 'shapeblind':True}},
+    {'name':'human1', 'botclass':Human, 'settings':{'slowdown':1}}
+    ]
+}}}
+* Edit this section as needed. Each line is creating one bot. Names must be unique. Botclass must match the class name of the agent you need. Slowdown X means that this agent slows by a factor X, the minimum number is 1.
+* save the file. Make sure it remains plain ASCI text and don't change the end-of-line markers.
+=== runtournament settings
+* Edit {{{bw4t/runtournament.py}}} program with your favourite text editor. Edit the agents as described with the runsession settings above.
+* Additionally there is a {{{teamsize}}} variable. Set it to the desired team size for the sessions.
+The tournament will create and run all possible teams consisting of <teamsize> agents selected from the agents list.
+== Run
+While running, do not open the csv log file that the session/tournament is writing to: doing this may block the file for writing and cause the session/tournament to halt with a write error.
+=== Run Session
+* run {{{python runsession.py}}}
+* In a web browser (eg safari or firefox) go to http://localhost:3000
+* Go to god mode and start the simulation by pressing the white triangle at the top
+* The log file is written to world1 directory.
+=== Run Tournament
+* run {{{python runsession.py}}}
+* The log file is written to worldXX directory
+* You can not use the browser to see the progress with tournaments
+= Human agent
+A human bot is controlled through the web browser. Go to http://localhost:3000 and select the humanbot you want to control. Now you can navigate the humanbot using the keyboard:
+||keyboard letter||effect||
+||WASD||move agent||
+||Q||grab an object||
+||E||to drop object||
+||R||to open a door||
+||F||to close it||
+Humans can also send and receive messages. Click on the balloons-icon at the top right. An additional area will appear at the right or at the bottom of the screen (notice, in many browsers this remains invisible because they hide the scroll bar). You can type your own messages and select agent(s) to adress to.
+= Implementing your agent
+To implement an agent for BW4T, all you need is to create a class that extends the {{{bw4t/BW4TBrain}}}.
+To do this, you must implement {{{filter_bw4t_observations}}} and {{{decide_on_bw4t_action}}}.
+If you implement __init__, you must call super().init(settings) with the original settings as first call in your init function.
+If you implement initialize, you must call super().initialize() with the original settings as first call in your initialize function.
+You cannot override get_log_data because we use it already for logging the outcomes of the game
+With that done, you can import and use your agent in runsession, check [[wiki:#Setup]].
+The following sections give some details about the two functions to implement
+== filter_bw4t_observations
+This class is called every tick, with the new State object.
+You should process here all observations, because observations may be visible only for one tick.
+Normally this function returns the state. If you return a modified state here, then decide_on_bw4t_actions will receive this modified state.
+== decide_on_bw4t_action
+This function is called every time your agent can take an action. This may not be every tick.
+The decide_on_bw4t_action receives the state as parameter.
+The agent code decides on this and returns what the next action is, in the form of a tuple {{{(action, params)}}}.
+The following actions  and params are possible in BW4T:
+||Action||required params||
+||'OpenDoorAction'||'door_range':1, 'object_id':doorId||
+||'MoveNorth' 'MoveEast' 'MoveSouth' 'MoveWest'|| ||
+||GrabObject||'object_id':blockId ||
+||DropObject||'object_id':blockId ||
+||None (do nothing)|| ||
+== BW4TBrain
+All agents for BW4T are required to extend our BW4TBrain.
+Implementing a BW4TBrain is slightly different from implementing a normal AgentBrain (the default superclass for agents):
+* decide_on_bw4t_action replaces decide_on_action. BW4T agents must not override decide_on_action
+* filter_bw4t_observations replaces filter_observations. Agents must not override filter_observations
+* agent action parameters are restricted, they can not set 'remove_range', 'grab_range', 'door_range' and 'action_duration'.
+* If an agent raises an exception, only a message is printed but the world continues running. This is because (1) the agent may be able to recover in the next round (2) other agents that are still running may be still able to complete the task. **You should however ensure such errors do not occur in the first place.**
+The following values are enforced for the parameters for all actions
+||action parameter||value||
+||grab_range||1||
+||max_objects||3||
+||action_duration||set by the agent slowdown setting||
+== State info
+When implementing your agent, you receive a State object.
+It contains a dictionary representing a gridworld state. The gridworld is a 2D grid with a set of objects/tiles, each at a given 'location' on the grid. Some tiles like walls are fixed, some tiles like bots and blocks can be moved
+The keys are unique IDs from the Matrx system, and the values again are dictionaries. The values dictionary contents depend on the type of tile. Typically agents will receive something like this:
+{{{
+dict: {
+'agent_0_in_team_0_344':
+  {
+    'isAgent': True,
+    'team': '',
+    'name': 'Agent 0 in Team 0',
+    'obj_id': 'agent_0_in_team_0_344',
+    'location': (1, 1),
+    'is_movable': True,
+    'action_set': ['MoveSouthWest', 'OpenDoorAction', 'MoveWest', 'MoveSouth', 'GrabObject', 'MoveEast', 'MoveNorthEast', 'DropObject', 'MoveNorth', 'RemoveObject', 'MoveNorthWest', 'Move', 'MoveSouthEast', 'CloseDoorAction'],
+    'carried_by': [],
+    'is_human_agent': False,
+    'is_traversable': True,
+    'class_inheritance': ['BlockWorldAgent', 'AgentBrain', 'object'],
+    'is_blocked_by_action': False,
+    'is_carrying': [],
+    'sense_capability': {<class 'matrx.objects.agent_body.AgentBody'>: 2, <class 'builder.CollectableBlock'>: 2, '*': inf},
+    'visualization': {'size': 1.0, 'shape': 1, 'colour': '#92f441', 'depth': 100, 'opacity': 1.0, 'show_busy': False},
+    'current_action': None,
+    'current_action_args': {},
+    'current_action_duration': -1000000,
+    'current_action_started_at_tick': -1000000
+  },
+  'human_0_in_team_0_345':{
+    'isAgent': True,
+    'key_action_map': {'w': 'MoveNorth', 'd': 'MoveEast', 's': 'MoveSouth', 'a': 'MoveWest', 'q': 'GrabObject', 'e': 'DropObject', 'r': 'OpenDoorAction', 'f': 'CloseDoorAction'},
+    'team': '',
+    'name': 'Human 0 in Team 0',
+    'obj_id': 'human_0_in_team_0_346',
+    'location': (3, 1),
+    'is_movable': True,
+    'action_set': ['Move', 'MoveWest', 'GrabObject', 'MoveSouthEast', 'MoveNorthWest', 'MoveNorthEast', 'MoveEast', 'MoveSouth', 'OpenDoorAction', 'RemoveObject', 'MoveSouthWest', 'DropObject', 'MoveNorth', 'CloseDoorAction'],
+    'carried_by': [],
+    'is_human_agent': True,
+    'is_traversable': False,
+    'class_inheritance': ['HumanAgentBrain', 'AgentBrain', 'object'],
+    'is_blocked_by_action': False,
+    'is_carrying': [],
+    'sense_capability': {<class 'matrx.objects.agent_body.AgentBody'>: 2, <class 'builder.CollectableBlock'>: 2, '*': inf},
+    'visualization': {'size': 1.0, 'shape': 1, 'colour': '#92f441', 'depth': 100, 'opacity': 1.0, 'show_busy': False},
+    'current_action': None,
+    'current_action_args': {},
+    'current_action_duration': -1000000,
+    'current_action_started_at_tick': -1000000
+  },
+  'world_bounds_-_wall@(23,_4)_0': {
+    'room_name': 'world_bounds',
+    'name': 'world_bounds - wall@(23, 4)',
+    'obj_id': 'world_bounds_-_wall@(23,_4)_0',
+    'location': (23, 4),
+    'is_movable': False,
+    'carried_by': [],
+    'is_traversable': False,
+    'class_inheritance': ['Wall', 'EnvObject', 'object'],
+    'visualization': {'size': 1.0, 'shape': 0, 'colour': '#000000', 'depth': 80, 'opacity': 1.0}
+  }
+  'world_bounds_-_wall@(0,_5)_0': {'room_name': 'world_bounds', ...}
+  'room_0_-_wall@(4,_4)_94': {'room_name': 'room_0', name:...}
+  'room_0_-_wall@(5,_7)_102': {
+    'room_name': 'room_0',
+    'name': 'room_0 - wall@(5, 7)',
+    'obj_id': 'room_0_-_wall@(5,_7)_102',
+    'location': (5, 7),
+    'is_movable': False,
+    'carried_by': [],
+    'is_traversable': False,
+    'class_inheritance': ['Wall', 'EnvObject', 'object'],
+    'visualization': {'size': 1.0, 'shape': 0, 'colour': '#8a8a8a', 'depth': 80, 'opacity': 1.0}
+  }
+  'room_0_-_wall@(6,_4)_101'
+  ....
+  'room_0_-_door@(7,_7)_109': {
+    'is_open': False,
+    'room_name': 'room_0',
+    'name': 'room_0 - door@(7, 7)',
+    'obj_id': 'room_0_-_door@(7,_7)_109',
+    'location': (7, 7),
+    'is_movable': False,
+    'carried_by': [],
+    'is_traversable': False,
+    'class_inheritance': ['Door', 'EnvObject', 'object'],
+    'visualization': {'size': 1.0, 'shape': 0, 'colour': '#640000', 'depth': 80, 'opacity': 1.0}
+  },
+  'room_0_area_110: {
+    'room_name': 'room_0',
+    'name': 'room_0_area',
+    'obj_id': 'room_0_area_110',
+    'location': (5, 5),
+    'is_movable': False,
+    'carried_by': [],
+    'is_traversable': True,
+    'class_inheritance': ['AreaTile', 'EnvObject', 'object'],
+    'visualization': {'size': 1.0, 'shape': 0, 'colour': '#0008ff', 'depth': 80, 'opacity': 0.1}
+  }
+  ...
+  'room_1_-_wall@(5,_7)_118': {'room_name': 'room_0', name:...}
+  ........................
+  'Drop_off_0_338', {'drop_zone_nr': 0, 'is_drop_zone': True, 'is_goal_block': False, 'is_collectable': False, 'name': 'Drop off 0', 'obj_id': 'Drop_off_0_338', 'location': (12, 21), 'is_movable': False, 'carried_by': [], 'is_traversable': True, 'class_inheritance': ['AreaTile', 'EnvObject', 'object'], 'visualization': {'size': 1.0, 'shape': 0, 'colour': '#878787', 'depth': 80, 'opacity': 1.0}}
+  'Drop_off_0_339',
+  'Drop_off_0_340',
+  'Collect_Block_341',
+  'Collect_Block_342',
+  'Collect_Block_343', {'drop_zone_nr': 0, 'is_drop_zone': False, 'is_goal_block': True, 'is_collectable': False, 'name': 'Collect Block', 'obj_id': 'Collect_Block_343', 'location': (12, 21), 'is_movable': False, 'carried_by': [], 'is_traversable': True, 'class_inheritance': ['CollectableBlock', 'EnvObject', 'object'], 'visualization': {'size': 0.5, 'shape': 1, 'colour': '#0008ff', 'depth': 85, 'opacity': 0.5}}
+  'World': {
+    'nr_ticks':0,
+    'curr_tick_timestamp':1609927984782732,
+    'grid_shape':(24,25),
+    'team_members': ['agent_0_in_team_0_344', 'human_0_in_team_0_345'],
+    'world_ID': 'world_1',
+    'vis_settings': {'vis_bg_clr': '#C2C2C2', 'vis_bg_img': None}}
+}
+}}}
+The World object is somewhat special. The other objects contain a 'obj_id', 'class_inheritance" field and other fields that make it possible to identify the type of object contain in here.
+=== Blocks
+Blocks only appear when the agent is near the block (within block_sense_range, see setup section), you get an additional item like
+{{{
+Block_in_room_6_330:{
+  'is_drop_zone': False,
+  'is_goal_block': False,
+  'is_collectable': True,
+  'name': 'Block in room_2',
+  'obj_id': 'Block_in_room_2_318',
+  'location': (20, 5),
+  'is_movable': True,
+  'carried_by': [],
+  'is_traversable': True,
+  'class_inheritance': ['CollectableBlock', 'EnvObject', 'object'],
+  'visualization': {'size': 0.5, 'shape': 1, 'colour': '#0dff00', 'depth': 80, 'opacity': 1.0}}
+}}}
+'is_goal_block' is set for the "GhostBlock" objects that are placed in the dropzone. These objects contain also a 'visualize' property with 'colour', 'shape' and 'size' values that must match the block that is to be dropped in this zone.
+If you move a block, it keeps the same name eg 'Block_in_room_6_330' even though it is not in the said room anymore.
+=== Room doors
+The room door objects can have only one roomid assigned. Therefore you can assume that all rooms connect to the corridor.
+=== State functions
+state has some utility functionality, for example
+||get_all_room_names()||the available room_name properties on the map||
+||get_room_objects('room_1') ||gives the areas (tiles) that make up room 1||
+||get_traverse_map()||returns dict {(0,0):False, (0,1):True...} where (X,Y) is the map coordinate and the boolean indicates whether the location can be traversed.||
+You can create a crude map of the environment with the get_traverse_map() like this
+{{{
+        print("Extracting the map")
+        map=state.get_traverse_map()
+        width, length = state.get_world_info()['grid_shape']
+        for y in range(length):
+            for x in range(width):
+                if map[(x,y)]:
+                    print("*",end="")
+                else:
+                    print(" ", end="")
+            print()
+}}}
+=== The object fields
+Most fields are undocumented and you will have to figure it out yourself or even reverse engineer using the matrx code. Here are a few:
+||location||a tuple (x,y) with the x,y location of the object on the grid
+||isAgent||set when the object contains info about an agent||
+||class_inheritance||This is the chain off MATRX used classes the object inherits from. Meaning that a value of [class_B, class_A, EnvObject, object] means that the dictionary with properties comes from a Class_B instance, which inherits from Class_A, which in turn inherits from MATRX’ base object class EnvObject. Finallt, EnvObject in turn inherits from ‘object’; the base Python class from which everything in Python inherits*. So with the snippet {{{state[“some_object_id”][“class_inheritance”][0]|}}} you get the string name of the class definition of the object with ID “some_object_id”.||
+= Path Planner
+There is a simple path planner available in matrx, called Navigator. It takes (x,y) coordinates as input, and it needs a state_tracker as well. You can use it like this
+{{{
+    state_tracker = StateTracker(agent_id=self.agent_id)
+    navigator = Navigator(agent_id=self.agent_id,
+            action_set=self.action_set, algorithm=Navigator.A_STAR_ALGORITHM)
+}}}
+To use the navigator you need something like
+{{{
+    navigator.reset_full()
+    navigator.add_waypoints([targetloc])
+}}}
+navigator has a is_circular option to create a circular path that loops back to the first waypoint after reaching the last. Check the python documentation for details.
+If you are not using reset_full, old path points may remain and the path may not work
+Now you can extract the next steps using
+{{{
+  self._state_tracker.update(state)
+  action=navigator.get_move_action(state_tracker)
+or
+}}}
+It will take the current agent location as start point and navigate to the waypoint you can add later. The StateTracker contains some processed map so better
+Check the PatrollingAgent example to see how to use it.
+FAIK it can only reach reachable places, it will not consider opening doors and refuse planning if the end point can not be reached. Once a door is open, path planning will work
+= Duration of actions
+BW4TAgents can not actively use duration of actions. This section is only here to explain how duration of actions actually works.
+In the gridworld/BW4T world, all time is measured in ticks. Actions and messages are all executed at a single tick. These ticks are independent from the wall time clock and thus this world can be slowed down without changing the behaviour. In a typical simulation, around 10 ticks are executed in a second, but this may differ depending on the CPU speed and load.
+In grid_world, all actions you can also set 'action_duration'. The minimum is 1, if you set it smaller matrx will change it to 1. This field indicates after how many ticks the action will actually be performed, so 10 means that nothing happens the coming 9 ticks and the action will then be performed in tick 10 from now.
+In BW4T agents can NOT change the action_duration, because BW4T uses the action_duration field to control the overall speed of an agent. Your agent will raise an exception if you try to set action_duration.
+'duration' works like this: once an agent decides to do an action by returning this from the decide_on_action function, e.g. move north, with an action duration of 5, that agent will be "busy" for 4 ticks, and do the action on the 5th. The agent being busy is indicated with the gears animation in the frontend.
+What this means for the agent is that when the agent is busy with an action, the agent can only perceive the world, but not decide on a new action: only filter_observations is called for agents that are busy with an action, and decide_on_action is skipped. In that manner the situation you propose is impossible, as the agent is blocked from deciding on any new actions once it has initiated an action that takes multiple ticks.
+= The next required block
+The BW4T world only accepts the placed blocks if only the correct blocks are placed in the right places in the right order. So it is not acceptable if other blocks are placed on the dropzone as well, nor is placement accepted if the order of placement was incorrect.
+Since agents only can see the blocks when they are very close to the blocks, agents can usually not determine the order in which the blocks were placed, thus not whether the placement order is correct. So if all blocks are there and the game is not completed, the only solution then is to re-place all the blocks.
+= Block Shapes
+The blocks have a number of options for their shape
+||shape number||shape||
+||img field||as in the image data?||
+||0||rectangle||
+||1||triangle||
+||2||circle||
+= Communication
+A basic communication mechanism is provided to all agents.
+Use this to send a message:
+{{{
+    msg = Message(content='Delivering block'), from_id='me' )
+    self.send_message(msg)
+}}}
+This example message is not too informative but you can extend it to make it more useful. The from_id is a required field but can contain anything. It probably is useful to put the agent ID there. Check the matrx.messages.Message class for more details.
+If you do not specify {{{to_id}}}, your message will be broadcasted to all agents, including yourself.
+Received messages appear some time later (probably the next tick) in {{{self.received_messages}}} list. The received messages remain in the list only one tick so you need to scan them every tick if you don't want to miss messages. (but see [https://github.com/matrx-software/matrx/issues/240 issue])
+= Log files
+In the world_1 directory  a log file is written for each session. It's  in the CSV file format, with ";" as the column separator. Its first line contains the column information. Each row contains the info for one tick.
+||Column||meaning||
+||done||contains True if the goal has been achieved, else False. ||
+||Agent*_msgs||contain the number of messages sent by the agent in the previous tick||
+||Agent*_acts||The action done by the agent in this tick||
+||tick_nr||The tick value for this row||
+= Running the tests
+To run the tests using eclipse pyDev,
+* Set the PYTHONPATH, add the root of project as source folder
+* All test functions must have names test_XXX. Otherwise it don't work
+With that, right click on the project and select 'run as / python unit test'.