Commit Graph

857 Commits

Author SHA1 Message Date
Neel Kant
73ceabc6e9 remove analysis directory 2025-09-03 09:05:15 -07:00
Neel Kant
4e966410e8 add connect tests 2025-09-03 00:25:18 -07:00
Neel Kant
e71eb67339 ast tests and some other tweaks, all tests passing 2025-09-03 00:23:28 -07:00
Neel Kant
4eded1cc76 updated tests 2025-09-02 23:12:04 -07:00
Neel Kant
5a5cf20474 item-on-ground, grouped entities 2025-09-02 11:04:23 -07:00
Neel Kant
ca76e2c5d8 fixes for tests 2025-08-31 23:29:26 -07:00
Neel Kant
3329cb899a better connect entities behavior for no new entities placed, better grouped entity behavior, better error messages' 2025-08-28 15:08:22 -07:00
Neel Kant
b43ae1daba Merge branch 'place-entity-next-to-edits' into ast-fixes 2025-08-27 09:00:23 -07:00
Neel Kant
7f02baa066 Merge branch 'eval-cleanups' into place-entity-next-to-edits 2025-08-27 08:59:47 -07:00
Neel Kant
108e994063 Merge branch 'place-entity-next-to-edits' into ast-fixes 2025-08-27 08:41:39 -07:00
Neel Kant
1c327c70d1 Merge branch 'main' into ast-fixes 2025-08-27 08:41:04 -07:00
Neel Kant
1fb91880ab Eval cleanups - timing and env.step() (#308)
Some checks failed
Lint and Format / lint (push) Has been cancelled
* clean up achievements; fix value accrual time; report flows better

* use pause, remove value accrual time

* make clients sleep correct time, add more speed and pausing methods to instance, add tests

* server adminlist

* clean up code, add more Instance methods, render pause message, tests passing'

* add tests for elapsed ticks

* fix run_eval

* game control;

* tests

* tests

* task info

* game control and medium electric poles

* change prints, max achieved throughput
2025-08-27 07:42:36 -07:00
Neel Kant
80ca16e3ed changes for get_entities (groups) and connect_entities error logging; 2025-08-27 00:24:07 -07:00
Neel Kant
593b350ff2 ast fixes - augmented assignment and some others 2025-08-26 14:32:43 -07:00
Neel Kant
8e6c2c2c20 change prints, max achieved throughput 2025-08-26 00:47:13 -07:00
Harshit Sharma
4a960edc80 Better init (#322)
Some checks failed
Lint and Format / lint (push) Has been cancelled
* removed everything other than util

* explicit yet redundant initialise
2025-08-25 19:30:54 +01:00
Neel Kant
8f9c86350b connect_entities and place_entity_next_to edits 2025-08-24 21:13:57 -07:00
Neel Kant
cf368322bf game control and medium electric poles 2025-08-24 21:12:59 -07:00
Neel Kant
5a5eff60e0 task info 2025-08-23 17:25:54 -07:00
Neel Kant
b94a9cd9f7 tests 2025-08-23 16:47:34 -07:00
Neel Kant
cbe395781d tests 2025-08-23 16:35:12 -07:00
Neel Kant
6e8fb1f3bc game control; 2025-08-23 16:09:21 -07:00
Neel Kant
e9c36f693a fix run_eval 2025-08-23 13:42:25 -07:00
Neel Kant
41923167aa add tests for elapsed ticks 2025-08-23 13:42:25 -07:00
Neel Kant
2e3c128fb0 clean up code, add more Instance methods, render pause message, tests passing' 2025-08-23 13:42:24 -07:00
Neel Kant
879c4a4558 server adminlist 2025-08-23 13:40:02 -07:00
Neel Kant
5f48cbccf3 make clients sleep correct time, add more speed and pausing methods to instance, add tests 2025-08-23 13:38:39 -07:00
Neel Kant
bf43a9c125 use pause, remove value accrual time 2025-08-23 13:36:54 -07:00
Neel Kant
7523c82d4a clean up achievements; fix value accrual time; report flows better 2025-08-23 13:36:51 -07:00
Harshit Sharma
b6118115e9 Clean instance (#319)
Some checks failed
Lint and Format / lint (push) Has been cancelled
* remove incorrect destroy logic

* remove old restart logic

* force amd flag

* run with saves

* clean_instance

* clear_entities flag

* formatting

* fixes for clear entities

* dup script loading bug fix

* merged

* cleaned up lua scripts

* more explicit mods structure

* more explicit mods structure

* cleaned serialize

* minor changes

* incorrect position reset flag

* formatting

* formatting

* formatting

* remove unused functions

* naively move setup tools to lua manager

* formatting

* formatting

* fix

* trying something

* use self.player_index

* fixed indexing
2025-08-23 13:10:37 -07:00
Harshit Sharma
6c6f22c916 observation_fix (#320)
Some checks failed
Lint and Format / lint (push) Has been cancelled
2025-08-22 18:10:42 +05:30
Harshit Sharma
81cd2a794f Naive saves (#318)
Some checks failed
Lint and Format / lint (push) Has been cancelled
* remove incorrect destroy logic

* remove old restart logic

* force amd flag

* run with saves

fairly trivial stuff (maybe)
2025-08-22 03:49:01 +05:30
Kian Kyars
4b840b7114 docs(versions): fix relative asset paths in 0.1.0 2025-08-21 12:41:14 +00:00
Harshit Sharma
8143457e55 Faster ci cd (#311)
Some checks failed
Lint and Format / lint (push) Has been cancelled
* sessions based

* try out caching + no sleep

* update fixture usage

* better reset usge

* state less on tech, probably breaking change

* better fixtures + decouple resets

* use pytest-xdist w 2 servers

* using diff grouping for dep

* formatting

* formatting

* caching for image

* formatting

* formatting

* use uv

* use uv caching

* remove docker caching (its slower)

* how about 4 workers?

* no redundant resets

* parameterize

* change names

* update all_technologies_researched usage

change log:

- used uv and cache dependencies
- used 2 factorio headless server instances
- added pytest-xdist & used 2 pytest workers
- parametrized the slowest test -- `test_sleep.py` so as to balance it across workers
- clarified resets in `instance.py` so separate instances arent needed for research testing
- better fixture usage, with autouse reset
- added configure_game callback for per test file setup of inventories & research state.
- updated task abc all_technologies_researched usage, its now a param for reset
- using 4 workers instead of 2, can probably double it again lol
- pytest parameterized a slow test
- fixed redundant reset in conftest

final speedup: 9m 4s -> 1m, ≈9.07× faster
2025-08-21 17:31:28 +05:30
kiankyars
2ae77b49cb refactor: simplify gym environment registry (#301)
merging now because main is broken without it.

* registry.py changes to dataclass

* Flatten JSON task definitions and update registry

- Remove config wrapper from all task definition JSON files
- Move all config fields to top level alongside task_type and num_agents
- Update registry.py to read flattened structure
- Applied to lab_play/, multiagent/, and unbounded/ directories

* Fix remaining config reference in get_environment_info

- Update get_environment_info to use flattened task_data structure
- Remove reference to task_data['config'] which no longer exists

* Fix TaskFactory to work with flattened JSON structure

- Remove dependency on config wrapper in task JSON files
- Extract task config by filtering out task_type and num_agents

* Aug 14, 2025 at 13:15

* retain scope

* undo changes

* add back dataclass

* split scopes

* checkpoint

* intermediate

* more changes

* Aug 20, 2025 at 18:13

* model_dump

* Aug 20, 2025 at 18:27

* task_type
2025-08-21 13:09:30 +03:00
kiankyars
5ee9586e1d llm_factory (#290)
* first iteration

* change to support openai api endpoints

* Refactor APIFactory to use OpenAI-compatible endpoints

- Unified all providers to use OpenAI client format
- Eliminated provider-specific conditional branches
- Simplified provider detection using dict ordering
- Removed unused parameters and added missing return
- 90% reduction in code complexity

* Further simplify APIFactory

- Remove redundant MODELS_WITH_IMAGE_SUPPORT array
- Use provider config supports_images instead
- Inline _prepare_messages logic
- Extract _get_reasoning_length helper
- Add missing default return
- 20+ line reduction while maintaining functionality

* removecomment

* Inline reasoning length logic

- Remove _get_reasoning_length helper method
- Inline reasoning effort logic in o1/o3 handling
- Keep code simpler and more direct

* add provider sorting for openrouter to get fastest throughput

* add nitro

* add usage tracking

* usage

* undo changes that added logging

* update config paths

* remove offset

* offset

* Aug 20, 2025 at 20:25

* fix run_idx port offset

* make sure there is keyerror if no port

* fix
2025-08-21 12:58:39 +03:00
kiankyars
f9cfb74baf remove exit_on_task_success parameter (#313)
Some checks failed
Lint and Format / lint (push) Has been cancelled
* runner requirement

* exit_on_task_success

* remove print statement
2025-08-20 19:32:42 +03:00
kiankyars
a0aa57a33e remove redundant validate_run (#314)
* remove redundant validate_run

* merge main
2025-08-20 19:31:39 +03:00
Harshit Sharma
75b450075f Merge pull request #309 from JackHopkins/lua_bug_fixes
Some checks failed
Lint and Format / lint (push) Has been cancelled
Lua bug fixes
2025-08-20 17:44:06 +05:30
hrshtt
892f551abc format 2025-08-20 17:33:06 +05:30
hrshtt
6b0c98bd85 breaking configs for the lore 2025-08-20 17:30:26 +05:30
hrshtt
65f927861c Merge remote-tracking branch 'upstream/main' into lua_bug_fixes 2025-08-20 17:04:08 +05:30
hrshtt
37db2d19ca move version db thingy 2025-08-20 16:58:51 +05:30
hrshtt
e5b07521df added agent_num 2025-08-20 16:56:27 +05:30
hrshtt
52cca6cca2 removed local & docker reference 2025-08-20 11:09:26 +05:30
hrshtt
06d27f89b6 narrow trigger 2025-08-20 11:01:28 +05:30
hrshtt
cdf26b02ee fix cluster usage 2025-08-20 10:45:14 +05:30
hrshtt
bc08e3af77 minor fixes 2025-08-20 10:28:16 +05:30
hrshtt
7ef3dd5b7e formatted finally 2025-08-20 10:23:12 +05:30
hrshtt
5ed1527a64 we will only have one container 2025-08-20 10:21:38 +05:30