GitHub - georgysavva/appsim: AppSim: A Learned World Model for an App API

AppSim: Todoist World-Models

This repository contains the code to:

collect realistic Todoist request/response trajectories via AppWorld + OpenAI models,
train a causal language model to predict the next Todoist API response given a history of req/res pairs,
evaluate both trained models and ChatGPT-family models on the same trajectories.

It uses Hydra for configuration, Hugging Face Transformers for training (optionally with LoRA), and scripts for generation and evaluation.

Installation

cd /home/georgy/repos/appsim
conda env create -f env.yml -n appsim
conda activate appsim
pip install -r requirements.txt
pip install -e .

Environment:

Set your OpenAI API key: export OPENAI_API_KEY=...

Data format

Trajectories are plain-text files with alternating lines of req: and res::

req:create_project(name='Alpha', color='red', description='...', is_favorite=False)
res:{'message': 'Project created.', 'project_id': 280}
req:show_project(project_id=280)
res:{'name': 'Alpha', 'color': 'red', ... 'project_id': 280, ...}

Models are trained to predict each next res: given the history so far.

Collect Todoist trajectories (via AppWorld + OpenAI)

python scripts/collect_todoist_data.py \
  --save_dir /abs/path/to/data/todoist/raw \
  --num_trajectories 50 \
  --trajectory_base_length 50 \
  --max_appworld_retry 7 \
  --start_trajectory_id 0 \
  --chatgpt_model gpt-4o-mini \
  --openai_api_key "$OPENAI_API_KEY"

Outputs (per --save_dir):

trajectory_0.txt, trajectory_1.txt, ...
stats/trajectory_0.json (aggregate counts)
state/trajectory_0.json (final state snapshot)

Train a world model

python src/main \
  experiment_name=my_run \
  common.project_storage_base_path=/abs/path/for/outputs \
  dataset.path=/abs/path/to/data/todoist

Generate with a trained model

python scripts/trained_model_generate.py \
  --trajectory_path /abs/path/to/data/todoist/test/trajectory_0.txt \
  --checkpoint_path /abs/path/for/outputs/runs/<run_name>/checkpoint-<step> \
  --temperature 0.1 \
  --max_new_tokens 128 \
  --output_dir output/

Generate with ChatGPT-family models

python scripts/chatgpt_generate.py \
  --trajectory_path /abs/path/to/data/todoist/test/trajectory_0.txt \
  --chatgpt_model gpt-4.1 \
  --temperature 0.1 \
  --output_dir output/ \
  --openai_api_key "$OPENAI_API_KEY"

Evaluate a generated trajectory

python scripts/evaluate_trajectory.py \
  --generated_trajectory_path output/gpt-4.1-temp_0.1/trajectory_0.txt \
  --gt_trajectory_path /abs/path/to/data/todoist/test/trajectory_0.txt

Name		Name	Last commit message	Last commit date
Latest commit History 188 Commits
config		config
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env.yml		env.yml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AppSim: Todoist World-Models

Installation

Data format

Collect Todoist trajectories (via AppWorld + OpenAI)

Train a world model

Generate with a trained model

Generate with ChatGPT-family models

Evaluate a generated trajectory

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

georgysavva/appsim

Folders and files

Latest commit

History

Repository files navigation

AppSim: Todoist World-Models

Installation

Data format

Collect Todoist trajectories (via AppWorld + OpenAI)

Train a world model

Generate with a trained model

Generate with ChatGPT-family models

Evaluate a generated trajectory

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages