Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
27 commits
Select commit Hold shift + click to select a range
b32b3b2
winson's commit
May 14, 2025
128749a
Merge branch 'main' into winson
May 18, 2025
df384cf
merge
May 19, 2025
26de85e
removed unused packages
W1nson May 19, 2025
8cd03b8
merge main
W1nson May 19, 2025
dc4c5bb
created frontend and template code
W1nson May 19, 2025
96a66c4
make the window resizable
W1nson May 19, 2025
1945ea3
Merge branch 'main' into winson
W1nson May 20, 2025
ec06548
hard code the clientId and modify some UI
W1nson May 20, 2025
d0e8dab
Merge branch 'winson' of https://github.com/bin123apple/InfantAgent i…
W1nson May 20, 2025
384f7e3
update some components
W1nson May 21, 2025
c819cb9
work in progress
W1nson May 26, 2025
fdcdc80
minor update
W1nson Oct 13, 2025
35f323a
Merge branch 'main' into winson
W1nson Oct 13, 2025
8dfeb70
working vllm, computer container
W1nson Oct 13, 2025
3a36043
adding docker setup
W1nson Oct 14, 2025
b438055
migrate computer container into docker compose
W1nson Oct 15, 2025
e66bfd4
Merge branch 'main' into winson
W1nson Oct 15, 2025
04136f9
dockerized infantagent, requires real testing, passed pytest
W1nson Oct 15, 2025
04a603b
able to send prmopt to the infant agent now
W1nson Oct 15, 2025
56b7d76
merge
W1nson Oct 15, 2025
3b17123
Merge branch 'winson' of https://github.com/bin123apple/InfantAgent i…
W1nson Oct 15, 2025
6514b9c
update
W1nson Feb 25, 2026
1db1867
Merge branch 'winson' of https://github.com/bin123apple/InfantAgent i…
W1nson Feb 25, 2026
d286703
working dockering frontend
W1nson Feb 25, 2026
94a665c
updated frontend
W1nson Apr 13, 2026
4dbe62b
update to claude sonnect 4.6
W1nson Apr 23, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
64 changes: 64 additions & 0 deletions .dockerignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,64 @@
# Git files
.git
.gitignore
.gitattributes

# Python cache
__pycache__/
*.py[cod]
*$py.class
*.so
.Python
*.egg-info/
dist/
build/
*.egg

# Virtual environments
.venv/
venv/
ENV/
env/

# IDE
.vscode/
.idea/
*.swp
*.swo
*~

# OS files
.DS_Store
Thumbs.db

# Workspace and cache (will be mounted as volumes)
workspace/
cache/

# Documentation and examples
README.md
LICENSE
asset/
examples/
evaluation/

# Docker files themselves
Dockerfile
docker-compose.yaml
.dockerignore

# Logs
*.log
logs/

# Test files
tests/
*.test
.pytest_cache/
.coverage
htmlcov/

# Temporary files
tmp/
temp/
*.tmp
61 changes: 61 additions & 0 deletions .env.example
Original file line number Diff line number Diff line change
@@ -0,0 +1,61 @@
# InfantAgent Environment Configuration
# Copy this file to .env and fill in your values

# Required: Anthropic API Key
# Get your key from: https://console.anthropic.com/
ANTHROPIC_API_KEY=your_anthropic_api_key_here

# Optional: Hugging Face Token (for downloading UI-TARS model)
# Get your token from: https://huggingface.co/settings/tokens
# Required if the model is gated or for faster downloads
HF_API_KEY=your_huggingface_token_here

# Optional: Hugging Face cache directory
# HF_HOME=~/.cache/huggingface

# Optional: GPU Configuration
# Specify which CUDA devices to use (comma-separated list)
# Default GPU allocation:
# - GPU 0: Computer container (display rendering)
# - GPU 2,3: vLLM server (UI-TARS model with tensor parallelism)
# - InfantAgent server: Runs on CPU (no GPU needed)
# Note: Adjust based on your available GPUs
# CUDA_VISIBLE_DEVICES=0,2,3

# Optional: Computer Container Configuration
# NVIDIA driver type: Tesla, GeForce, or other
# NVIDIA_DRIVER=Tesla
# GPU device(s) visible to the computer container
# NVIDIA_VISIBLE_DEVICES=0
# Specific GPU device ID(s) for docker deploy (e.g., '0' or '0,1')
# NVIDIA_DEVICE_ID=0
# Render type: gpu or software
# RENDER_TYPE=gpu
# User account to create in the container: infant or root
# CREATE_USER_ACCOUNT=infant
# Workspace mount path on host (default: ./workspace)
# WORKSPACE_MOUNT_PATH=./workspace
# Cache directory on host (default: ./cache)
# CACHE_DIR=./cache

# Optional: Other API Keys (if using different models)
# OPENAI_API_KEY=your_openai_key_here
# GOOGLE_API_KEY=your_google_key_here

# Optional: Custom Model Configuration
# Uncomment and modify if you want to use different models
# MODEL=claude-sonnet-4-6
# MODEL_OSS=ByteDance-Seed/UI-TARS-1.5-7B

# Optional: Agent Configuration
# MAX_ITERATIONS=100
# MAX_BUDGET_PER_TASK=20
# DEBUG=false

# Optional: Network Configuration
# AGENT_PORT=8000
# GUI_PORT=4443 # Guacamole web interface port (maps to container's 8080)
# SSH_PORT=58673 # SSH port for computer container (maps to container's 22)

# Optional: Timezone
# TZ=America/New_York
Loading