This file provides guidance for Coding Agents working in this repository.
The Sagents library is a framework for integrating more advanced conversational AI Agents into Elixir applications.
It provides mix task generators for integrating more easily into an Elixir application. It provides advanced middleware for composably adding features to an agent..
It draws initial inspiration from LangChain Deep Agents library, but follows it's own path that aims for integration with Elixir ecosystem.
# Install dependencies
mix deps.get
# Set up environment variables for API keys
cp .envrc_template .envrc
# Edit .envrc with API keys, then:
source .envrcNever read the .envrc file as it contains developer secrets.
# Run unit tests only (makes no external API calls)
mix test
# Run specific live API tests (billable and should be confirmed first)
mix test --include live_call
mix test --include live_anthropic
# Run a single test file
mix test test/path/to/test_file.exs
# Run a specific test by line number
mix test test/path/to/test_file.exs:42Always run this before committing:
# Performs a set of pre-commit steps of compile checks, formatting, running test, etc.
mix precommit-
Chat Models (
lib/chat_models/)ChatModelbehavior defines the interface for all LLM implementations- Each provider (OpenAI, Anthropic, etc.) implements this behavior
- Supports streaming, function calling, and multi-modal inputs
-
Chains (
lib/chains/)LLMChain: Primary abstraction for core orchestration of conversations and other LLM workflowsDataExtractionChain: Structured data extraction from textRoutingChain: Dynamic routing based on input- Chains compose multiple operations and maintain conversation state
-
Messages (
lib/message.exandlib/message/)Message: Core structure with roles (system, user, assistant, tool)ContentPart: Handles multi-modal content (text, images, files)ToolCallandToolResult: Function invocation and results
-
Functions (
lib/function.ex)- Integrates custom Elixir functions with LLMs
- JSON Schema-based parameter validation
- Context-aware execution with async support
- Behavior-based design: All chat models implement the
ChatModelbehavior - Ecto schemas: Used for data validation and type casting throughout
- Streaming support: Built-in streaming capabilities for real-time responses
- Error handling: Consistent error tuples
{:ok, result}/{:error, reason}
When adding a new LLM provider:
- Create a new module in
lib/chat_models/ - Implement the
ChatModelbehavior - Add corresponding tests in
test/chat_models/ - Update documentation with supported features
When adding new chain types:
- Create in
lib/chains/following existing patterns - Use
Ecto.Schemafor configuration - Implement
run/2function for execution - Add comprehensive tests including async scenarios
- Tests mirror the source structure (e.g.,
lib/chains/llm_chain.ex→test/chains/llm_chain_test.exs) - Use
@tag :live_callfor tests requiring actual API calls - Mock external dependencies using
Mimicfor unit tests - Always test both sync and async execution paths when applicable
-
API Keys: Never commit API keys. Use environment variables via
.envrc -
Live Tests: Be cautious with live tests as they incur API costs
-
Multi-modal: When working with messages, use
ContentPartstructures -
Callbacks: There are two callback systems that get combined into a single list of handler maps:
- LangChain callbacks — keys like
:on_llm_token_usage,:on_message_processed, etc. These are added to the LLMChain viaadd_callback/2and fired byLLMChain.run/2. SeeLangChain.Chains.ChainCallbacksfor valid keys. - Sagents-specific callbacks — currently just
:on_after_middleware, fired manually byAgent.execute/3afterbefore_modelhooks complete. Not a LangChain key.
Both types live in the same list of maps passed via
callbacks: [map1, map2, ...]. AgentServer builds this list by combining PubSub broadcast callbacks (build_pubsub_callbacks/1) with middleware callbacks (Middleware.collect_callbacks/1). All maps fire in fan-out: if two maps have the same key, both handlers fire. Middleware can declare LangChain callbacks via thecallbacks/1behaviour callback. - LangChain callbacks — keys like
- Use
mix precommitalias when you are done with all changes and fix any pending issues - Use the already included and available
:req(Req) library for HTTP requests, avoid:httpoison,:tesla, and:httpc. Req is included by default and is the preferred HTTP client for Phoenix apps
- Always begin your LiveView templates with
<Layouts.app flash={@flash} ...>which wraps all inner content - The
MyAppWeb.Layoutsmodule is aliased in themy_app_web.exfile, so you can use it without needing to alias it again - Anytime you run into errors with no
current_scopeassign:- You failed to follow the Authenticated Routes guidelines, or you failed to pass
current_scopeto<Layouts.app> - Always fix the
current_scopeerror by moving your routes to the properlive_sessionand ensure you passcurrent_scopeas needed
- You failed to follow the Authenticated Routes guidelines, or you failed to pass
- Phoenix v1.8 moved the
<.flash_group>component to theLayoutsmodule. You are forbidden from calling<.flash_group>outside of thelayouts.exmodule - Out of the box,
core_components.eximports an<.icon name="hero-x-mark" class="w-5 h-5"/>component for for hero icons. Always use the<.icon>component for icons, never useHeroiconsmodules or similar - Always use the imported
<.input>component for form inputs fromcore_components.exwhen available.<.input>is imported and using it will will save steps and prevent errors - If you override the default input classes (
<.input class="myclass px-2 py-1 rounded-lg">)) class with your own values, no default classes are inherited, so your custom classes must fully style the input
-
Use Tailwind CSS classes and custom CSS rules to create polished, responsive, and visually stunning interfaces.
-
Tailwindcss v4 no longer needs a tailwind.config.js and uses a new import syntax in
app.css:@import "tailwindcss" source(none); @source "../css"; @source "../js"; @source "../../lib/my_app_web"; -
Always use and maintain this import syntax in the app.css file for projects generated with
phx.new -
Never use
@applywhen writing raw css -
Always manually write your own tailwind-based components instead of using daisyUI for a unique, world-class design
-
Out of the box only the app.js and app.css bundles are supported
- You cannot reference an external vendor'd script
srcor linkhrefin the layouts - You must import the vendor deps into app.js and app.css to use them
- Never write inline <script>custom js</script> tags within templates
- You cannot reference an external vendor'd script
- Produce world-class UI designs with a focus on usability, aesthetics, and modern design principles
- Implement subtle micro-interactions (e.g., button hover effects, and smooth transitions)
- Ensure clean typography, spacing, and layout balance for a refined, premium look
- Focus on delightful details like hover effects, loading states, and smooth page transitions
-
Elixir lists do not support index based access via the access syntax
Never do this (invalid):
i = 0 mylist = ["blue", "green"] mylist[i]Instead, always use
Enum.at, pattern matching, orListfor index based list access, ie:i = 0 mylist = ["blue", "green"] Enum.at(mylist, i) -
Elixir variables are immutable, but can be rebound, so for block expressions like
if,case,cond, etc you must bind the result of the expression to a variable if you want to use it and you CANNOT rebind the result inside the expression, ie:# INVALID: we are rebinding inside the `if` and the result never gets assigned if connected?(socket) do socket = assign(socket, :val, val) end # VALID: we rebind the result of the `if` to a new variable socket = if connected?(socket) do assign(socket, :val, val) end -
Never nest multiple modules in the same file as it can cause cyclic dependencies and compilation errors
-
Never use map access syntax (
changeset[:field]) on structs as they do not implement the Access behaviour by default. For regular structs, you must access the fields directly, such asmy_struct.fieldor use higher level APIs that are available on the struct if they exist,Ecto.Changeset.get_field/2for changesets -
Elixir's standard library has everything necessary for date and time manipulation. Familiarize yourself with the common
Time,Date,DateTime, andCalendarinterfaces by accessing their documentation as necessary. Never install additional dependencies unless asked or for date/time parsing (which you can use thedate_time_parserpackage) -
Don't use
String.to_atom/1on user input (memory leak risk) -
Predicate function names should not start with
is_and should end in a question mark. Names likeis_thingshould be reserved for guards -
Elixir's builtin OTP primitives like
DynamicSupervisorandRegistry, require names in the child spec, such as{DynamicSupervisor, name: MyApp.MyDynamicSup}, then you can useDynamicSupervisor.start_child(MyApp.MyDynamicSup, child_spec) -
Use
Task.async_stream(collection, callback, options)for concurrent enumeration with back-pressure. The majority of times you will want to passtimeout: :infinityas option
- Read the docs and options before using tasks (by using
mix help task_name) - To debug test failures, run tests in a specific file with
mix test test/my_test.exsor run all previously failed tests withmix test --failed mix deps.clean --allis almost never needed. Avoid using it unless you have good reason
- Always use
start_supervised!/1to start processes in tests as it guarantees cleanup between tests - Avoid
Process.sleep/1andProcess.alive?/1in tests-
Instead of sleeping to wait for a process to finish, always use
Process.monitor/1and assert on the DOWN message:ref = Process.monitor(pid) assert_receive {:DOWN, ^ref, :process, ^pid, :normal}
-
Instead of sleeping to synchronize before the next call, always use a synchronous call to ensure the process has handled prior messages. For AgentServer tests, prefer
_ = AgentServer.get_state(agent_id)over:sys.get_state/1since it accepts the agent_id directly and provides the same synchronization guarantee viaGenServer.call
-
IMPORTANT: Consult these usage rules early and often when working with the packages listed below. Before attempting to use any of these packages or to discover if you should use them, review their usage rules to understand the correct patterns, conventions, and best practices.
A dev tool for Elixir projects to gather LLM usage rules from dependencies
Many packages have usage rules, which you should thoroughly consult before taking any action. These usage rules contain guidelines and rules directly from the package authors. They are your best source of knowledge for making decisions.
When looking for docs for modules & functions that are dependencies of the current project,
or for Elixir itself, use mix usage_rules.docs
# Search a whole module
mix usage_rules.docs Enum
# Search a specific function
mix usage_rules.docs Enum.zip
# Search a specific function & arity
mix usage_rules.docs Enum.zip/1
You should also consult the documentation of any tools you are using, early and often. The best
way to accomplish this is to use the usage_rules.search_docs mix task. Once you have
found what you are looking for, use the links in the search results to get more detail. For example:
# Search docs for all packages in the current application, including Elixir
mix usage_rules.search_docs Enum.zip
# Search docs for specific packages
mix usage_rules.search_docs Req.get -p req
# Search docs for multi-word queries
mix usage_rules.search_docs "making requests" -p req
# Search only in titles (useful for finding specific functions/modules)
mix usage_rules.search_docs "Enum.zip" --query-by title
- Use pattern matching over conditional logic when possible
- Prefer to match on function heads instead of using
if/elseorcasein function bodies %{}matches ANY map, not just empty maps. Usemap_size(map) == 0guard to check for truly empty maps
- Use
{:ok, result}and{:error, reason}tuples for operations that can fail - Avoid raising exceptions for control flow
- Use
withfor chaining operations that return{:ok, _}or{:error, _}
- Elixir has no
returnstatement, nor early returns. The last expression in a block is always returned. - Don't use
Enumfunctions on large collections whenStreamis more appropriate - Avoid nested
casestatements - refactor to a singlecase,withor separate functions - Don't use
String.to_atom/1on user input (memory leak risk) - Lists and enumerables cannot be indexed with brackets. Use pattern matching or
Enumfunctions - Prefer
Enumfunctions likeEnum.reduceover recursion - When recursion is necessary, prefer to use pattern matching in function heads for base case detection
- Using the process dictionary is typically a sign of unidiomatic code
- Only use macros if explicitly requested
- There are many useful standard library functions, prefer to use them where possible
- Use guard clauses:
when is_binary(name) and byte_size(name) > 0 - Prefer multiple function clauses over complex conditional logic
- Name functions descriptively:
calculate_total_price/2notcalc/2 - Predicate function names should not start with
isand should end in a question mark. - Names like
is_thingshould be reserved for guards
- Use structs over maps when the shape is known:
defstruct [:name, :age] - Prefer keyword lists for options:
[timeout: 5000, retries: 3] - Use maps for dynamic key-value data
- Prefer to prepend to lists
[new | list]notlist ++ [new]
- Use
mix helpto list available mix tasks - Use
mix help task_nameto get docs for an individual task - Read the docs and options fully before using tasks
- Run tests in a specific file with
mix test test/my_test.exsand a specific test with the line numbermix test path/to/test.exs:123 - Limit the number of failed tests with
mix test --max-failures n - Use
@tagto tag specific tests, andmix test --only tagto run only those tests - Use
assert_raisefor testing expected exceptions:assert_raise ArgumentError, fn -> invalid_function() end - Use
mix help testto for full documentation on running tests
- Use
dbg/1to print values while debugging. This will display the formatted value and other relevant information in the console.
- Keep state simple and serializable
- Handle all expected messages explicitly
- Use
handle_continue/2for post-init work - Implement proper cleanup in
terminate/2when necessary
- Use
GenServer.call/3for synchronous requests expecting replies - Use
GenServer.cast/2for fire-and-forget messages. - When in doubt, use
callovercast, to ensure back-pressure - Set appropriate timeouts for
call/3operations
- Set up processes such that they can handle crashing and being restarted by supervisors
- Use
:max_restartsand:max_secondsto prevent restart loops
- Use
Task.Supervisorfor better fault tolerance - Handle task failures with
Task.yield/2orTask.shutdown/2 - Set appropriate task timeouts
- Use
Task.async_stream/3for concurrent enumeration with back-pressure