-
-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: ctrl+k to pull a model while keeping any changes #178
Conversation
sammcj
commented
Mar 7, 2025
- ctrl+k to pull a models updates without destroying your changes to parameters
- hopefully a fix for 🐞 Bug: edit some model give "error updating model : error writing file : /tmp/tom_himanen/deepseek-r1-roo-cline-tools:70b" #175
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Auto Pull Request Review from LlamaPReview
1. Overview
1.1 PR Summary
- Business value and requirements alignment: This PR introduces a new feature that allows users to pull model updates while preserving their custom configurations. This aligns with the requirement to provide users with more control over their model management, ensuring that their customizations are not lost during updates.
- Key components modified: The PR modifies
README.md
,app_model.go
,cspell.json
,keymap.go
, andoperations.go
. The primary changes are inapp_model.go
andoperations.go
, where the new logic for pulling models while preserving configurations is implemented. - Impact assessment: The new feature adds complexity to the model management process by introducing a new interaction flow (
ctrl+k
). This requires careful handling of user configurations and robust error management to ensure data integrity. - System dependencies and integration impacts: The changes interact with the Ollama API for pulling models and require logic to extract, store, and re-apply model configurations. This involves reading and potentially modifying local model storage or configuration files.
1.2 Architecture Changes
- System design modifications: The PR introduces a new user interaction flow for model updates, adding a layer of complexity in model management by differentiating between destructive and non-destructive pull operations.
- Component interactions: The new feature interacts with the Ollama API for pulling models and requires logic to extract, store, and re-apply model configurations. This involves reading and potentially modifying local model storage or configuration files.
- Integration points: The integration points include the Ollama API for model pulling and the local file system for managing model configurations.
2. Detailed Technical Analysis
2.1 Code Logic Deep-Dive
Core Logic Changes
- operations.go - startPullModelPreserveConfig
- Submitted PR Code:
// startPullModelPreserveConfig pulls a model while preserving user-modified configuration
func (m *AppModel) startPullModelPreserveConfig(modelName string) tea.Cmd {
return func() tea.Msg {
ctx := context.Background()
// Step 1: Extract current parameters and template before pulling
logging.InfoLogger.Printf("Extracting parameters for model %s before pulling", modelName)
currentParams, currentTemplate, systemPrompt, err := getModelParamsWithSystem(modelName, m.client)
if err != nil {
logging.ErrorLogger.Printf("Error extracting parameters for model %s: %v", modelName, err)
return pullErrorMsg{fmt.Errorf("failed to extract parameters: %v", err)}
}
// Step 2: Pull the updated model
logging.InfoLogger.Printf("Pulling updated model: %s", modelName)
req := &api.PullRequest{Name: modelName}
err = m.client.Pull(ctx, req, func(resp api.ProgressResponse) error {
m.pullProgress = float64(resp.Completed) / float64(resp.Total)
return nil
})
if err != nil {
return pullErrorMsg{err}
}
// Step 3: Apply the saved configuration back to the updated model
logging.InfoLogger.Printf("Restoring configuration for model: %s", modelName)
// Create request with base fields
createReq := &api.CreateRequest{
Model: modelName, // The model to update
From: modelName, // Use the same model name as base (it's now been updated)
}
// Add template if it exists
if currentTemplate != "" {
createReq.Template = currentTemplate
}
// Add system prompt if it exists
if systemPrompt != "" {
createReq.System = systemPrompt
}
// Add parameters if any were found
if len(currentParams) > 0 {
// Convert map[string]string to map[string]any
parameters := make(map[string]any)
for k, v := range currentParams {
// Try to convert numeric values
if floatVal, err := strconv.ParseFloat(v, 64); err == nil {
parameters[k] = floatVal
} else if intVal, err := strconv.Atoi(v); err == nil {
parameters[k] = intVal
} else {
parameters[k] = v
}
}
createReq.Parameters = parameters
}
// Apply the configuration
err = m.client.Create(ctx, createReq, func(resp api.ProgressResponse) error {
return nil
})
if err != nil {
return pullErrorMsg{fmt.Errorf("failed to restore configuration: %v", err)}
}
return pullSuccessMsg{modelName}
}
}
-
Analysis:
- The
startPullModelPreserveConfig
function attempts to preserve user configurations during a model pull. It extracts parameters, template, and system prompt before pulling the model and then attempts to re-apply them usingclient.Create
. - Potential Issue 1: Incomplete Error Handling: While errors are checked after
getModelParamsWithSystem
,client.Pull
, andclient.Create
, the error handling is basic. IfgetModelParamsWithSystem
fails, the function returns, preventing the pull and re-application. However, ifclient.Pull
succeeds butclient.Create
fails, the function also returns apullErrorMsg
. This means that if configuration re-application fails after a successful pull, the user might be left with an updated model but without their configurations, and the error messagefailed to restore configuration
might be misleading as it doesn't clearly indicate that the pull itself was successful. - Potential Issue 2: Overwriting Existing Model: The
client.Create
call uses the samemodelName
for bothModel
andFrom
fields inCreateRequest
. This effectively attempts to re-create the model with the same name, potentially overwriting the just-pulled updated model with a new "layer" of configuration. While this achieves the goal of re-applying config, it's not explicitly clear if this is the most efficient or intended way to update an existing model's configuration. It might be more intuitive to have a dedicated "update configuration" API call if Ollama provides one, or clarify in comments thatCreate
is being used to layer configurations. - Potential Issue 3: Data Type Conversion in Parameters: The code attempts to convert parameter values to
float64
andint
before re-applying them. This is good for numerical parameters. However, it assumes that all parameters that can be converted to numbers should be. This might not always be the case. Some parameters might be intended as strings even if they look like numbers. A more robust approach would be to either have a predefined list of parameters that should be treated as numbers or to avoid automatic type conversion and treat all extracted parameter values as strings during re-application, relying on Ollama to handle type conversion based on the Modelfile schema.
- The
-
LlamaPReview Suggested Improvements:
// startPullModelPreserveConfig pulls a model while preserving user-modified configuration
func (m *AppModel) startPullModelPreserveConfig(modelName string) tea.Cmd {
return func() tea.Msg {
ctx := context.Background()
var originalError error // To track original error for better error reporting
// Step 1: Extract current parameters and template before pulling
logging.InfoLogger.Printf("Extracting parameters for model %s before pulling", modelName)
currentParams, currentTemplate, systemPrompt, err := getModelParamsWithSystem(modelName, m.client)
if err != nil {
logging.ErrorLogger.Printf("Error extracting parameters for model %s: %v", modelName, err)
return pullErrorMsg{fmt.Errorf("failed to extract parameters: %v", err)}
}
// Step 2: Pull the updated model
logging.InfoLogger.Printf("Pulling updated model: %s", modelName)
req := &api.PullRequest{Name: modelName}
err = m.client.Pull(ctx, req, func(resp api.ProgressResponse) error {
m.pullProgress = float64(resp.Completed) / float64(resp.Total)
return nil
})
if err != nil {
originalError = err // Capture pull error
logging.ErrorLogger.Printf("Error pulling model %s: %v", modelName, err)
return pullErrorMsg{fmt.Errorf("failed to pull updated model: %v", err)} // More specific error message
}
// Step 3: Apply the saved configuration back to the updated model
logging.InfoLogger.Printf("Restoring configuration for model: %s", modelName)
// Create request with base fields
createReq := &api.CreateRequest{
Model: modelName, // The model to update
From: modelName, // Use the same model name as base (it's now been updated)
}
// Add template if it exists
if currentTemplate != "" {
createReq.Template = currentTemplate
}
// Add system prompt if it exists
if systemPrompt != "" {
createReq.System = systemPrompt
}
// Add parameters if any were found
if len(currentParams) > 0 {
// Convert map[string]string to map[string]any
parameters := make(map[string]any)
for k, v := range currentParams {
// Keep all parameters as strings to avoid potential type mismatch issues.
parameters[k] = v
}
createReq.Parameters = parameters
}
// Apply the configuration
err = m.client.Create(ctx, createReq, func(resp api.ProgressResponse) error {
return nil
})
if err != nil {
logging.ErrorLogger.Printf("Error restoring configuration for model %s: %v", modelName, err)
if originalError != nil { // If pull also failed, indicate pull failure in message
return pullErrorMsg{fmt.Errorf("failed to restore configuration after pulling model: %v (Original pull error: %v)", err, originalError)}
} else {
return pullErrorMsg{fmt.Errorf("failed to restore configuration: %v (Model update may have succeeded but config not reapplied)", err)} // More informative message
}
}
return pullSuccessMsg{modelName}
}
}
- Improvement rationale:
- Improved Error Reporting: Added a
originalError
variable to track if the pull itself failed. The error message in case ofclient.Create
failure is enhanced to indicate whether the model update might have succeeded even if configuration re-application failed. This provides more clarity to the user about the state of their model. - Simplified Parameter Handling: Removed the automatic type conversion for parameters and now keeps them as strings. This avoids potential issues with incorrect type assumptions and relies on Ollama's API to handle parameter types correctly. This is a safer and more predictable approach.
- Clarity on Overwriting: While the code still uses
client.Create
for re-application, the comments and logging already indicate this is a "restore configuration" step. For further clarity, a comment could be added within theclient.Create
block to explicitly state thatCreate
is being used to layer configurations on top of the pulled model. However, this might be an architectural discussion beyond the scope of this PR review and could be considered as a future improvement if a dedicated "update config" API becomes available in Ollama.
- Improved Error Reporting: Added a
Core Logic Changes
- operations.go - getModelParamsWithSystem
- Submitted PR Code:
// getModelParamsWithSystem extracts parameters, template, and system prompt from a model's modelfile
func getModelParamsWithSystem(modelName string, client *api.Client) (map[string]string, string, string, error) {
logging.InfoLogger.Printf("Getting parameters and system prompt for model: %s\n", modelName)
ctx := context.Background()
req := &api.ShowRequest{Name: modelName}
resp, err := client.Show(ctx, req)
if err != nil {
logging.ErrorLogger.Printf("Error getting modelfile for %s: %v\n", modelName, err)
return nil, "", "", err
}
output := []byte(resp.Modelfile)
lines := strings.Split(strings.TrimSpace(string(output)), "\n")
params := make(map[string]string)
var template string
var system string
inTemplate := false
inMultilineTemplate := false
var templateLines []string
for _, line := range lines {
trimmed := strings.TrimSpace(line)
// Handle TEMPLATE directive
if strings.HasPrefix(trimmed, "TEMPLATE") {
if strings.Contains(trimmed, `"""`) {
// Multi-line template
templateContent := strings.TrimPrefix(trimmed, "TEMPLATE ")
templateContent = strings.TrimSpace(templateContent)
if strings.HasPrefix(templateContent, `"""`) {
templateContent = strings.TrimPrefix(templateContent, `"""`)
}
inTemplate = true
inMultilineTemplate = true
if templateContent != "" {
templateLines = append(templateLines, templateContent)
}
} else {
// Single-line template
template = strings.TrimPrefix(trimmed, "TEMPLATE ")
template = strings.Trim(template, `"`)
}
} else if inTemplate {
if inMultilineTemplate && strings.HasSuffix(trimmed, `"""`) {
line = strings.TrimSuffix(line, `"""`)
if line != "" {
templateLines = append(templateLines, line)
}
inTemplate = false
inMultilineTemplate = false
} else {
templateLines = append(templateLines, line)
}
} else if strings.HasPrefix(trimmed, "SYSTEM") {
system = strings.TrimPrefix(trimmed, "SYSTEM ")
// Remove surrounding quotes if present
system = strings.Trim(system, `"`)
} else if strings.HasPrefix(trimmed, "PARAMETER") {
parts := strings.SplitN(trimmed, " ", 3)
if len(parts) >= 3 {
key := parts[1]
value := strings.TrimSpace(parts[2])
params[key] = value
}
}
}
if len(templateLines) > 0 {
template = strings.Join(templateLines, "\n")
}
return params, template, system, nil
}
-
Analysis:
- The
getModelParamsWithSystem
function parses a Modelfile string to extractPARAMETER
,TEMPLATE
, andSYSTEM
directives. - Potential Issue 1: Inconsistent Template Handling: The code handles single-line and multi-line templates differently. For single-line templates, it removes surrounding quotes using
strings.Trim(template,
"). However, for multi-line templates, it does not perform similar quote removal from each line within
templateLines. If a multi-line template in a Modelfile uses quotes within the
"""` block, these quotes will be preserved in the extracted template, which might not be intended. It's important to ensure consistent handling of quotes for both single-line and multi-line templates. - Potential Issue 2: Lack of Robustness in Directive Parsing: The parsing logic relies on simple
strings.HasPrefix
andstrings.SplitN
. It might be vulnerable to variations in Modelfile syntax, such as extra whitespace around directives or inconsistent casing (though Modelfile directives are typically uppercase). While perfect Modelfile validation might be overkill, the parsing could be made slightly more robust by trimming whitespace more aggressively before checking prefixes and potentially converting directive names to uppercase for case-insensitive matching. - Potential Issue 3: Missing Directive Handling: The code only extracts
PARAMETER
,TEMPLATE
, andSYSTEM
. Modelfiles can contain other directives likeFROM
,ADAPTER
,LICENSE
,DISTRIBUTION
, etc. While this function is specifically designed to extract configuration parameters, it might be beneficial to either:- Document clearly that this function only extracts these three directives and will ignore others.
- If there's a possibility of needing to preserve other directives in the future (even if not re-applied), the function could be extended to at least identify and store unknown directives, even if their values are not parsed. This would make the function more forward-compatible with potential Modelfile extensions.
- The
-
LlamaPReview Suggested Improvements:
// getModelParamsWithSystem extracts parameters, template, and system prompt from a model's modelfile
func getModelParamsWithSystem(modelName string, client *api.Client) (map[string]string, string, string, error) {
logging.InfoLogger.Printf("Getting parameters and system prompt for model: %s\n", modelName)
ctx := context.Background()
req := &api.ShowRequest{Name: modelName}
resp, err := client.Show(ctx, req)
if err != nil {
logging.ErrorLogger.Printf("Error getting modelfile for %s: %v\n", modelName, err)
return nil, "", "", err
}
output := []byte(resp.Modelfile)
lines := strings.Split(strings.TrimSpace(string(output)), "\n")
params := make(map[string]string)
var template string
var system string
inTemplate := false
inMultilineTemplate := false
var templateLines []string
for _, line := range lines {
trimmed := strings.TrimSpace(line)
trimmedUpper := strings.ToUpper(trimmed) // Convert to uppercase for case-insensitive matching
// Handle TEMPLATE directive
if strings.HasPrefix(trimmedUpper, "TEMPLATE") {
templateContent := strings.TrimPrefix(trimmed, "TEMPLATE ") // Keep original case for content
templateContent = strings.TrimSpace(templateContent)
if strings.Contains(templateContent, `"""`) {
// Multi-line template
if strings.HasPrefix(templateContent, `"""`) {
templateContent = strings.TrimPrefix(templateContent, `"""`)
}
inTemplate = true
inMultilineTemplate = true
if templateContent != "" {
templateLines = append(templateLines, templateContent)
}
} else {
// Single-line template - Consistent quote removal
template = strings.Trim(templateContent, `"`) // Trim quotes consistently
}
} else if inTemplate {
if inMultilineTemplate && strings.HasSuffix(trimmed, `"""`) {
line = strings.TrimSuffix(line, `"""`)
// No quote removal from individual lines in multiline template - preserve as is.
if line != "" {
templateLines = append(templateLines, line)
}
inTemplate = false
inMultilineTemplate = false
} else {
templateLines = append(templateLines, line)
}
} else if strings.HasPrefix(trimmedUpper, "SYSTEM") {
systemContent := strings.TrimPrefix(trimmed, "SYSTEM ") // Keep original case for content
// Remove surrounding quotes if present - Consistent quote removal
system = strings.Trim(systemContent, `"`) // Trim quotes consistently
} else if strings.HasPrefix(trimmedUpper, "PARAMETER") {
parts := strings.SplitN(trimmed, " ", 3)
if len(parts) >= 3 {
key := strings.TrimSpace(parts[1]) // Trim key and value
value := strings.TrimSpace(parts[2])
params[key] = value
}
}
// Optionally handle other directives here if future preservation is needed.
// else if strings.HasPrefix(trimmedUpper, "FROM") { ... }
}
if len(templateLines) > 0 {
template = strings.Join(templateLines, "\n")
}
return params, template, system, nil
}
- Improvement rationale:
- Consistent Quote Removal: Modified template and system prompt handling to use
strings.Trim(content,
")
for both single-line templates and system prompts, ensuring consistent removal of surrounding quotes. Multi-line template lines are now preserved as is, without individual quote trimming within the lines, as this is generally how multi-line string literals are intended to be handled. - Case-Insensitive Directive Matching: Converted directive prefix checks to be case-insensitive by using
strings.ToUpper(trimmed)
andstrings.HasPrefix(trimmedUpper, "DIRECTIVE")
. This makes the parsing more robust against variations in Modelfile casing. - Improved Whitespace Handling: Added
strings.TrimSpace
to thekey
andvalue
when parsingPARAMETER
directives to handle potential leading/trailing whitespace around parameter names and values, improving robustness. - Future Directive Handling (Comment): Added a comment suggesting how to extend the function to handle other Modelfile directives if future requirements include preserving or processing them. This addresses the "Missing Directive Handling" point by providing a clear path for future extensibility and documenting the current scope.
- Consistent Quote Removal: Modified template and system prompt handling to use
2.2 Implementation Quality
- Code organization and structure: The code is well-organized, with clear separation of concerns. The
startPullModelPreserveConfig
function handles the main logic for pulling a model while preserving configurations, andgetModelParamsWithSystem
is responsible for parsing the Modelfile. - Design patterns usage: The code follows a clear pattern of extracting configurations, pulling the model, and then re-applying the configurations. This pattern is easy to follow and understand.
- Error handling approach: The error handling is basic but present. Errors are logged and returned, but there is room for improvement in providing more informative error messages and handling edge cases.
- Resource management: The code efficiently manages resources by using temporary files and ensuring they are cleaned up after use.
3. Critical Findings
3.1 Potential Issues
-
🔴 Critical Issues
-
Incomplete Error Handling: The current error handling in
startPullModelPreserveConfig
does not clearly indicate the state of the model if configuration re-application fails after a successful pull. This could lead to user confusion and potential data loss.- Impact: Users might be left with an updated model but without their configurations, leading to unexpected behavior.
- Recommendation: Improve error reporting to clearly indicate the state of the model and the configuration re-application process.
-
Overwriting Existing Model: The use of
client.Create
to re-apply configurations might not be the most efficient or intended way to update an existing model's configuration. This could lead to unintended overwriting of the model.- Impact: Potential data loss or unintended model behavior due to overwriting.
- Recommendation: Consider using a dedicated "update configuration" API call if available, or clarify in comments that
Create
is being used to layer configurations.
-
Data Type Conversion in Parameters: The automatic type conversion of parameters might not always be appropriate. Some parameters might be intended as strings even if they look like numbers.
- Impact: Incorrect parameter types could lead to unexpected model behavior.
- Recommendation: Avoid automatic type conversion and treat all extracted parameter values as strings during re-application, relying on Ollama to handle type conversion based on the Modelfile schema.
-
-
🟡 Warnings
-
Inconsistent Template Handling: The current handling of single-line and multi-line templates is inconsistent. Multi-line templates do not have consistent quote removal, which might lead to unexpected behavior.
- Potential risks: Incorrect template handling could lead to unexpected model behavior.
- Suggested improvements: Ensure consistent quote removal for both single-line and multi-line templates.
-
Lack of Robustness in Directive Parsing: The parsing logic relies on simple
strings.HasPrefix
andstrings.SplitN
, which might be vulnerable to variations in Modelfile syntax.- Potential risks: Incorrect parsing could lead to missed configurations or incorrect model behavior.
- Suggested improvements: Make the parsing more robust by trimming whitespace more aggressively and converting directive names to uppercase for case-insensitive matching.
-
Missing Directive Handling: The code only extracts
PARAMETER
,TEMPLATE
, andSYSTEM
. Other directives in the Modelfile are ignored.- Potential risks: Ignoring other directives might lead to incomplete configuration preservation.
- Suggested improvements: Document clearly that this function only extracts these three directives and will ignore others. Consider extending the function to handle other directives if future requirements include preserving or processing them.
-
3.2 Code Quality Concerns
- Maintainability aspects: The code is generally maintainable, but there is room for improvement in error handling and robustness.
- Readability issues: The code is readable, but some areas could benefit from more detailed comments and documentation.
- Performance bottlenecks: No significant performance bottlenecks were identified.
4. Security Assessment
- Authentication/Authorization impacts: The changes do not impact authentication or authorization.
- Data handling concerns: The code handles model configurations, which might contain sensitive data. Ensure that the configurations are stored and transmitted securely.
- Input validation: The code does not perform input validation on the Modelfile content. Ensure that the Modelfile content is validated before processing to prevent potential security risks.
- Security best practices: Follow security best practices for handling and storing model configurations.
- Potential security risks: The editing of Modelfiles in a temp directory warrants a quick security check to ensure no unintended file exposure.
- Mitigation strategies: Ensure that temporary files are securely managed and cleaned up after use.
- Security testing requirements: Perform security testing to ensure that model configurations are handled securely and that there are no unintended file exposures.
5. Testing Strategy
5.1 Test Coverage
- Unit test analysis: The PR includes unit tests for the new functionality. Ensure that the tests cover various configuration scenarios and edge cases.
- Integration test requirements: Perform integration tests to verify the end-to-end flow of pulling a model with config preservation, ensuring the Ollama API interactions and local state management are correct.
- Edge cases coverage: Test edge cases such as invalid Modelfile content, partial configuration extraction, and errors during model pulling.
5.2 Test Recommendations
Suggested Test Cases
// Sample test code for unit tests
func TestGetModelParamsWithSystem(t *testing.T) {
// Test cases for getModelParamsWithSystem function
// Include tests for various Modelfile contents and edge cases
}
func TestStartPullModelPreserveConfig(t *testing.T) {
// Test cases for startPullModelPreserveConfig function
// Include tests for successful and failed configuration extraction, model pulling, and configuration re-application
}
- Coverage improvements: Ensure that the tests cover all possible scenarios and edge cases.
- Performance testing needs: Perform performance testing to ensure that the configuration preservation process does not introduce significant delays.
6. Documentation & Maintenance
- Documentation updates needed: Update the documentation to reflect the new feature and its usage. Include detailed instructions on how to use the
ctrl+k
key to pull a model while preserving configurations. - Long-term maintenance considerations: Consider the long-term maintenance of the new feature, including potential changes in the Modelfile format and the need to update the configuration extraction logic accordingly.
- Technical debt and monitoring requirements: Monitor the feature for any issues or errors and address them promptly to minimize technical debt.
7. Deployment & Operations
- Deployment impact and strategy: The changes require careful deployment to ensure that the new feature does not introduce any issues. Perform thorough testing in a staging environment before deploying to production.
- Key operational considerations: Monitor the feature for any issues or errors during operation and address them promptly to ensure smooth user experience.
8. Summary & Recommendations
8.1 Key Action Items
- Improve error handling in
startPullModelPreserveConfig
to provide more informative error messages and handle edge cases. - Consider using a dedicated "update configuration" API call if available, or clarify in comments that
Create
is being used to layer configurations. - Avoid automatic type conversion of parameters and treat all extracted parameter values as strings during re-application.
- Ensure consistent quote removal for both single-line and multi-line templates.
- Make the parsing logic more robust by trimming whitespace more aggressively and converting directive names to uppercase for case-insensitive matching.
- Document clearly that the
getModelParamsWithSystem
function only extractsPARAMETER
,TEMPLATE
, andSYSTEM
directives and will ignore others. Consider extending the function to handle other directives if future requirements include preserving or processing them.
8.2 Future Considerations
- Technical evolution path: Consider the long-term maintenance of the new feature, including potential changes in the Modelfile format and the need to update the configuration extraction logic accordingly.
- Business capability evolution: The new feature provides users with more control over their model management, ensuring that their customizations are not lost during updates. This aligns with the business requirement to provide a better user experience.
- System integration impacts: The changes interact with the Ollama API for pulling models and require logic to extract, store, and re-apply model configurations. Ensure that the integration points are well-documented and maintained.
💡 Help Shape LlamaPReview
How's this review format working for you? Vote in our Github Discussion Polls to help us improve your review experience!