You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: source/web-app/poet/generate-sequences.rst
+72-15Lines changed: 72 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -23,22 +23,22 @@ A note on the *Random seed* setting: this determines the state of the random num
23
23
Generating a sequence
24
24
---------------------
25
25
26
-
Navigate to the tool by opening the **PoET** dropdown menu, then selecting **Generate Sequences.**
26
+
Navigate to the tool by opening the **PoET** dropdown menu, then selecting **Generate Sequences.** You can choose the model used to run the job. We recommend using PoET-2 for most use cases.
27
27
28
-
Set your parameters to control sampling behavior. In particular, **temperature**, **top-p**, and **top-k** provide the ability to focus sampling around highly likely sequences. We recommend that you use either top-p or top-k on a given job, not both.
**Build Custom Context**: If you want to create a custom context, please refer to `Creating a Context <./prompts.rst#creating-a-context>`_.
38
38
39
-
If you have an existing prompt, you may select it. Alternatively, add your custom MSA to the **Prompt Definition** field. You can input the sequence(s) directly, or upload an existing .fa, .fasta, or .csv file.
39
+
**Build from MSA**: If you have an existing prompt, you may select it. Alternatively, add your custom MSA to the **Prompt Definition** field. You can input the sequence(s) directly, or upload an existing .fa, .fasta, or .csv file.
40
40
41
-
If you do not have an existing MSA, input your target protein and select **Single Sequence**. OpenProtein will generate an MSA using a homology search against Uniref using mmseqs2 with default settings from ColabFold, then use the MSA to create a prompt. Please note that if you select **Single Sequence** when multiple sequences are entered, sequences after the first are ignored.
41
+
If you do not have an existing MSA, you can select **Upload MSA**. If you select **Run Homology Search Using a Seed Sequence**, OpenProtein will generate an MSA using a homology search against Uniref using mmseqs2 with default settings from ColabFold, then use the MSA to create a prompt. Please note that when multiple sequences are entered, sequences after the first are ignored.
42
42
43
43
Choose the number of prompts to ensemble. Select 1 to sample a single prompt, or increase the diversity of generated outputs by ensembling over 2-15 prompts. We suggest using 3-5 prompts.
44
44
@@ -50,23 +50,80 @@ Set sampling method fields. We suggest you start with the default settings, then
You're ready to generate a custom sequence! Select **Run.** The job may take a few minutes depending on how busy the service is, how long your sequences are, and how many sequences you want to score.
53
+
54
+
Step 3: Sampling Settings
55
+
^^^^^^^^^^^^^^^^^^^^^^^^^^
56
+
57
+
Set your parameters to control sampling behavior. In particular, **temperature**, **top-p**, and **top-k** provide the ability to focus sampling around highly likely sequences. We recommend that you use either top-p or top-k on a given job, not both. You can choose the default structure prediction model to generate the sequence structures after the job completes.
You're ready to generate a custom sequence! Click **Run.** The job may take a few minutes depending on how busy the service is, how long your sequences are, and how many sequences you want to score.
54
64
55
65
A 400 (Bad request) error code may be due to the following:
- Reupload prompt and try again. Refer to the article about `prompts <./prompts.rst>`_. Ensure minimum and maximum similarity parameters are not filtering out all sequences in prompt.
78
+
* - Invalid user input in align service
79
+
- Ensure you don't have
80
+
81
+
- a top_p>1
82
+
- a non-valid amino acid
83
+
- Maximum similarity < minimum similarity
84
+
If necessary, refer to the article on `sampling parameters <./prompts.rst#prompt-sampling-definitions>`_.
85
+
* - Invalid MSA (not aligned, etc)
86
+
- - Make sure your MSAs are aligned and rebuild MSA if necessary.
87
+
- If you have uploaded pre-computed MSA, confirm that formatting is correct and sequences are of equal length (use gap tokens “-”).
88
+
- If you are building from a seed sequence, try rebuilding the MSA
89
+
59
90
60
91
Please contact `OpenProtein.AI support <https://www.openprotein.ai/contact>`_ if the suggested solutions don't resolve the issue.
61
92
62
93
Interpreting your results
63
94
-------------------------
64
95
65
-
Your results are presented as a table, with each generated sequence being assigned a log-likelihood score. This score indicates the fitness of the generated sequence compared to the prompt. The higher the log-likelihood score, the more fit the sequence.
96
+
The PoET history page allows you to view and access past jobs, sorted by created date, job type, and status. Clicking the job ID will take you to the results page for that job.
97
+
98
+
**Results Table**
99
+
100
+
Your results are presented in a table, with each generated sequence assigned a log-likelihood score. This score reflects how well the generated sequence fits the prompt: a higher score indicates a better fit.
101
+
102
+
Right-click any sequence to access local fitness landscapes through the **Run Substitution Analysis** menu. You can also sort your results and export them using the **Export** button.
Once the structure prediction job completes, the structure viewer will appear on the right side of the page. The prediction model can be changed via the **Structure Predictor** dropdown located above the viewer. Hover over a sequence in the results table to preview and compare its structure against the query structure. The viewer also displays key metrics, including:
110
+
111
+
- **pLDDT (predicted Local Distance Difference Test)**: A per-residue confidence score (commonly scaled from 0–100 or 0.0–1.0) indicating how reliable each residue's predicted position is.
112
+
113
+
- **RMSD (Root Mean Square Deviation)**: A measure of structural similarity between two molecules, typically comparing backbone atoms. Lower RMSD values indicate greater structural similarity.
114
+
115
+
Click a sequence to expand the structure viewer, which will overlay the results table. In this view, you can examine detailed metrics and sequence-to-prompt alignment for the selected sequence.
By default, the query structure is used as the reference structure for structural comparisons. You can change it by selecting an alternative structure via the **Edit Reference Sequence** button, accessible from the results table **Settings** in the results table.
66
123
67
-
Access local fitness landscapes using the substitution analysis buttons for each sequence. You can sort your results and download them using the **Export** button.
Note: The PoET history page allows you to view and access past jobs, sorted by created date, job type, and status. Clicking the job ID will take you to the results page for that job.
Copy file name to clipboardExpand all lines: source/web-app/poet/rank-sequences.rst
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -52,7 +52,7 @@ A 400 (Bad request) error code may be due to the following:
52
52
* - Invalid prompt in PoET service
53
53
- Reupload prompt and try again. Refer to the article about `prompts <./prompts.rst>`_. Ensure minimum and maximum similarity parameters are not filtering out all sequences in prompt.
Copy file name to clipboardExpand all lines: source/web-app/poet/substitution-analysis.rst
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -46,7 +46,8 @@ A 400 (Bad request) error code may be due to the following:
46
46
* - Invalid prompt in PoET service
47
47
- Reupload prompt and try again. Refer to the article about `prompts <./prompts.rst>`_. Ensure minimum and maximum similarity parameters are not filtering out all sequences in prompt.
0 commit comments