Skip to content

Commit 0406bcc

Browse files
committed
Merge remote-tracking branch 'origin/phap/update-poet' into develop
2 parents 6647e1b + 0577d48 commit 0406bcc

File tree

10 files changed

+76
-17
lines changed

10 files changed

+76
-17
lines changed

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,3 +6,4 @@
66
# pixi environments
77
.pixi
88
*.egg-info
9+
.DS_Store
15.7 KB
Loading
594 KB
Loading
358 KB
Loading
112 KB
Loading
55.1 KB
Loading
58.7 KB
Loading

source/web-app/poet/generate-sequences.rst

Lines changed: 72 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -23,22 +23,22 @@ A note on the *Random seed* setting: this determines the state of the random num
2323
Generating a sequence
2424
---------------------
2525

26-
Navigate to the tool by opening the **PoET** dropdown menu, then selecting **Generate Sequences.**
26+
Navigate to the tool by opening the **PoET** dropdown menu, then selecting **Generate Sequences.** You can choose the model used to run the job. We recommend using PoET-2 for most use cases.
2727

28-
Set your parameters to control sampling behavior. In particular, **temperature**, **top-p**, and **top-k** provide the ability to focus sampling around highly likely sequences. We recommend that you use either top-p or top-k on a given job, not both.
28+
Step 1: Prompt Query
29+
^^^^^^^^^^^^^^^^^^^^^
2930

30-
.. image:: ../../_static/tools/poet/sampling-parameters.png
31-
:alt: Sampling Parameters
31+
Refer to `Creating a Query <./prompts.rst#creating-a-query>`_ to learn about Prompt Query.
3232

33-
You can choose the model used to run the job. We recommend using PoET-2 for most use cases.
3433

35-
.. image:: ../../_static/tools/poet/choose-model.png
36-
:alt: Choose Model
34+
Step 2: Prompt Context
35+
^^^^^^^^^^^^^^^^^^^^^^^
3736

37+
**Build Custom Context**: If you want to create a custom context, please refer to `Creating a Context <./prompts.rst#creating-a-context>`_.
3838

39-
If you have an existing prompt, you may select it. Alternatively, add your custom MSA to the **Prompt Definition** field. You can input the sequence(s) directly, or upload an existing .fa, .fasta, or .csv file.
39+
**Build from MSA**: If you have an existing prompt, you may select it. Alternatively, add your custom MSA to the **Prompt Definition** field. You can input the sequence(s) directly, or upload an existing .fa, .fasta, or .csv file.
4040

41-
If you do not have an existing MSA, input your target protein and select **Single Sequence**. OpenProtein will generate an MSA using a homology search against Uniref using mmseqs2 with default settings from ColabFold, then use the MSA to create a prompt. Please note that if you select **Single Sequence** when multiple sequences are entered, sequences after the first are ignored.
41+
If you do not have an existing MSA, you can select **Upload MSA**. If you select **Run Homology Search Using a Seed Sequence**, OpenProtein will generate an MSA using a homology search against Uniref using mmseqs2 with default settings from ColabFold, then use the MSA to create a prompt. Please note that when multiple sequences are entered, sequences after the first are ignored.
4242

4343
Choose the number of prompts to ensemble. Select 1 to sample a single prompt, or increase the diversity of generated outputs by ensembling over 2-15 prompts. We suggest using 3-5 prompts.
4444

@@ -50,23 +50,80 @@ Set sampling method fields. We suggest you start with the default settings, then
5050
.. image:: ../../_static/tools/poet/sampling-methods.png
5151
:alt: Sampling Methods
5252

53-
You're ready to generate a custom sequence! Select **Run.** The job may take a few minutes depending on how busy the service is, how long your sequences are, and how many sequences you want to score.
53+
54+
Step 3: Sampling Settings
55+
^^^^^^^^^^^^^^^^^^^^^^^^^^
56+
57+
Set your parameters to control sampling behavior. In particular, **temperature**, **top-p**, and **top-k** provide the ability to focus sampling around highly likely sequences. We recommend that you use either top-p or top-k on a given job, not both. You can choose the default structure prediction model to generate the sequence structures after the job completes.
58+
59+
.. image:: ../../_static/tools/poet/sampling-parameters.png
60+
:alt: Sampling Parameters
61+
62+
63+
You're ready to generate a custom sequence! Click **Run.** The job may take a few minutes depending on how busy the service is, how long your sequences are, and how many sequences you want to score.
5464

5565
A 400 (Bad request) error code may be due to the following:
5666

57-
.. image:: ../../_static/tools/poet/results-table.png
58-
:alt: Results Table
67+
.. list-table::
68+
:header-rows: 1
69+
:widths: 20 20
70+
:align: left
71+
72+
* - Issue description
73+
- Solution
74+
* - Invalid PoET Job or Parent
75+
- Re-enter prompt and try again.
76+
* - Invalid prompt in PoET service
77+
- Reupload prompt and try again. Refer to the article about `prompts <./prompts.rst>`_. Ensure minimum and maximum similarity parameters are not filtering out all sequences in prompt.
78+
* - Invalid user input in align service
79+
- Ensure you don't have
80+
81+
- a top_p>1
82+
- a non-valid amino acid
83+
- Maximum similarity < minimum similarity
84+
If necessary, refer to the article on `sampling parameters <./prompts.rst#prompt-sampling-definitions>`_.
85+
* - Invalid MSA (not aligned, etc)
86+
- - Make sure your MSAs are aligned and rebuild MSA if necessary.
87+
- If you have uploaded pre-computed MSA, confirm that formatting is correct and sequences are of equal length (use gap tokens “-”).
88+
- If you are building from a seed sequence, try rebuilding the MSA
89+
5990

6091
Please contact `OpenProtein.AI support <https://www.openprotein.ai/contact>`_ if the suggested solutions don't resolve the issue.
6192

6293
Interpreting your results
6394
-------------------------
6495

65-
Your results are presented as a table, with each generated sequence being assigned a log-likelihood score. This score indicates the fitness of the generated sequence compared to the prompt. The higher the log-likelihood score, the more fit the sequence.
96+
The PoET history page allows you to view and access past jobs, sorted by created date, job type, and status. Clicking the job ID will take you to the results page for that job.
97+
98+
**Results Table**
99+
100+
Your results are presented in a table, with each generated sequence assigned a log-likelihood score. This score reflects how well the generated sequence fits the prompt: a higher score indicates a better fit.
101+
102+
Right-click any sequence to access local fitness landscapes through the **Run Substitution Analysis** menu. You can also sort your results and export them using the **Export** button.
103+
104+
.. image:: ../../_static/tools/poet/results-table.png
105+
:alt: Results Table
106+
107+
**Structure Prediction and Comparison**
108+
109+
Once the structure prediction job completes, the structure viewer will appear on the right side of the page. The prediction model can be changed via the **Structure Predictor** dropdown located above the viewer. Hover over a sequence in the results table to preview and compare its structure against the query structure. The viewer also displays key metrics, including:
110+
111+
- **pLDDT (predicted Local Distance Difference Test)**: A per-residue confidence score (commonly scaled from 0–100 or 0.0–1.0) indicating how reliable each residue's predicted position is.
112+
113+
- **RMSD (Root Mean Square Deviation)**: A measure of structural similarity between two molecules, typically comparing backbone atoms. Lower RMSD values indicate greater structural similarity.
114+
115+
Click a sequence to expand the structure viewer, which will overlay the results table. In this view, you can examine detailed metrics and sequence-to-prompt alignment for the selected sequence.
116+
117+
.. image:: ../../_static/tools/poet/results-table-expanded.png
118+
:alt: Sequence structure viewer
119+
120+
**Change Reference Structure**
121+
122+
By default, the query structure is used as the reference structure for structural comparisons. You can change it by selecting an alternative structure via the **Edit Reference Sequence** button, accessible from the results table **Settings** in the results table.
66123

67-
Access local fitness landscapes using the substitution analysis buttons for each sequence. You can sort your results and download them using the **Export** button.
124+
.. image:: ../../_static/tools/poet/results-table-ref-structure.png
125+
:alt: Change reference structure
68126

69-
Note: The PoET history page allows you to view and access past jobs, sorted by created date, job type, and status. Clicking the job ID will take you to the results page for that job.
70127

71128
Fine-tuning your results
72129
------------------------

source/web-app/poet/rank-sequences.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ A 400 (Bad request) error code may be due to the following:
5252
* - Invalid prompt in PoET service
5353
- Reupload prompt and try again. Refer to the article about `prompts <./prompts.rst>`_. Ensure minimum and maximum similarity parameters are not filtering out all sequences in prompt.
5454
* - Invalid user input in align service
55-
- Ensure you dont have
55+
- Ensure you don't have
5656

5757
- a top_p>1
5858
- a non-valid amino acid

source/web-app/poet/substitution-analysis.rst

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,8 @@ A 400 (Bad request) error code may be due to the following:
4646
* - Invalid prompt in PoET service
4747
- Reupload prompt and try again. Refer to the article about `prompts <./prompts.rst>`_. Ensure minimum and maximum similarity parameters are not filtering out all sequences in prompt.
4848
* - Invalid user input in align service
49-
- Ensure you don’t have
49+
- Ensure you don't have
50+
5051
- a top_p>1
5152
- a non-valid amino acid
5253
- Maximum similarity < minimum similarity

0 commit comments

Comments
 (0)