Skip to content

Commit a6b611e

Browse files
committed
Merge pull request #273 from SeqWare/feature/seqware_1967
Feature/seqware 1967
2 parents 5e08db7 + 6d0eff1 commit a6b611e

File tree

9 files changed

+111
-1553
lines changed

9 files changed

+111
-1553
lines changed

seqware-distribution/docs/site/content/docs/6-pipeline.md

+7-5
Original file line numberDiff line numberDiff line change
@@ -15,8 +15,12 @@ monitoring workflows.
1515

1616
<img width="600" src="/assets/images/seqware_hpc_oozie.png"/>
1717

18-
We currently support one workflow languages (Java) and four
19-
workflow engines (oozie, oozie-sge, whitestar, and whitestar-sge). Our current recommended combination is
18+
We currently support one workflow language (Java) and four
19+
workflow engines (oozie, oozie-sge, whitestar, and whitestar-sge).
20+
21+
(Previously, we also supported Pegasus/Condor/Globus as a workflow engine).
22+
23+
Our current recommended combination is
2024
Java workflows with the Oozie-sge engine.
2125

2226
* Oozie uses the Hadoop Workflow Scheduler to schedule steps in workflows on the Hadoop ecosystem (JobTrackers and TaskTrackers).
@@ -74,8 +78,6 @@ run on a randomly chosen cluster node.
7478

7579
* [Java Workflows](/docs/6-pipeline/java-workflows/)
7680
: This is our newer workflow language that is much simpler than the FTL and more expressive. We recommend this for all new workflow development.
77-
* [Deprecated FTL Workflows](/docs/6-pipeline/legacy-ftl-workflows/)
78-
: This is our older, more verbose workflow language based on FreeMarker. This provides information on the structure of a typical workflow descriptor written in this language. FTL is deprecated, do not use it for new workflows.
7981
* [Workflow Bundle Conventions](/docs/6-pipeline/workflow_bundles/)
8082
: We rely on a bundle format for packaging up and exchanging workflows. This document describes the format and directory structure.
8183
* [Workflow Config Files](/docs/6-pipeline/config_files/)
@@ -88,7 +90,7 @@ run on a randomly chosen cluster node.
8890
## Developer Tutorials
8991

9092
* [Debugging, Troubleshooting, & Restarting Workflow](/docs/6-pipeline/debug-workflows/)
91-
: A guide to debugging, troubleshooting, and restarting failed workflows for both workflow engines (Pegasus and Oozie).
93+
: A guide to debugging, troubleshooting, and restarting failed workflows for the Oozie workflow engine.
9294

9395
* [Developing in Partial SeqWare Environments with Whitestar](/docs/6-pipeline/partial_environments/)
9496
: A guide to developing without dependencies such as Oozie, SGE/Hadoop, and metadata (via the SeqWare webservice).

seqware-distribution/docs/site/content/docs/6-pipeline/connecting-to-cluster.md

-42
Original file line numberDiff line numberDiff line change
@@ -51,23 +51,6 @@ From the SeqWare perspective you will need to tell SeqWare which HDFS/MapReduce
5151
cluster to talk to, see the Oozie and Hadoop sections of the [SeqWare
5252
Configuration Guide](/docs/6-pipeline/user-configuration/).
5353

54-
## Pegasus Workflow Engine
55-
56-
How to connect the Pipeline's Pegasus Workflow Engine to a real SGE cluster.
57-
58-
### Importing the VM
59-
60-
This really depends on your organization and how they want to run a virtualized
61-
server. For information on VirtualBox see their
62-
[website](https://www.virtualbox.org/). For information on importing our
63-
VirtualBox image into KVM see [this
64-
link](http://cheznick.net/main/content/converting-a-virtual-machine-from-virtualbox-to-kvm).
65-
If you use Xen as your server virtualization see [this
66-
link](http://roymic.blogspot.ca/2012/02/how-to-convert-virtual-box-image-to-xen.html).
67-
You can also just use VirtualBox to run your virtual SeqWare server, in which
68-
case you will find the command line tools useful, see [this
69-
link](http://www.ubuntugeek.com/how-to-control-virtual-machines-virtualbox-using-vboxmanage.html).
70-
7154
### Install the Corresponding SGE Version
7255

7356
This is really up to your local sysadmin. You will need to use a common
@@ -77,31 +60,6 @@ Consult the [GridEngine
7760
wiki](http://wiki.gridengine.info/wiki/index.php/Main_Page) for more
7861
information about obtaining and configuring SGE.
7962

80-
### Configuring GRAM on the VM
81-
82-
This is the key step in the whole process. The GRAM layer from the Globus Toolkit
83-
actually handles the submission of workflow step to the cluster as a job. This is
84-
done via a Perl module that knows how to talk with SGE. We need to modify that
85-
Perl module so it knows 1) where the SGE tools are located and 2) the settings particular
86-
to the cluster being connected to.
87-
88-
#### Perl Submission Module
89-
90-
On the VM, this Perl module lives in: <tt>/usr/share/perl5/vendor_perl/Globus/GRAM/JobManager/sge.pm</tt>.
91-
92-
You may have seen this document mentioned in the [Install from Scratch](/docs/2a-installation-from-scratch/) guide where we patched it. Now, you need to ensure you setup the parallel environment (<tt>-pe</tt>) and max memory (<tt>-l vf=</tt>) correctly for your new cluster. Annoyingly, the parameters used for this are actually configurable in SGE so you will need to discuss with your cluster Sys Admin to see what is right for your environment. See the code snippet patches below.
93-
94-
<%= render '/includes/setup_gram_1/' %>
95-
96-
<%= render '/includes/setup_gram_2/' %>
97-
98-
You will also need to modify the following settings files:
99-
100-
/etc/sysconfig/gridengine
101-
/etc/globus/globus-sge.conf
102-
103-
These contain file paths and cluster name parameters that you need to update to point to the correct values for the real cluster.
104-
10563
### Testing
10664

10765
Finally you can submit and run a workflow just as you normally do following the

seqware-distribution/docs/site/content/docs/6-pipeline/debug-workflows.md

-15
Original file line numberDiff line numberDiff line change
@@ -433,18 +433,3 @@ Fix the error, rebuild, and relaunch. The workflow should now proceed onto the n
433433
-----------------------------------------------------------------------
434434

435435
Fix the two errors and your workflow should now run to completion.
436-
437-
438-
## The Pegasus Engine - Unsupported
439-
440-
The Pegasus engine is deprecated in 1.0.X and is no longer supported in 11.0.
441-
The Pegasus Engine used <tt>pegasus-status</tt> as the primary command for monitoring the status of workflows.
442-
443-
### Debugging Workflows
444-
445-
<%= render '/includes/debug/pegasus_debug/' %>
446-
447-
### Restarting Workflows
448-
449-
<%= render '/includes/debug/pegasus_restart/' %>
450-

seqware-distribution/docs/site/content/docs/6-pipeline/legacy-ftl-workflows.md

-104
This file was deleted.

0 commit comments

Comments
 (0)