documentation improvements

neoakris · neoakris · commit 9431351de274 · 2025-05-06T12:30:58.000-04:00
diff --git a/README.md b/README.md
@@ -2,9 +2,9 @@
 
 ## What is Easy EKS?
 An opinionated bundling of automation & Infrastructure as code that aims to:
-1. Make it easy to provision EKS clusters that are production ready by default.
-2. Maintain a heavily standardized opinionated set of IaC, which makes automation easier.
-3. Apply a helm like design pattern to AWS CDK.
+1. Make it easy to provision EKS clusters that are nearly production ready by default.
+2. Maintain a heavily standardized opinionated set of IaC, which makes automation maintainable.
+3. Apply useful design patterns from Helm and Kustomize to IaC based on AWS CDK.
 
 ## What is the current status of Easy EKS?
 Pre-Alpha
diff --git a/docs/09_Project_Goals_and_Target_Audience/Project_Goals.md b/docs/09_Project_Goals_and_Target_Audience/Project_Goals.md
@@ -1,5 +1,5 @@
 # Easy EKS's Project Goals
-Easy EKS = EKS + Apps + Config + Infrastructure as Code + Automation + Docs.
+Easy EKS = EKS + Kube Apps + Config + Infrastructure as Code + Automation + Docs.
 
 ## 1. Make a standardized baseline distribution of EKS
 * Standardization is a prerequisite for:
diff --git a/docs/README.md b/docs/README.md
@@ -1,34 +1,110 @@
 # Easy EKS (Pre-Alpha)
 
-## What problem does this solve?
+## What is Easy EKS?
+**Here are 3 useful answers to that question based on different perspectives:**  
+(Each answer is clarified further in later sections on this page.)
+1. **A Solution:**
+   Setting up and learning how to implement EKS according to best practices is often said to be hard,
+   so much so that it crosses the threshold of being problematically difficult and a barrier to
+   adoption. From this perspective Easy EKS can be seen as a solution to EKS's difficulty problem.
+2. **Summarized Technical Description:**  
+   An opionated bundling of automation & IaC (Infrastructure as Code) that aims to:
+   1. Make it easy to provision EKS clusters that are nearly production ready by default.
+   2. Maintain a heavily standardized opinionated set of IaC, which makes automation maintainable.
+   3. Apply useful design patterns from Helm and Kustomize to IaC based on AWS CDK.
+3. **General Description:**  
+   A user experience optimized approach to EKS, that aims to make using EKS `simpler`, `accessible`,
+   and `enjoyable`.
+
+-------------------------------------------------------------------------------------------------------
+
+### What problem does Easy EKS solve?
 * EKS is a double-edged sword:
   * Good: 
     * It simplifies the setup of a Kubernetes Cluster on AWS.
     * It works great after it's set up.
-  * Bad: 
-    * Set up is left to end users who have a high risk of setting it up poorly, taking months, or
-      both.
+  * Bad:
+    * EKS by itself is far from being production ready by default, EKS is more like the virtual
+      equivalent of receiving a custom PC build project, while wanting a push button server.
     * It has a terrible FTUX (first time user experience) and OOTB (out of the box) UX (user
-      experience).
-* Easy EKS is a solution to EKS's problems related to its slow and flawed set up process, FTUX, and
-  OOTB UX.
+      experience), because end users are left to figure out how to re-invent a production ready setup,
+      and there's a high risk that they'll set it up poorly, need months to figure it out, or both.
+* Easy EKS can be seen as a solution to EKS's problems related to its slow and flawed set up process,
+  FTUX, and OOTB UX:
   * https://www.reddit.com/r/aws/comments/qpw36d/aws_eks_rant/
   * https://www.reddit.com/r/devops/comments/y5am95/why_is_eks_and_aws_in_general_so_much_more/
   * https://matduggan.com/aws-eks/
 
-## What is Easy EKS?
-* Easy EKS is a user experience optimized approach to EKS, where using it becomes `simpler`, `accessible`, and `enjoyable`.
+-------------------------------------------------------------------------------------------------------
+
+### What Specific Technical Benefits does Easy EKS Offer?
+* **Currently Available in Pre-Alpha:**
+  1. `Useful elements of Helm's design pattern are used:`
+     * A nice feature of Helm over say Kustomize, Terraform, or common CDK/Pulumi design patterns, is
+       that it's intuitively clear what parts of the IaC are fine to change vs shouldn't be changed.
+     * Configuration input parameters have sensible defaults, but can be overridden.
+     * Some IaC complexity can be hidden, which allows users to focus on well organized config, which
+       in turn significantly lowers cognitive overhead and improves ease of mangement and accessibility.
+     * Supports the deployment of Multiple Instances: It's very easy to have multiple clusters per
+       environment (dev1-eks, dev2-eks, etc.)
+     * Helm popularized a convention of mixing config values with
+       [heavy commentary](https://artifacthub.io/packages/helm/prometheus-community/prometheus?modal=values)
+       which improves accessibility and general user experience, by explaining what a config flag will
+       do and documenting commented out examples of alternative possible values with correct syntax.
+  1. `Useful elements of Kustomize's design pattern are used:`
+     * Kustomize popularized the [config overlay design pattern](https://kubectl.docs.kubernetes.io/guides/introduction/kustomize/#2-create-variants-using-overlays),
+       which offers multiple advantages:
+       * It allows config shared between multiple environments, to be deduplicated which makes it much
+         easier to avoid unwanted config drift between environments, which improves maintainability.
+       * It keeps the config well organized, which makes it easier to quickly navigate.
+  1. `Two well configured AWS VPCs`
+     * The VPCs are dualstack(IPv4/v6), and EKS cluster's use IPv6 mode to eliminate problem of running
+       out of IPs.
+     * fck-nat: The (f)easible (c)ost (k)onfigurable NAT, is an alternative to AWS's Managed NAT GW,
+       that's an order of magnitude cheaper. 
+     * lower-envs-vpc defaults to 1 fck-NAT instance
+     * higher-envs-vpc defaults to 2 fck-NAT instances, and can optionally be set to 3 AWS Managed NAT
+       GWs.
+     * node-local-dns-cache and S3 Gateway endpoints are also enabled by default.
+  1. `Heavily cost optimized:`
+     * Easy EKS gives the benefits of EKS's Auto Mode (and more), without Auto Mode's additional costs.
+     * The baseline costs of a dev cluster is under $100/month.
+       * EKS control plane cost is $73/month.
+       * lower-env-vpc's fck-NAT defaults to $3.06/month, and is meant to be shared by multiple clusters.
+       * 2x t4g.small spot baseline nodes are $10.22/month
+       * karpenter's lower-envs default config is weighted to prefer spot based ARM bottlerocket nodes.
+  1. `UX optimizations:`
+     * EKS clusters have useful tags.
+     * Name tags of EC2 instances are nicely organized.
+     * IAM admins are given EKS viewer access by default for both the EKS web console and kubectl.
+     * kubectl onboarding is streamlined.
+  1. `Production Readiness optimizations:`
+     * kubernetes secrets stored in etcd get KMS encrypted by default.
+     * EKS Addons are all installed by default.
+     * CoreDNS's config is optimized by default in terms of node affinity and autoscaling.
+     * AWS Load Balancer Controller is installed by default and configured using eks-pod-identity-agent,
+       which means it doubles as a great IaC reference for pod level IAM rights.
+     * Karpenter is installed by default and preconfigured to provision spot, on-demand, AMD, or ARM
+       bottlerocket based worker nodes.
+* **Planned for Alpha:**
+  1. `The default storage class is preconfigured to provide kms encrypted gp3 ebs volumes.`
+  1. `Additional streamlining of kubectl access onboarding`
+  1. `Metric Level Observability`
+  1. `Log Level Observability`
+  1. `Standardize Variable Naming Conventions`
 
 -------------------------------------------------------------------------------------------------------
 
 ### Simpler EKS
 1. **Deployment <u>and baseline configuration</u> are both automated:**
-   * `cdk` is used to automate the provisioning of production ready EKS Clusters.
-2. **The administrative overhead associated with managing multiple clusters is minimized:**
-   * A `kustomize inspired` design pattern is used to make the deployment and management over time of multiple clusters much easier.
-3. **Complexity is simplified, by shielding the end user engineers from unnecessary complexity that's practical to hide away:**
+   * `cdk` is used to automate the provisioning of nearly production ready EKS Clusters.
+2. **The administrative overhead associated with managing multiple clusters is lower:**
+   * A `kustomize inspired` design pattern is used to make the deployment and management over time of
+     multiple clusters much easier.
+3. **Complexity is simplified, by shielding the end user engineers from unnecessary complexity that can be practically hidden away:**
    * A `helm inspired` design pattern to abstract away complexity.
-     * helm hides complexity in templatized yaml files, and helm values.yaml files, which represent sane default values of input parameters to feed into the templating engine.
+     * helm hides complexity in templatized yaml files, and helm values.yaml files, which represent
+       sane default values of input parameters to feed into the templating engine.
      * Here's an example of how helm allows end uesrs to see a significantly simplified interface:
        * A 15 line long `kps.helm-values.yaml` file (of values representing overrides of
          kube-prometheus-stack helm chart's default input parameters)
@@ -39,7 +115,8 @@
        * /lib/ (a cdk library)
        * /.flox/ (a recommended, yet optional method of automating dev shell dependencies with `flox activate`)
      * Easy EKS presents a simplified workflow to end users:
-       * Edit /config/ (which is an intuitive and simplified end user interface inspired by kustomize and helm values)
+       * Edit /config/ (which is an intuitive and simplified end user interface inspired by kustomize
+         and helm values)
        * `cdk list`
        * `cdk deploy dev1-eks`
 
@@ -57,21 +134,20 @@
 -------------------------------------------------------------------------------------------------------
 
 ### Enjoyable EKS
-
 * User Experience is what makes cars enjoyable products. The same is true for Easy EKS.
   * Cars have complexity,
     * But it's the car maker that deals with the complexity. 
     * You the end user get a simplifed turn key user experience.
     * It's designed to be intuitive, learning how to drive isn't hard.
   * Easy EKS has complexity,
     * But you will be shielded from the majority of the complexity, it's abstrated away where practical.
-    * `You get to enjoy a turn key, batteries included, production ready user experience`.
+    * `You get to enjoy a turn key, batteries included, nearly production ready user experience`.
     * It's designed to be intuitive, and even FTUX (first time user experience) and OUX (onboarding UX)
       are prioritized to make it easy to learn.
 * You can enjoy:
   * Being able to get meaningful work done quick:
     * Learn the basics within a day.
-    * Deploy a cluster in under an hour, with a production ready baseline configuration.
+    * Deploy a cluster in under an hour, with a nearly production ready baseline configuration.
     * Develop working proficiency in under a week. 
   * Not having to think through engineering toil:
     * Instead of choices, that make engineer's stress over identifying the best chocie.
@@ -86,24 +162,3 @@
     * ADR's (Architectural Decision Records) are available to verify reasoning behind all choices.
       * This isn't just a platform that claims to follow best practices.
       * It's a platform that includes justifications of why it's practices are best practices.
-
--------------------------------------------------------------------------------------------------------
-
-## Why Easy EKS Exists
-| **Basic Functionality you'd expect to see, for normal usage and production readiness:** | **GCP's GKE AutoPilot:**<br> (a point of reference of what good looks like) | **AWS EKS:**<br> (The default out of the box user experience is a collection of dumb problems to have)                                                                                                                                           | **Easy EKS**  <br> (Smart solutions to dumb problems that make EKS easier, brought to you by doit.com)                                                                                 |
-|-----------------------------------------------------------------------------------------|-----------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| A well configured VPC                                                                   | Default VPC ships with Cloud NAT                                            | Default VPC doesn't ship with a NAT GW, and Managed NAT GW is so bad [(link 1)](https://www.lastweekinaws.com/blog/the-aws-managed-nat-gateway-is-unpleasant-and-not-recommended/), that fck-NAT exists [(link 2)](https://fck-nat.dev/stable/). | Ships with fck-NAT (order of magnitude cheaper), and dualstack VPC for IPv6 based EKS, which eliminates potential problem of running out of IPs.                                       |
-| Optimized DNS                                                                           | DNS is optimized by default via Node Local DNS Cache and Cloud DNS          | Ships with nothing. A relatively easy install won't be Fault Tolerant, won't have a dns auto-scaler, nor node-local-dns-cache. Figuring out production grade optimizations takes days.                                                           | Alpha ships with Node Local DNS Cache, core dns autoscaler, and anti affinity rules for increased fault tolerance.<br> Planned for Beta: verify/optimize core dns autoscaler's config. |
-| Easily populate ~/.kube/config for Kubectl Access                                       | A blue connect button at the top of the Web GUI, shows a command.           | Access tends to be a multistep process, so you look up docs for something that should be trivially easy.                                                                                                                                         | When cdk eks blueprints finishes, it outputs a config command.                                                                                                                         |
-| Teammates can easily access to kubectl and Web Console                                  | GCP IAM roles map to GKE's rbac rights by default.                           | In general, access needs to be explicitly configured per cluster, nuanced limitations make it hard.                                                                                                                                              | Pragmatic workarounds to access limitations are set by default to make access easier.                                                                                                  |
-| Metric Level Observability                                                              | Ships with preconfigured working dashboards                                 | Ships with nothing, figuring out how to set up takes days.                                                                                                                                                                                       | PLANNED (alpha)                                                                                                                                                                        |
-| Log Level Observability                                                                 | Ships with intuitive centralized logging                                    | Ships with nothing, figuring out how to set up takes days.                                                                                                                                                                                       | PLANNED (alpha)                                                                                                                                                                        |
-| Automatically Provisions storage for stateful workloads                                 | Ships with a preconfigured storageclass                                     | Ships with broken implementation, fixing is relatively easy, but how/why is this not a default functionality baked into the platform?                                                                                                            | Ships with KMS Encrypted EBS storageclass                                                                                                                                              |
-| Automatically Provision Load Balancers for Ingress                                      | Ships with GKE Ingress Controller and GKE's Gateway API controller          | Ships with nothing, and the solution: AWS Load Balancer Controller, is considered a 3rd party add-on, with a complex installation that can take days to figure out.                                                                              | Ships with AWS Load Balancer Controller                                                                                                                                                |
-| Pod Level IAM Identity                                                                  | Ships with Workload Identity (pod level IAM roles)                          | Ships with nothing, making it work is relatively easy, seems reasonable to have this be a default baked into the platform.                                                                                                                       | Ships with Amazon EKS Pod Identity Agent                                                                                                                                               |
-| Worker Node Autoscaling                                                                 | Ships with NAP (Node Auto Provisioner)                                      | Ships with nothing, figuring out how to install cluster autoscaler or karpenter.sh can take days.                                                                                                                                                | Ships with Karpenter.sh (Note: currently an outdated version to avoid compatibility issues, waiting for Karpenter 1.2.x / stable version planned for alpha)                            |
-
--------------------------------------------------------------------------------------------------------
-
-## How do I get started?
-[Check the docs page](/docs)