GitHub - ThoughtWorks-DPS/psk-aws-control-plane-base: control plane base for AWS managed components

psk-aws-control-plane-base

This control plane base pipeline is effectively limited to all, and only, those components of EKS that are managed by AWS. Deployments, version changes, and removal of the associated resource belong to AWS in the shared-responsibility model of IaaS vendor managed services. The pipeline owner directs only 'when' such changes occur by specifying version changes in the environment configuration or other similar practices of notifying AWS of a change to be made.

A typical Engineering Platform release pipeline for the underlying cluster control plane instances will have the following cluster roles:

At scale, each role may include multiple clusters. Note that the platform customer namespaces are limited to targeted roles that all amount to production from the platform product team's point of view.

Configuration

authentication mode = API
- infrastructure access configuration via access_entries
control plane logging default = "api", "audit", "authenticator", "controllerManager", "scheduler"
control plan internals encrypted using aws managed kms key
arm-based Managed Node Group for dedicated management pool with specific toleration requirements
eks addons:
- vpc-cni
- coredns
- kube-proxy
- aws-ebs-csi-driver * default storage class target provisioned, by convention = $cluster_name-ebs-csi-storage-class
  - aws-efs-csi-driver
    - efs file share created
    - default storage class provisioned, by convention = $cluster_name-efs-csi-storage-class
    - filesystem-id stored in 1password, make discoverable via platforms/clusters API
  - karpenter
    - sqs and eventbridge deployed
    - arm and amd NodePools resources defined
    - target desired architecture with kubernetes.io/arch = "arm64" | "amd64"
psk-system namespace created
admin ClusterRolebinding created for ThoughtWorks-DPS/twdps-core-labs-team claim

EKS Best Practices Guides

See implementation notes.

Maintainers

upgrade kubernetes and addon version

Change eks_version in the environments json to initiate upgrade to new EKS version. Addons will automatically update to the correct, latest version with each pipeline run.

managment node group

The taint step results in the MNG nodes updating to the correct, latest patch version.

general data plane ndoes

Karpenter managed nodepools will automatically update to the correct, latest patch version each week.

TODO

observability solution to replace datadog not yet implemented
currently the "taint" logic for refresh of management node group nodes is based on a value in the environment file. Which means that it is just on or off. The reason for this is that when creating a new cluster there are no node groups to taint so a command to do so will fail so you must set it true or false in the code based on the cluster (or cluster role if scaled). A better solution would be to have a test that can determine if the cluster does not yet exist and thereby skip the taint, successfully.

Name	Name	Last commit message	Last commit date
Latest commit ncheneweth [nc] refresh prod management node group nodes Jan 5, 2025 a48873e · Jan 5, 2025 History 189 Commits
.circleci	.circleci	[nc] move karpenter integration test up in the order	Jan 5, 2025
environments	environments	[nc] refresh prod management node group nodes	Jan 5, 2025
scripts	scripts	[nc] karpenter v1.1.0	Dec 3, 2024
test	test	[nc] cross test muilti-write	Sep 2, 2024
tpl	tpl	[nc] update nodepool definitions	Dec 5, 2024
.checkov.yaml	.checkov.yaml	[nc] new v19 base config	Jul 12, 2023
.gitignore	.gitignore	[nc] cleanup	Jun 1, 2024
.pre-commit-config.yaml	.pre-commit-config.yaml	[nc] add efs storage class support	May 22, 2024
.rspec	.rspec	[nc] integration testing	May 22, 2024
.trivyignore	.trivyignore	[nc] trivy mods	May 22, 2024
EKS-Best-Practices-Guides.md	EKS-Best-Practices-Guides.md	[nc] update documentation	May 31, 2024
LICENSE	LICENSE	[nc] new v19 base config	Jul 12, 2023
README.md	README.md	[nc] change role permissions to account for blue print 1.19 changes	Dec 1, 2024
data.tf	data.tf	[nc] add efs storage class support	May 22, 2024
efs-csi-storage.tf	efs-csi-storage.tf	[nc] integration testing in pipeline	May 24, 2024
eks-addons.tf	eks-addons.tf	[nc] efs-csi-driver does not support pdb	Jan 5, 2025
karpenter_deployment.tf	karpenter_deployment.tf	[nc] tf 1.9, Karpenter changes	Jul 2, 2024
main.tf	main.tf	Update Terraform terraform-aws-modules/eks/aws to v20.31.6	Dec 20, 2024
op.prod-i01-aws-us-east-2.env	op.prod-i01-aws-us-east-2.env	[nc] remove ENV strategy	Jun 28, 2024
op.sbx-i01-aws-us-east-1.env	op.sbx-i01-aws-us-east-1.env	[nc] remove ENV strategy	Jun 28, 2024
outputs.tf	outputs.tf	[nc] integration testing in pipeline	May 24, 2024
release-pipeline.png	release-pipeline.png	[nc] update documentation	May 31, 2024
renovate.json	renovate.json	Add renovate.json	Jun 8, 2024
requirements-dev.txt	requirements-dev.txt	[nc] minor edit	Jan 4, 2025
trivy.yaml	trivy.yaml	[nc] skip trivy scan of eks addon example files	Jan 4, 2025
variables.tf	variables.tf	[nc] new k8s version, additional node option in mng	Jun 1, 2024
versions.tf	versions.tf	[nc] reset tf v1.9	Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

psk-aws-control-plane-base

Configuration

EKS Best Practices Guides

Maintainers

About

Releases

Packages

Contributors 2

Languages

License

ThoughtWorks-DPS/psk-aws-control-plane-base

Folders and files

Latest commit

History

Repository files navigation

psk-aws-control-plane-base

Configuration

EKS Best Practices Guides

Maintainers

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages