aws deployment #60

dentiny · 2025-08-15T20:23:38Z

Summary

Briefly explain what this PR does.

Related Issues

Closes # or links to related issues.

Changes

Checklist

Code builds correctly
Tests have been added or updated
Documentation updated if necessary
I have reviewed my own changes

gemini-code-assist

Summary of Changes

Hello @dentiny, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces the foundational Kubernetes configurations required to deploy the moonlink application and its associated data server on AWS. It defines a deployment for the moonlink service and an Nginx data server, along with a service to expose these components via a LoadBalancer, enabling external access and readiness checks.

Highlights

Kubernetes Deployment for Moonlink: Defines a Kubernetes Deployment for the moonlink-deployment which includes both the moonlink_service container and an nginx container. It configures resource requests/limits, ports (3030, 3031, 5050), and a readiness probe for the moonlink_service.
Nginx Data Server Configuration: Integrates an nginx container within the moonlink-deployment to serve data plane requests, mounting a shared empty directory for content.
Kubernetes Service for Moonlink: Creates a Kubernetes Service of type LoadBalancer to expose the moonlink-deployment. This service maps external ports to internal container ports for data (8080->80), REST API (3030->3030), TCP RPC (3031->3031), and readiness checks (5050->5050).
Shared Volume for Inter-Container Communication: Establishes a shared emptyDir volume named shared-cache-directory to facilitate data exchange between the moonlink_service and nginx containers.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces Kubernetes deployment and service configurations for an AWS production environment. My review focuses on ensuring these configurations adhere to best practices for production systems, covering aspects like high availability, security, image management, and health checks. I've identified several areas for improvement, such as increasing the replica count for redundancy, using an appropriate container registry for AWS, adding a liveness probe for better health monitoring, and avoiding mutable image tags to ensure deployment stability.

gemini-code-assist · 2025-08-15T20:24:58Z

deploy/aws/prod/deployment/moonlink_deployment.yaml

+
+      # Data server, which serves data plane requests.
+      - name: nginx
+        image: nginx:latest


Using the :latest tag for a production container image is a critical risk. The latest tag is mutable and can point to different versions of the image over time. This can lead to unpredictable behavior and makes it difficult to track which version of the image is running or to perform rollbacks. Please use a specific, immutable image tag (e.g., nginx:1.25.3) to ensure your deployments are deterministic and reliable.

image: nginx:1.25.3

gemini-code-assist · 2025-08-15T20:24:58Z

deploy/aws/prod/deployment/moonlink_deployment.yaml

+metadata:
+  name: moonlink-deployment
+spec:
+  replicas: 1


For a production deployment, using a single replica creates a single point of failure. If this pod fails, the service will be unavailable. Please consider increasing the number of replicas to at least 2 to ensure high availability.

replicas: 2

gemini-code-assist · 2025-08-15T20:24:59Z

deploy/aws/prod/deployment/moonlink_deployment.yaml

+      containers:
+      # Moonlink deployment, which serves REST API and TCP rpc requests.
+      - name: moonlink-deployment
+        image: us-west1-docker.pkg.dev/coral-ring-465417-r0/moonlink-standalone-experiment/moonlink-deployment:478dc9ee-cc19-48f9-b741-21e6e8b27de7


The Docker image is being pulled from Google Artifact Registry (us-west1-docker.pkg.dev). For a deployment on AWS, it is highly recommended to use AWS Elastic Container Registry (ECR). This will reduce cross-cloud data transfer costs and improve image pull latency, leading to faster pod startups and better performance.

gemini-code-assist · 2025-08-15T20:24:59Z

deploy/aws/prod/deployment/moonlink_deployment.yaml

+      - name: moonlink-deployment
+        image: us-west1-docker.pkg.dev/coral-ring-465417-r0/moonlink-standalone-experiment/moonlink-deployment:478dc9ee-cc19-48f9-b741-21e6e8b27de7
+        command: ["/app/moonlink_service"]
+        args:
+        - "/tmp/moonlink"
+        - "--no-rest-api"
+        - "--tcp-port=3031"
+        # TODO(hjiang): Revisit before release and production deployment, use a dummy guaranteed QoS for now. 
+        resources:
+          requests:
+            cpu: "500m"
+            memory: "1Gi"
+          limits:
+            cpu: "500m"
+            memory: "1Gi"
+        env:
+        volumeMounts:
+        - name: shared-cache-directory
+          mountPath: /tmp/moonlink
+        ports:
+        # Port number for REST API server.
+        - containerPort: 3030
+        # Port number for TCP rpc server.
+        - containerPort: 3031
+        # Port number for readiness probe.
+        - containerPort: 5050 
+        readinessProbe:
+          httpGet:
+            path: /ready
+            port: 5050
+          initialDelaySeconds: 2
+          periodSeconds: 5
+          timeoutSeconds: 1
+          failureThreshold: 3


The moonlink-deployment container has a readinessProbe but is missing a livenessProbe. A liveness probe is crucial for production workloads as it allows Kubernetes to detect if an application is in an unrecoverable state (e.g., deadlocked) and restart it automatically. Please add a livenessProbe to improve the reliability of the service.

You can often use the same endpoint as the readiness probe, for example:

livenessProbe: httpGet: path: /ready port: 5050 initialDelaySeconds: 15 periodSeconds: 20

gemini-code-assist · 2025-08-15T20:24:59Z

deploy/aws/prod/service/moonlink_service.yaml

+  # Exposed port for readiness check.
+  - name: readiness
+    protocol: TCP
+    port: 5050
+    targetPort: 5050


Exposing the readiness probe port (5050) on a public-facing LoadBalancer is not recommended. This endpoint is intended for internal cluster health checks. Exposing it publicly is unnecessary and could present a minor security risk by revealing information about your internal services. The readiness probe will function correctly without being exposed in the service definition. Please remove this port from the service.

aws deployment

3b9ccff

gemini-code-assist bot reviewed Aug 15, 2025

View reviewed changes

dentiny force-pushed the main branch from 2a64a6f to f0c7f22 Compare September 8, 2025 05:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

aws deployment #60

aws deployment #60

Uh oh!

dentiny commented Aug 15, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Aug 15, 2025

Uh oh!

gemini-code-assist bot Aug 15, 2025

Uh oh!

gemini-code-assist bot Aug 15, 2025

Uh oh!

gemini-code-assist bot Aug 15, 2025

Uh oh!

gemini-code-assist bot Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

aws deployment #60

Are you sure you want to change the base?

aws deployment #60

Uh oh!

Conversation

dentiny commented Aug 15, 2025

Summary

Related Issues

Changes

Checklist

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant