Skip to content

Merge test to main (#113) #114

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Jun 17, 2025
Merged

Merge test to main (#113) #114

merged 3 commits into from
Jun 17, 2025

Conversation

eberrigan
Copy link
Collaborator

No description provided.

@eberrigan eberrigan requested a review from Copilot June 17, 2025 02:01
Copilot

This comment was marked as outdated.

@eberrigan eberrigan changed the title Fix terraform cycle (#113) Merge test to main (#113) Jun 17, 2025
@eberrigan eberrigan requested a review from Copilot June 17, 2025 04:33
Copy link

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

A Terraform update that removes an explicit EIP dependency, increases the VM root volume size, and adjusts Docker GPU runtime flags.

  • Remove explicit depends_on for EIP association in the allocator server
  • Increase root block device volume_size from 40 to 80
  • Add --runtime=nvidia to Docker run commands alongside --gpus all

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File Description
lablink-allocator/main.tf Removed the depends_on block for the EIP association
lablink-allocator/lablink-allocator-service/terraform/main.tf Updated root volume size and added GPU runtime flag to Docker run
Comments suppressed due to low confidence (1)

lablink-allocator/main.tf:78

  • Removing the explicit depends_on on the EIP association may cause the server instance to be created before the Elastic IP is attached, leading to race conditions. Consider re-adding the depends_on or referencing the EIP in the network interface block to enforce correct resource ordering.
depends_on = [

@@ -94,7 +94,7 @@ resource "aws_instance" "lablink_vm" {
vpc_security_group_ids = [aws_security_group.lablink_sg_.id]
key_name = aws_key_pair.lablink_key_pair.key_name
root_block_device {
volume_size = 40
volume_size = 80
Copy link
Preview

Copilot AI Jun 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] Hardcoding the volume size reduces flexibility; consider parameterizing this value via a Terraform variable or input to allow easier adjustments without changing code.

Suggested change
volume_size = 80
volume_size = var.volume_size

Copilot uses AI. Check for mistakes.

Comment on lines +116 to +119
docker run -dit --runtime=nvidia --gpus all -e ALLOCATOR_HOST=${var.allocator_ip} ${var.image_name}
else
echo "Cloning repository: $TUTORIAL_REPO_TO_CLONE"
docker run -dit --gpus all -e ALLOCATOR_HOST=${var.allocator_ip} -e TUTORIAL_REPO_TO_CLONE=${var.repository} ${var.image_name}
docker run -dit --runtime=nvidia --gpus all -e ALLOCATOR_HOST=${var.allocator_ip} -e TUTORIAL_REPO_TO_CLONE=${var.repository} ${var.image_name}
Copy link
Preview

Copilot AI Jun 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] Using both --runtime=nvidia and --gpus all flags may be redundant depending on Docker version; pick one method to enable GPU support and ensure compatibility.

Copilot uses AI. Check for mistakes.

@eberrigan eberrigan merged commit 673142a into main Jun 17, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant