Eric Meehan eric
  • Joined on 2024-12-02
eric opened issue DevOps/software-infrastructure#13 2025-03-08 19:53:37 +00:00
Disk pressure on alpha-worker-0
eric commented on issue DevOps/software-infrastructure#10 2025-03-08 19:50:01 +00:00
NVIDIA Tesla T4

The T4 appears to be available within Kubernetes; however, the underlying VM is running low on storage, preventing containers from being deployed.

eric closed issue DevOps/software-infrastructure#10 2025-03-08 19:50:01 +00:00
NVIDIA Tesla T4
eric commented on issue DevOps/ansible-role-eom#26 2025-03-08 17:23:16 +00:00
Deploy LocalAI

This deployment was straight forward, but resulted in a pod with pending status. More work is needed to utilize GPUs in K8s.

eric pushed to main at DevOps/ansible-role-eom 2025-03-08 17:00:08 +00:00
38e9886155 Luanti
eric opened issue DevOps/ansible-role-eom#26 2025-03-08 16:57:40 +00:00
Deploy LocalAI
eric commented on issue DevOps/software-infrastructure#10 2025-03-08 16:37:33 +00:00
NVIDIA Tesla T4

This installation guide provides more detailed information.

eric commented on issue DevOps/software-infrastructure#10 2025-03-08 16:30:13 +00:00
NVIDIA Tesla T4

Documentation for scheduled GPUs in Kubernetes.

eric opened issue DevOps/software-infrastructure#12 2025-03-08 16:28:45 +00:00
GPU passthrough in main playbook
eric commented on issue DevOps/software-infrastructure#10 2025-03-08 16:27:48 +00:00
NVIDIA Tesla T4
eric opened issue DevOps/software-infrastructure#11 2025-03-08 16:27:28 +00:00
GPU passthrough on reboot
eric commented on issue DevOps/software-infrastructure#10 2025-03-08 16:25:49 +00:00
NVIDIA Tesla T4

GPU passthrough to alpha-worker-0 was surprisingly smooth. Steps 1-5 are now complete. A separate issue should be made for automating the steps taken during this installation and for reconnecting…

eric commented on issue DevOps/software-infrastructure#10 2025-03-08 15:02:30 +00:00
NVIDIA Tesla T4

The following needs to be done:

  1. Uninstall Nvidia drivers from T640
  2. Update grub
  3. Enable VFIO drivers
  4. Setup device passthrough using virt-manager
  5. Install Nvidia drivers on…
eric opened issue DevOps/ansible-role-eom#25 2025-03-07 16:16:55 +00:00
Create a deployment for Luanti
eric opened issue DevOps/software-infrastructure#10 2025-03-07 16:03:53 +00:00
NVIDIA Tesla T4
eric commented on issue DevOps/software-infrastructure#3 2025-03-07 16:00:42 +00:00
NVIDIA RTX A6000 on PowerEdge T640

The A6000 has been uninstalled and replaced with a Tesla T4.

eric closed issue DevOps/software-infrastructure#3 2025-03-07 16:00:42 +00:00
NVIDIA RTX A6000 on PowerEdge T640
eric closed issue DevOps/software-infrastructure#8 2025-03-07 16:00:36 +00:00
PowerEdge R720
eric commented on issue DevOps/software-infrastructure#8 2025-03-07 16:00:35 +00:00
PowerEdge R720

The server is configured, but its use is not yet determined. Much of the above was not done.