logo

InferX AI Function Platform (Lambda Function for Inference)

    --   Serve tens models in one box with ultra-fast (<2 sec) cold start (contact: support@inferx.net)

Action

Pods

tenant namespace pod name state Node name Req Gpu Count Req Gpu vRam Type Standby allocated GPU vRam (MB) allocated GPU Slots
gpu pageable pinned
public bigcode public/bigcode/starcoder2-3b/242/1175 Standby node2 1 13800 MB Restore Blob : 12806 MB Blob : 1298 MB Blob : 7680 MB 0 {}
public bigcode public/bigcode/starcoder2-7b/244/1176 Standby node2 2 13800 MB Restore Blob : 25140 MB Blob : 1534 MB Blob : 8192 MB 0 {}