[CI][Test] Fix uvm_test.py when a machine has only 1 GPU #1549

shintaro-iwasaki · 2023-01-18T21:42:40Z

The following skipIf does not work correctly when a machine has only one GPU.

@unittest.skipIf(*gpu_unavailable or torch.cuda.device_count() < 2)
def test_uvm_to_device(self, sizes: List[int], uvm_op) -> None:
  [...]

This patch corrects the condition.

Details

First, *gpu_unavailable is defined as follows.

gpu_unavailable: Tuple[bool, str] = (
    not torch.cuda.is_available() or torch.cuda.device_count() == 0,
    "CUDA is not available or no GPUs detected",
)

So the skipIf is expanded as follows when there is only one CUDA device.

@unittest.skipIf(*gpu_unavailable or torch.cuda.device_count() < 2)
->
@unittest.skipIf((False, "CUDA is not ...") or True)

Because (False, "abc") or True is (False, "abc") in Python,

@unittest.skipIf((False, "CUDA is not ...") or True)
->
@unittest.skipIf(False, "CUDA is not ...")

It is False, so this unit test is not skipped. This UVM test seems failing occasionally because the machine does not have two GPUs, which annoys FBGEMM developers.

netlify · 2023-01-18T21:42:44Z

✅ Deploy Preview for pytorch-fbgemm-docs canceled.

Name	Link
🔨 Latest commit	`138fc7b`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/63c86937e0dca700094a05e3

shintaro-iwasaki · 2023-01-19T01:21:33Z

Now we see fbgemm_gpu/test/uvm_test.py::UvmTest::test_uvm_to_device SKIPPED on a single-GPU machine.

facebook-github-bot · 2023-01-19T01:21:46Z

@shintaro-iwasaki has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-01-19T16:55:25Z

@shintaro-iwasaki merged this pull request in 0237a8a.

shintaro-iwasaki added the bug Something isn't working label Jan 18, 2023

facebook-github-bot added the cla signed label Jan 18, 2023

shintaro-iwasaki force-pushed the siwasaki/pr/fix_uvm_test_error branch from faeac0c to 66d1b3c Compare January 18, 2023 21:47

[CI][Test] Fix uvm_test.py when a machine has only 1 GPU

138fc7b

shintaro-iwasaki force-pushed the siwasaki/pr/fix_uvm_test_error branch from 66d1b3c to 138fc7b Compare January 18, 2023 21:48

facebook-github-bot closed this in 0237a8a Jan 19, 2023

facebook-github-bot added the Merged label Jan 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI][Test] Fix uvm_test.py when a machine has only 1 GPU #1549

[CI][Test] Fix uvm_test.py when a machine has only 1 GPU #1549

Uh oh!

shintaro-iwasaki commented Jan 18, 2023

Uh oh!

netlify bot commented Jan 18, 2023 •

edited

Loading

Uh oh!

shintaro-iwasaki commented Jan 19, 2023

Uh oh!

facebook-github-bot commented Jan 19, 2023

Uh oh!

facebook-github-bot commented Jan 19, 2023

Uh oh!

Uh oh!

[CI][Test] Fix uvm_test.py when a machine has only 1 GPU #1549

[CI][Test] Fix uvm_test.py when a machine has only 1 GPU #1549

Uh oh!

Conversation

shintaro-iwasaki commented Jan 18, 2023

Details

Uh oh!

netlify bot commented Jan 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-fbgemm-docs canceled.

Uh oh!

shintaro-iwasaki commented Jan 19, 2023

Uh oh!

facebook-github-bot commented Jan 19, 2023

Uh oh!

facebook-github-bot commented Jan 19, 2023

Uh oh!

Uh oh!

netlify bot commented Jan 18, 2023 •

edited

Loading