fix: passing cache tests when WDK config is enabled #332

leon-xd · 2025-04-04T01:10:01Z

Overview

Currently, our automated test matrix is not testing bindings when the WDK configuration is enabled at the project level. Locally, @wmmc88 discovered that the following wdk-macros tests failed when enabling the KMDF configuration:

tests::get_wdf_function_info_map::valid_input_no_cache
tests::get_wdf_function_info_map::valid_input_cache_exists
tests::inputs::generate_derived_ast_fragments::valid_input
tests::inputs::generate_derived_ast_fragments::valid_input_with_no_arguments.

These all test the feature I developed in #295 .

Root cause

These tests failed with configuration turned on because of the wdk crate. We conditionally compile the abstractions contained in the wdf mod based on whether KMDF/UMDF is enabled in our config. Within these abstractions, we use the call_unsafe_wdf_function_binding macro several times to call the bindings to the WDK functions.

Generating and saving the cache is triggered by the call_unsafe_wdf_function_binding macro. I used the scratch crate in order to determine the location of our saved cache -- by creating a scratch directory called wdk_macros_ast_fragments, we can write, save, and read the cache in a location that is accessible throughout the build process.

However, turning on a WDK configuration conflicted with assumptions made when writing tests for the cache generation. When all root WDK configurations were disabled, there was no call to the call_unsafe_wdf_function_binding macro, since the methods that used this macro in the wdk crate weren't compiled. Therefore, tests were written expecting a completely clean scratch directory. This worked fine until this configuration was enabled, as the call_unsafe_wdf_function_binding macro was called, and the cache was generated before tests ran. This triggered an assertion which checked for a completely clean environment, causing the failure of each of the tests.

Solution

In order to fix the root cause of this issue, I clean out the environment before each test is ran. However, more issues arose thinking about this more.

This problem broke a few assumptions I made when initially writing the tests, leading me to rewrite a portion of both the test and cache generation code. My old implementation included a test.lock that tests relied on for synchronous access to the scratch directory, as acquiring the lock used in the actual code resulted in a double acquire, and hence a deadlock. However, because now I know other crates can mutate the scratch directory, having separate locks for tests and implementation is not a clear enough guarantee to prevent a data race (even though Cargo runs build & test separately, this didn't feel like a strong enough invariant to prevent).

This lead me to conditionally compile two different versions of the function get_wdf_function_info_map. The cfg(test) version assumes an lock has already been obtained, and the cfg(not(test) version uses either a shared or exclusive lock depending on whether it needs to read or write to the cache. Now, the tests obtain the same lock as the actual code, ensuring that no data race exists between the wdk crate and the wdk-macros tests.

Other issues

cargo clippy --all-features also had not been run with the configuration turned on, and many warnings prevented its completion when run locally. While this does not explicitly stop any of the pipeline checks we have today, when we eventually turn this on in our pipelines, these clippy warnings will have to be dealt with. Any #[allow] is a result of me silencing these warnings pre-emptively.

Update 4-10

Instead of conditionally compiling two different versions of get_wdf_function_info_map, I instead conditionally compile two different locations for the scratch crate. This allows us to keep most of the old logic.

I also raised #331 as a result of tinkering with running cargo build/clippy --all-features with the UMDF config enabled.

Copilot

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

Comments suppressed due to low confidence (2)

crates/wdk-macros/src/lib.rs:464

Consider using RAII (e.g., a guard type) for the exclusive file lock to ensure the lock is always released even if the closure panics.

FileExt::unlock(&flock).to_syn_result(span, "unable to unlock file lock")?;

crates/wdk-macros/src/lib.rs:482

Consider using RAII (e.g., a guard type) for the shared file lock to guarantee that the lock is released even if the closure panics.

FileExt::unlock(&flock).to_syn_result(span, "unable to unlock file lock")?;

wmmc88

In the new logic, you are conditionally changing the locking behavior of a single lock, depending on if the code executing was compiled for test or normally. I think this results in fairly messy code. It is not immediately obvious that what's going on here is:

If cache doesnt exist and not in test, the lock is exclusively held (preventing tests from running)
If the cache does exist and not in test, the lock is non-exclusively held (preventing tests from running but allowing normal proc-macro execution
If the code is in test, regardless of if the cache exists or not, the lock is exclusively held and normal proc-macro execution is blocked

I think the reason the code feels unclear is largely because of how much the logic diverges between test-path and normal path, and I think we want to minimize that as much as possible. One way to do this is to revert the logic to how it was originally, but just have an extra test-specific lock. The normal path can acquire this "test" lock non-exclusively and the normal path does nothing.

crates/wdk-macros/src/lib.rs

leon-xd added 4 commits April 3, 2025 11:57

fixed cache generation bugs

5a0b586

Passing clippy and fmt

9280edf

Recommented KMDF configuration

226c1b7

renamed with_file_lock_clean_env

8a5a300

leon-xd requested review from Copilot and wmmc88 and removed request for Copilot April 4, 2025 01:12

Copilot AI reviewed Apr 4, 2025

View reviewed changes

wmmc88 requested changes Apr 7, 2025

View reviewed changes

leon-xd added 5 commits April 7, 2025 15:40

Added second scratch directory

51fb3c7

added const to driver_entry_stub for 1.86 nightly clippy

7e81d38

Removed link to WdfFunctions in doc

8e94269

Replace comments in cargo.toml

b99850e

Removes outdated shared lock comment

4382d65

leon-xd requested a review from wmmc88 April 9, 2025 18:41

leon-xd added 2 commits April 10, 2025 11:12

removed second cache clean & went back to old logic

83567b9

pass fmt

30f093d

leon-xd requested a review from gurry April 10, 2025 22:40

gurry reviewed Apr 16, 2025

View reviewed changes

crates/wdk-macros/src/lib.rs Outdated Show resolved Hide resolved

crates/wdk-macros/src/lib.rs Outdated Show resolved Hide resolved

leon-xd added 2 commits April 16, 2025 11:58

Addressed Gurinder's comments

f940add

fixed formatting

b4df8e5

wmmc88 reviewed Apr 17, 2025

View reviewed changes

crates/wdk-macros/src/lib.rs Show resolved Hide resolved

reverted latent code

5063f5d

leon-xd requested review from wmmc88 and gurry April 17, 2025 03:27

wmmc88 approved these changes Apr 17, 2025

View reviewed changes

gurry approved these changes Apr 18, 2025

View reviewed changes

leon-xd added this pull request to the merge queue Apr 18, 2025

Merged via the queue into microsoft:main with commit 790c824 Apr 18, 2025
62 checks passed

leon-xd deleted the fix-cache-tests branch April 18, 2025 05:08

leon-xd mentioned this pull request Apr 18, 2025

chore: april 2025 release #343

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: passing cache tests when WDK config is enabled #332

fix: passing cache tests when WDK config is enabled #332

Uh oh!

leon-xd commented Apr 4, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

wmmc88 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fix: passing cache tests when WDK config is enabled #332

fix: passing cache tests when WDK config is enabled #332

Uh oh!

Conversation

leon-xd commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Root cause

Solution

Other issues

Update 4-10

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

wmmc88 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leon-xd commented Apr 4, 2025 •

edited

Loading