Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Order of projects in solution file affects build results #6198

Closed
chtenb opened this issue Feb 25, 2021 · 19 comments
Closed

Order of projects in solution file affects build results #6198

chtenb opened this issue Feb 25, 2021 · 19 comments
Assignees
Labels
has-repro There is a sample project, or steps to reproduce the issue.

Comments

@chtenb
Copy link

chtenb commented Feb 25, 2021

Visual Studio Version: 16.8.6

Summary:

The order in which projects are listed in the solution (.sln) file affects the results of the building process. In my case, for some orders the build results are correct, and for others they are not. It seems to me that this is a bug and the order of the projects should not matter at all, since the correct references are set and the derived build order seems valid in both cases.

Moreover, this problem only occurs when executing MSBuild Rubjerg.Graphviz.sln from the commandline (Developer Command Prompt for VS 2019). From within Visual Studio the build always succeeds. MSBuild version is the newest: 16.8.3.61104.

Context:

I encountered this problem in a very small opensource project, so it's easy for other people to reproduce. The solution (Rubjerg.Graphviz.sln) consists of a single .NET library (Rubjerg.Graphviz.csproj) with a dependency on a single C++ library (GraphvizWrapper.vcxproj). The remaining projects are test projects.

Steps to Reproduce:

  1. Check out the master branch of https://github.com/Rubjerg/Graphviz.NetWrapper

  2. Restore the solution nuget packages: nuget restore Rubjerg.Graphviz.sln.

  3. Build the solution on a command prompt: MSBuild Rubjerg.Graphviz.sln.

  4. Run the unittests on a command prompt: packages\nunit.consolerunner\3.12.0\tools\nunit3-console.exe Rubjerg.Graphviz.Test\bin\x64\Debug\net48\Rubjerg.Graphviz.Test.dll

  5. Notice that the build succeeds and the tests pass. This is also indicated by the github workflow run on the latest commit of the master branch.

  6. Clear all build artifacts (e.g. by running git clean -dfx in the root of the repository)

  7. Change the solution file according to the diff shown in this reproduction scenario

  8. Repeat step 2, 3 and 4

Expected Behavior:
Succeeding build and succeeding unit tests.

Actual Behavior:
Notice that after step 2 the C++ DLLs are not present in the output directories of the managed projects. If you execute step 3, you will notice the System.DllNotFoundExceptions. This is also indicated by the github workflow run on the reproduction scenario.

User Impact:
All clients of our project run into this problem and have to meddle with the project order in the solution file to fix it.

@jjmew jjmew transferred this issue from dotnet/project-system Feb 26, 2021
@benvillalobos benvillalobos self-assigned this Mar 3, 2021
@benvillalobos
Copy link
Member

Team Triage: Verify the repro works & gather some info, then add the untriaged label.

@benvillalobos benvillalobos added under-investigation initial-investigation Perform initial investigation, apply untriaged label when done. and removed under-investigation labels Mar 3, 2021
@benvillalobos
Copy link
Member

So I got it to repro and captured binlogs of each build: solutions.zip

Unfortunately I couldn't repro after that. The tests seemed to work for me with the updated solution. Are you still seeing this with an updated msbuild?

It actually looks like you may have fixed the issue with https://github.com/Rubjerg/Graphviz.NetWrapper/pull/35/files?

The general idea here is you want to use ProjectReference to ensure certain projects are built first. When msbuild encounters a .sln file, it creates a metaproj file containing various ProjectReferences. The order of these references happen to be the same order as what it discovered in the .sln file.

@chtenb
Copy link
Author

chtenb commented Apr 29, 2021

I just re-ran the workflow in the reproduction PR for you, and still the same error occurs: https://github.com/Rubjerg/Graphviz.NetWrapper/pull/33/checks?check_run_id=2465253939

You should be able to reproduce by checking out that PR which is also referred to in the OP.

https://github.com/Rubjerg/Graphviz.NetWrapper/pull/35/files doesn't seem relevant, because the reproduction PR actually branches off from that commit.

image

@benvillalobos
Copy link
Member

Using our latest preview Microsoft (R) Build Engine version 16.10.0-preview-21253-02+fa96a2a81 for .NET Framework I can't reproduce this behavior. There's a chance this could have been fixed in the meantime. Could you send the full output of msbuild --version so I can try on that specific version? Along with a specific commit hash (even if latest/master reproduces the behavior).

@benvillalobos benvillalobos added the needs-more-info Issues that need more info to continue investigation. label May 21, 2021
@chtenb
Copy link
Author

chtenb commented May 22, 2021

On github the build action reports:

> Run msbuild Rubjerg.Graphviz.sln /p:Configuration=Release

Microsoft (R) Build Engine version 16.9.0+5e4b48a27 for .NET Framework
Copyright (C) Microsoft Corporation. All rights reserved.

@benvillalobos benvillalobos added author-responded Author responded, needs response from dev team. and removed needs-more-info Issues that need more info to continue investigation. labels May 24, 2021
@v-codyguan v-codyguan added needs-more-info Issues that need more info to continue investigation. and removed author-responded Author responded, needs response from dev team. labels Sep 29, 2021
@v-codyguan
Copy link

v-codyguan commented Sep 29, 2021

@chtenb I tried to reproduce this issue on 16.8.6 with MSBuild 16.8.3+39993bd9d, and I couldn't reproduce this issue. Could you please help to check the repro steps in case I missed something? And to reproduce this issue, we tried to delete 'Rubjerg.Graphviz.Test.dll' manually from directories after step 3, then execute step 4, and we still couldn't get this issue reproduce, does it make sense?
Repro steps:

  1. Clone and check out the master branch of https://github.com/Rubjerg/Graphviz.NetWrapper
  2. Open the solution to restore the packages.
  3. Executing 'MSBuild Rubjerg.Graphviz.sln' on Developer Command Prompt for VS 2019.
  4. Executing 'packages\nunit.consolerunner\3.12.0\tools\nunit3-console.exe'.
  5. Executing 'git clean -dfx'.
  6. Change the solution file.
  7. Repeat step 2, 3 and 4.
    image

@chtenb
Copy link
Author

chtenb commented Oct 7, 2021

@v-codyguan It looks like you forgot to pass the test.dll to the testrunner. The full test command of step 4 should be:

packages\nunit.consolerunner\3.12.0\tools\nunit3-console.exe Rubjerg.Graphviz.Test\bin\x64\Debug\net48\Rubjerg.Graphviz.Test.dll

Otherwise your steps look fine. I just repro'd with version 16.11.1+3e40a09f8.

@ghost ghost added the author-responded Author responded, needs response from dev team. label Oct 7, 2021
@v-codyguan v-codyguan added has-repro There is a sample project, or steps to reproduce the issue. and removed needs-more-info Issues that need more info to continue investigation. author-responded Author responded, needs response from dev team. labels Oct 8, 2021
@v-codyguan
Copy link

Hi @chtenb thank you for sharing! We can reproduce this issue and contact our engineer in time. Thanks!

@v-codyguan
Copy link

Hi @benvillalobos We verified this issue on 16.8.6 with MSBuild 16.8.3+39993bd9d and 16.11.4 with 16.11.1+3e40a09f8, this issue reproduced after passing the test.dll to the testrunner in step 4. Could you help to take a look? If you need repro machine, please feel free to contact us!
Actual

@benvillalobos
Copy link
Member

benvillalobos commented Nov 4, 2021

This one's a doozy. I don't fully understand the issue here, but I'm fairly certain it has to do with incremental builds. Marking as needs-attention for bug triage meeting.

note to self: Relevant path where the DLL's need to exist: Graphviz.NetWrapper\Rubjerg.Graphviz.Test\bin\x64\Debug\net48

Side note: Passing /graph to your msbuild invocation should resolve the issue. It did for me

@benvillalobos benvillalobos added needs-attention Needs a member of the team to respond. and removed initial-investigation Perform initial investigation, apply untriaged label when done. labels Nov 4, 2021
@chtenb
Copy link
Author

chtenb commented Nov 5, 2021

Side note: Passing /graph to your msbuild invocation should resolve the issue. It did for me

Interesting, I didn't know about that parameter. Any reason why that isn't on by default? And can Visual Studio pass it as well?

@benvillalobos
Copy link
Member

Any reason why that isn't on by default?

I don't know the details, just that there are many reasons unfortunately

You should be able to set up Command Line Arguments in the properties of your project. You could specify /graph there.

@benvillalobos benvillalobos removed the needs-attention Needs a member of the team to respond. label Mar 9, 2022
@benvillalobos
Copy link
Member

Dropping some context on this before it all disappears: Judging by the following symptoms:

  • Changing order of build causes a failure
  • /graph fixes it
  • A build output isn't present at the right time

It's gotta be a build ordering issue, or that a certain project isn't copying the bits to the right place where another project may have been doing that (this is controlled by ReferenceOutputAssembly (see #7986 for notes on that).

So this could be a transitive reference issue, which I'm not too sure of a good solution for.

I think the simplest solution here is for your test project to reference your vcxproj and set ReferenceOutputAssembly=true for it so it can copy the output dll to the right place. This shouldn't cause overbuilds.

Airing out another workaround, you can create a target in your test project that looks for your cpp dll and copies it to where it needs to be in order for this test to run.

Sorry this one took a while to get back to 😅

@chtenb
Copy link
Author

chtenb commented Nov 21, 2022

Thanks for coming back to this! Unfortunately the two workarounds you proposed are not more convenient for me than using a project order in the solution that works. Having all dependent projects reference this C++ project or add a custom target is unfeasible for large solutions and generally undesirable for a project that is meant to be pretty much plug-and-play.

So this could be a transitive reference issue, which I'm not too sure of a good solution for.

Based on my anecdotal experience with build problems I'd definitely think so. Transitive copying behavior (or the lack thereof) has always caused headaches in my projects, especially when C++ projects were involved.

I hoped that I had provided a tangible problem in this area with this github issue + example project, for which the solution or workaround hopefully could be made an example of for how to solve this nagging problem once and for all :)

@KirillOsenkov
Copy link
Member

See if #9709 is related

@JaynieBai JaynieBai self-assigned this Oct 24, 2024
@JaynieBai
Copy link
Member

JaynieBai commented Oct 30, 2024

Tested this issue with MSBuild main branch merged #10836. Still has the error occasionally ----> "System.DllNotFoundException : Unable to load DLL 'gvc.dll': The specified module could not be found" when change the project order in solution Rubjerg.Graphviz.sln
Image
Following the step 2 to step 4 at the first time
Image

@JanKrivanek
Copy link
Member

Discussed offline - next steps:

  • Collect binlogs of a build of a success and failed case
  • Attach the binlogs to this bug
  • Try to see if there is a difference between the two binlogs in how they copy the missing dll (gvc.dll)
  • Put your findings (even if they are inconclusive) here, ping team for more help if needed

@JaynieBai
Copy link
Member

This is success build logmsbuildSuccess.binlog.txt

After changed the solution based on this reproduction scenario and generated the failed binlog.
msbuildFail.binlog.txt

The differences are None items in the project Rubjerg.Graphviz don't include the binaries (including gvc.dll) generated form native project.

Image

@SimaTian
Copy link
Member

SimaTian commented Jan 9, 2025

Hello, building on the work by @JaynieBai and thanks to pointers from @JanKrivanek I have prepared a fix pr for the original repository. This I did because I believe that this isn't a bug per-se, just a weird and somewhat poorly documented behavior of MSBuild.

For the completeness sake I'm copying a part of my analysis included in the PR:

Basically the issue is that if the item isn't within a target, it is resolved during Evaluation phase - e.g. as soon as the project file is loaded into the MSBuild, but ProjectReference that is setting up the project dependency ordering is a Target so this happens:

  • MSBuild first loads a project (including evaluation) as specified by the .sln file
    this is affected by the original project ordering change that resulted in the build failing
  • Then it evaluates the targets related to project references, see here.
    -specifically ResolveProjectReferences target
  • this builds the referenced projects - however the item is already processed by this point, resulting in an empty copy

By moving the item into a custom target and forcing order by setting AfterTargets, the files are moved correctly. Tests pass.

While I'm not completely sure I got everything correct, this is the conclusion I arrived after going through the MSBuildism blogpost, crosschecking with MSBuild documentation and experimenting on my machine.
It might be worth updating the documentation somewhere, if anyone has an idea where, it would be nice (since this behavior is a confluence of several different places)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
has-repro There is a sample project, or steps to reproduce the issue.
Projects
None yet
Development

No branches or pull requests

7 participants