Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add xml_find_function_calls() helper to source expressions #2357

Merged
merged 25 commits into from
Dec 13, 2023

Conversation

AshesITR
Copy link
Collaborator

@AshesITR AshesITR commented Nov 27, 2023

fixes #2350
fixes #2338

  • Implement helper
  • Document
  • Add NEWS bullet
  • Use in our linters
  • Compare performance
  • Add section in custom linters vignette

Filing for early feedback and discussion

@codecov-commenter
Copy link

codecov-commenter commented Nov 27, 2023

Codecov Report

Attention: 6 lines in your changes are missing coverage. Please review.

Comparison is base (d1491c2) 98.64% compared to head (3766ac6) 98.53%.

❗ Current head 3766ac6 differs from pull request most recent head 451f409. Consider uploading reports for the commit 451f409 to get more accurate results

Files Patch % Lines
R/make_linter_from_xpath.R 57.14% 6 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #2357      +/-   ##
==========================================
- Coverage   98.64%   98.53%   -0.12%     
==========================================
  Files         125      125              
  Lines        5629     5609      -20     
==========================================
- Hits         5553     5527      -26     
- Misses         76       82       +6     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@MichaelChirico
Copy link
Collaborator

this should probably get a section in the custom linters vignette, I don't see that on your TODO right now

@AshesITR
Copy link
Collaborator Author

so far 118s on main vs 113s here for

devtools::load_all()
system.time(lintr::lint_package())

So 5 secs savings ATM with only some linters converted.

@AshesITR
Copy link
Collaborator Author

We have quite a few //SYMBOL_FUNCTION_CALL[text() = ...]/path/1 | //SYMBOL_FUNCTION_CALL[text() = ...]/path/2 XPaths.
ATM, I convert them using self:: and the union of the two function calls examined.

It might be worth splitting these into to separate paths with separate calls to xml_find_function_call(). Not sure about the performance gains of that.

@AshesITR
Copy link
Collaborator Author

@MichaelChirico PTAL for another round.
The technical part of this PR is done I think.

If you approve of the changes so far, I would flesh out the documentation.

@MichaelChirico
Copy link
Collaborator

@MichaelChirico PTAL for another round. The technical part of this PR is done I think.

If you approve of the changes so far, I would flesh out the documentation.

I was also hoping for the vignette change to serve as a high-level overview for guiding the review 😅

It would help to leave that in the main PR comment anyway for future reference on such a big change.

For now, I'll try reading the tests and see if that suffices.

@MichaelChirico
Copy link
Collaborator

so far 118s on main vs 113s here for

devtools::load_all()
system.time(lintr::lint_package())

So 5 secs savings ATM with only some linters converted.

Overall change looks pretty good, I think we're close enough to ready that we can re-run the benchmark + start on user-facing docs.

@AshesITR AshesITR force-pushed the feature/2350-xpath-cache branch from 600370b to d874444 Compare December 12, 2023 21:05
@AshesITR
Copy link
Collaborator Author

Another performance timing on lintr itself is 117s [main] vs 101s [here] => 14% Speedup 🥳

LMK if you want more comparisons before merging. Otherwise, I think it's good to go.

@AshesITR AshesITR marked this pull request as ready for review December 12, 2023 21:21
NEWS.md Outdated Show resolved Hide resolved
NEWS.md Outdated Show resolved Hide resolved
Co-authored-by: Michael Chirico <chiricom@google.com>
MichaelChirico
MichaelChirico previously approved these changes Dec 13, 2023
Copy link
Collaborator

@MichaelChirico MichaelChirico left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might be worth benchmarking on base R as well, here's a snippet I tried:

system.time(withr::with_dir(
  "~/svn/r-devel/src/library",
  for (pkg in list.files()) if (file.exists(file.path(pkg, "DESCRIPTION")))
    lint_package(pkg, linters = linters)
))

But linters includes cyclocomp_linter() which probably isn't a good idea, wheels were spinning interminably. So maybe a benchmark on a "reasonable" set of linters instead of the full suite.

Approved now if you want to forego that.

@MichaelChirico
Copy link
Collaborator

OK, I'm seeing 27% improvement on r-devel using our suite (minus cyclocomp_linter()):

Setup

get_linters <- \() modify_defaults(
  eval(parse(text = read.dcf(".lintr", "linters"))),
  cyclocomp_linter = NULL
)
linters <- get_linters(); system.time(withr::with_dir(
  "~/svn/r-devel/src/library",
  for (pkg in list.files()) if (file.exists(file.path(pkg, "DESCRIPTION")))
    lint_package(pkg, linters = linters)
))

Timings

# main
    user   system  elapsed 
2239.148  327.508 2666.586 

# current branch
    user   system  elapsed 
1828.817   88.002 1949.443 

@MichaelChirico MichaelChirico merged commit 538440d into main Dec 13, 2023
21 checks passed
@MichaelChirico MichaelChirico deleted the feature/2350-xpath-cache branch December 13, 2023 15:00
MichaelChirico added a commit that referenced this pull request Dec 13, 2023
Forgot to remove this when I updated the NEWS in #2357
@AshesITR
Copy link
Collaborator Author

Thanks @MichaelChirico for taking this to the finish line 👍🏻

MichaelChirico added a commit that referenced this pull request Dec 13, 2023
Forgot to remove this when I updated the NEWS in #2357

Co-authored-by: AshesITR <alexander.rosenstock@web.de>
Co-authored-by: Indrajeet Patil <patilindrajeet.science@gmail.com>
@MichaelChirico MichaelChirico mentioned this pull request Mar 20, 2024
18 tasks
#' @param function_names Character vector, names of functions whose calls to examine..
#' @export
# nolint next: object_length.
make_linter_from_function_xpath <- function(function_names,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks like this snuck in without test coverage

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Re-Use (and cache?) common XPaths Investigate performance of backport_linter()
3 participants