-
Notifications
You must be signed in to change notification settings - Fork 13.5k
[MLGO] Upstream the corpus extraction tooling #72319
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MLGO] Upstream the corpus extraction tooling #72319
Conversation
✅ With the latest revision this PR passed the Python code formatter. |
After this lands, my plan is to work on getting CI up and running, both to run testing and also to publish the package. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm assuming the files were drop-in copied from google/ml-compiler-opt, so no need to comment there; also that we would subsequently delete them from there and depend on this package.
They were mostly copied. I modified My plan was to do exactly as you mention and delete them from the |
Would it be also possible to remove the dependency on Abseil? None of the existing scripts in LLVM use it and I don't think we should be introducing this dependency. It looks like Abseil is only used for flag parsing, logging and testing; those should be straightforward to replace with standard libraries like |
Yes. My plan was to remove the dependency on abseil as well. My plan was to get this landed with all the infrastructure setup and the code basically just directly copied and then remove the abseil dependency in a follow-up patch so that the different pieces get reviewed appropriately. |
Ah, if you can drop the abseil dependency, the dependency problem for tests goes away. May be worth doing the abseil dropping bit in this patch, too? |
I believe this is ready to go now. All the tests have been converted to lit-style tests and everything has been wired up into a I talked with Mircea about the absl dependency. It's not in the tests anymore and the plan is to refactor in a subsequent commit the rest of the utilities that use absl to the generic python versions. For now the tests have been gated to actually having |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for adding license info!
This patch upstreams some of the MLGO utilities, particularly the corpus extraction tooling, into LLVM proper. The motivation for this patch is available in the RFC.
https://discourse.llvm.org/t/rfc-upstreaming-elements-of-the-mlgo-tooling/74939