ci : add coreml job that converts base.en to coreml [no ci] #2981

danbev · 2025-04-01T12:44:31Z

This commit adds a new job to the CI pipeline that downloads the base.en model and converts it to CoreML format. The CoreML model is then packed into a zip file and uploaded as an artifact.

This will only be done for pushes to master, releases, or pre-releases.

Refs: #2783

I've run this on my fork and it produces the following release:
https://github.com/danbev/whisper.cpp/releases/tag/b2368

I've only included one model but this will exercise the model conversion scripts and should give early feedback if anything breaks. We could also add more models to be published too.

This commit adds a new job to the CI pipeline that downloads the base.en model and converts it to CoreML format. The CoreML model is then packed into a zip file and uploaded as an artifact. This will only be done for pushes to master, releases, or pre-releases. Refs: ggml-org#2783

ggerganov

For now one model is OK to test the conversion.

Not sure if there is need to create an artifact - it would result in a lot of data (80MB) that is almost always the same.

So I think it's better to remove the artifact from this CI. We can add an additional ggml-ci that generates a CoreML model and even runs a transcription with it. It can run on the ggml-100-mac-m4 node.

danbev · 2025-04-01T12:59:31Z

So I think it's better to remove the artifact from this CI.

Just wanted to make sure that I'm not misunderstanding anything. Should we not include this job at all or should we just skip the publishing of the artifact?

ggerganov · 2025-04-01T13:03:04Z

Just skip the publishing of the artifact. The job is good to have.

danbev · 2025-04-01T13:07:51Z

Running the CI for this as I've modified the bulid.yml. I think the build it taking a little longer than before because the test have been enabled. I did not consider this when I enabled them. I'll take a closer look though.

Only the tests labeled with gh will run in CI which currently are only test-whisper-cli-tiny and test-whisper-cli-tiny.en:

set(TEST_TARGET test-whisper-cli-tiny)
add_test(NAME ${TEST_TARGET}
    COMMAND $<TARGET_FILE:whisper-cli>
    -m ${PROJECT_SOURCE_DIR}/models/for-tests-ggml-tiny.bin -l fr
    -f ${PROJECT_SOURCE_DIR}/samples/jfk.wav)
set_tests_properties(${TEST_TARGET} PROPERTIES LABELS "tiny;gh")

set(TEST_TARGET test-whisper-cli-tiny.en)
add_test(NAME ${TEST_TARGET}
    COMMAND $<TARGET_FILE:whisper-cli>
    -m ${PROJECT_SOURCE_DIR}/models/for-tests-ggml-tiny.en.bin
    -f ${PROJECT_SOURCE_DIR}/samples/jfk.wav)
set_tests_properties(${TEST_TARGET} PROPERTIES LABELS "tiny;en;gh")

But this can still take a long time, for example ubuntu-22-gcc (Debug, linux/ppc64le):

Test project /workspace
    Start 1: test-whisper-cli-tiny
1/2 Test #1: test-whisper-cli-tiny ............   Passed  784.02 sec
    Start 2: test-whisper-cli-tiny.en
2/2 Test #2: test-whisper-cli-tiny.en .........   Passed  763.74 sec
100% tests passed, 0 tests failed out of 2
Label Time Summary:
en      = 763.74 sec*proc (1 test)
gh      = 1547.75 sec*proc (2 tests)
tiny    = 1547.75 sec*proc (2 tests)
Total Test time (real) = 1547.80 sec

ggerganov · 2025-04-01T14:03:30Z

Btw, to fix the thread sanitizer warnings here: https://github.com/ggerganov/whisper.cpp/actions/runs/14195883603/job/39770792600?pr=2979, we have to build with GGML_OPENMP=OFF.

Here is llama.cpp config:

https://github.com/ggml-org/llama.cpp/blob/3fd072a54001a908c54e81fd2e82b682ecfdd475/.github/workflows/build.yml#L295-L306

The reason is that OpenMP threads are causing false positives when the thread sanitizer is enabled, so we simply disable them for sanitizer tests.

ggerganov approved these changes Apr 1, 2025

View reviewed changes

coreml : remove publishing of coreml model

6197d73

ci : add GGML_OPENMP=OFF to ubuntu-22-gcc-sanitized

ff3841f

danbev merged commit 04b9508 into ggml-org:master Apr 1, 2025
50 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ci : add coreml job that converts base.en to coreml [no ci] #2981

ci : add coreml job that converts base.en to coreml [no ci] #2981

Uh oh!

danbev commented Apr 1, 2025 •

edited

Loading

Uh oh!

ggerganov left a comment

Uh oh!

danbev commented Apr 1, 2025

Uh oh!

ggerganov commented Apr 1, 2025

Uh oh!

danbev commented Apr 1, 2025 •

edited

Loading

Uh oh!

ggerganov commented Apr 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ci : add coreml job that converts base.en to coreml [no ci] #2981

ci : add coreml job that converts base.en to coreml [no ci] #2981

Uh oh!

Conversation

danbev commented Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

danbev commented Apr 1, 2025

Uh oh!

ggerganov commented Apr 1, 2025

Uh oh!

danbev commented Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danbev commented Apr 1, 2025 •

edited

Loading

danbev commented Apr 1, 2025 •

edited

Loading

ggerganov commented Apr 1, 2025 •

edited

Loading