Skip to content

Incompatible with latest exllamav2: too many values to unpack (expected 3) #1

Open
@davideuler

Description

@davideuler

When I run the API server, error shows on calling api:

exllamav2-openai-server/server.py", line 156, in _gen_single_token
    token, prob, eos = ExLlamaV2Sampler.sample(logits, gen_settings, self.sequence_ids[:1, :], random.random(), self.tokenizer, prefix_token)
ValueError: too many values to unpack (expected 3)

I fixed it by update the server.py, although I need to hack some of the exllamav2 source code to make the code works together.
vim server.py

        #token, prob, eos = ExLlamaV2Sampler.sample(logits, gen_settings, self.sequence_ids[:1, :], random.random(), self.tokenizer, prefix_token)
        result = ExLlamaV2Sampler.sample(logits, gen_settings, self.sequence_ids[:1, :], random.random(), self.tokenizer, None)
        token, _, _, prob, eos =  result

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions