Reasoning without using the think function

Hi, i want to use Qwen3_0.6B model in 8255 device, i exported pte model and run it on device successfully. Now i want to disable the "think" function to verify something, how can i achieve it ?
I use the following command and get outputs.txt:
./qnn_llama_runner_ndk27 --decoder_model_version qwen3 --tokenizer_path tokenizer.json --model_path hybrid_llama_qnn.pte --prompt "who are you" --seq_len 512 --eval_mode 1 --temperature 0.8 && cat outputs.txt

<img width="2498" height="488" alt="Image" src="https://github.com/user-attachments/assets/173d3f93-9657-4678-ac96-2b22151c8a5c" />

cc @cccclai @winskuo-quic @shewu-quic @haowhsu-quic @DannyYuyang-quic @cbilgin

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reasoning without using the think function #16392

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reasoning without using the think function #16392

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions