gaoqianshen commited on
Commit
e58ac95
·
verified ·
1 Parent(s): 11c9ecb

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +113 -0
README.md ADDED
@@ -0,0 +1,113 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ tags:
6
+ - multimodal
7
+ - vision-language
8
+ - reasoning
9
+ - math
10
+ - ocr
11
+ - gui-grounding
12
+ - computer-use
13
+ - chain-of-thought
14
+ - llama-cpp
15
+ - gguf-my-repo
16
+ base_model: microsoft/Phi-4-reasoning-vision-15B
17
+ pipeline_tag: image-text-to-text
18
+ model-index:
19
+ - name: Phi-4-Reasoning-Vision-15B
20
+ results:
21
+ - task:
22
+ type: visual-question-answering
23
+ dataset:
24
+ name: AI2D
25
+ type: ai2d
26
+ metrics:
27
+ - type: accuracy
28
+ value: 84.8
29
+ - task:
30
+ type: visual-question-answering
31
+ dataset:
32
+ name: ChartQA
33
+ type: chartqa
34
+ metrics:
35
+ - type: accuracy
36
+ value: 83.3
37
+ - task:
38
+ type: visual-question-answering
39
+ dataset:
40
+ name: MathVista (MINI)
41
+ type: mathvista
42
+ metrics:
43
+ - type: accuracy
44
+ value: 75.2
45
+ - task:
46
+ type: visual-question-answering
47
+ dataset:
48
+ name: MMMU
49
+ type: mmmu
50
+ metrics:
51
+ - type: accuracy
52
+ value: 54.3
53
+ - task:
54
+ type: visual-question-answering
55
+ dataset:
56
+ name: OCRBench
57
+ type: ocrbench
58
+ metrics:
59
+ - type: accuracy
60
+ value: 76.0
61
+ - task:
62
+ type: visual-question-answering
63
+ dataset:
64
+ name: ScreenSpot-V2
65
+ type: screenspot-v2
66
+ metrics:
67
+ - type: accuracy
68
+ value: 88.2
69
+ ---
70
+
71
+ # gaoqianshen/Phi-4-reasoning-vision-15B-Q8_0-GGUF
72
+ This model was converted to GGUF format from [`microsoft/Phi-4-reasoning-vision-15B`](https://huggingface.co/microsoft/Phi-4-reasoning-vision-15B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
73
+ Refer to the [original model card](https://huggingface.co/microsoft/Phi-4-reasoning-vision-15B) for more details on the model.
74
+
75
+ ## Use with llama.cpp
76
+ Install llama.cpp through brew (works on Mac and Linux)
77
+
78
+ ```bash
79
+ brew install llama.cpp
80
+
81
+ ```
82
+ Invoke the llama.cpp server or the CLI.
83
+
84
+ ### CLI:
85
+ ```bash
86
+ llama-cli --hf-repo gaoqianshen/Phi-4-reasoning-vision-15B-Q8_0-GGUF --hf-file phi-4-reasoning-vision-15b-q8_0.gguf -p "The meaning to life and the universe is"
87
+ ```
88
+
89
+ ### Server:
90
+ ```bash
91
+ llama-server --hf-repo gaoqianshen/Phi-4-reasoning-vision-15B-Q8_0-GGUF --hf-file phi-4-reasoning-vision-15b-q8_0.gguf -c 2048
92
+ ```
93
+
94
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
95
+
96
+ Step 1: Clone llama.cpp from GitHub.
97
+ ```
98
+ git clone https://github.com/ggerganov/llama.cpp
99
+ ```
100
+
101
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
102
+ ```
103
+ cd llama.cpp && LLAMA_CURL=1 make
104
+ ```
105
+
106
+ Step 3: Run inference through the main binary.
107
+ ```
108
+ ./llama-cli --hf-repo gaoqianshen/Phi-4-reasoning-vision-15B-Q8_0-GGUF --hf-file phi-4-reasoning-vision-15b-q8_0.gguf -p "The meaning to life and the universe is"
109
+ ```
110
+ or
111
+ ```
112
+ ./llama-server --hf-repo gaoqianshen/Phi-4-reasoning-vision-15B-Q8_0-GGUF --hf-file phi-4-reasoning-vision-15b-q8_0.gguf -c 2048
113
+ ```