forked from Const-me/Whisper
-
Notifications
You must be signed in to change notification settings - Fork 0
/
jfk-large-vega8.txt
47 lines (47 loc) · 2.62 KB
/
jfk-large-vega8.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
CPU Tasks
LoadModel 1.57347 seconds
RunComplete 9.46787 seconds
Run 9.40671 seconds
Callbacks 292.1 microseconds, 4 calls, 73.025 microseconds average
Spectrogram 12.2725 milliseconds, 3 calls, 4.09083 milliseconds average
Sample 3.5692 milliseconds, 27 calls, 132.193 microseconds average
Encode 7.26322 seconds
Decode 2.14291 seconds
DecodeStep 2.13933 seconds, 27 calls, 79.2343 milliseconds average
GPU Tasks
LoadModel 991.351 milliseconds
Run 9.36883 seconds
Encode 7.35144 seconds
EncodeLayer 6.25822 seconds, 32 calls, 195.569 milliseconds average
Decode 2.01739 seconds
DecodeStep 2.01737 seconds, 27 calls, 74.7173 milliseconds average
DecodeLayer 1.88943 seconds, 864 calls, 2.18684 milliseconds average
Compute Shaders
mulMatTiledEx 5.37127 seconds, 320 calls, 16.7852 milliseconds average
mulMatTiled 1.17596 seconds, 385 calls, 3.05444 milliseconds average
mulMatByRowTiled 878.04 milliseconds, 8346 calls, 105.205 microseconds average
mulMatByRowTiledEx 460.074 milliseconds, 1664 calls, 276.486 microseconds average
softMaxFixed 288.221 milliseconds, 896 calls, 321.675 microseconds average
convolutionMain2Fixed 201.063 milliseconds
norm 141.073 milliseconds, 2684 calls, 52.5606 microseconds average
matReshapePanels 138.851 milliseconds, 193 calls, 719.436 microseconds average
addRepeatGelu 89.1783 milliseconds, 898 calls, 99.3077 microseconds average
copyConvert 83.2232 milliseconds, 1856 calls, 44.8401 microseconds average
scaleInPlace 77.8363 milliseconds, 896 calls, 86.8709 microseconds average
fmaRepeat1 77.8123 milliseconds, 2684 calls, 28.9912 microseconds average
addRepeatEx 76.9018 milliseconds, 2656 calls, 28.954 microseconds average
addRepeat 66.8479 milliseconds, 960 calls, 69.6332 microseconds average
copyTranspose 62.5101 milliseconds, 1792 calls, 34.8829 microseconds average
addRepeatScale 40.5807 milliseconds, 1728 calls, 23.4842 microseconds average
convolutionMain 39.8186 milliseconds
softMaxLong 32.0594 milliseconds, 27 calls, 1.18739 milliseconds average
softMax 15.9281 milliseconds, 864 calls, 18.4353 microseconds average
diagMaskInf 12.6164 milliseconds, 864 calls, 14.6023 microseconds average
convolutionPrep1 5.4486 milliseconds, 2 calls, 2.7243 milliseconds average
convolutionPrep2 4.0996 milliseconds, 2 calls, 2.0498 milliseconds average
add 883.4 microseconds
addRows 73.4 microseconds, 27 calls, 2.71852 microseconds average
Memory Usage
Model 892.591 KB RAM, 2.8815 GB VRAM
Context 1.98376 MB RAM, 1.13447 GB VRAM
Total 2.85543 MB RAM, 4.01597 GB VRAM