0s autopkgtest [22:00:10]: starting date and time: 2025-07-07 22:00:10+0000 0s autopkgtest [22:00:10]: git checkout: 508d4a25 a-v-ssh wait_for_ssh: demote "ssh connection failed" to a debug message 0s autopkgtest [22:00:10]: host juju-7f2275-prod-proposed-migration-environment-15; command line: /home/ubuntu/autopkgtest/runner/autopkgtest --output-dir /tmp/autopkgtest-work.w5rc29ko/out --timeout-copy=6000 --setup-commands /home/ubuntu/autopkgtest-cloud/worker-config-production/setup-canonical.sh --apt-pocket=proposed=src:glibc --apt-upgrade ggml --timeout-short=300 --timeout-copy=20000 --timeout-build=20000 --env=ADT_TEST_TRIGGERS=glibc/2.41-9ubuntu1 -- ssh -s /home/ubuntu/autopkgtest/ssh-setup/nova -- --flavor autopkgtest-cpu2-ram4-disk20-s390x --security-groups autopkgtest-juju-7f2275-prod-proposed-migration-environment-15@sto01-s390x-1.secgroup --name adt-questing-s390x-ggml-20250707-220010-juju-7f2275-prod-proposed-migration-environment-15-dcac3358-3866-45f1-9911-a717e2eb1c01 --image adt/ubuntu-questing-s390x-server --keyname testbed-juju-7f2275-prod-proposed-migration-environment-15 --net-id=net_prod-autopkgtest-workers-s390x -e TERM=linux --mirror=http://ftpmaster.internal/ubuntu/ 5s Creating nova instance adt-questing-s390x-ggml-20250707-220010-juju-7f2275-prod-proposed-migration-environment-15-dcac3358-3866-45f1-9911-a717e2eb1c01 from image adt/ubuntu-questing-s390x-server-20250707.img (UUID 141e4834-1e59-4811-9d22-ee73c0890197)... 153s autopkgtest [22:02:43]: testbed dpkg architecture: s390x 155s autopkgtest [22:02:45]: testbed apt version: 3.1.3 156s autopkgtest [22:02:46]: @@@@@@@@@@@@@@@@@@@@ test bed setup 157s autopkgtest [22:02:47]: testbed release detected to be: None 159s autopkgtest [22:02:49]: updating testbed package index (apt update) 160s Get:1 http://ftpmaster.internal/ubuntu questing-proposed InRelease [249 kB] 160s Hit:2 http://ftpmaster.internal/ubuntu questing InRelease 160s Hit:3 http://ftpmaster.internal/ubuntu questing-updates InRelease 160s Hit:4 http://ftpmaster.internal/ubuntu questing-security InRelease 160s Get:5 http://ftpmaster.internal/ubuntu questing-proposed/multiverse Sources [15.8 kB] 160s Get:6 http://ftpmaster.internal/ubuntu questing-proposed/universe Sources [182 kB] 160s Get:7 http://ftpmaster.internal/ubuntu questing-proposed/main Sources [28.2 kB] 161s Get:8 http://ftpmaster.internal/ubuntu questing-proposed/main s390x Packages [46.7 kB] 161s Get:9 http://ftpmaster.internal/ubuntu questing-proposed/universe s390x Packages [154 kB] 161s Get:10 http://ftpmaster.internal/ubuntu questing-proposed/multiverse s390x Packages [5272 B] 161s Fetched 681 kB in 2s (434 kB/s) 165s Reading package lists... 166s autopkgtest [22:02:56]: upgrading testbed (apt dist-upgrade and autopurge) 166s Reading package lists... 167s Building dependency tree... 167s Reading state information... 168s Calculating upgrade... 168s The following packages will be upgraded: 168s apparmor libapparmor1 libc-bin libc-dev-bin libc6 libc6-dev libnss-systemd 168s libpam-systemd libsystemd-shared libsystemd0 libudev1 libxml2-16 locales 168s systemd systemd-cryptsetup systemd-resolved systemd-sysv systemd-timesyncd 168s udev 169s 19 upgraded, 0 newly installed, 0 to remove and 0 not upgraded. 169s Need to get 19.4 MB of archives. 169s After this operation, 54.3 kB of additional disk space will be used. 169s Get:1 http://ftpmaster.internal/ubuntu questing-proposed/main s390x libc6-dev s390x 2.41-9ubuntu1 [1678 kB] 169s Get:2 http://ftpmaster.internal/ubuntu questing-proposed/main s390x libc-dev-bin s390x 2.41-9ubuntu1 [23.0 kB] 169s Get:3 http://ftpmaster.internal/ubuntu questing-proposed/main s390x libc6 s390x 2.41-9ubuntu1 [2896 kB] 169s Get:4 http://ftpmaster.internal/ubuntu questing-proposed/main s390x libc-bin s390x 2.41-9ubuntu1 [672 kB] 169s Get:5 http://ftpmaster.internal/ubuntu questing/main s390x libsystemd0 s390x 257.7-1ubuntu1 [538 kB] 169s Get:6 http://ftpmaster.internal/ubuntu questing/main s390x libnss-systemd s390x 257.7-1ubuntu1 [179 kB] 169s Get:7 http://ftpmaster.internal/ubuntu questing/main s390x systemd-sysv s390x 257.7-1ubuntu1 [11.9 kB] 169s Get:8 http://ftpmaster.internal/ubuntu questing/main s390x systemd-resolved s390x 257.7-1ubuntu1 [333 kB] 169s Get:9 http://ftpmaster.internal/ubuntu questing/main s390x libpam-systemd s390x 257.7-1ubuntu1 [263 kB] 169s Get:10 http://ftpmaster.internal/ubuntu questing/main s390x libsystemd-shared s390x 257.7-1ubuntu1 [2335 kB] 170s Get:11 http://ftpmaster.internal/ubuntu questing/main s390x systemd s390x 257.7-1ubuntu1 [3018 kB] 170s Get:12 http://ftpmaster.internal/ubuntu questing/main s390x systemd-timesyncd s390x 257.7-1ubuntu1 [41.5 kB] 170s Get:13 http://ftpmaster.internal/ubuntu questing/main s390x systemd-cryptsetup s390x 257.7-1ubuntu1 [128 kB] 170s Get:14 http://ftpmaster.internal/ubuntu questing/main s390x udev s390x 257.7-1ubuntu1 [1416 kB] 170s Get:15 http://ftpmaster.internal/ubuntu questing/main s390x libudev1 s390x 257.7-1ubuntu1 [207 kB] 170s Get:16 http://ftpmaster.internal/ubuntu questing/main s390x libapparmor1 s390x 4.1.1-0ubuntu3 [57.1 kB] 170s Get:17 http://ftpmaster.internal/ubuntu questing/main s390x libxml2-16 s390x 2.14.4+dfsg-0exp1 [628 kB] 170s Get:18 http://ftpmaster.internal/ubuntu questing-proposed/main s390x locales all 2.41-9ubuntu1 [4253 kB] 171s Get:19 http://ftpmaster.internal/ubuntu questing/main s390x apparmor s390x 4.1.1-0ubuntu3 [762 kB] 173s Preconfiguring packages ... 174s Fetched 19.4 MB in 2s (9313 kB/s) 176s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 176s Preparing to unpack .../libc6-dev_2.41-9ubuntu1_s390x.deb ... 176s Unpacking libc6-dev:s390x (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 176s Preparing to unpack .../libc-dev-bin_2.41-9ubuntu1_s390x.deb ... 176s Unpacking libc-dev-bin (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 176s Preparing to unpack .../libc6_2.41-9ubuntu1_s390x.deb ... 177s Unpacking libc6:s390x (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 178s Setting up libc6:s390x (2.41-9ubuntu1) ... 179s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 179s Preparing to unpack .../libc-bin_2.41-9ubuntu1_s390x.deb ... 179s Unpacking libc-bin (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 179s Setting up libc-bin (2.41-9ubuntu1) ... 180s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 180s Preparing to unpack .../libsystemd0_257.7-1ubuntu1_s390x.deb ... 180s Unpacking libsystemd0:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 180s Setting up libsystemd0:s390x (257.7-1ubuntu1) ... 180s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 180s Preparing to unpack .../libnss-systemd_257.7-1ubuntu1_s390x.deb ... 180s Unpacking libnss-systemd:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 181s Preparing to unpack .../systemd-sysv_257.7-1ubuntu1_s390x.deb ... 181s Unpacking systemd-sysv (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 182s Preparing to unpack .../systemd-resolved_257.7-1ubuntu1_s390x.deb ... 182s Unpacking systemd-resolved (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 182s Preparing to unpack .../libpam-systemd_257.7-1ubuntu1_s390x.deb ... 182s Unpacking libpam-systemd:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 182s Preparing to unpack .../libsystemd-shared_257.7-1ubuntu1_s390x.deb ... 183s Unpacking libsystemd-shared:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 184s Setting up libsystemd-shared:s390x (257.7-1ubuntu1) ... 184s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 184s Preparing to unpack .../systemd_257.7-1ubuntu1_s390x.deb ... 184s Unpacking systemd (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 186s Preparing to unpack .../systemd-timesyncd_257.7-1ubuntu1_s390x.deb ... 186s Unpacking systemd-timesyncd (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 186s Preparing to unpack .../systemd-cryptsetup_257.7-1ubuntu1_s390x.deb ... 186s Unpacking systemd-cryptsetup (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 186s Preparing to unpack .../udev_257.7-1ubuntu1_s390x.deb ... 186s Unpacking udev (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 187s Preparing to unpack .../libudev1_257.7-1ubuntu1_s390x.deb ... 187s Unpacking libudev1:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 188s Setting up libudev1:s390x (257.7-1ubuntu1) ... 188s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85971 files and directories currently installed.) 188s Preparing to unpack .../libapparmor1_4.1.1-0ubuntu3_s390x.deb ... 188s Unpacking libapparmor1:s390x (4.1.1-0ubuntu3) over (4.1.1-0ubuntu2) ... 188s Preparing to unpack .../libxml2-16_2.14.4+dfsg-0exp1_s390x.deb ... 188s Unpacking libxml2-16:s390x (2.14.4+dfsg-0exp1) over (2.14.3+dfsg-0exp3) ... 188s Preparing to unpack .../locales_2.41-9ubuntu1_all.deb ... 188s Unpacking locales (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 190s Preparing to unpack .../apparmor_4.1.1-0ubuntu3_s390x.deb ... 193s Unpacking apparmor (4.1.1-0ubuntu3) over (4.1.1-0ubuntu2) ... 196s Setting up libapparmor1:s390x (4.1.1-0ubuntu3) ... 196s Setting up libxml2-16:s390x (2.14.4+dfsg-0exp1) ... 196s Setting up systemd (257.7-1ubuntu1) ... 197s /usr/lib/tmpfiles.d/legacy.conf:14: Duplicate line for path "/run/lock", ignoring. 197s /usr/lib/tmpfiles.d/legacy.conf:14: Duplicate line for path "/run/lock", ignoring. 200s Setting up locales (2.41-9ubuntu1) ... 202s Generating locales (this might take a while)... 209s en_US.UTF-8... done 209s Generation complete. 209s Setting up apparmor (4.1.1-0ubuntu3) ... 209s Installing new version of config file /etc/apparmor.d/lsusb ... 209s Installing new version of config file /etc/apparmor.d/mbsync ... 212s Reloading AppArmor profiles 222s Setting up systemd-cryptsetup (257.7-1ubuntu1) ... 222s Setting up systemd-timesyncd (257.7-1ubuntu1) ... 225s systemd-time-wait-sync.service is a disabled or a static unit not running, not starting it. 225s Setting up udev (257.7-1ubuntu1) ... 228s Creating group 'clock' with GID 987. 230s Setting up libc-dev-bin (2.41-9ubuntu1) ... 230s Setting up systemd-resolved (257.7-1ubuntu1) ... 233s Setting up systemd-sysv (257.7-1ubuntu1) ... 234s Setting up libnss-systemd:s390x (257.7-1ubuntu1) ... 234s Setting up libpam-systemd:s390x (257.7-1ubuntu1) ... 234s Setting up libc6-dev:s390x (2.41-9ubuntu1) ... 234s Processing triggers for libc-bin (2.41-9ubuntu1) ... 234s Processing triggers for man-db (2.13.1-1) ... 241s Processing triggers for dbus (1.16.2-2ubuntu1) ... 241s Processing triggers for shared-mime-info (2.4-5build2) ... 245s Processing triggers for procps (2:4.0.4-8ubuntu2) ... 245s Processing triggers for initramfs-tools (0.149ubuntu1) ... 245s update-initramfs: Generating /boot/initrd.img-6.15.0-3-generic 246s W: No lz4 in /usr/bin:/sbin:/bin, using gzip 270s Using config file '/etc/zipl.conf' 270s Building bootmap in '/boot' 270s Adding IPL section 'ubuntu' (default) 270s Preparing boot device for LD-IPL: vda (0000). 270s Done. 272s Reading package lists... 273s Building dependency tree... 273s Reading state information... 273s Solving dependencies... 274s 0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded. 275s autopkgtest [22:04:45]: rebooting testbed after setup commands that affected boot 331s autopkgtest [22:05:41]: testbed running kernel: Linux 6.15.0-3-generic #3-Ubuntu SMP Wed Jun 4 07:31:50 UTC 2025 335s autopkgtest [22:05:45]: @@@@@@@@@@@@@@@@@@@@ apt-source ggml 341s Get:1 http://ftpmaster.internal/ubuntu questing/universe ggml 0.0~git20250618.9e4bee1-1 (dsc) [2567 B] 341s Get:2 http://ftpmaster.internal/ubuntu questing/universe ggml 0.0~git20250618.9e4bee1-1 (tar) [1404 kB] 341s Get:3 http://ftpmaster.internal/ubuntu questing/universe ggml 0.0~git20250618.9e4bee1-1 (diff) [10.2 kB] 342s gpgv: Signature made Fri Jun 20 09:29:08 2025 UTC 342s gpgv: using RSA key 419F7E9A47C3AB9519F1B0A339953A37DE4EB36B 342s gpgv: Can't check signature: No public key 342s dpkg-source: warning: cannot verify inline signature for ./ggml_0.0~git20250618.9e4bee1-1.dsc: no acceptable signature found 344s autopkgtest [22:05:54]: testing package ggml version 0.0~git20250618.9e4bee1-1 346s libggml-hip SKIP Test lists explicitly supported architectures, but the current architecture s390x isn't listed. 346s autopkgtest [22:05:56]: build not needed 350s libggml-hip SKIP Test lists explicitly supported architectures, but the current architecture s390x isn't listed. 350s autopkgtest [22:06:00]: test libggml-cpu: preparing testbed 350s Reading package lists... 351s Building dependency tree... 351s Reading state information... 351s Solving dependencies... 352s The following NEW packages will be installed: 352s libggml libggml-cpu libgomp1 352s 0 upgraded, 3 newly installed, 0 to remove and 0 not upgraded. 352s Need to get 809 kB of archives. 352s After this operation, 2641 kB of additional disk space will be used. 352s Get:1 http://ftpmaster.internal/ubuntu questing/main s390x libgomp1 s390x 15.1.0-8ubuntu1 [154 kB] 353s Get:2 http://ftpmaster.internal/ubuntu questing/universe s390x libggml-cpu s390x 0.0~git20250618.9e4bee1-1 [348 kB] 353s Get:3 http://ftpmaster.internal/ubuntu questing/universe s390x libggml s390x 0.0~git20250618.9e4bee1-1 [307 kB] 355s Fetched 809 kB in 1s (959 kB/s) 355s Selecting previously unselected package libgomp1:s390x. 359s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85973 files and directories currently installed.) 359s Preparing to unpack .../libgomp1_15.1.0-8ubuntu1_s390x.deb ... 359s Unpacking libgomp1:s390x (15.1.0-8ubuntu1) ... 359s Selecting previously unselected package libggml-cpu:s390x. 359s Preparing to unpack .../libggml-cpu_0.0~git20250618.9e4bee1-1_s390x.deb ... 359s Unpacking libggml-cpu:s390x (0.0~git20250618.9e4bee1-1) ... 359s Selecting previously unselected package libggml:s390x. 359s Preparing to unpack .../libggml_0.0~git20250618.9e4bee1-1_s390x.deb ... 359s Unpacking libggml:s390x (0.0~git20250618.9e4bee1-1) ... 359s Setting up libgomp1:s390x (15.1.0-8ubuntu1) ... 359s Setting up libggml-cpu:s390x (0.0~git20250618.9e4bee1-1) ... 359s Setting up libggml:s390x (0.0~git20250618.9e4bee1-1) ... 359s Processing triggers for libc-bin (2.41-9ubuntu1) ... 363s autopkgtest [22:06:13]: test libggml-cpu: /usr/libexec/ggml/runtests 363s autopkgtest [22:06:13]: test libggml-cpu: [----------------------- 364s dpkg-architecture: warning: cannot determine CC system type, falling back to default (native compilation) 364s 0.00 1.00 2.00 364s Testing 1 devices 364s 364s Backend 1/1: CPU 364s Skipping CPU backend 364s 1/1 backends passed 364s OK 364s load_model: ggml tensor size = 336 bytes 364s load_model: backend buffer size = 0.09 MB 364s 364s Performing test: 364s ggml_conv_1d_transpose (4): PASSED 364s ggml_conv_1d_transpose (5): PASSED 364s ggml_conv_1d_transpose (10): PASSED 364s ggml_conv_1d_transpose (14): PASSED 364s ggml_conv_1d_transpose (12): PASSED 364s ggml_conv_1d_transpose (18): PASSED 364s ggml_conv_1d_transpose (24): PASSED 364s ggml_conv_1d_transpose (50688): PASSED 364s main: compute buffer size: 0.19 MB 364s load_model: ggml tensor size = 336 bytes 364s load_model: backend buffer size = 0.00 MB 364s 364s Performing test: 364s ggml_im2col (240): PASSED 364s ggml_conv1d (80): PASSED 364s main: compute buffer size: 0.00 MB 364s load_model: ggml tensor size = 336 bytes 364s load_model: backend buffer size = 0.00 MB 364s 364s Performing test: 364s ggml_im2col (4320): PASSED 364s ggml_conv2d (480): PASSED 364s main: compute buffer size: 0.02 MB 364s ggml_conv_2d_dw(channels=3, kernel=1x1, stride=1, pad=0, dilation=1, layout=WHCN): PASSED 364s ggml_conv_2d_dw(channels=3, kernel=1x1, stride=1, pad=0, dilation=1, layout=CWHN): PASSED 364s ggml_conv_2d_dw(channels=42, kernel=3x3, stride=2, pad=1, dilation=1, layout=WHCN): PASSED 364s ggml_conv_2d_dw(channels=42, kernel=3x3, stride=2, pad=1, dilation=1, layout=CWHN): PASSED 364s ggml_conv_2d_dw(channels=8, kernel=5x5, stride=1, pad=2, dilation=2, layout=WHCN): PASSED 364s ggml_conv_2d_dw(channels=8, kernel=5x5, stride=1, pad=2, dilation=2, layout=CWHN): PASSED 364s Testing dup on i16 -> i16 copy 364s Testing dup on i32 -> i32 copy 364s Testing dup on f16 -> f16 copy 364s Testing dup on f16 -> f32 copy 364s Testing dup on f32 -> f16 copy 364s Testing dup on f32 -> f32 copy 364s main: compute buffer size: 0.00 MB 364s 364s Performing gemm_f16_out_f32 test: 364s 1224.0f,1023.0f,1158.0f,1259.0f,1359.0f,1194.0f,1535.0f,1247.0f,1185.0f,1029.0f,889.0f,1182.0f,955.0f,1179.0f,1147.0f,1048.0f, 364s 1216.0f,1087.0f,1239.0f,1361.0f,1392.0f,1260.0f,1247.0f,1563.0f,1167.0f,1052.0f,942.0f,1214.0f,1045.0f,1134.0f,1264.0f,1126.0f, 364s 1125.0f,966.0f,1079.0f,1333.0f,1287.0f,1101.0f,1185.0f,1167.0f,1368.0f,990.0f,967.0f,1121.0f,971.0f,1086.0f,1130.0f,980.0f, 364s 999.0f,902.0f,1020.0f,1056.0f,1076.0f,929.0f,1029.0f,1052.0f,990.0f,1108.0f,823.0f,989.0f,759.0f,1041.0f,1003.0f,870.0f, 364s gemm_mult (64): PASSED 364s load_model: ggml tensor size = 336 bytes 364s load_model: backend buffer size = 3584 bytes 364s Matrix A: [36, 4] 364s Matrix B: [36, 16] 364s 364s Performing ggml_mul_mat test: 364s 1224.0 1023.0 1158.0 1259.0 1359.0 1194.0 1535.0 1247.0 1185.0 1029.0 889.0 1182.0 955.0 1179.0 1147.0 1048.0 364s 1216.0 1087.0 1239.0 1361.0 1392.0 1260.0 1247.0 1563.0 1167.0 1052.0 942.0 1214.0 1045.0 1134.0 1264.0 1126.0 364s 1125.0 966.0 1079.0 1333.0 1287.0 1101.0 1185.0 1167.0 1368.0 990.0 967.0 1121.0 971.0 1086.0 1130.0 980.0 364s 999.0 902.0 1020.0 1056.0 1076.0 929.0 1029.0 1052.0 990.0 1108.0 823.0 989.0 759.0 1041.0 1003.0 870.0 364s ggml_mul_mat (64): PASSED 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s test-mul-mat0: iter:0/500 364s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 364s x0: [4, 1, 1, 1] 364s 0.435 -0.717 0.214 -0.967 364s 364s x1: [4, 3, 1, 1] 364s 0.823 -0.605 -0.330 0.536 364s -0.444 0.108 -0.045 0.258 364s -0.270 0.027 0.904 0.832 364s 364s y: [1, 3, 1, 1] 364s 0.202 364s -0.530 364s -0.748 364s testing: mul_mat, [4, 1, 2, 1] = [4, 4, 2, 1] * [4, 1, 2, 1] 364s x0: [4, 4, 2, 1] 364s 0.026 0.678 0.225 -0.408 364s 0.275 0.049 -0.013 0.946 364s -0.415 0.543 0.053 0.540 364s -0.200 0.783 -0.433 -0.295 364s 364s x1: [4, 1, 2, 1] 364s -0.514 -0.726 0.608 -0.687 364s 364s y: [4, 1, 2, 1] 364s -0.088 -0.834 -0.519 -0.527 364s testing: mul_mat, [2, 4, 2, 4] = [4, 2, 2, 4] * [4, 4, 2, 4] 364s x0: [4, 2, 2, 4] 364s 0.755 0.496 0.258 -0.929 364s 0.496 0.666 0.851 0.747 364s 364s x1: [4, 4, 2, 4] 364s 0.804 0.702 -0.467 0.080 364s -0.250 0.520 0.025 0.335 364s 0.063 -0.921 -0.125 0.864 364s 0.862 0.442 -0.431 0.477 364s 364s y: [2, 4, 2, 4] 364s 0.761 0.529 364s -0.236 0.495 364s -1.244 -0.044 364s 0.315 0.711 364s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 364s x0: [4, 1, 1, 1] 364s 0.065 -0.825 -0.479 0.755 364s 364s x1: [4, 2, 1, 1] 364s 0.254 0.461 -0.343 0.481 364s -0.596 0.842 0.370 0.306 364s 364s y: [1, 2, 1, 1] 364s 0.163 364s -0.679 364s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 364s x0: [1, 1, 2, 1] 364s -0.423 364s 364s x1: [1, 4, 2, 1] 364s 0.372 364s -0.813 364s -0.777 364s -0.277 364s 364s y: [1, 4, 2, 1] 364s -0.158 364s 0.344 364s 0.329 364s 0.117 364s testing: mul_mat, [2, 1, 2, 4] = [1, 2, 2, 4] * [1, 1, 2, 4] 364s x0: [1, 2, 2, 4] 364s 0.530 364s 0.398 364s 364s x1: [1, 1, 2, 4] 364s -0.620 364s 364s y: [2, 1, 2, 4] 364s -0.329 -0.247 364s test-mul-mat0: iter:1/500 364s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 364s x0: [1, 1, 1, 1] 364s -0.418 364s 364s x1: [1, 4, 1, 1] 364s 0.729 364s -0.015 364s -0.073 364s 0.698 364s 364s y: [1, 4, 1, 1] 364s -0.305 364s 0.006 364s 0.030 364s -0.292 364s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 364s x0: [1, 1, 2, 1] 364s -0.722 364s 364s x1: [1, 1, 2, 1] 364s -0.639 364s 364s y: [1, 1, 2, 1] 364s 0.461 364s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 364s x0: [1, 2, 2, 1] 364s -0.644 364s -0.556 364s 364s x1: [1, 1, 2, 1] 364s -0.015 364s 364s y: [2, 1, 2, 1] 364s 0.010 0.008 364s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 2, 1, 1] 364s 0.864 364s 0.817 364s 364s x1: [1, 2, 1, 1] 364s -0.724 364s -0.279 364s 364s y: [2, 2, 1, 1] 364s -0.625 -0.591 364s -0.241 -0.228 364s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 364s x0: [2, 1, 2, 1] 364s -0.211 0.318 364s 364s x1: [2, 1, 2, 1] 364s 0.244 0.674 364s 364s y: [1, 1, 2, 1] 364s 0.162 364s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 364s x0: [1, 1, 2, 1] 364s -0.423 364s 364s x1: [1, 2, 2, 1] 364s -0.698 364s -0.855 364s 364s y: [1, 2, 2, 1] 364s 0.295 364s 0.362 364s test-mul-mat0: iter:2/500 364s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 364s x0: [1, 2, 1, 1] 364s 0.524 364s -0.025 364s 364s x1: [1, 1, 1, 1] 364s -0.469 364s 364s y: [2, 1, 1, 1] 364s -0.246 0.012 364s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 3, 1, 1] 364s 0.035 364s -0.584 364s 0.115 364s 364s x1: [1, 2, 1, 1] 364s -0.685 364s 0.766 364s 364s y: [3, 2, 1, 1] 364s -0.024 0.401 -0.079 364s 0.027 -0.448 0.088 364s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 364s x0: [1, 2, 1, 1] 364s -0.348 364s 0.459 364s 364s x1: [1, 3, 1, 1] 364s -0.148 364s 0.660 364s -0.211 364s 364s y: [2, 3, 1, 1] 364s 0.051 -0.068 364s -0.230 0.303 364s 0.073 -0.097 364s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 4, 1, 1] 364s 0.795 364s -0.728 364s -0.178 364s -0.989 364s 364s x1: [1, 2, 1, 1] 364s 0.277 364s 0.970 364s 364s y: [4, 2, 1, 1] 364s 0.220 -0.202 -0.049 -0.274 364s 0.771 -0.706 -0.173 -0.959 364s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 364s x0: [4, 1, 1, 1] 364s 0.442 -0.902 -0.102 0.973 364s 364s x1: [4, 1, 1, 1] 364s 0.567 0.549 -0.413 -0.771 364s 364s y: [1, 1, 1, 1] 364s -0.952 364s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 364s x0: [1, 2, 1, 1] 364s -0.801 364s -0.234 364s 364s x1: [1, 4, 1, 1] 364s 0.416 364s -0.578 364s -0.052 364s 0.730 364s 364s y: [2, 4, 1, 1] 364s -0.333 -0.097 364s 0.463 0.135 364s 0.042 0.012 364s -0.585 -0.171 364s test-mul-mat0: iter:3/500 364s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 364s x0: [3, 1, 1, 1] 364s 0.777 -0.047 -0.660 364s 364s x1: [3, 4, 1, 1] 364s -0.897 -0.893 -0.085 364s 0.562 0.384 -0.115 364s -0.762 0.179 0.157 364s 0.060 0.190 -0.276 364s 364s y: [1, 4, 1, 1] 364s -0.600 364s 0.495 364s -0.704 364s 0.220 364s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 364s x0: [3, 1, 1, 1] 364s -0.533 0.660 -0.860 364s 364s x1: [3, 1, 1, 1] 364s 0.219 0.051 0.238 364s 364s y: [1, 1, 1, 1] 364s -0.287 364s testing: mul_ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s mat, [4, 1, 1, 3] = [3, 4, 1, 3] * [3, 1, 1, 3] 364s x0: [3, 4, 1, 3] 364s 0.630 0.318 -0.927 364s -0.485 0.557 0.252 364s 0.672 -0.384 -0.558 364s -0.604 0.225 -0.781 364s 364s x1: [3, 1, 1, 3] 364s -0.802 0.847 -0.661 364s 364s y: [4, 1, 1, 3] 364s 0.377 0.694 -0.496 1.191 364s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 364s x0: [3, 1, 1, 1] 364s -0.694 -0.934 -0.873 364s 364s x1: [3, 4, 1, 1] 364s -0.376 0.292 -0.988 364s 0.066 0.688 0.237 364s 0.285 0.037 -0.199 364s -0.276 0.438 0.604 364s 364s y: [1, 4, 1, 1] 364s 0.851 364s -0.895 364s -0.059 364s -0.745 364s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 364s x0: [1, 3, 1, 1] 364s 0.136 364s -0.998 364s -0.989 364s 364s x1: [1, 3, 1, 1] 364s 0.371 364s -0.625 364s 0.238 364s 364s y: [3, 3, 1, 1] 364s 0.050 -0.371 -0.367 364s -0.085 0.623 0.618 364s 0.032 -0.237 -0.235 364s testing: mul_mat, [1, 1, 1, 3] = [3, 1, 1, 3] * [3, 1, 1, 3] 364s x0: [3, 1, 1, 3] 364s 0.770 -0.627 -0.686 364s 364s x1: [3, 1, 1, 3] 364s -0.390 -0.477 0.311 364s 364s y: [1, 1, 1, 3] 364s -0.214 364s test-mul-mat0: iter:4/500 364s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 364s x0: [2, 3, 1, 1] 364s -0.011 0.105 364s 0.878 0.599 364s 0.628 0.189 364s 364s x1: [2, 2, 1, 1] 364s 0.865 -0.534 364s 0.853 0.103 364s 364s y: [3, 2, 1, 1] 364s -0.066 0.440 0.443 364s 0.001 0.811 0.555 364s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 364s x0: [2, 3, 2, 1] 364s -0.666 0.591 364s 0.730 0.746 364s 0.329 -0.175 364s 364s x1: [2, 3, 2, 1] 364s 0.314 0.991 364s 0.872 -0.351 364s 0.749 0.178 364s 364s y: [3, 3, 2, 1] 364s 0.376 0.969 -0.070 364s -0.788 0.375 0.348 364s -0.393 0.680 0.215 364s testing: mul_mat, [4, 3, 2, 3] = [2, 4, 2, 3] * [2, 3, 2, 3] 364s x0: [2, 4, 2, 3] 364s 0.905 0.898 364s 0.798 0.534 364s -0.333 0.073 364s -0.562 -0.045 364s 364s x1: [2, 3, 2, 3] 364s -0.934 0.402 364s 0.036 0.665 364s 0.030 -0.775 364s 364s y: [4, 3, 2, 3] 364s -0.485 -0.531 0.340 0.507 364s 0.630 0.384 0.037 -0.050 364s -0.668 -0.390 -0.067 0.018 364s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 364s x0: [2, 1, 1, 1] 364s 0.931 -0.689 364s 364s x1: [2, 4, 1, 1] 364s 0.203 0.541 364s 0.774 0.867 364s -0.654 -0.104 364s -0.025 0.590 364s 364s y: [1, 4, 1, 1] 364s -0.184 364s 0.124 364s -0.537 364s -0.430 364s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 364s x0: [1, 3, 2, 1] 364s -0.048 364s -0.454 364s 0.893 364s 364s x1: [1, 2, 2, 1] 364s -0.414 364s 0.764 364s 364s y: [3, 2, 2, 1] 364s 0.020 0.188 -0.370 364s -0.037 -0.347 0.683 364s testing: mul_mat, [2, 1, 2, 3] = [3, 2, 2, 3] * [3, 1, 2, 3] 364s x0: [3, 2, 2, 3] 364s -0.859 -0.007 -0.414 364s -0.213 -0.127 -0.512 364s 364s x1: [3, 1, 2, 3] 364s 0.437 0.849 -0.632 364s 364s y: [2, 1, 2, 3] 364s -0.120 0.123 364s test-mul-mat0: iter:5/500 364s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 364s x0: [4, 1, 1, 1] 364s 0.174 -0.662 0.169 -0.047 364s 364s x1: [4, 2, 1, 1] 364s -0.728 0.178 -0.884 0.779 364s 0.891 -0.888 0.850 -0.062 364s 364s y: [1, 2, 1, 1] 364s -0.431 364s 0.890 364s testing: mul_mat, [4, 1, 4, 1] = [4, 4, 4, 1] * [4, 1, 4, 1] 364s x0: [4, 4, 4, 1] 364s 0.290 -0.113 -0.462 0.406 364s -0.334 -0.571 0.518 -0.484 364s 0.367 -0.968 0.690 0.705 364s 0.202 -0.357 0.336 0.054 364s 364s x1: [4, 1, 4, 1] 364s 0.631 0.852 0.053 0.165 364s 364s y: [4, 1, 4, 1] 364s 0.129 -0.750 -0.440 -0.150 364s testing: mul_mat, [4, 4, 4, 2] = [4, 4, 4, 2] * [4, 4, 4, 2] 364s x0: [4, 4, 4, 2] 364s -0.642 -0.280 -0.114 -0.243 364s 0.295 -0.799 -0.349 0.739 364s 0.215 -0.792 0.612 0.499 364s -0.202 -0.266 -0.212 -0.456 364s 364s x1: [4, 4, 4, 2] 364s -0.521 0.619 -0.810 0.493 364s -0.446 -0.653 0.875 0.522 364s -0.807 0.962 0.691 -0.317 364s 0.385 -0.087 -0.131 0.308 364s 364s y: [4, 4, 4, 2] 364s 0.134 -0.002 -0.851 -0.113 364s 0.243 0.471 1.217 -0.159 364s 0.246 -1.481 -0.671 -0.095 364s -0.283 0.456 0.225 -0.167 364s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 364s x0: [4, 2, 1, 1] 364s 0.843 -0.821 0.937 -0.991 364s 0.857 0.642 0.018 -0.624 364s 364s x1: [4, 4, 1, 1] 364s 0.204 0.671 0.127 -0.596 364s 0.606 0.345 -0.857 0.925 364s -0.050 -0.231 -0.284 0.862 364s 0.834 -0.794 0.802 0.751 364s 364s y: [2, 4, 1, 1] 364s 0.329 0.979 364s -1.492 0.150 364s -0.971 -0.733 364s 1.361 -0.249 364s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 364s x0: [2, 1, 4, 1] 364s -0.704 0.851 364s 364s x1: [2, 4, 4, 1] 364s -0.426 0.255 364s -0.476 0.497 364s -0.927 0.444 364s -0.299 0.744 364s 364s y: [1, 4, 4, 1] 364s 0.516 364s 0.758 364s 1.030 364s 0.843 364s testing: mul_mat, [3, 2, 4, 2] = [1, 3, 4, 2] * [1, 2, 4, 2] 364s x0: [1, 3, 4, 2] 364s 0.231 364s 0.035 364s -0.242 364s 364s x1: [1, 2, 4, 2] 364s -0.554 364s -0.774 364s 364s y: [3, 2, 4, 2] 364s -0.128 -0.019 0.134 364s -0.179 -0.027 0.188 364s test-mul-mat0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s : iter:6/500 364s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 364s x0: [2, 1, 1, 1] 364s 0.047 -0.506 364s 364s x1: [2, 2, 1, 1] 364s -0.741 -0.974 364s -0.710 0.490 364s 364s y: [1, 2, 1, 1] 364s 0.458 364s -0.282 364s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 364s x0: [2, 2, 1, 1] 364s 0.688 -0.529 364s -0.166 0.479 364s 364s x1: [2, 1, 1, 1] 364s -0.551 0.083 364s 364s y: [2, 1, 1, 1] 364s -0.423 0.131 364s testing: mul_mat, [4, 2, 1, 2] = [2, 4, 1, 2] * [2, 2, 1, 2] 364s x0: [2, 4, 1, 2] 364s -0.103 -0.083 364s -0.395 0.173 364s 0.752 0.030 364s 0.944 0.308 364s 364s x1: [2, 2, 1, 2] 364s -0.046 -0.815 364s -0.073 0.882 364s 364s y: [4, 2, 1, 2] 364s 0.073 -0.123 -0.059 -0.294 364s -0.066 0.182 -0.029 0.202 364s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 364s x0: [2, 2, 1, 1] 364s 0.035 -0.145 364s 0.848 0.568 364s 364s x1: [2, 4, 1, 1] 364s 0.728 0.067 364s 0.169 -0.801 364s 0.902 -0.352 364s 0.153 -0.913 364s 364s y: [2, 4, 1, 1] 364s 0.016 0.656 364s 0.122 -0.312 364s 0.083 0.565 364s 0.138 -0.389 364s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 364s x0: [2, 4, 1, 1] 364s 0.638 -0.746 364s -0.054 -0.666 364s 0.045 0.552 364s 0.582 0.851 364s 364s x1: [2, 2, 1, 1] 364s -0.722 0.411 364s -0.535 0.194 364s 364s y: [4, 2, 1, 1] 364s -0.767 -0.234 0.194 -0.071 364s -0.486 -0.100 0.083 -0.146 364s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 364s x0: [4, 4, 1, 2] 364s -0.820 -0.237 -0.220 0.417 364s 0.569 0.735 0.865 0.814 364s 0.719 -0.716 0.963 -0.772 364s -0.355 0.709 0.115 -1.000 364s 364s x1: [4, 2, 1, 2] 364s 0.114 -0.417 -0.504 -0.613 364s -0.937 -0.776 0.455 0.232 364s 364s y: [4, 2, 1, 2] 364s -0.139 -1.177 0.368 0.219 364s 0.948 -0.521 0.141 -0.398 364s test-mul-mat0: iter:7/500 364s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 364s x0: [2, 2, 1, 1] 364s 0.538 0.979 364s -0.487 0.538 364s 364s x1: [2, 3, 1, 1] 364s -0.610 -0.745 364s -0.798 0.170 364s -0.879 -0.835 364s 364s y: [2, 3, 1, 1] 364s -1.057 -0.104 364s -0.263 0.480 364s -1.290 -0.021 364s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 364s x0: [2, 4, 2, 1] 364s -0.685 0.494 364s -0.301 0.461 364s 0.655 0.635 364s -0.212 0.385 364s 364s x1: [2, 2, 2, 1] 364s -0.711 0.129 364s 0.602 -0.177 364s 364s y: [4, 2, 2, 1] 364s 0.551 0.273 -0.384 0.200 364s -0.499 -0.263 0.282 -0.196 364s testing: mul_mat, [2, 4, 2, 4] = [2, 2, 2, 4] * [2, 4, 2, 4] 364s x0: [2, 2, 2, 4] 364s -0.939 0.612 364s -0.225 0.137 364s 364s x1: [2, 4, 2, 4] 364s -0.755 -0.865 364s -0.326 -0.776 364s -0.352 -0.787 364s -0.487 0.777 364s 364s y: [2, 4, 2, 4] 364s 0.179 0.052 364s -0.169 -0.033 364s -0.152 -0.029 364s 0.933 0.216 364s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 364s x0: [2, 2, 1, 1] 364s 0.270 0.090 364s 0.194 -0.650 364s 364s x1: [2, 2, 1, 1] 364s 0.498 -0.052 364s 0.721 0.609 364s 364s y: [2, 2, 1, 1] 364s 0.130 0.130 364s 0.250 -0.256 364s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 364s x0: [2, 4, 2, 1] 364s 0.226 0.064 364s -0.114 0.987 364s 0.138 -0.882 364s 0.092 -0.940 364s 364s x1: [2, 2, 2, 1] 364s 0.853 0.949 364s -0.609 -0.320 364s 364s y: [4, 2, 2, 1] 364s 0.254 0.839 -0.719 -0.813 364s -0.158 -0.246 0.198 0.244 364s testing: mul_mat, [2, 2, 2, 4] = [4, 2, 2, 4] * [4, 2, 2, 4] 364s x0: [4, 2, 2, 4] 364s 0.218 0.277 -0.239 0.011 364s 0.639 -0.276 0.482 0.000 364s 364s x1: [4, 2, 2, 4] 364s 0.054 0.003 -0.782 -0.752 364s -0.907 -0.432 -0.899 -0.958 364s 364s y: [2, 2, 2, 4] 364s 0.192 -0.343 364s -0.113 -0.894 364s test-mul-mat0: iter:8/500 364s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 364s x0: [4, 4, 1, 1] 364s -0.236 0.357 0.853 0.689 364s 0.244 -0.226 0.368 -0.607 364s -0.700 0.613 0.360 -0.539 364s 0.642 0.786 -0.464 -0.819 364s 364s x1: [4, 3, 1, 1] 364s -0.942 0.863 0.800 -0.918 364s 0.023 0.488 -0.465 -0.347 364s 0.065 -0.130 0.936 0.159 364s 364s y: [4, 3, 1, 1] 364s 0.580 0.427 1.971 0.454 364s -0.467 -0.065 0.302 0.898 364s 0.846 0.293 0.126 -0.624 364s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 364s x0: [4, 1, 3, 1] 364s -0.407 0.816 -0.495 -0.683 364s 364s x1: [4, 4, 3, 1] 364s -0.570 -0.994 -0.761 -0.708 364s 0.807 -0.680 0.315 0.295 364s -0.145 0.968 -0.640 0.726 364s 0.904 0.519 0.908 -0.332 364s 364s y: [1, 4, 3, 1] 364s 0.281 364s -1.241 364s 0.670 364s -0.167 364s testing: mul_mat, [2, 1, 3, 1] = [4, 2, 3, 1] * [4, 1, 3, 1] 364s x0: [4, 2, 3, 1] 364s 0.261 0.015 0.994 0.497 364s -0.289 0.964 -0.910 -0.473 364s 364s x1: [4, 1, 3, 1] 364s 0.793 0.951 -0.692 0.192 364s 364s y: [2, 1, 3, 1] 364s -0.371 1.226 364s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 364s x0: [4, 4, 1, 1] 364s 0.855 0.772 0.375 -0.457 364s 0.682 -0.482 -0.903 -0.065 364s 0.991 -0.712 -0.760 0.356 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s 0.819 -0.821 -0.565 0.457 364s 364s x1: [4, 2, 1, 1] 364s -0.409 0.232 -0.930 -0.924 364s -0.076 -0.735 0.861 -0.815 364s 364s y: [4, 2, 1, 1] 364s -0.098 0.508 -0.194 -0.423 364s 0.063 -0.422 -0.497 -0.319 364s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 364s x0: [4, 3, 3, 1] 364s -0.008 -0.415 -0.724 -0.490 364s 0.517 -0.077 -0.429 0.153 364s 0.254 -0.390 -0.600 0.546 364s 364s x1: [4, 4, 3, 1] 364s 0.423 -0.772 -0.321 0.103 364s 0.432 0.855 0.693 -0.336 364s 0.925 0.769 0.588 -0.810 364s 0.630 0.773 0.471 0.486 364s 364s y: [3, 4, 3, 1] 364s 0.498 0.431 0.657 364s -0.696 -0.191 -0.823 364s -0.356 0.042 -0.860 364s -0.905 0.139 -0.159 364s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 364s x0: [3, 2, 3, 1] 364s 0.883 -0.871 -0.332 364s -0.993 -0.009 0.099 364s 364s x1: [3, 4, 3, 1] 364s -0.758 0.377 -0.608 364s -0.358 0.887 0.545 364s -0.812 -0.911 -0.296 364s 0.108 -0.339 -0.030 364s 364s y: [2, 4, 3, 1] 364s -0.796 0.690 364s -1.271 0.402 364s 0.175 0.786 364s 0.401 -0.107 364s test-mul-mat0: iter:9/500 364s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 364s x0: [3, 4, 1, 1] 364s 0.515 -0.231 0.660 364s 0.522 -0.102 -0.349 364s -0.810 0.996 -0.522 364s -0.683 0.132 -0.661 364s 364s x1: [3, 2, 1, 1] 364s 0.029 0.566 0.464 364s -0.917 0.597 0.344 364s 364s y: [4, 2, 1, 1] 364s 0.191 -0.205 0.298 -0.252 364s -0.383 -0.660 1.158 0.478 364s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 364s x0: [3, 2, 2, 1] 364s -0.073 -0.203 -0.981 364s 0.243 0.928 -0.642 364s 364s x1: [3, 4, 2, 1] 364s -0.148 0.942 -0.505 364s 0.086 -0.735 -0.962 364s -0.876 0.624 -0.652 364s -0.659 -0.463 -0.675 364s 364s y: [2, 4, 2, 1] 364s 0.314 1.162 364s 1.087 -0.044 364s 0.576 0.784 364s 0.805 -0.158 364s testing: mul_mat, [4, 2, 2, 1] = [3, 4, 2, 1] * [3, 2, 2, 1] 364s x0: [3, 4, 2, 1] 364s 0.314 -0.609 -0.412 364s -0.949 0.191 -0.871 364s -0.023 0.988 -0.852 364s -0.779 0.916 -0.493 364s 364s x1: [3, 2, 2, 1] 364s 0.305 0.760 0.240 364s 0.646 -0.704 0.565 364s 364s y: [4, 2, 2, 1] 364s -0.466 -0.354 0.539 0.340 364s 0.398 -1.240 -1.191 -1.427 364s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 364s x0: [3, 2, 1, 1] 364s -0.182 -0.505 -0.615 364s -0.531 0.039 0.001 364s 364s x1: [3, 4, 1, 1] 364s 0.464 -0.622 -0.588 364s 0.932 -0.524 -0.100 364s 0.630 0.790 0.291 364s -0.782 0.841 -0.518 364s 364s y: [2, 4, 1, 1] 364s 0.592 -0.271 364s 0.157 -0.515 364s -0.692 -0.303 364s 0.036 0.447 364s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 364s x0: [2, 3, 2, 1] 364s -0.868 -0.392 364s 0.575 -0.524 364s 0.499 0.129 364s 364s x1: [2, 3, 2, 1] 364s 0.355 -0.828 364s -0.638 -0.147 364s -0.906 -0.386 364s 364s y: [3, 3, 2, 1] 364s 0.016 0.639 0.071 364s 0.612 -0.290 -0.338 364s 0.938 -0.319 -0.502 364s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 364s x0: [3, 2, 2, 1] 364s 0.146 -0.936 0.434 364s -0.729 -0.100 0.632 364s 364s x1: [3, 2, 2, 1] 364s 0.058 -0.282 -0.812 364s -0.903 0.103 0.190 364s 364s y: [2, 2, 2, 1] 364s -0.080 -0.527 364s -0.146 0.768 364s test-mul-mat0: iter:10/500 364s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 364s x0: [1, 2, 1, 1] 364s 0.335 364s -0.781 364s 364s x1: [1, 1, 1, 1] 364s 0.501 364s 364s y: [2, 1, 1, 1] 364s 0.168 -0.391 364s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 364s x0: [1, 1, 2, 1] 364s -0.403 364s 364s x1: [1, 2, 2, 1] 364s 0.488 364s 0.432 364s 364s y: [1, 2, 2, 1] 364s -0.197 364s -0.174 364s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 364s x0: [1, 2, 2, 1] 364s 0.150 364s 0.113 364s 364s x1: [1, 1, 2, 1] 364s 0.190 364s 364s y: [2, 1, 2, 1] 364s 0.029 0.021 364s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 4, 1, 1] 364s 0.044 364s 0.110 364s 0.978 364s -0.317 364s 364s x1: [1, 2, 1, 1] 364s -0.988 364s -0.900 364s 364s y: [4, 2, 1, 1] 364s -0.043 -0.108 -0.966 0.313 364s -0.039 -0.099 -0.880 0.285 364s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 364s x0: [4, 1, 2, 1] 364s 0.567 -0.366 0.766 -0.111 364s 364s x1: [4, 1, 2, 1] 364s -0.405 -0.762 -0.279 0.848 364s 364s y: [1, 1, 2, 1] 364s -0.258 364s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 364s x0: [1, 4, 2, 1] 364s 0.052 364s 0.622 364s 0.810 364s 0.030 364s 364s x1: [1, 4, 2, 1] 364s 0.840 364s 0.453 364s 0.935 364s -0.009 364s 364s y: [4, 4, 2, 1] 364s 0.044 0.522 0.681 0.025 364s 0.024 0.282 0.367 0.014 364s 0.049 0.581 0.758 0.028 364s -0.000 -0.006 -0.008 -0.000 364s test-mul-mat0: iter:11/500 364s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 364s x0: [1, 2, 1, 1] 364s -0.943 364s -0.795 364s 364s x1: [1, 4, 1, 1] 364s -0.741 364s -0.642 364s 0.168 364s -0.107 364s 364s y: [2, 4, 1, 1] 364s 0.698 0.589 364s 0.605 0.510 364s -0.159 -0.134 364s 0.101 0.085 364s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s 1] * [1, 2, 3, 1] 364s x0: [1, 3, 3, 1] 364s 0.562 364s -0.719 364s 0.811 364s 364s x1: [1, 2, 3, 1] 364s -0.225 364s 0.543 364s 364s y: [3, 2, 3, 1] 364s -0.127 0.162 -0.183 364s 0.305 -0.390 0.440 364s testing: mul_mat, [4, 3, 3, 2] = [1, 4, 3, 2] * [1, 3, 3, 2] 364s x0: [1, 4, 3, 2] 364s -0.183 364s 0.060 364s -0.406 364s -0.621 364s 364s x1: [1, 3, 3, 2] 364s -0.934 364s 0.198 364s 0.162 364s 364s y: [4, 3, 3, 2] 364s 0.171 -0.056 0.379 0.580 364s -0.036 0.012 -0.080 -0.123 364s -0.030 0.010 -0.066 -0.100 364s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 364s x0: [1, 1, 1, 1] 364s 0.574 364s 364s x1: [1, 4, 1, 1] 364s 0.630 364s 0.313 364s -0.805 364s -0.996 364s 364s y: [1, 4, 1, 1] 364s 0.362 364s 0.180 364s -0.462 364s -0.572 364s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 364s x0: [1, 2, 3, 1] 364s -0.671 364s 0.975 364s 364s x1: [1, 1, 3, 1] 364s -0.007 364s 364s y: [2, 1, 3, 1] 364s 0.005 -0.007 364s testing: mul_mat, [3, 1, 3, 2] = [2, 3, 3, 2] * [2, 1, 3, 2] 364s x0: [2, 3, 3, 2] 364s 0.340 -0.347 364s 0.035 0.230 364s 0.130 0.135 364s 364s x1: [2, 1, 3, 2] 364s 0.128 0.413 364s 364s y: [3, 1, 3, 2] 364s -0.100 0.099 0.072 364s test-mul-mat0: iter:12/500 364s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 364s x0: [4, 2, 1, 1] 364s 0.152 0.027 -0.478 0.091 364s -0.574 -0.496 -0.936 -0.044 364s 364s x1: [4, 4, 1, 1] 364s -0.841 -0.556 0.506 -0.775 364s 0.857 0.212 0.139 0.568 364s -0.824 0.181 -0.308 0.561 364s -0.302 0.541 0.686 0.128 364s 364s y: [2, 4, 1, 1] 364s -0.455 0.319 364s 0.121 -0.752 364s 0.078 0.646 364s -0.348 -0.743 364s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 364s x0: [4, 2, 1, 1] 364s 0.944 0.575 0.129 0.083 364s 0.144 0.306 -0.736 0.835 364s 364s x1: [4, 2, 1, 1] 364s 0.139 -0.733 -0.380 0.561 364s 0.274 -0.588 0.719 0.718 364s 364s y: [2, 2, 1, 1] 364s -0.293 0.544 364s 0.072 -0.070 364s testing: mul_mat, [1, 2, 1, 4] = [4, 1, 1, 4] * [4, 2, 1, 4] 364s x0: [4, 1, 1, 4] 364s -0.950 0.302 0.044 0.306 364s 364s x1: [4, 2, 1, 4] 364s -0.134 -0.038 0.377 -0.447 364s -0.910 0.256 0.705 0.117 364s 364s y: [1, 2, 1, 4] 364s -0.005 364s 1.009 364s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 364s x0: [4, 1, 1, 1] 364s 0.119 -0.999 -0.125 -0.152 364s 364s x1: [4, 1, 1, 1] 364s -0.812 -0.149 -0.527 -0.693 364s 364s y: [1, 1, 1, 1] 364s 0.224 364s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 364s x0: [1, 1, 1, 1] 364s 0.795 364s 364s x1: [1, 4, 1, 1] 364s -0.162 364s -0.242 364s -0.160 364s 0.511 364s 364s y: [1, 4, 1, 1] 364s -0.129 364s -0.192 364s -0.127 364s 0.407 364s testing: mul_mat, [4, 1, 1, 4] = [1, 4, 1, 4] * [1, 1, 1, 4] 364s x0: [1, 4, 1, 4] 364s 0.000 364s 0.952 364s 0.616 364s -0.296 364s 364s x1: [1, 1, 1, 4] 364s 0.561 364s 364s y: [4, 1, 1, 4] 364s 0.000 0.534 0.346 -0.166 364s test-mul-mat0: iter:13/500 364s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 364s x0: [3, 3, 1, 1] 364s 0.138 -0.208 0.291 364s -0.245 0.496 -0.390 364s 0.032 -0.598 -0.283 364s 364s x1: [3, 3, 1, 1] 364s 0.400 0.009 0.158 364s -0.089 -0.484 -0.047 364s -0.528 0.324 0.792 364s 364s y: [3, 3, 1, 1] 364s 0.099 -0.155 -0.037 364s 0.075 -0.200 0.300 364s 0.090 -0.019 -0.435 364s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 364s x0: [3, 3, 1, 1] 364s 0.332 -0.266 0.873 364s -0.660 0.891 -0.216 364s -0.144 -0.156 0.256 364s 364s x1: [3, 3, 1, 1] 364s 0.098 -0.372 -0.404 364s 0.352 0.705 -0.053 364s 0.912 0.081 -0.584 364s 364s y: [3, 3, 1, 1] 364s -0.221 -0.308 -0.059 364s -0.118 0.408 -0.174 364s -0.228 -0.403 -0.293 364s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 364s x0: [3, 4, 1, 1] 364s 0.326 0.214 -0.797 364s 0.954 0.810 0.555 364s 0.659 -0.243 0.467 364s -0.260 0.173 0.940 364s 364s x1: [3, 3, 1, 1] 364s -0.820 -0.364 -0.405 364s 0.318 0.428 0.887 364s -0.927 -0.076 -0.503 364s 364s y: [4, 3, 1, 1] 364s -0.023 -1.302 -0.641 -0.230 364s -0.511 1.142 0.519 0.824 364s 0.083 -1.225 -0.827 -0.245 364s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 364s x0: [3, 4, 1, 1] 364s -0.294 -0.170 -0.852 364s -0.666 0.944 -0.090 364s -0.161 -0.044 0.854 364s 0.630 -0.955 0.702 364s 364s x1: [3, 4, 1, 1] 364s -0.928 0.907 0.812 364s -0.588 0.798 -0.404 364s 0.268 -0.358 0.852 364s 0.448 0.278 -0.553 364s 364s y: [4, 4, 1, 1] 364s -0.573 1.402 0.803 -0.880 364s 0.382 1.182 -0.286 -1.416 364s -0.744 -0.594 0.700 1.109 364s 0.293 0.014 -0.557 -0.372 364s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 364s x0: [4, 2, 1, 1] 364s -0.870 -0.492 0.816 -0.886 364s 0.369 0.408 0.274 -0.851 364s 364s x1: [4, 3, 1, 1] 364s -0.431 -0.389 0.169 0.309 364s 0.784 0.109 0.380 0.691 364s -0.079 0.792 0.488 0.517 364s 364s y: [2, 3, 1, 1] 364s 0.431 -0.534 364s -1.038 -0.150 364s -0.381 -0.012 364s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 364s x0: [2, 4, 1, 1] 364s 0.907 0.486 364s -0.822 0.274 364s 0.489 -0.815 364s 0.076 -0.134 364s 364s x1: [2, 4, 1, 1] 364s -0.887 0.744 364s -0.021 -0.943 364s -0.301 0.024 364s -0.795 0.609 364s 364s y: [4, 4, 1, 1] 364s -0.443 0.933 -1.040 -0.167 364s -0.477 -0.241 0.758 0.124 364s -0.261 0.254 -0.167 -0.026 364s -0.425 0.820 -0.885 -0.142 364s test-mul-mat0: iter:14/500 364s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 364s x0: [1, 1, 1, 1] 364s -0.417 364s 364s x1: [1, 1, 1, 1] 364s -0.377 364s 364s y: [1, 1, 1, 1] 364s 0.157 364s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 364s x0: [1, 1, 4, 1] 364s -0.896 364s 364s x1: [1, 1, 4, 1] 364s 0.992 364s 364s y: [1, 1, 4, 1] 364s -0.888 364s testing: mul_mat, [1, 1, 4, 4] = [1, 1, 4, 4] * [1, 1, 4, 4] 364s x0: [1, 1, 4, 4] 364s 0.346 364s 364s x1: [1, 1, 4, 4] 364s -0.064 364s 364s y: [1, 1, 4, 4] 364s -0.022 364s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 364s x0: [1, 4, 1, 1] 364s 0.951 364s -0.753 364s -0.738 364s 0.734 364s 364s x1: [1, 1, 1, 1] 364s -0.909 364s 364s y: [4, 1, 1, 1] 364s -0.865 0.684 0.670 -0.667 364s testing: mul_mat, [4, 1, 4, 1] = [4, 4, 4, 1] * [4, 1, 4, 1] 364s x0: [4, 4, 4, 1] 364s 0.902 0.986 -0.705 0.584 364s 0.581 -0.796 0.321 0.746 364s -0.340 0.896 -0.988 0.124 364s -0.569 -0.939 -0.458 -0.827 364s 364s x1: [4, 1, 4, 1] 364s -0.418 -0.410 -0.407 -0.598 364s 364s y: [4, 1, 4, 1] 364s -0.844 -0.493 0.102 1.304 364s testing: mul_mat, [4, 4, 4, 4] = [4, 4, 4, 4] * [4, 4, 4, 4] 364s x0: [4, 4, 4, 4] 364s -0.478 0.013 -0.869 0.974 364s -0.615 0.129 -0.643 0.983 364s -0.833 0.941 -0.785 0.662 364s -0.938 0.475 0.312 -0.355 364s 364s x1: [4, 4, 4, 4] 364s -0.266 -0.658 -0.041 -0.169 364s -0.771 0.925 0.223 0.437 364s -0.097 -0.844 -0.772 0.626 364s -0.786 0.193 -0.312 0.797 364s 364s y: [4, 4, 4, 4] 364s -0.011 -0.061 -0.478 -0.017 364s 0.612 0.879 1.627 1.078 364s 1.316 1.063 0.307 -0.772 364s 1.425 1.492 1.608 0.449 364s test-mul-mat0: iter:15/500 364s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 364s x0: [1, 4, 1, 1] 364s 0.574 364s 0.840 364s 0.803 364s 0.902 364s 364s x1: [1, 4, 1, 1] 364s 0.623 364s 0.952 364s 0.923 364s 0.902 364s 364s y: [4, 4, 1, 1] 364s 0.358 0.524 0.501 0.562 364s 0.546 0.799 0.764 0.858 364s 0.530 0.775 0.741 0.832 364s 0.518 0.757 0.724 0.813 364s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 364s x0: [1, 1, 2, 1] 364s -0.895 364s 364s x1: [1, 4, 2, 1] 364s 0.097 364s -0.027 364s 0.511 364s -0.836 364s 364s y: [1, 4, 2, 1] 364s -0.087 364s 0.024 364s -0.457 364s 0.749 364s testing: mul_mat, [2, 1, 2, 3] = [1, 2, 2, 3] * [1, 1, 2, 3] 364s x0: [1, 2, 2, 3] 364s -0.061 364s -0.019 364s 364s x1: [1, 1, 2, 3] 364s 0.624 364s 364s y: [2, 1, 2, 3] 364s -0.038 -0.012 364s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 3, 1, 1] 364s 0.292 364s 0.607 364s -0.197 364s 364s x1: [1, 2, 1, 1] 364s 0.781 364s 0.444 364s 364s y: [3, 2, 1, 1] 364s 0.228 0.475 -0.154 364s 0.130 0.269 -0.087 364s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 364s x0: [3, 4, 2, 1] 364s 0.186 0.683 0.017 364s -0.472 0.533 0.813 364s -0.720 0.430 0.269 364s -0.509 -0.336 -0.547 364s 364s x1: [3, 1, 2, 1] 364s -0.021 -0.061 0.296 364s 364s y: [4, 1, 2, 1] 364s -0.040 0.218 0.069 -0.131 364s testing: mul_mat, [2, 3, 2, 3] = [4, 2, 2, 3] * [4, 3, 2, 3] 364s x0: [4, 2, 2, 3] 364s -0.190 -0.626 -0.456 -0.768 364s 0.574 0.385 -0.937 -0.483 364s 364s x1: [4, 3, 2, 3] 364s -0.949 -0.888 0.317 -0.927 364s 0.156 -0.873 0.719 -0.658 364s -0.345 0.999 -0.167 -0.662 364s 364s y: [2, 3, 2, 3] 364s 1.304 -0.736 364s 0.695 -0.602 364s 0.025 0.663 364s test-mul-mat0: iter:16/500 364s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 2, 1, 1] 364s -0.366 364s 0.518 364s 364s x1: [1, 2, 1, 1] 364s -0.666 364s 0.232 364s 364s y: [2, 2, 1, 1] 364s 0.243 -0.345 364s -0.085 0.120 364s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, 1] * [1, 2, 3, 1] 364s x0: [1, 3, 3, 1] 364s 0.666 364s 0.093 364s 0.475 364s 364s x1: [1, 2, 3, 1] 364s 0.291 364s -0.654 364s 364s y: [3, 2, 3, 1] 364s 0.194 0.027 0.138 364s -0.436 -0.061 -0.311 364s testing: mul_mat, [3, 3, 3, 4] = [1, 3, 3, 4] * [1, 3, 3, 4] 364s x0: [1, 3, 3, 4] 364s 0.405 364s -0.384 364s -0.712 364s 364s x1: [1, 3, 3, 4] 364s -0.680 364s 0.379 364s -0.392 364s 364s y: [3, 3, 3, 4] 364s -0.275 0.261 0.484 364s 0.154 -0.146 -0.270 364s -0.159 0.151 0.279 364s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 364s x0: [1, 4, 1, 1] 364s 0.393 364s -0.758 364s -0.179 364s 0.518 364s 364s x1: [1, 3, 1, 1] 364s -0.728 364s 0.255 364s 0.353 364s 364s y: [4, 3, 1, 1] 364s -0.287 0.552 0.130 -0.377 364s 0.100 -0.194 -0.046 0.132 364s 0.139 -0.268 -0.063 0.183 364s testing: mul_mat, [1, 1, 3, 1] = [4, 1, 3, 1] * [4, 1, 3, 1] 364s x0: [4, 1, 3, 1] 364s -0.173 -0.932 0.600 0.040 364s 364s x1: [4, 1, 3, 1] 364s 0.816 -0.378 -0.178 0.743 364s 364s y: [1, 1, 3, 1] 364s 0.134 364s testing: mul_mat, [4, 4, 3, 4] = [1, 4, 3, 4] * [1, 4, 3, 4] 364s x0: [1, 4, 3, 4] 364s -0.021 364s -0.828 364s 0.722 364s -0.925 364s 364s x1: [1, 4, 3, 4] 364s 0.553 364s -0.904 364s 0.178 364s 0.794 364s 364s y: [4, 4, 3, 4] 364s -0.012 -0.458 0.399 -0.511 364s 0.019 0.749 -0.652 0.836 364s -0.004 -0.147 0.128 -0.164 364s -0.017 -0.658 0.573 -0.735 364s test-mul-mat0: iter:17/500 364s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 1, 1, 1] 364s -0.004 364s 364s x1: [1, 2, 1, 1] 364s -0.143 364s 0.623 364s 364s y: [1, 2, 1, 1] 364s 0.001 364s -0.003 364s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 364s x0: [1, 3, 1, 1] 364s 0.075 364s -0.077 364s 0.254 364s 364s x1: [1, 1, 1, 1] 364s 0.532 364s 364s y: [3, 1, 1, 1] 364s 0.040 -0.041 0.135 364s testing: mul_mat, [1, 3, 1, 4] = [1, 1, 1, 4] * [1, 3, 1, 4] 364s x0: [1, 1, 1, 4] 364s 0.189 364s 364s x1: [1, 3, 1, 4] 364s -0.048 364s 0.293 364s -0.636 364s 364s y: [1, 3, 1, 4] 364s -0.009 364s 0.055 364s -0.120 364s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 364s x0: [1, 3, 1, 1] 364s -0.897 364s 0.536 364s 0.068 364s 364s x1: [1, 1, 1, 1] 364s -0.888 364s 364s y: [3, 1, 1, 1] 364s 0.797 -0.476 -0.061 364s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 364s x0: [3, 1, 1, 1] 364s -0.309 -0.404 0.032 364s 364s x1: [3, 1, 1, 1] 364s -0.039 0.159 -0.720 364s 364s y: [1, 1, 1, 1] 364s -0.075 364s testing: mul_mat, [2, 3, 1, 4] = [1, 2, 1, 4] * [1, 3, 1, 4] 364s x0: [1, 2, 1, 4] 364s 0.903 364s 0.410 364s 364s x1: [1, 3, 1, 4] 364s 0.614 364s 0.851 364s 0.984 364s 364s y: [2, 3, 1, 4] 364s 0.555 0.251 364s 0.769 0.348 364s 0.889 0.403 364s test-mul-mat0: iter:18/500 364s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 364s x0: [2, 4, 1, 1] 364s -0.471 0.499 364s -0.190 -0.857 364s 0.350 -0.206 364s 0.050 0.565 364s 364s x1: [2, 1, 1, 1] 364s 0.838 -0.098 364s 364s y: [4, 1, 1, 1] 364s -0.443 -0.075 0.313 -0.013 364s testing: mul_mat, [4, 4, 3, 1] = [2, 4, 3, 1] * [2, 4, 3, 1] 364s x0: [2, 4, 3, 1] 364s -0.885 0.793 364s -0.832 -0.091 364s -0.157 0.733 364s -0.763 -0.464 364s 364s x1: [2, 4, 3, 1] 364s 0.328 0.693 364s -0.196 -0.454 364s -0.430 -0.475 364s 0.917 -0.267 364s 364s y: [4, 4, 3, 1] 364s 0.259 -0.337 0.456 -0.572 364s -0.187 0.204 -0.302 0.360 364s 0.004 0.401 -0.281 0.549 364s -1.023 -0.739 -0.340 -0.575 364s testing: mul_mat, [4, 4, 3, 4] = [2, 4, 3, 4] * [2, 4, 3, 4] 364s x0: [2, 4, 3, 4] 364s 0.570 0.862 364s -0.545 0.675 364s -0.919 -0.964 364s 0.703 -0.567 364s 364s x1: [2, 4, 3, 4] 364s -0.523 0.997 364s 0.954 0.796 364s 0.302 0.604 364s -0.386 0.417 364s 364s y: [4, 4, 3, 4] 364s 0.561 0.958 -0.481 -0.933 364s 1.230 0.017 -1.645 0.220 364s 0.693 0.243 -0.860 -0.131 364s 0.139 0.492 -0.047 -0.508 364s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 364s x0: [2, 4, 1, 1] 364s 0.032 0.120 364s 0.142 -0.502 364s -0.080 -0.993 364s -0.282 0.433 364s 364s x1: [2, 4, 1, 1] 364s -0.147 -0.647 364s -0.666 -0.222 364s -0.214 0.791 364s 0.982 -0.533 364s 364s y: [4, 4, 1, 1] 364s -0.082 0.304 0.655 -0.239 364s -0.048 0.017 0.274 0.092 364s 0.088 -0.427 -0.768 0.403 364s -0.033 0.407 0.450 -0.508 364s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 364s x0: [4, 2, 3, 1] 364s -0.932 0.539 -0.086 -0.388 364s -0.093 -0.434 -0.028 0.243 364s 364s x1: [4, 2, 3, 1] 364s -0.955 -0.670 -0.602 0.494 364s -0.733 0.067 -0.514 0.107 364s 364s y: [2, 2, 3, 1] 364s 0.388 0.517 364s 0.722 0.080 364s testing: mul_mat, [4, 4, 3, 4] = [2, 4, 3, 4] * [2, 4, 3, 4] 364s x0: [2, 4, 3, 4] 364s -0.468 -0.049 364s -0.193 0.995 364s 0.338 0.117 364s 0.320 0.332 364s 364s x1: [2, 4, 3, 4] 364s 0.295 -0.212 364s -0.734 -0.428 364s 0.936 -0.544 364s -0.010 -0.995 364s 364s y: [4, 4, 3, 4] 364s -0.128 -0.268 0.075 0.024 364s 0.364 -0.284 -0.298 -0.377 364s -0.411 -0.722 0.253 0.119 364s 0.053 -0.989 -0.119 -0.333 364s test-mul-mat0: iter:19/500 364s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 364s x0: [3, 2, 1, 1] 364s 0.122 -0.698 -0.011 364s -0.949 0.958 0.025 364s 364s x1: [3, 3, 1, 1] 364s -0.022 0.587 0.161 364s -0.016 -0.000 0.189 364s 0.294 -0.191 0.480 364s 364s y: [2, 3, 1, 1] 364s -0.414 0.586 364s -0.004 0.019 364s 0.164 -0.450 364s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 364s x0: [3, 4, 1, 1] 364s -0.263 -0.148 0.480 364s -0.741 0.941 -0.639 364s 0.918 0.928 0.340 364s 0.504 0.088 -0.676 364s 364s x1: [3, 2, 1, 1] 364s 0.294 0.556 0.789 364s 0.579 -0.284 0.486 364s 364s y: [4, 2, 1, 1] 364s 0.219 -0.198 1.054 -0.336 364s 0.123 -1.007 0.433 -0.062 364s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 364s x0: [3, 4, 1, 1] 364s 0.574 0.084 0.359 364s -0.711 -0.430 0.614 364s 0.026 0.422 0.094 364s 0.285 0.364 0.456 364s 364s x1: [3, 4, 1, 1] 364s -0.496 -0.722 0.618 364s 0.313 0.7ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s 57 0.281 364s -0.565 -0.940 -0.730 364s -0.514 -0.983 0.295 364s 364s y: [4, 4, 1, 1] 364s -0.123 1.043 -0.259 -0.122 364s 0.344 -0.375 0.354 0.492 364s -0.665 0.357 -0.480 -0.836 364s -0.272 0.969 -0.400 -0.369 364s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 364s x0: [3, 3, 1, 1] 364s -0.526 -0.509 -0.935 364s -0.712 0.583 -0.333 364s 0.444 0.224 -0.417 364s 364s x1: [3, 4, 1, 1] 364s 0.202 0.291 -0.205 364s -0.294 -0.621 0.119 364s 0.210 -0.343 -0.263 364s -0.477 -0.586 -0.982 364s 364s y: [3, 4, 1, 1] 364s -0.063 0.094 0.240 364s 0.359 -0.193 -0.319 364s 0.309 -0.262 0.126 364s 1.467 0.325 0.066 364s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 364s x0: [3, 2, 1, 1] 364s 0.315 -0.427 -0.338 364s 0.543 0.694 0.783 364s 364s x1: [3, 3, 1, 1] 364s -0.646 0.238 -0.803 364s 0.380 -0.340 0.292 364s -0.336 -0.977 -0.253 364s 364s y: [2, 3, 1, 1] 364s -0.034 -0.814 364s 0.166 0.198 364s 0.397 -1.058 364s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 364s x0: [2, 4, 1, 1] 364s 0.706 0.932 364s -0.292 0.795 364s 0.731 0.956 364s 0.289 -0.484 364s 364s x1: [2, 3, 1, 1] 364s -0.649 0.399 364s -0.694 -0.234 364s 0.418 0.264 364s 364s y: [4, 3, 1, 1] 364s -0.086 0.507 -0.092 -0.381 364s -0.708 0.016 -0.731 -0.087 364s 0.541 0.089 0.558 -0.007 364s test-mul-mat0: iter:20/500 364s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 364s x0: [1, 4, 1, 1] 364s -0.248 364s -0.941 364s 0.191 364s -0.705 364s 364s x1: [1, 3, 1, 1] 364s 0.853 364s -0.995 364s -0.807 364s 364s y: [4, 3, 1, 1] 364s -0.212 -0.803 0.163 -0.602 364s 0.247 0.936 -0.190 0.702 364s 0.200 0.759 -0.154 0.569 364s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 364s x0: [1, 2, 1, 1] 364s -0.644 364s -0.278 364s 364s x1: [1, 4, 1, 1] 364s -0.368 364s -0.115 364s -0.043 364s -0.585 364s 364s y: [2, 4, 1, 1] 364s 0.237 0.102 364s 0.074 0.032 364s 0.028 0.012 364s 0.376 0.163 364s testing: mul_mat, [2, 2, 1, 3] = [1, 2, 1, 3] * [1, 2, 1, 3] 364s x0: [1, 2, 1, 3] 364s -0.232 364s -0.373 364s 364s x1: [1, 2, 1, 3] 364s -0.998 364s 0.774 364s 364s y: [2, 2, 1, 3] 364s 0.231 0.372 364s -0.179 -0.289 364s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 2, 1, 1] 364s 0.771 364s 0.860 364s 364s x1: [1, 2, 1, 1] 364s 0.918 364s 0.856 364s 364s y: [2, 2, 1, 1] 364s 0.708 0.790 364s 0.660 0.736 364s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 364s x0: [2, 4, 1, 1] 364s -0.302 0.725 364s -0.160 0.863 364s -0.093 0.746 364s 0.330 -0.039 364s 364s x1: [2, 1, 1, 1] 364s -0.361 0.648 364s 364s y: [4, 1, 1, 1] 364s 0.579 0.618 0.517 -0.145 364s testing: mul_mat, [4, 2, 1, 3] = [4, 4, 1, 3] * [4, 2, 1, 3] 364s x0: [4, 4, 1, 3] 364s -0.847 0.720 -0.208 0.701 364s 0.974 0.829 -0.771 0.790 364s -0.132 -0.765 -0.359 -0.826 364s -0.984 -0.813 -0.003 0.942 364s 364s x1: [4, 2, 1, 3] 364s -0.781 -0.532 -0.037 0.993 364s -0.546 -0.795 0.473 0.149 364s 364s y: [4, 2, 1, 3] 364s 0.982 -0.389 -0.297 2.137 364s -0.104 -1.439 0.388 1.322 364s test-mul-mat0: iter:21/500 364s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 364s x0: [2, 3, 1, 1] 364s 0.203 -0.189 364s 0.891 0.963 364s -0.883 -0.541 364s 364s x1: [2, 4, 1, 1] 364s -0.887 -0.508 364s 0.935 -0.440 364s -0.616 -0.833 364s 0.310 0.521 364s 364s y: [3, 4, 1, 1] 364s -0.084 -1.279 1.058 364s 0.273 0.409 -0.588 364s 0.033 -1.352 0.995 364s -0.036 0.778 -0.555 364s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 364s x0: [2, 2, 4, 1] 364s 0.819 -0.006 364s -0.783 0.783 364s 364s x1: [2, 3, 4, 1] 364s -0.447 0.658 364s 0.215 0.934 364s 0.428 -0.388 364s 364s y: [2, 3, 4, 1] 364s -0.371 0.865 364s 0.170 0.563 364s 0.353 -0.639 364s testing: mul_mat, [4, 2, 4, 4] = [2, 4, 4, 4] * [2, 2, 4, 4] 364s x0: [2, 4, 4, 4] 364s -0.077 0.364 364s 0.857 0.085 364s -0.751 0.231 364s 0.344 -0.290 364s 364s x1: [2, 2, 4, 4] 364s 0.408 -0.148 364s -0.019 -0.958 364s 364s y: [4, 2, 4, 4] 364s -0.085 0.337 -0.341 0.183 364s -0.347 -0.098 -0.206 0.271 364s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 364s x0: [2, 4, 1, 1] 364s -0.813 0.748 364s 0.386 0.038 364s 0.452 -0.327 364s -0.950 0.986 364s 364s x1: [2, 4, 1, 1] 364s -0.238 -0.631 364s -0.254 -0.834 364s 0.709 -0.256 364s -0.111 0.988 364s 364s y: [4, 4, 1, 1] 364s -0.278 -0.116 0.098 -0.395 364s -0.418 -0.129 0.158 -0.581 364s -0.768 0.264 0.404 -0.926 364s 0.829 -0.005 -0.373 1.079 364s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 364s x0: [4, 1, 4, 1] 364s 0.348 -0.949 -0.048 -0.628 364s 364s x1: [4, 2, 4, 1] 364s -0.080 -0.249 0.656 -0.543 364s 0.694 0.024 0.224 0.179 364s 364s y: [1, 2, 4, 1] 364s 0.517 364s 0.096 364s testing: mul_mat, [3, 4, 4, 4] = [1, 3, 4, 4] * [1, 4, 4, 4] 364s x0: [1, 3, 4, 4] 364s 0.977 364s -0.652 364s -0.000 364s 364s x1: [1, 4, 4, 4] 364s -0.984 364s -0.035 364s 0.451 364s -0.412 364s 364s y: [3, 4, 4, 4] 364s -0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s 962 0.641 0.000 364s -0.034 0.023 0.000 364s 0.441 -0.294 -0.000 364s -0.403 0.268 0.000 364s test-mul-mat0: iter:22/500 364s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 364s x0: [4, 2, 1, 1] 364s -0.019 -0.533 0.959 0.615 364s -0.650 -0.993 -0.403 0.233 364s 364s x1: [4, 1, 1, 1] 364s 0.789 -0.229 -0.211 -0.203 364s 364s y: [2, 1, 1, 1] 364s -0.220 -0.248 364s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 364s x0: [4, 3, 3, 1] 364s -0.802 -0.976 -0.098 -0.795 364s -0.378 -0.865 0.806 0.241 364s 0.104 -0.630 0.070 -0.911 364s 364s x1: [4, 2, 3, 1] 364s 0.602 -0.381 -0.031 -0.436 364s 0.829 -0.014 -0.339 0.430 364s 364s y: [3, 2, 3, 1] 364s 0.239 -0.028 0.697 364s -0.960 -0.471 -0.320 364s testing: mul_mat, [1, 3, 3, 4] = [4, 1, 3, 4] * [4, 3, 3, 4] 364s x0: [4, 1, 3, 4] 364s -0.791 0.304 -0.679 -0.247 364s 364s x1: [4, 3, 3, 4] 364s -0.394 -0.986 0.033 0.710 364s -0.616 -0.898 0.799 0.415 364s -0.398 0.431 0.398 -0.171 364s 364s y: [1, 3, 3, 4] 364s -0.186 364s -0.430 364s 0.218 364s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 364s x0: [4, 2, 1, 1] 364s 0.624 0.659 0.558 -0.280 364s 0.138 -0.216 0.089 0.294 364s 364s x1: [4, 1, 1, 1] 364s -0.979 0.924 -0.711 0.276 364s 364s y: [2, 1, 1, 1] 364s -0.476 -0.317 364s testing: mul_mat, [1, 4, 3, 1] = [2, 1, 3, 1] * [2, 4, 3, 1] 364s x0: [2, 1, 3, 1] 364s 0.959 0.901 364s 364s x1: [2, 4, 3, 1] 364s 0.954 -0.136 364s -0.668 0.139 364s 0.761 0.018 364s 0.717 -0.535 364s 364s y: [1, 4, 3, 1] 364s 0.792 364s -0.515 364s 0.746 364s 0.206 364s testing: mul_mat, [4, 2, 3, 4] = [1, 4, 3, 4] * [1, 2, 3, 4] 364s x0: [1, 4, 3, 4] 364s 0.835 364s -0.764 364s 0.253 364s -0.117 364s 364s x1: [1, 2, 3, 4] 364s 0.002 364s 0.485 364s 364s y: [4, 2, 3, 4] 364s 0.001 -0.001 0.000 -0.000 364s 0.405 -0.371 0.123 -0.057 364s test-mul-mat0: iter:23/500 364s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 364s x0: [1, 2, 1, 1] 364s -0.234 364s 0.299 364s 364s x1: [1, 3, 1, 1] 364s -0.912 364s -0.581 364s -0.047 364s 364s y: [2, 3, 1, 1] 364s 0.213 -0.273 364s 0.136 -0.174 364s 0.011 -0.014 364s testing: mul_mat, [4, 2, 3, 1] = [1, 4, 3, 1] * [1, 2, 3, 1] 364s x0: [1, 4, 3, 1] 364s 0.990 364s -0.327 364s 0.616 364s -0.184 364s 364s x1: [1, 2, 3, 1] 364s -0.267 364s -0.010 364s 364s y: [4, 2, 3, 1] 364s -0.264 0.087 -0.164 0.049 364s -0.010 0.003 -0.006 0.002 364s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 364s x0: [1, 3, 3, 1] 364s 0.828 364s -0.529 364s -0.212 364s 364s x1: [1, 4, 3, 1] 364s 0.681 364s -0.994 364s -0.587 364s -0.129 364s 364s y: [3, 4, 3, 1] 364s 0.564 -0.360 -0.144 364s -0.824 0.526 0.211 364s -0.486 0.310 0.124 364s -0.107 0.068 0.027 364s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 364s x0: [1, 1, 1, 1] 364s 0.321 364s 364s x1: [1, 3, 1, 1] 364s 0.682 364s -0.170 364s 0.052 364s 364s y: [1, 3, 1, 1] 364s 0.219 364s -0.054 364s 0.017 364s testing: mul_mat, [4, 1, 3, 1] = [1, 4, 3, 1] * [1, 1, 3, 1] 364s x0: [1, 4, 3, 1] 364s 0.431 364s -0.180 364s 0.903 364s -0.699 364s 364s x1: [1, 1, 3, 1] 364s 0.499 364s 364s y: [4, 1, 3, 1] 364s 0.215 -0.090 0.451 -0.349 364s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 364s x0: [4, 3, 3, 1] 364s 0.986 0.736 0.166 0.467 364s 0.855 -0.331 0.489 0.961 364s -0.388 -0.898 -0.995 -0.855 364s 364s x1: [4, 1, 3, 1] 364s 0.165 0.211 -0.708 0.953 364s 364s y: [3, 1, 3, 1] 364s 0.645 0.641 -0.363 364s test-mul-mat0: iter:24/500 364s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 364s x0: [4, 2, 1, 1] 364s 0.404 -0.566 0.859 -0.299 364s -0.828 -0.041 -0.710 0.863 364s 364s x1: [4, 1, 1, 1] 364s 0.135 0.464 0.928 0.862 364s 364s y: [2, 1, 1, 1] 364s 0.331 -0.046 364s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 364s x0: [4, 3, 3, 1] 364s -0.859 -0.966 0.605 0.100 364s -0.676 0.468 0.246 0.866 364s 0.220 0.535 -0.638 -0.077 364s 364s x1: [4, 2, 3, 1] 364s -0.854 0.541 0.752 -0.711 364s -0.504 0.703 0.068 0.986 364s 364s y: [3, 2, 3, 1] 364s 0.595 0.400 -0.323 364s -0.107 1.539 0.145 364s testing: mul_mat, [3, 3, 3, 3] = [4, 3, 3, 3] * [4, 3, 3, 3] 364s x0: [4, 3, 3, 3] 364s -0.440 -0.372 0.374 -0.476 364s -0.564 -0.652 -0.125 -0.939 364s 0.894 0.852 0.873 0.627 364s 364s x1: [4, 3, 3, 3] 364s 0.418 -0.048 0.836 -0.362 364s -0.513 -0.802 0.560 0.090 364s -0.455 0.249 0.527 -0.776 364s 364s y: [3, 3, 3, 3] 364s 0.319 0.031 0.835 364s 0.691 0.658 -0.597 364s 0.674 0.757 -0.220 364s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 364s x0: [4, 3, 1, 1] 364s -0.835 -0.869 0.079 -0.775 364s 0.154 0.779 0.926 0.315 364s -0.862 0.357 -0.211 -0.047 364s 364s x1: [4, 3, 1, 1] 364s -0.026 0.825 -0.533 -0.107 364s 0.383 -0.790 -0.711 -0.670 364s -0.722 -0.806 -0.162 -0.605 364s 364s y: [3, 3, 1, 1] 364s -0.654 0.111 0.434 364s 0.830 -1.425 -0.431 364s 1.759 -1.080 0.397 364s testingggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 364s : mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 364s x0: [3, 3, 3, 1] 364s 0.811 -0.280 0.918 364s -0.649 0.695 0.370 364s 0.981 0.848 0.994 364s 364s x1: [3, 4, 3, 1] 364s -0.360 0.538 0.994 364s 0.042 -0.863 0.191 364s -0.984 0.962 0.658 364s -0.091 0.345 0.868 364s 364s y: [3, 4, 3, 1] 364s 0.470 0.976 1.092 364s 0.451 -0.557 -0.501 364s -0.463 1.551 0.505 364s 0.626 0.620 1.065 364s testing: mul_mat, [1, 3, 3, 3] = [3, 1, 3, 3] * [3, 3, 3, 3] 364s x0: [3, 1, 3, 3] 364s 0.987 0.471 0.533 364s 364s x1: [3, 3, 3, 3] 364s 0.097 0.917 -0.544 364s 0.419 0.728 -0.194 364s 0.400 -0.552 -0.498 364s 364s y: [1, 3, 3, 3] 364s 0.238 364s 0.654 364s -0.131 364s test-mul-mat0: iter:25/500 364s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 364s x0: [2, 1, 1, 1] 364s -0.546 0.533 364s 364s x1: [2, 3, 1, 1] 364s 0.619 -0.924 364s -0.011 -0.384 364s 0.819 -0.509 364s 364s y: [1, 3, 1, 1] 364s -0.831 364s -0.199 364s -0.718 364s testing: mul_mat, [1, 1, 4, 1] = [2, 1, 4, 1] * [2, 1, 4, 1] 364s x0: [2, 1, 4, 1] 364s -0.570 0.968 364s 364s x1: [2, 1, 4, 1] 364s -0.015 -0.330 364s 364s y: [1, 1, 4, 1] 364s -0.310 364s testing: mul_mat, [3, 1, 4, 2] = [2, 3, 4, 2] * [2, 1, 4, 2] 364s x0: [2, 3, 4, 2] 364s -0.215 0.696 364s 0.567 -0.777 364s -0.077 0.230 364s 364s x1: [2, 1, 4, 2] 364s -0.540 0.981 364s 364s y: [3, 1, 4, 2] 364s 0.799 -1.068 0.268 364s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 364s x0: [2, 2, 1, 1] 364s 0.644 -0.700 364s -0.538 -0.802 364s 364s x1: [2, 3, 1, 1] 364s 0.140 -0.646 364s 0.199 -0.586 364s 0.540 -0.504 364s 364s y: [2, 3, 1, 1] 364s 0.542 0.443 364s 0.538 0.362 364s 0.701 0.114 364s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 364s x0: [2, 3, 4, 1] 364s -0.790 0.350 364s 0.199 0.553 364s -0.780 0.419 364s 364s x1: [2, 2, 4, 1] 364s -0.916 -0.964 364s 0.988 -0.509 364s 364s y: [3, 2, 4, 1] 364s 0.386 -0.715 0.310 364s -0.958 -0.085 -0.984 364s testing: mul_mat, [3, 2, 4, 2] = [3, 3, 4, 2] * [3, 2, 4, 2] 364s x0: [3, 3, 4, 2] 364s -0.757 0.676 -0.939 364s 0.070 -0.062 0.023 364s 0.120 -0.027 0.094 364s 364s x1: [3, 2, 4, 2] 364s 0.106 0.464 0.810 364s -0.535 -0.333 0.414 364s 364s y: [3, 2, 4, 2] 364s -0.526 -0.002 0.076 364s -0.209 -0.007 -0.016 364s test-mul-mat0: iter:26/500 364s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 2, 1, 1] 364s -0.898 364s 0.042 364s 364s x1: [1, 2, 1, 1] 364s -0.796 364s 0.779 364s 364s y: [2, 2, 1, 1] 364s 0.715 -0.033 364s -0.699 0.033 364s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 364s x0: [1, 3, 2, 1] 364s 0.887 364s -0.312 364s 0.885 364s 364s x1: [1, 2, 2, 1] 364s -0.339 364s -0.020 364s 364s y: [3, 2, 2, 1] 364s -0.300 0.106 -0.300 364s -0.018 0.006 -0.018 364s testing: mul_mat, [3, 3, 2, 3] = [1, 3, 2, 3] * [1, 3, 2, 3] 364s x0: [1, 3, 2, 3] 364s 0.644 364s -0.225 364s 0.912 364s 364s x1: [1, 3, 2, 3] 364s 0.762 364s -0.301 364s -0.960 364s 364s y: [3, 3, 2, 3] 364s 0.491 -0.171 0.695 364s -0.194 0.068 -0.275 364s -0.619 0.216 -0.876 364s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 364s x0: [1, 3, 1, 1] 364s -0.097 364s 0.179 364s -0.122 364s 364s x1: [1, 3, 1, 1] 364s 0.582 364s -0.048 364s -0.208 364s 364s y: [3, 3, 1, 1] 364s -0.056 0.104 -0.071 364s 0.005 -0.009 0.006 364s 0.020 -0.037 0.025 364s testing: mul_mat, [2, 1, 2, 1] = [3, 2, 2, 1] * [3, 1, 2, 1] 364s x0: [3, 2, 2, 1] 364s -0.012 0.049 -0.144 364s -0.448 -0.303 -0.987 364s 364s x1: [3, 1, 2, 1] 364s 0.726 -0.901 0.708 364s 364s y: [2, 1, 2, 1] 364s -0.155 -0.751 364s testing: mul_mat, [2, 3, 2, 3] = [2, 2, 2, 3] * [2, 3, 2, 3] 364s x0: [2, 2, 2, 3] 364s -0.401 0.140 364s 0.231 -0.403 364s 364s x1: [2, 3, 2, 3] 364s -0.023 -0.713 364s -0.280 -0.857 364s -0.363 -0.699 364s 364s y: [2, 3, 2, 3] 364s -0.090 0.282 364s -0.007 0.281 364s 0.048 0.198 364s test-mul-mat0: iter:27/500 364s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 364s x0: [1, 3, 1, 1] 364s 0.520 364s -0.683 364s -0.196 364s 364s x1: [1, 2, 1, 1] 364s -0.079 364s 0.086 364s 364s y: [3, 2, 1, 1] 364s -0.041 0.054 0.015 364s 0.045 -0.059 -0.017 364s testing: mul_mat, [2, 3, 2, 1] = [1, 2, 2, 1] * [1, 3, 2, 1] 364s x0: [1, 2, 2, 1] 364s -0.837 364s -0.426 364s 364s x1: [1, 3, 2, 1] 364s -0.883 364s -0.845 364s 0.375 364s 364s y: [2, 3, 2, 1] 364s 0.740 0.377 364s 0.707 0.360 364s -0.314 -0.160 364s testing: mul_mat, [1, 2, 2, 2] = [1, 1, 2, 2] * [1, 2, 2, 2] 364s x0: [1, 1, 2, 2] 364s -0.244 364s 364s x1: [1, 2, 2, 2] 364s -0.607 364s -0.313 364s 364s y: [1, 2, 2, 2] 364s 0.148 364s 0.077 364s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 364s x0: [1, 4, 1, 1] 364s 0.125 364s -0.990 364s 0.095 364s 0.442 364s 364s x1: [1, 1, 1, 1] 364s -0.654 364s 364s y: [4, 1, 1, 1] 364s -0.082 0.647 -0.062 -0.289 364s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 364s x0: [4, 1, 2, 1] 364s 0.847 0.672 0.993 -0.759 364s 364s x1: [4, 1, 2, 1] 364s -0.186 0.211 0.597 -0.811 364s 364s y: [1, 1, 2, 1] 364s 1.193 364s testiggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 364s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ng: mul_mat, [1, 4, 2, 2] = [1, 1, 2, 2] * [1, 4, 2, 2] 365s x0: [1, 1, 2, 2] 365s -0.288 365s 365s x1: [1, 4, 2, 2] 365s -0.319 365s -0.722 365s -0.916 365s 0.917 365s 365s y: [1, 4, 2, 2] 365s 0.092 365s 0.208 365s 0.264 365s -0.264 365s test-mul-mat0: iter:28/500 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.901 0.335 -0.198 -0.703 365s 0.045 -0.958 0.938 0.568 365s -0.506 0.176 -0.721 -0.406 365s 365s x1: [4, 3, 1, 1] 365s 0.988 -0.927 -0.410 0.488 365s 0.713 -0.802 -0.831 0.992 365s -0.717 -0.914 0.026 0.636 365s 365s y: [3, 3, 1, 1] 365s -1.463 0.825 -0.566 365s -1.444 0.583 -0.306 365s -0.112 1.228 -0.074 365s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 2, 1, 1] 365s 0.875 -0.542 -0.563 -0.099 365s -0.906 0.094 -0.000 0.429 365s 365s x1: [4, 3, 1, 1] 365s -0.289 -0.735 0.155 -0.294 365s -0.903 0.387 -0.306 -0.830 365s 0.978 -0.818 0.884 -0.824 365s 365s y: [2, 3, 1, 1] 365s 0.087 0.067 365s -0.745 0.498 365s 0.883 -1.316 365s testing: mul_mat, [4, 2, 1, 3] = [4, 4, 1, 3] * [4, 2, 1, 3] 365s x0: [4, 4, 1, 3] 365s 0.493 0.513 -0.726 -0.414 365s -0.488 0.703 -0.518 0.810 365s -0.823 -0.580 0.045 -0.782 365s -0.148 0.456 -0.461 0.878 365s 365s x1: [4, 2, 1, 3] 365s 0.896 0.297 -0.526 0.938 365s 0.235 -0.959 -0.568 -0.589 365s 365s y: [4, 2, 1, 3] 365s 0.587 0.805 -1.666 1.069 365s 0.280 -0.971 0.797 -0.727 365s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.536 -0.214 -0.543 -0.021 365s -0.462 -0.619 0.268 0.612 365s 0.634 -0.136 -0.687 -0.699 365s 365s x1: [4, 4, 1, 1] 365s 0.113 -0.540 0.330 -0.080 365s -0.913 -0.740 0.580 0.261 365s 0.894 -0.024 -0.844 -0.416 365s -0.016 0.231 -0.233 -0.495 365s 365s y: [3, 4, 1, 1] 365s -0.002 0.322 -0.026 365s -0.651 1.195 -1.059 365s 0.951 -0.880 1.441 365s 0.079 -0.502 0.465 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.396 0.371 -0.093 365s 0.872 -0.623 -0.085 365s -0.425 0.156 -0.210 365s 365s x1: [3, 4, 1, 1] 365s 0.380 -0.465 0.413 365s 0.839 0.866 -0.667 365s 0.927 -0.874 0.912 365s 0.187 -0.980 -0.112 365s 365s y: [3, 4, 1, 1] 365s -0.361 0.585 -0.321 365s 0.052 0.250 -0.082 365s -0.776 1.275 -0.722 365s -0.428 0.783 -0.209 365s testing: mul_mat, [2, 3, 1, 3] = [3, 2, 1, 3] * [3, 3, 1, 3] 365s x0: [3, 2, 1, 3] 365s 0.605 0.325 0.273 365s 0.978 0.298 -0.021 365s 365s x1: [3, 3, 1, 3] 365s 0.693 0.295 0.654 365s -0.850 -0.437 0.967 365s 0.129 -0.824 -0.733 365s 365s y: [2, 3, 1, 3] 365s 0.694 0.752 365s -0.392 -0.982 365s -0.390 -0.104 365s test-mul-mat0: iter:29/500 365s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.667 0.356 -0.250 365s 0.397 0.892 -0.645 365s 0.375 0.217 0.654 365s 365s x1: [3, 1, 1, 1] 365s -0.275 0.647 0.976 365s 365s y: [3, 1, 1, 1] 365s 0.170 -0.162 0.675 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.248 -0.800 -0.480 365s -0.845 0.910 -0.478 365s 365s x1: [3, 3, 1, 1] 365s -0.352 -0.804 0.102 365s -0.515 0.141 0.680 365s -0.503 -0.646 -0.234 365s 365s y: [2, 3, 1, 1] 365s 0.507 -0.483 365s -0.567 0.239 365s 0.504 -0.050 365s testing: mul_mat, [1, 2, 1, 2] = [3, 1, 1, 2] * [3, 2, 1, 2] 365s x0: [3, 1, 1, 2] 365s 0.522 -0.943 -0.613 365s 365s x1: [3, 2, 1, 2] 365s -0.013 0.081 0.247 365s -0.366 0.057 0.298 365s 365s y: [1, 2, 1, 2] 365s -0.234 365s -0.428 365s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.605 -0.412 -0.539 365s 365s x1: [3, 1, 1, 1] 365s -0.141 -0.161 0.368 365s 365s y: [1, 1, 1, 1] 365s -0.218 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s 0.630 365s 0.603 365s -0.204 365s -0.123 365s 365s x1: [1, 3, 1, 1] 365s 0.805 365s 0.108 365s -0.384 365s 365s y: [4, 3, 1, 1] 365s 0.507 0.485 -0.165 -0.099 365s 0.068 0.065 -0.022 -0.013 365s -0.242 -0.232 0.079 0.047 365s testing: mul_mat, [2, 1, 1, 2] = [4, 2, 1, 2] * [4, 1, 1, 2] 365s x0: [4, 2, 1, 2] 365s 0.306 -0.317 0.810 -0.437 365s -0.907 0.692 0.843 -0.331 365s 365s x1: [4, 1, 1, 2] 365s -0.764 0.853 -0.825 0.203 365s 365s y: [2, 1, 1, 2] 365s -1.261 0.520 365s test-mul-mat0: iter:30/500 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.293 0.158 365s -0.126 0.386 365s 0.841 -0.434 365s 0.196 0.684 365s 365s x1: [2, 4, 1, 1] 365s -0.397 -0.866 365s -0.202 -0.544 365s -0.691 -0.999 365s -0.278 -0.468 365s 365s y: [4, 4, 1, 1] 365s -0.253 -0.284 0.042 -0.670 365s -0.145 -0.184 0.066 -0.412 365s -0.360 -0.298 -0.148 -0.818 365s -0.155 -0.145 -0.031 -0.375 365s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.208 0.048 365s 365s x1: [2, 4, 1, 1] 365s 0.130 0.864 365s -0.634 0.061 365s 0.746 -0.347 365s 0.581 -0.911 365s 365s y: [1, 4, 1, 1] 365s 0.068 365s -0.129 365s 0.139 365s 0.077 365s testing: mul_mat, [3, 1, 1, 2] = [2, 3, 1, 2] * [2, 1, 1, 2] 365s x0: [2, 3, 1, 2] 365s -0.593 0.611 365s -0.091 -0.592 365s -0.667 0.441 365s 365s x1: [2, 1, 1, 2] 365s -0.448 -0.534 365s 365s y: [3, 1, 1, 2] 365s -0.060 0.357 0.064 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.817 0.143 365s 365s x1: [2, 3, 1, 1] 365s 0.416 -0.794 365s -0.876 -0.455 365s -0.929 -0.510 365s 365s y: [1, 3, 1, 1] 365s 0.227 365s -0.781 365s -0.832 365s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.604 365s -0.045 365s 365s x1: [1, 2, 1, 1] 365s 0.187 365s 0.907 365s 365s y: [2, 2, 1, 1] 365s -0.113 -0.008 365s -0.548 -0.041 365s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 365s x0: [2, 4, 1, 2] 365s -0.030 0.538 365s 0.205 0.812 365s -0.629 0.354 365s 0.378 0.163 365s 365s x1: [2, 1, 1, 2] 365s -0.561 -0.138 365s 365s y: [4, 1, 1, 2] 365s -0.058 -0.227 0.304 -0.234 365s test-mul-mat0: iter:31/500 365s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.354 0.130 -0.065 365s -0.784 0.694 -0.471 365s 0.678 -0.336 0.734 365s 365s x1: [3, 1, 1, 1] 365s -0.392 -0.041 -0.825 365s 365s y: [3, 1, 1, 1] 365s -0.090 0.667 -0.858 365s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 365s x0: [3, 2, 3, 1] 365s 0.364 0.372 0.864 365s 0.413 0.414 0.136 365s 365s x1: [3, 3, 3, 1] 365s -0.951 -0.958 0.272 365s 0.861 0.396 -0.565 365s -0.728 0.600 -0.391 365s 365s y: [2, 3, 3, 1] 365s -0.467 -0.752 365s -0.028 0.443 365s -0.380 -0.106 365s testing: mul_mat, [3, 2, 3, 4] = [3, 3, 3, 4] * [3, 2, 3, 4] 365s x0: [3, 3, 3, 4] 365s -0.865 0.953 -0.520 365s -0.476 -0.135 0.724 365s 0.624 0.676 -0.541 365s 365s x1: [3, 2, 3, 4] 365s -0.668 -0.830 -0.681 365s -0.497 -0.715 -0.797 365s 365s y: [3, 2, 3, 4] 365s 0.141 -0.063 -0.610 365s 0.163 -0.244 -0.362 365s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.477 -0.755 0.040 365s 365s x1: [3, 3, 1, 1] 365s -0.923 0.834 -0.745 365s -0.654 -0.899 0.852 365s 0.088 0.776 -0.175 365s 365s y: [1, 3, 1, 1] 365s -1.101 365s 0.401 365s -0.551 365s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3, 1] 365s x0: [1, 3, 3, 1] 365s -0.435 365s 0.270 365s 0.186 365s 365s x1: [1, 3, 3, 1] 365s -0.135 365s -0.012 365s 0.077 365s 365s y: [3, 3, 3, 1] 365s 0.059 -0.036 -0.025 365s 0.005 -0.003 -0.002 365s -0.034 0.021 0.014 365s testing: mul_mat, [1, 1, 3, 4] = [3, 1, 3, 4] * [3, 1, 3, 4] 365s x0: [3, 1, 3, 4] 365s -0.108 0.010 0.077 365s 365s x1: [3, 1, 3, 4] 365s 0.117 -0.742 -0.117 365s 365s y: [1, 1, 3, 4] 365s -0.029 365s test-mul-mat0: iter:32/500 365s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.138 -0.727 365s 365s x1: [2, 2, 1, 1] 365s 0.448 -0.384 365s -0.117 -0.295 365s 365s y: [1, 2, 1, 1] 365s 0.217 365s 0.231 365s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.434 -0.029 365s -0.640 -0.293 365s 0.584 0.876 365s 0.527 -0.837 365s 365s x1: [2, 1, 1, 1] 365s 0.987 -0.935 365s 365s y: [4, 1, 1, 1] 365s -0.401 -0.357 -0.243 1.303 365s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.489 -0.499 365s -0.456 -0.655 365s 0.949 0.160 365s 365s x1: [2, 4, 1, 1] 365s -0.231 -0.756 365s -0.184 0.009 365s 0.854 -0.520 365s -0.674 -0.173 365s 365s y: [3, 4, 1, 1] 365s 0.264 0.600 -0.341 365s -0.094 0.078 -0.173 365s 0.677 -0.049 0.727 365s -0.243 0.421 -0.668 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.735 0.400 365s 365s x1: [2, 3, 1, 1] 365s 0.228 -0.346 365s -0.153 -0.909 365s -0.073 -0.166 365s 365s y: [1, 3, 1, 1] 365s -0.306 365s -0.251 365s -0.013 365s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.289 365s -0.499 365s 0.633 365s -0.127 365s 365s x1: [1, 2, 1, 1] 365s 0.126 365s -0.375 365s 365s y: [4, 2, 1, 1] 365s -0.037 -0.063 0.080 -0.016 365s 0.108 0.187 -0.237 0.047 365s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.831 -0.498 0.046 -0.794 365s -0.395 -0.906 -0.560 0.668 365s -0.442 0.059 0.008 0.663 365s 365s x1: [4, 1, 1, 1] 365s 0.270 0.877 0.690 -0.721 365s 365s y: [3, 1, 1, 1] 365s -0.056 -1.769 -0.540 365s test-mul-mat0: iter:33/500 365s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.393 365s 365s x1: [1, 4, 1, 1] 365s 0.914 365s -0.145 365s 0.286 365s 0.040 365s 365s y: [1, 4, 1, 1] 365s 0.359 365s -0.057 365s 0.112 365s 0.016 365s testing: mul_mat, [3, 1, 3, 1] = [1, 3, 3, 1] * [1, 1, 3, 1] 365s x0: [1, 3, 3, 1] 365s 0.251 365s -0.097 365s -0.686 365s 365s x1: [1, 1, 3, 1] 365s 0.751 365s 365s y: [3, 1, 3, 1] 365s 0.189 -0.073 -0.515 365s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s , 1] 365s x0: [1, 3, 3, 1] 365s 0.512 365s 0.177 365s -0.991 365s 365s x1: [1, 3, 3, 1] 365s -0.771 365s -0.749 365s -0.968 365s 365s y: [3, 3, 3, 1] 365s -0.394 -0.137 0.764 365s -0.383 -0.133 0.743 365s -0.496 -0.172 0.960 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.785 365s 0.737 365s -0.391 365s 365s x1: [1, 3, 1, 1] 365s 0.828 365s -0.118 365s 0.423 365s 365s y: [3, 3, 1, 1] 365s 0.650 0.610 -0.323 365s -0.093 -0.087 0.046 365s 0.332 0.312 -0.165 365s testing: mul_mat, [4, 1, 3, 1] = [3, 4, 3, 1] * [3, 1, 3, 1] 365s x0: [3, 4, 3, 1] 365s 0.476 -0.926 0.922 365s 0.979 0.556 -0.593 365s -0.956 0.944 0.132 365s -0.234 -0.917 0.769 365s 365s x1: [3, 1, 3, 1] 365s -0.581 -0.780 -0.256 365s 365s y: [4, 1, 3, 1] 365s 0.210 -0.851 -0.214 0.654 365s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 365s x0: [4, 4, 3, 1] 365s 0.998 0.754 0.840 0.961 365s 0.998 0.549 0.135 0.470 365s -0.651 -0.660 -0.525 0.561 365s -0.170 -0.561 0.442 0.791 365s 365s x1: [4, 3, 3, 1] 365s 0.544 0.940 -0.216 0.467 365s -0.653 0.915 0.236 -0.389 365s 0.846 -0.167 0.756 0.200 365s 365s y: [4, 3, 3, 1] 365s 1.518 1.249 -0.600 -0.346 365s -0.138 -0.300 -0.522 -0.606 365s 1.545 0.948 -0.725 0.442 365s test-mul-mat0: iter:34/500 365s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.696 0.363 0.754 365s 365s x1: [3, 2, 1, 1] 365s -0.448 0.345 -0.382 365s 0.727 0.444 0.296 365s 365s y: [1, 2, 1, 1] 365s 0.148 365s -0.121 365s testing: mul_mat, [2, 1, 2, 1] = [3, 2, 2, 1] * [3, 1, 2, 1] 365s x0: [3, 2, 2, 1] 365s 0.501 -0.466 -0.957 365s -0.768 0.979 0.167 365s 365s x1: [3, 1, 2, 1] 365s 0.163 0.426 -0.632 365s 365s y: [2, 1, 2, 1] 365s 0.488 0.187 365s testing: mul_mat, [1, 2, 2, 4] = [3, 1, 2, 4] * [3, 2, 2, 4] 365s x0: [3, 1, 2, 4] 365s -0.709 0.344 0.232 365s 365s x1: [3, 2, 2, 4] 365s -0.533 -0.596 0.815 365s 0.812 0.022 0.541 365s 365s y: [1, 2, 2, 4] 365s 0.362 365s -0.443 365s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.637 -0.696 -0.376 365s 0.280 0.928 0.536 365s 365s x1: [3, 1, 1, 1] 365s 0.960 0.801 0.788 365s 365s y: [2, 1, 1, 1] 365s -0.241 1.434 365s testing: mul_mat, [2, 3, 2, 1] = [2, 2, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 2, 2, 1] 365s 0.713 -0.022 365s 0.280 0.713 365s 365s x1: [2, 3, 2, 1] 365s -0.285 -0.947 365s -0.036 0.166 365s -1.000 0.574 365s 365s y: [2, 3, 2, 1] 365s -0.182 -0.755 365s -0.029 0.109 365s -0.725 0.129 365s testing: mul_mat, [3, 2, 2, 4] = [2, 3, 2, 4] * [2, 2, 2, 4] 365s x0: [2, 3, 2, 4] 365s -0.759 0.850 365s 0.484 0.798 365s -0.211 0.407 365s 365s x1: [2, 2, 2, 4] 365s -0.281 0.053 365s -0.671 -0.990 365s 365s y: [3, 2, 2, 4] 365s 0.258 -0.093 0.081 365s -0.333 -1.114 -0.261 365s test-mul-mat0: iter:35/500 365s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.346 0.005 -0.682 -0.491 365s 365s x1: [4, 3, 1, 1] 365s 0.001 -0.027 -0.375 -0.427 365s -0.952 0.923 0.189 0.037 365s -0.305 0.904 -0.734 -0.049 365s 365s y: [1, 3, 1, 1] 365s 0.466 365s -0.472 365s 0.423 365s testing: mul_mat, [4, 1, 4, 1] = [4, 4, 4, 1] * [4, 1, 4, 1] 365s x0: [4, 4, 4, 1] 365s -0.453 0.214 0.022 0.737 365s -0.749 0.718 0.641 -0.483 365s -0.331 0.048 0.863 0.673 365s 0.366 -0.628 -0.621 0.336 365s 365s x1: [4, 1, 4, 1] 365s -0.294 0.970 -0.958 -0.131 365s 365s y: [4, 1, 4, 1] 365s 0.223 0.366 -0.771 -0.166 365s testing: mul_mat, [4, 4, 4, 3] = [4, 4, 4, 3] * [4, 4, 4, 3] 365s x0: [4, 4, 4, 3] 365s -0.176 0.174 -0.362 -0.758 365s -0.155 -0.788 -0.774 0.357 365s 0.415 0.698 0.995 0.590 365s 0.711 0.269 -0.808 0.400 365s 365s x1: [4, 4, 4, 3] 365s -0.244 0.320 -0.904 -0.473 365s -0.411 0.492 -0.033 -0.094 365s -0.249 0.080 -0.411 -0.617 365s 0.572 0.607 -0.572 0.313 365s 365s y: [4, 4, 4, 3] 365s 0.785 0.316 -1.056 0.454 365s 0.241 -0.332 0.085 -0.171 365s 0.675 0.073 -0.821 -0.070 365s -0.025 -0.013 0.277 1.158 365s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 2, 1, 1] 365s 0.279 0.936 0.918 0.773 365s -0.475 -0.201 0.952 -0.055 365s 365s x1: [4, 4, 1, 1] 365s 0.040 -0.821 0.262 0.146 365s -0.539 -0.937 -0.201 0.310 365s -0.812 -0.619 0.888 -0.804 365s 0.949 -0.253 -0.560 -0.301 365s 365s y: [2, 4, 1, 1] 365s -0.404 0.388 365s -0.973 0.236 365s -0.612 1.400 365s -0.719 -0.916 365s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 365s x0: [2, 4, 4, 1] 365s -0.303 0.009 365s -0.398 0.345 365s 0.122 -0.932 365s -0.248 -0.437 365s 365s x1: [2, 4, 4, 1] 365s -0.675 0.024 365s 0.928 0.368 365s -0.955 -0.945 365s -0.593 -0.777 365s 365s y: [4, 4, 4, 1] 365s 0.205 0.277 -0.105 0.157 365s -0.278 -0.243 -0.230 -0.391 365s 0.281 0.054 0.765 0.651 365s 0.172 -0.032 0.651 0.487 365s testing: mul_mat, [2, 2, 4, 3] = [4, 2, 4, 3] * [ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 4, 2, 4, 3] 365s x0: [4, 2, 4, 3] 365s -0.092 0.009 -0.303 -0.002 365s -0.261 -0.002 0.730 0.028 365s 365s x1: [4, 2, 4, 3] 365s 0.257 -0.006 0.661 -0.734 365s -0.661 0.729 -0.171 -0.632 365s 365s y: [2, 2, 4, 3] 365s -0.222 0.394 365s 0.121 0.029 365s test-mul-mat0: iter:36/500 365s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.084 -0.671 0.317 365s 0.870 -0.249 0.021 365s 0.748 0.900 0.418 365s 365s x1: [3, 2, 1, 1] 365s 0.813 -0.358 -0.359 365s 0.784 0.647 -0.021 365s 365s y: [3, 2, 1, 1] 365s 0.194 0.789 0.137 365s -0.375 0.521 1.161 365s testing: mul_mat, [1, 3, 2, 1] = [3, 1, 2, 1] * [3, 3, 2, 1] 365s x0: [3, 1, 2, 1] 365s -0.301 -0.492 0.944 365s 365s x1: [3, 3, 2, 1] 365s 0.161 -0.369 0.545 365s -0.255 -0.172 -0.225 365s -0.900 0.735 0.389 365s 365s y: [1, 3, 2, 1] 365s 0.647 365s -0.052 365s 0.277 365s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 365s x0: [3, 4, 2, 1] 365s -0.882 0.245 0.308 365s -0.053 -0.980 0.408 365s -0.319 0.409 -0.030 365s -0.718 -0.924 0.138 365s 365s x1: [3, 1, 2, 1] 365s 0.653 0.588 -0.718 365s 365s y: [4, 1, 2, 1] 365s -0.653 -0.904 0.054 -1.110 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.342 -0.059 0.632 365s -0.255 -0.178 0.786 365s 0.526 -0.336 0.782 365s 365s x1: [3, 4, 1, 1] 365s 0.876 0.425 -0.628 365s 0.277 -0.088 -0.928 365s 0.840 0.030 0.317 365s 0.148 0.977 0.336 365s 365s y: [3, 4, 1, 1] 365s -0.722 -0.793 -0.174 365s -0.676 -0.784 -0.551 365s -0.089 0.029 0.680 365s 0.104 0.052 0.013 365s testing: mul_mat, [2, 3, 2, 1] = [3, 2, 2, 1] * [3, 3, 2, 1] 365s x0: [3, 2, 2, 1] 365s -0.540 -0.263 0.425 365s -0.321 0.118 -0.737 365s 365s x1: [3, 3, 2, 1] 365s 0.374 -0.722 -0.334 365s 0.815 0.278 0.494 365s -0.962 0.115 0.583 365s 365s y: [2, 3, 2, 1] 365s -0.154 0.041 365s -0.304 -0.592 365s 0.737 -0.108 365s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 1, 2, 1] 365s -0.304 -0.568 365s 365s x1: [2, 3, 2, 1] 365s 0.064 -0.690 365s 0.375 -0.121 365s 0.588 -0.131 365s 365s y: [1, 3, 2, 1] 365s 0.372 365s -0.045 365s -0.104 365s test-mul-mat0: iter:37/500 365s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.874 0.163 -0.494 -0.843 365s 365s x1: [4, 1, 1, 1] 365s 0.974 -0.243 0.879 -0.763 365s 365s y: [1, 1, 1, 1] 365s -0.682 365s testing: mul_mat, [4, 1, 3, 1] = [4, 4, 3, 1] * [4, 1, 3, 1] 365s x0: [4, 4, 3, 1] 365s -0.777 -0.300 0.405 0.280 365s 0.879 0.181 -0.483 0.518 365s 0.636 -0.509 -0.724 0.515 365s -0.273 0.092 0.641 0.891 365s 365s x1: [4, 1, 3, 1] 365s 0.872 0.570 -0.533 0.247 365s 365s y: [4, 1, 3, 1] 365s -0.996 1.256 0.777 -0.307 365s testing: mul_mat, [2, 4, 3, 3] = [4, 2, 3, 3] * [4, 4, 3, 3] 365s x0: [4, 2, 3, 3] 365s 0.048 0.527 -0.941 -0.601 365s -0.239 -0.581 0.748 -0.731 365s 365s x1: [4, 4, 3, 3] 365s 0.842 -0.924 0.314 -0.892 365s 0.086 -0.068 -0.571 -0.787 365s 0.917 0.510 0.906 0.102 365s -0.406 -0.428 -0.491 -0.623 365s 365s y: [2, 4, 3, 3] 365s -0.206 1.223 365s 0.978 0.167 365s -0.600 0.088 365s 0.590 0.433 365s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.677 -0.040 -0.264 0.564 365s 365s x1: [4, 2, 1, 1] 365s -0.661 0.033 0.004 0.410 365s 0.920 0.481 -0.707 0.026 365s 365s y: [1, 2, 1, 1] 365s -0.219 365s 0.805 365s testing: mul_mat, [2, 4, 3, 1] = [1, 2, 3, 1] * [1, 4, 3, 1] 365s x0: [1, 2, 3, 1] 365s -0.485 365s -0.712 365s 365s x1: [1, 4, 3, 1] 365s 0.014 365s 0.395 365s -0.505 365s -0.414 365s 365s y: [2, 4, 3, 1] 365s -0.007 -0.010 365s -0.191 -0.281 365s 0.245 0.360 365s 0.201 0.295 365s testing: mul_mat, [2, 1, 3, 3] = [2, 2, 3, 3] * [2, 1, 3, 3] 365s x0: [2, 2, 3, 3] 365s -0.419 0.453 365s 0.695 -0.998 365s 365s x1: [2, 1, 3, 3] 365s 0.909 -0.749 365s 365s y: [2, 1, 3, 3] 365s -0.720 1.380 365s test-mul-mat0: iter:38/500 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.789 0.010 365s -0.182 -0.008 365s -0.005 -0.977 365s 365s x1: [2, 2, 1, 1] 365s -0.421 -0.338 365s -0.373 0.881 365s 365s y: [3, 2, 1, 1] 365s 0.329 0.079 0.332 365s 0.304 0.061 -0.858 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.459 -0.094 365s 365s x1: [2, 3, 1, 1] 365s -0.354 -0.207 365s -0.774 -0.262 365s 0.419 -0.180 365s 365s y: [1, 3, 1, 1] 365s -0.143 365s -0.331 365s 0.209 365s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 365s x0: [2, 4, 1, 2] 365s -0.759 0.173 365s -0.800 0.427 365s -0.126 -0.856 365s -0.994 0.536 365s 365s x1: [2, 1, 1, 2] 365s -0.374 -0.275 365s 365s y: [4, 1, 1, 2] 365s 0.236 0.181 0.282 0.224 365s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.798 0.747 365s 365s x1: [2, 4, 1, 1] 365s -0.749 -0.380 365s -0.315 -0.524 365s 0.358 -0.895 365s 0.296 0.120 365s 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s y: [1, 4, 1, 1] 365s 0.314 365s -0.140 365s -0.954 365s -0.147 365s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 2, 1, 1] 365s 0.733 365s -0.983 365s 365s x1: [1, 2, 1, 1] 365s -0.712 365s 0.775 365s 365s y: [2, 2, 1, 1] 365s -0.522 0.700 365s 0.568 -0.762 365s testing: mul_mat, [1, 1, 1, 2] = [2, 1, 1, 2] * [2, 1, 1, 2] 365s x0: [2, 1, 1, 2] 365s 0.450 -0.485 365s 365s x1: [2, 1, 1, 2] 365s -0.894 0.933 365s 365s y: [1, 1, 1, 2] 365s -0.854 365s test-mul-mat0: iter:39/500 365s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.188 -0.120 0.782 365s -0.084 -0.823 -0.098 365s -0.520 -0.621 -0.351 365s 365s x1: [3, 1, 1, 1] 365s 0.804 -0.874 -0.596 365s 365s y: [3, 1, 1, 1] 365s -0.513 0.710 0.334 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.064 0.182 0.358 365s -0.599 0.108 0.188 365s 365s x1: [3, 3, 1, 1] 365s -0.232 -0.846 0.582 365s -0.499 -0.829 0.688 365s -0.567 0.614 -0.333 365s 365s y: [2, 3, 1, 1] 365s 0.039 0.157 365s 0.063 0.339 365s -0.044 0.343 365s testing: mul_mat, [4, 2, 1, 4] = [3, 4, 1, 4] * [3, 2, 1, 4] 365s x0: [3, 4, 1, 4] 365s -0.897 0.869 -0.461 365s -0.495 -0.023 0.727 365s 0.140 -0.763 -0.281 365s 0.902 0.902 -0.864 365s 365s x1: [3, 2, 1, 4] 365s -0.365 0.261 -0.008 365s -0.238 0.664 0.416 365s 365s y: [4, 2, 1, 4] 365s 0.557 0.169 -0.248 -0.087 365s 0.599 0.405 -0.657 0.024 365s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.919 -0.792 0.224 365s -0.369 0.217 -0.723 365s 365s x1: [3, 4, 1, 1] 365s -0.350 -0.994 -0.911 365s -0.164 -0.324 0.611 365s 0.354 0.583 -0.443 365s 0.891 0.161 -0.458 365s 365s y: [2, 4, 1, 1] 365s 0.262 0.572 365s 0.243 -0.452 365s -0.236 0.316 365s 0.589 0.037 365s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.701 0.080 365s 0.445 -0.225 365s 365s x1: [2, 3, 1, 1] 365s -0.758 -0.043 365s -0.417 0.986 365s -0.408 -0.209 365s 365s y: [2, 3, 1, 1] 365s -0.535 -0.328 365s -0.213 -0.408 365s -0.303 -0.135 365s testing: mul_mat, [2, 2, 1, 4] = [2, 2, 1, 4] * [2, 2, 1, 4] 365s x0: [2, 2, 1, 4] 365s 0.336 0.412 365s 0.921 0.578 365s 365s x1: [2, 2, 1, 4] 365s 0.267 0.731 365s -0.219 0.356 365s 365s y: [2, 2, 1, 4] 365s 0.391 0.668 365s 0.073 0.004 365s test-mul-mat0: iter:40/500 365s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 4, 1, 1] 365s 0.223 365s 0.347 365s -0.459 365s 0.423 365s 365s x1: [1, 1, 1, 1] 365s 0.822 365s 365s y: [4, 1, 1, 1] 365s 0.183 0.285 -0.377 0.348 365s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 365s x0: [1, 4, 3, 1] 365s -0.203 365s 0.451 365s 0.551 365s 0.029 365s 365s x1: [1, 4, 3, 1] 365s 0.414 365s 0.469 365s 0.543 365s -0.889 365s 365s y: [4, 4, 3, 1] 365s -0.084 0.186 0.228 0.012 365s -0.095 0.211 0.258 0.013 365s -0.110 0.245 0.299 0.016 365s 0.180 -0.401 -0.490 -0.026 365s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 365s x0: [1, 4, 3, 1] 365s -0.632 365s 0.933 365s 0.018 365s -0.621 365s 365s x1: [1, 4, 3, 1] 365s 0.204 365s 0.738 365s 0.630 365s -0.449 365s 365s y: [4, 4, 3, 1] 365s -0.129 0.191 0.004 -0.127 365s -0.467 0.689 0.013 -0.459 365s -0.398 0.588 0.011 -0.391 365s 0.284 -0.419 -0.008 0.279 365s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.076 365s 365s x1: [1, 4, 1, 1] 365s -0.268 365s 0.568 365s -0.331 365s -0.886 365s 365s y: [1, 4, 1, 1] 365s -0.020 365s 0.043 365s -0.025 365s -0.067 365s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 365s x0: [1, 2, 3, 1] 365s -0.405 365s 0.758 365s 365s x1: [1, 1, 3, 1] 365s 0.318 365s 365s y: [2, 1, 3, 1] 365s -0.129 0.242 365s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 2, 3, 1] 365s 0.718 -0.382 365s -0.567 0.124 365s 365s x1: [2, 1, 3, 1] 365s -0.305 0.830 365s 365s y: [2, 1, 3, 1] 365s -0.536 0.275 365s test-mul-mat0: iter:41/500 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.779 -0.381 365s 0.645 0.985 365s 0.655 0.087 365s 0.703 -0.913 365s 365s x1: [2, 4, 1, 1] 365s -0.723 -0.143 365s 0.544 -0.964 365s -0.308 0.887 365s -0.609 -0.398 365s 365s y: [4, 4, 1, 1] 365s 0.617 -0.607 -0.485 -0.378 365s -0.057 -0.598 0.272 1.262 365s -0.098 0.675 -0.124 -1.026 365s 0.626 -0.786 -0.434 -0.065 365s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 365s x0: [2, 1, 4, 1] 365s -0.278 0.937 365s 365s x1: [2, 4, 4, 1] 365s 0.706 -0.173 365s 0.469 -0.471 365s 0.316 0.297 365s -0.916 0.374 365s 365s y: [1, 4, 4, 1] 365s -0.359 365s -0.572 365s 0.190 365s 0.605 365s testing: mul_mat, [4, 1, 4, 2] = [2, 4, 4, 2] * [2, 1, 4, 2] 365s x0: [2, 4, 4, 2] 365s 0.969 -0.364 365s 0.111 0.222 365s -0.956 0.916 365s 0.944 0.981 365s 365s x1: [2, 1, 4, 2] 365s 0.502 0.178 365s 365s y: [4, 1, 4, 2] 365s 0.422 0.095 -0.317 0.648 365s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.269 0.730 365s -0.248 0.476 365s 0.697 0.476 365s 365s x1: [2, 4, 1, 1] 365s -0.916 -0.053 365s -0.214 0.944 365s -0.018 -0.504 365s 0.841 -0.020 365s 365s y: [3, 4, 1, 1] 365s -0.285 0.202 -0.664 365s 0.632 0.503 0.300 365s -0.373 -0.235 -0.252 365s 0.211 -0.218 0.577 365s testing: mul_mat, [3, 2, 4, 1] = [3, 3, 4, 1] * [3, 2, 4, 1] 365s x0: [3, 3, 4, 1] 365s -0.062 -0.332 -0.647 365s -0.775 0.701 0.674 365s -0.765 0.711 0.368 365s 365s x1: [3, 2, 4, 1] 365s 0.684 0.973 0.657 365s -0.910 -0.258 -0.358 365s 365s y: [3, 2, 4, 1] 365s -0.790 0.594 0.410 365s 0.373 0.283 0.381 365s testing: mul_mat, [4, 3, 4, 2] = [3, 4, 4, 2] * [3, 3, 4, 2] 365s x0: [3, 4, 4, 2] 365s -0.619 0.003 -0.940 365s -0.262 0.910 -0.860 365s 0.832 0.892 -0.065 365s -0.382 -0.602 -0.166 365s 365s x1: [3, 3, 4, 2] 365s 0.039 0.568 0.087 365s -0.594 -0.989 0.503 365s -0.583 -0.250 -0.729 365s 365s y: [4, 3, 4, 2] 365s -0.104 0.432 0.533 -0.371 365s -0.109 -1.177 -1.408 0.739 365s 1.046 0.552 -0.661 0.494 365s test-mul-mat0: iter:42/500 365s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 2, 1, 1] 365s 0.110 -0.315 -0.224 0.451 365s 0.649 -0.962 0.996 0.800 365s 365s x1: [4, 4, 1, 1] 365s 0.103 -0.917 -0.438 -0.132 365s -0.603 -0.302 0.512 -0.303 365s -0.724 -0.351 0.851 -0.461 365s 0.157 -0.656 -0.052 -0.977 365s 365s y: [2, 4, 1, 1] 365s 0.338 0.407 365s -0.223 0.167 365s -0.368 0.346 365s -0.205 -0.100 365s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 365s x0: [4, 2, 3, 1] 365s 0.958 0.165 -0.233 0.996 365s 0.160 -0.433 0.002 0.760 365s 365s x1: [4, 2, 3, 1] 365s 0.005 -0.400 -0.355 0.120 365s 0.030 0.580 -0.777 0.113 365s 365s y: [2, 2, 3, 1] 365s 0.141 0.265 365s 0.418 -0.161 365s testing: mul_mat, [1, 2, 3, 3] = [4, 1, 3, 3] * [4, 2, 3, 3] 365s x0: [4, 1, 3, 3] 365s -0.437 0.808 0.855 0.442 365s 365s x1: [4, 2, 3, 3] 365s 0.702 -0.918 0.145 -0.865 365s 0.264 -0.142 0.253 0.221 365s 365s y: [1, 2, 3, 3] 365s -1.307 365s 0.085 365s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.010 -0.271 -0.008 -0.474 365s 0.732 -0.173 0.037 0.515 365s 0.141 -0.286 -0.393 0.253 365s 365s x1: [4, 1, 1, 1] 365s -0.633 0.235 0.651 -0.702 365s 365s y: [3, 1, 1, 1] 365s 0.257 -0.841 -0.590 365s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 4, 3, 1] 365s 0.217 0.118 0.792 365s -0.948 -0.157 -0.178 365s -0.740 0.736 0.611 365s 0.886 0.539 0.040 365s 365s x1: [3, 4, 3, 1] 365s 0.203 0.548 -0.653 365s 0.754 0.678 -0.193 365s -0.070 -0.532 -0.233 365s -0.331 0.066 -0.417 365s 365s y: [4, 4, 3, 1] 365s -0.408 -0.163 -0.146 0.450 365s 0.090 -0.787 -0.177 1.026 365s -0.262 0.191 -0.482 -0.358 365s -0.394 0.377 0.039 -0.274 365s testing: mul_mat, [4, 3, 3, 3] = [4, 4, 3, 3] * [4, 3, 3, 3] 365s x0: [4, 4, 3, 3] 365s 0.952 -0.672 -0.698 -0.558 365s -0.500 -0.564 -0.978 -0.398 365s 0.166 0.882 -0.175 -0.298 365s -0.989 0.345 0.563 0.551 365s 365s x1: [4, 3, 3, 3] 365s -0.603 -0.705 0.228 -0.811 365s 0.117 -0.161 0.229 -0.437 365s 0.531 0.248 0.775 -0.965 365s 365s y: [4, 3, 3, 3] 365s 0.193 0.799 -0.521 0.035 365s 0.304 -0.018 -0.033 -0.283 365s 0.337 -0.780 0.459 -0.536 365s test-mul-mat0: iter:43/500 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.386 -0.427 0.803 365s -0.777 0.222 -0.052 365s 365s x1: [3, 3, 1, 1] 365s 0.238 0.024 -0.141 365s -0.895 0.908 -0.113 365s -0.162 -0.536 0.676 365s 365s y: [2, 3, 1, 1] 365s -0.032 -0.172 365s -0.823 0.903 365s 0.709 -0.028 365s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.304 0.561 -0.031 365s 365s x1: [3, 2, 1, 1] 365s 0.410 -0.618 0.647 365s -0.295 -0.464 0.598 365s 365s y: [1, 2, 1, 1] 365s -0.491 365s -0.189 365s testing: mul_mat, [3, 1, 1, 2] = [3, 3, 1, 2] * [3, 1, 1, 2] 365s x0: [3, 3, 1, 2] 365s 0.728 -0.534 -0.884 365s -0.385 0.303 -0.419 365s -0.710 0.449 0.966 365s 365s x1: [3, 1, 1, 2] 365s 0.084 0.675 0.123 365s 365s y: [3, 1, 1, 2] 365s -0.408 0.121 0.362 365s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.551 0.279 0.894 365s 0.496 0.962 0.265 365s -0.823 -0.707 -0.126 365s 365s x1: [3, 3, 1, 1] 365s 0.004 0.446 0.044 365s 0.700 0.007 -0.987 365s -0.216 -0.318 0.136 365s 365s y: [3, 3, 1, 1] 365s 0.166 0.442 -0.324 365s -0.495 0.092 -0.456 365s -0.086 -0.377 0.385 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.985 -0.395 0.976 365s -0.068 0.461 -0.695 365s 365s x1: [3, 3, 1, 1] 365s -0.816 -0.286 -0.160 365s 0.048 0.967 0.029 365s 0.134 -0.833 -0.372 365s 365s y: [2, 3, 1, 1] 365s 0.760 0.035 365s -0.400 0.422 365s -0.166 -0.135 365s testing: mul_mat, [4, 3, 1, 2] = [2, 4, 1, 2] * [2, 3, 1, 2] 365s x0: [2, 4, 1, 2] 365s 0.848 0.242 365s -0.598 0.104 365s -0.943 -0.471 365s -0.437 -0.729 365s 365s x1: [2, 3, 1, 2] 365s -0.533 0.989 365s 0.089 0.149 365s 0.125 0.058 365s 365s y: [4, 3, 1, 2] 365s -0.212 0.422 0.037 -0.488 365s 0.112 -0.038 -0.154 -0.148 365s 0.120 -0.069 -0.145 -0.097 365s test-mul-mat0: iter:44/500 365s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 2, 1, 1] 365s 0.544 365s -0.869 365s 365s x1: [1, 4, 1, 1] 365s 0.839 365s 0.789 365s -0.077 365s -0.104 365s 365s y: [2, 4, 1, 1] 365s 0.456 -0.729 365s 0.429 -0.686 365s -0.042 0.067 365s -0.056 0.090 365s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 365s x0: [1, 1, 2, 1] 365s -0.939 365s 365s x1: [1, 2, 2, 1] 365s -0.532 365s -0.873 365s 365s y: [1, 2, 2, 1] 365s 0.500 365s 0.820 365s testing: mul_mat, [4, 1, 2, 4] = [1, 4, 2, 4] * [1, 1, 2, 4] 365s x0: [1, 4, 2, 4] 365s -0.524 365s -0.694 365s -0.976 365s -0.352 365s 365s x1: [1, 1, 2, 4] 365s -0.462 365s 365s y: [4, 1, 2, 4] 365s 0.242 0.320 0.451 0.162 365s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.735 365s 0.042 365s 0.001 365s 0.342 365s 365s x1: [1, 4, 1, 1] 365s 0.015 365s 0.617 365s -0.024 365s 0.130 365s 365s y: [4, 4, 1, 1] 365s -0.011 0.001 0.000 0.005 365s -0.453 0.026 0.001 0.211 365s 0.018 -0.001 -0.000 -0.008 365s -0.095 0.005 0.000 0.044 365s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 365s x0: [4, 3, 2, 1] 365s 0.243 -0.737 -0.649 -0.797 365s -0.196 0.465 0.700 -0.591 365s -0.907 -0.633 -0.619 0.974 365s 365s x1: [4, 1, 2, 1] 365s 0.876 -0.491 -0.091 -0.745 365s 365s y: [3, 1, 2, 1] 365s 1.227 -0.023 -1.154 365s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 365s x0: [3, 2, 2, 4] 365s -0.331 0.784 -0.972 365s 0.337 0.565 -0.548 365s 365s x1: [3, 4, 2, 4] 365s -0.977 0.727 0.639 365s -0.033 -0.511 0.084 365s -0.713 0.732 0.888 365s -0.620 0.995 0.352 365s 365s y: [2, 4, 2, 4] 365s 0.273 -0.269 365s -0.472 -0.346 365s -0.053 -0.313 365s 0.643 0.160 365s test-mul-mat0: iter:45/500 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.465 -0.568 0.137 365s -0.043 0.188 0.837 365s 365s x1: [3, 3, 1, 1] 365s 0.437 0.248 0.326 365s -0.795 -0.720 -0.687 365s -0.576 0.538 0.704 365s 365s y: [2, 3, 1, 1] 365s 0.107 0.301 365s -0.055 -0.676 365s -0.477 0.715 365s testing: mul_mat, [1, 2, 2, 1] = [3, 1, 2, 1] * [3, 2, 2, 1] 365s x0: [3, 1, 2, 1] 365s 0.840 -0.012 -0.543 365s 365s x1: [3, 2, 2, 1] 365s 0.359 0.773 0.534 365s -0.189 -0.851 0.775 365s 365s y: [1, 2, 2, 1] 365s 0.002 365s -0.569 365s testing: mul_mat, [3, 1, 2, 2] = [3, 3, 2, 2] * [3, 1, 2, 2] 365s x0: [3, 3, 2, 2] 365s 0.725 -0.518 0.497 365s 0.873 -0.743 -0.705 365s 0.772 -0.529 0.524 365s 365s x1: [3, 1, 2, 2] 365s 0.368 -0.372 0.086 365s 365s y: [3, 1, 2, 2] 365s 0.502 0.537 0.526 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.562 0.337 -0.529 365s 0.626 0.748 0.298 365s -0.001 -0.163 -0.814 365s -0.253 0.380 -0.694 365s 365s x1: [3, 3, 1, 1] 365s -0.733 -0.871 0.993 365s 0.791 0.017 0.773 365s -0.665 0.746 -0.458 365s 365s y: [4, 3, 1, 1] 365s -1.230 -0.815 -0.665 -0.834 365s 0.042 0.738 -0.633 -0.730 365s 0.119 0.005 0.252 0.770 365s testing: mul_mat, [3, 3, 2, 1] = [4, 3, 2, 1] * [4, 3, 2, 1] 365s x0: [4, 3, 2, 1] 365s 0.821 0.119 -0.794 0.767 365s -0.474 -0.288 0.656 0.083 365s -0.519 -0.213 0.875 -0.771 365s 365s x1: [4, 3, 2, 1] 365s 0.087 -0.056 0.089 0.562 365s 0.427 -0.647 0.534 0.783 365s 0.875 0.801 0.912 0.867 365s 365s y: [3, 3, 2, 1] 365s 0.425 0.080 -0.388 365s 0.451 0.399 -0.221 365s 0.754 0.026 -0.495 365s testing: mul_mat, [2, 4, 2, 2] = [3, 2, 2, 2] * [3, 4, 2, 2] 365s x0: [3, 2, 2, 2] 365s 0.282 -0.425 -0.318 365s 0.760 -0.090 -0.053 365s 365s x1: [3, 4, 2, 2] 365s 0.882 -0.378 0.603 365s -0.857 0.607 -0.252 365s -0.848 0.428 0.274 365s -0.367 -0.453 0.986 365s 365s y: [2, 4, 2, 2] 365s 0.218 0.673 365s -0.419 -0.693 365s -0.508 -0.698 365s -0.225 -0.290 365s test-mul-mat0: iter:46/500 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.302 0.209 0.723 0.344 365s 0.790 0.051 -0.749 0.066 365s 0.733 0.568 -0.536 0.044 365s 0.931 -0.469 -0.030 -0.244 365s 365s x1: [4, 3, 1, 1] 365s -0.416 0.429 0.165 0.866 365s 0.189 0.740 -0.224 0.871 365s -0.313 0.284 -0.833 0.715 365s 365s y: [4, 3, 1, 1] 365s 0.633 -0.373 -0.111 -0.804 365s 0.236 0.413 0.718 -0.377 365s -0.202 0.439 0.410 -0.573 365s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 4, 4, 1] 365s -0.787 0.908 -0.581 0.837 365s -0.776 -0.828 0.969 0.285 365s 0.191 -0.225 -0.194 -0.521 365s -0.047 0.996 0.758 0.848 365s 365s x1: [4, 4, 4, 1] 365s -0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 981 0.152 0.341 0.448 365s -0.683 0.207 -0.363 -0.943 365s 0.983 -0.493 -0.256 0.267 365s -0.326 -0.540 -0.732 0.372 365s 365s y: [4, 4, 4, 1] 365s 1.087 1.092 -0.521 0.835 365s 0.147 -0.263 0.385 -0.837 365s -0.849 -0.526 0.209 -0.504 365s 0.503 0.097 0.007 -0.762 365s testing: mul_mat, [3, 4, 4, 4] = [4, 3, 4, 4] * [4, 4, 4, 4] 365s x0: [4, 3, 4, 4] 365s -0.406 -0.789 -0.729 -0.089 365s 0.370 -0.192 0.631 0.140 365s 0.397 -0.766 -0.113 0.991 365s 365s x1: [4, 4, 4, 4] 365s -0.835 -0.259 0.245 -0.539 365s 0.657 -0.892 0.947 0.348 365s 0.778 0.913 0.992 0.443 365s 0.636 -0.517 0.754 0.438 365s 365s y: [3, 4, 4, 4] 365s 0.412 -0.181 -0.696 365s -0.284 1.061 1.182 365s -1.798 0.800 -0.064 365s -0.439 0.872 0.998 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.016 -0.898 0.063 -0.288 365s -0.768 0.826 -0.929 0.015 365s 0.556 -0.893 -0.281 -0.854 365s 365s x1: [4, 3, 1, 1] 365s -0.655 -0.401 0.877 -0.888 365s -0.011 0.539 -0.132 0.845 365s -0.070 0.426 0.762 -0.767 365s 365s y: [3, 3, 1, 1] 365s 0.660 -0.655 0.505 365s -0.735 0.588 -1.171 365s -0.116 -0.313 0.021 365s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 365s x0: [3, 2, 4, 1] 365s -0.877 -0.698 0.263 365s 0.256 0.864 0.730 365s 365s x1: [3, 4, 4, 1] 365s -0.632 0.556 0.811 365s -0.017 0.693 -0.208 365s 0.327 -0.708 -0.331 365s 0.439 0.281 -0.793 365s 365s y: [2, 4, 4, 1] 365s 0.379 0.911 365s -0.523 0.442 365s 0.121 -0.770 365s -0.789 -0.224 365s testing: mul_mat, [2, 3, 4, 4] = [2, 2, 4, 4] * [2, 3, 4, 4] 365s x0: [2, 2, 4, 4] 365s 0.265 0.579 365s 0.598 -0.320 365s 365s x1: [2, 3, 4, 4] 365s 0.010 0.959 365s 0.431 -0.661 365s 0.899 -0.217 365s 365s y: [2, 3, 4, 4] 365s 0.558 -0.301 365s -0.268 0.469 365s 0.112 0.607 365s test-mul-mat0: iter:47/500 365s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.113 -0.815 0.221 0.828 365s 365s x1: [4, 1, 1, 1] 365s 0.925 0.158 0.018 0.911 365s 365s y: [1, 1, 1, 1] 365s 0.524 365s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 365s x0: [4, 2, 2, 1] 365s -0.826 0.402 -0.393 -0.119 365s -0.834 -0.355 0.208 0.904 365s 365s x1: [4, 1, 2, 1] 365s -0.290 -0.975 0.424 -0.045 365s 365s y: [2, 1, 2, 1] 365s -0.313 0.636 365s testing: mul_mat, [3, 2, 2, 2] = [4, 3, 2, 2] * [4, 2, 2, 2] 365s x0: [4, 3, 2, 2] 365s 0.273 -0.275 -0.248 0.421 365s -0.247 -0.567 -0.007 0.736 365s -0.631 -0.693 0.841 -0.347 365s 365s x1: [4, 2, 2, 2] 365s 0.132 -0.566 -0.679 0.318 365s 0.655 -0.852 -0.972 0.680 365s 365s y: [3, 2, 2, 2] 365s 0.493 0.527 -0.374 365s 0.940 0.829 -0.877 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.301 -0.117 0.500 -0.513 365s 0.532 0.448 -0.642 0.575 365s -0.522 -0.363 -0.467 -0.730 365s 365s x1: [4, 3, 1, 1] 365s -0.831 -0.119 -0.210 -0.885 365s 0.261 0.543 0.115 -0.068 365s 0.814 -0.611 0.267 0.036 365s 365s y: [3, 3, 1, 1] 365s 0.613 -0.869 1.221 365s -0.050 0.270 -0.337 365s -0.059 0.008 -0.355 365s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 3, 2, 1] 365s 0.811 -0.702 -0.576 365s 0.789 0.364 -0.824 365s -0.388 -0.118 0.011 365s 365s x1: [3, 4, 2, 1] 365s -0.874 -0.189 -0.872 365s -0.896 -0.613 0.268 365s -0.727 0.268 -0.942 365s -0.612 -0.471 0.601 365s 365s y: [3, 4, 2, 1] 365s -0.075 -0.040 0.353 365s -0.451 -1.151 0.423 365s -0.235 0.300 0.241 365s -0.512 -1.150 0.299 365s testing: mul_mat, [2, 3, 2, 2] = [3, 2, 2, 2] * [3, 3, 2, 2] 365s x0: [3, 2, 2, 2] 365s -0.834 0.066 0.941 365s 0.652 0.155 -0.716 365s 365s x1: [3, 3, 2, 2] 365s -0.307 0.822 0.352 365s -0.415 -0.449 0.803 365s 0.548 0.978 0.786 365s 365s y: [2, 3, 2, 2] 365s 0.642 -0.326 365s 1.073 -0.916 365s 0.348 -0.054 365s test-mul-mat0: iter:48/500 365s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.079 -0.991 365s 365s x1: [2, 1, 1, 1] 365s -0.245 -0.644 365s 365s y: [1, 1, 1, 1] 365s 0.657 365s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 3, 3, 1] 365s -0.802 -0.567 365s 0.252 -0.341 365s 0.695 0.460 365s 365s x1: [2, 1, 3, 1] 365s -0.984 -0.924 365s 365s y: [3, 1, 3, 1] 365s 1.313 0.067 -1.109 365s testing: mul_mat, [3, 3, 3, 4] = [2, 3, 3, 4] * [2, 3, 3, 4] 365s x0: [2, 3, 3, 4] 365s -0.528 -0.137 365s -0.385 -0.297 365s 0.013 0.471 365s 365s x1: [2, 3, 3, 4] 365s -0.494 -0.931 365s 0.634 0.456 365s -0.010 0.642 365s 365s y: [3, 3, 3, 4] 365s 0.388 0.467 -0.445 365s -0.397 -0.379 0.223 365s -0.083 -0.187 0.302 365s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.481 -0.955 365s -0.386 -0.525 365s -0.524 0.622 365s 365s x1: [2, 3, 1, 1] 365s 0.137 -0.281 365s 0.719 0.133 365s -0.019 0.216 365s 365s y: [3, 3, 1, 1] 365s 0.335 0.095 -0.247 365s 0.219 -0.347 -0.294 365s -0.216 -0.106 0.145 365s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 3, 3, 1] 365s -0.914 0.066 -0.453 365s -0.346 0.870 -0.515 365s -0.053 -0.208 0.269 365s 365s x1: [3, 2, 3, 1] 365s 0.427 0.998 -0.086 365s -0.217 -0.905 -0.389 365s 365s y: [3, 2, 3, 1] 365s -0.286 0.766 -0.253 365s 0.314 -0.512 0.095 365s testing: mul_mat, [1, 3, 3, 4] = [3, 1, 3, 4] * [3, 3, 3, 4] 365s x0: [3, 1, 3, 4] 365s 0.341 -0.576 0.745 365s 365s x1: [3, 3, 3, 4] 365s 0.448 -0.242 0.594 365s 0.262 -0.157 -0.752 365s -0.791 0.910 -0.881 365s 365s y: [1, 3, 3, 4] 365s 0.735 365s -0.381 365s -1.450 365s test-mul-mat0: iter:49/500 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.020 -0.660 0.251 -0.026 365s 0.386 0.840 0.554 -0.951 365s -0.626 0.475 0.137 -0.346 365s -0.426 -0.741 0.567 0.887 365s 365s x1: [4, 3, 1, 1] 365s -0.618 -0.523 0.061 0.514 365s -0.242 0.144 -0.239 -0.279 365s 0.511 -0.727 -0.972 0.337 365s 365s y: [4, 3, 1, 1] 365s 0.360 -1.133 -0.031 1.142 365s -0.143 0.160 0.284 -0.387 365s 0.218 -1.272 -0.914 0.068 365s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 365s x0: [4, 2, 3, 1] 365s 0.291 0.755 -0.626 0.989 365s -0.341 0.168 -0.838 -0.301 365s 365s x1: [4, 4, 3, 1] 365s 0.205 -0.923 -0.731 0.682 365s 0.138 0.783 -0.560 -0.719 365s -0.456 0.161 0.792 -0.183 365s 0.189 0.129 0.329 -0.831 365s 365s y: [2, 4, 3, 1] 365s 0.495 0.182 365s 0.270 0.770 365s -0.688 -0.426 365s -0.875 -0.069 365s testing: mul_mat, [2, 2, 3, 2] = [4, 2, 3, 2] * [4, 2, 3, 2] 365s x0: [4, 2, 3, 2] 365s -0.193 0.556 -0.785 0.057 365s -0.875 -0.385 -0.741 0.824 365s 365s x1: [4, 2, 3, 2] 365s -0.592 -0.916 0.197 0.783 365s -0.768 -0.903 0.738 0.523 365s 365s y: [2, 2, 3, 2] 365s -0.505 1.370 365s -0.903 0.904 365s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.556 -0.122 0.757 -0.240 365s 0.767 -0.061 -0.674 -0.673 365s -0.607 -0.443 0.777 0.256 365s 365s x1: [4, 2, 1, 1] 365s -0.242 0.446 -0.736 -0.376 365s 0.729 -0.041 0.665 -0.900 365s 365s y: [3, 2, 1, 1] 365s -0.656 0.537 -0.719 365s 1.129 0.719 -0.138 365s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 1, 3, 1] 365s -0.040 -0.690 -0.493 365s 365s x1: [3, 4, 3, 1] 365s 0.588 -0.338 0.659 365s -0.699 0.153 0.416 365s 0.367 0.707 -0.846 365s 0.037 -0.535 0.600 365s 365s y: [1, 4, 3, 1] 365s -0.115 365s -0.282 365s -0.086 365s 0.072 365s testing: mul_mat, [2, 3, 3, 2] = [1, 2, 3, 2] * [1, 3, 3, 2] 365s x0: [1, 2, 3, 2] 365s 0.648 365s 0.498 365s 365s x1: [1, 3, 3, 2] 365s 0.555 365s -0.384 365s -0.135 365s 365s y: [2, 3, 3, 2] 365s 0.360 0.277 365s -0.248 -0.191 365s -0.088 -0.067 365s test-mul-mat0: iter:50/500 365s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.899 365s -0.452 365s 0.631 365s 0.158 365s 365s x1: [1, 2, 1, 1] 365s 0.988 365s -0.649 365s 365s y: [4, 2, 1, 1] 365s -0.888 -0.447 0.624 0.156 365s 0.584 0.294 -0.410 -0.102 365s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 365s x0: [1, 4, 2, 1] 365s 0.045 365s -0.077 365s -0.771 365s -0.147 365s 365s x1: [1, 4, 2, 1] 365s 0.205 365s -0.561 365s 0.920 365s -0.349 365s 365s y: [4, 4, 2, 1] 365s 0.009 -0.016 -0.158 -0.030 365s -0.025 0.043 0.433 0.082 365s 0.041 -0.071 -0.709 -0.135 365s -0.016 0.027 0.269 0.051 365s testing: mul_mat, [3, 4, 2, 4] = [1, 3, 2, 4] * [1, 4, 2, 4] 365s x0: [1, 3, 2, 4] 365s -0.769 365s 0.692 365s -0.029 365s 365s x1: [1, 4, 2, 4] 365s -0.266 365s 0.882 365s -0.779 365s -0.810 365s 365s y: [3, 4, 2, 4] 365s 0.205 -0.184 0.008 365s -0.678 0.610 -0.025 365s 0.599 -0.539 0.022 365s 0.623 -0.560 0.023 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.021 365s 0.228 365s -0.554 365s 365s x1: [1, 3, 1, 1] 365s -0.728 365s -0.031 365s 0.726 365s 365s y: [3, 3, 1, 1] 365s 0.015 -0.166 0.404 365s 0.001 -0.007 0.017 365s -0.015 0.165 -0.403 365s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 365s x0: [3, 4, 2, 1] 365s -0.358 0.379 0.086 365s -0.745 -0.806 -0.070 365s -0.433 0.071 0.933 365s 0.149 -0.386 -0.209 365s 365s x1: [3, 1, 2, 1] 365s 0.210 -0.080 0.417 365s 365s y: [4, 1, 2, 1] 365s -0.070 -0.121 0.292 -0.025 365s testing: mul_mat, [4, 3, 2, 4] = [4, 4, 2, 4] * [4, 3, 2, 4] 365s x0: [4, 4, 2, 4] 365s -0.582 0.638 0.731 -0.137 365s 0.736 -0.733 0.989 -0.025 365s -0.492 -0.738 -0.181 -0.276 365s -0.009 0.842 0.439 0.847 365s 365s x1: [4, 3, 2, 4] 365s 0.187 0.788 -0.096 0.139 365s -0.103 -0.776 -0.307 0.539 365s -0.521 0.260 -0.312 0.858 365s 365s y: [4, 3, 2, 4] 365s 0.305 -0.538 -0.695 0.738 365s -0.734 0.176 0.531 -0.331 365s 0.124 -0.905 -0.116 0.814 365s test-mul-mat0: iter:51/500 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s , 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.654 365s 0.527 365s -0.578 365s 365s x1: [1, 3, 1, 1] 365s -0.040 365s -0.557 365s -0.412 365s 365s y: [3, 3, 1, 1] 365s -0.026 -0.021 0.023 365s -0.364 -0.294 0.322 365s -0.269 -0.217 0.238 365s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 365s x0: [1, 4, 2, 1] 365s 0.825 365s 0.380 365s -0.512 365s -0.302 365s 365s x1: [1, 3, 2, 1] 365s 0.132 365s -0.412 365s 0.731 365s 365s y: [4, 3, 2, 1] 365s 0.109 0.050 -0.068 -0.040 365s -0.340 -0.157 0.211 0.124 365s 0.603 0.278 -0.374 -0.220 365s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 365s x0: [1, 3, 2, 1] 365s -0.563 365s -0.151 365s 0.512 365s 365s x1: [1, 4, 2, 1] 365s -0.942 365s 0.280 365s 0.164 365s -0.023 365s 365s y: [3, 4, 2, 1] 365s 0.530 0.143 -0.482 365s -0.158 -0.042 0.143 365s -0.092 -0.025 0.084 365s 0.013 0.003 -0.012 365s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.858 365s 365s x1: [1, 3, 1, 1] 365s 0.954 365s 0.635 365s 0.627 365s 365s y: [1, 3, 1, 1] 365s 0.819 365s 0.545 365s 0.538 365s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 365s x0: [1, 1, 2, 1] 365s 0.554 365s 365s x1: [1, 1, 2, 1] 365s 0.729 365s 365s y: [1, 1, 2, 1] 365s 0.404 365s testing: mul_mat, [3, 1, 2, 1] = [1, 3, 2, 1] * [1, 1, 2, 1] 365s x0: [1, 3, 2, 1] 365s -0.064 365s -0.044 365s -0.150 365s 365s x1: [1, 1, 2, 1] 365s 0.742 365s 365s y: [3, 1, 2, 1] 365s -0.047 -0.033 -0.111 365s test-mul-mat0: iter:52/500 365s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.942 -0.581 0.056 0.801 365s -0.852 0.930 0.054 0.702 365s -0.815 -0.204 0.954 -0.485 365s 365s x1: [4, 2, 1, 1] 365s -0.043 -0.758 0.886 0.805 365s 0.753 0.418 -0.692 -0.208 365s 365s y: [3, 2, 1, 1] 365s 1.094 -0.055 0.644 365s 0.262 -0.437 -1.259 365s testing: mul_mat, [1, 3, 2, 1] = [4, 1, 2, 1] * [4, 3, 2, 1] 365s x0: [4, 1, 2, 1] 365s 0.565 0.752 0.991 -0.639 365s 365s x1: [4, 3, 2, 1] 365s 0.732 -0.090 0.364 -0.274 365s 0.146 0.378 -0.304 0.698 365s -0.304 -0.855 -0.346 -0.062 365s 365s y: [1, 3, 2, 1] 365s 0.882 365s -0.379 365s -1.117 365s testing: mul_mat, [4, 1, 2, 2] = [4, 4, 2, 2] * [4, 1, 2, 2] 365s x0: [4, 4, 2, 2] 365s -0.947 -0.041 0.833 -0.436 365s 0.009 0.639 -0.871 -0.239 365s 0.630 -0.511 -0.532 -0.865 365s -0.418 0.084 -0.995 0.400 365s 365s x1: [4, 1, 2, 2] 365s 0.870 -0.182 0.763 0.248 365s 365s y: [4, 1, 2, 2] 365s -0.288 -0.833 0.020 -1.039 365s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.862 -0.293 -0.726 -0.816 365s -0.331 0.798 -0.423 -0.936 365s 0.168 -0.520 0.195 0.271 365s 365s x1: [4, 4, 1, 1] 365s 0.269 0.263 0.828 -0.898 365s -0.649 0.265 -0.435 0.928 365s 0.169 0.162 -0.914 -0.843 365s 0.764 0.646 0.994 0.619 365s 365s y: [3, 4, 1, 1] 365s -0.178 0.611 -0.173 365s 0.041 -0.258 -0.081 365s 1.158 1.249 -0.462 365s -2.074 -0.737 0.154 365s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 3, 2, 1] 365s -0.251 -0.067 0.421 365s -0.204 -0.140 -0.924 365s -0.275 0.995 -0.465 365s 365s x1: [3, 4, 2, 1] 365s -0.511 0.217 -0.460 365s 0.752 0.045 -0.358 365s -0.897 -0.690 0.207 365s -0.970 0.479 -0.631 365s 365s y: [3, 4, 2, 1] 365s -0.080 0.499 0.571 365s -0.343 0.171 0.004 365s 0.359 0.089 -0.536 365s -0.055 0.714 1.037 365s testing: mul_mat, [3, 3, 2, 2] = [3, 3, 2, 2] * [3, 3, 2, 2] 365s x0: [3, 3, 2, 2] 365s -0.049 0.067 -0.288 365s 0.137 0.556 -0.276 365s 0.791 0.067 0.140 365s 365s x1: [3, 3, 2, 2] 365s 0.423 -0.454 -0.115 365s 0.185 -0.824 -0.363 365s 0.066 0.944 0.058 365s 365s y: [3, 3, 2, 2] 365s -0.018 -0.163 0.288 365s 0.041 -0.333 0.040 365s 0.043 0.518 0.124 365s test-mul-mat0: iter:53/500 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.956 365s -0.929 365s -0.520 365s 365s x1: [1, 3, 1, 1] 365s -0.311 365s 0.697 365s -0.009 365s 365s y: [3, 3, 1, 1] 365s 0.297 0.289 0.162 365s -0.666 -0.648 -0.362 365s 0.009 0.009 0.005 365s testing: mul_mat, [4, 3, 3, 1] = [1, 4, 3, 1] * [1, 3, 3, 1] 365s x0: [1, 4, 3, 1] 365s 0.556 365s -0.938 365s -0.736 365s 0.011 365s 365s x1: [1, 3, 3, 1] 365s 0.087 365s 0.281 365s -0.098 365s 365s y: [4, 3, 3, 1] 365s 0.048 -0.082 -0.064 0.001 365s 0.156 -0.264 -0.207 0.003 365s -0.055 0.092 0.072 -0.001 365s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 365s x0: [1, 4, 3, 1] 365s -0.211 365s -0.415 365s 0.946 365s -0.915 365s 365s x1: [1, 4, 3, 1] 365s -0.378 365s -0.606 365s 0.285 365s -0.681 365s 365s y: [4, 4, 3, 1] 365s 0.080 0.157 -0.357 0.346 365s 0.128 0.252 -0.573 0.554 365s -0.060 -0.118 0.270 -0.261 365s 0.143 0.283 -0.644 0.623 365s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 2, 1, 1] 365s 0.273 365s -0.373 365s 365s x1: [1, 4, 1, 1] 365s -0.746 365s 0.201 365s -0.323 365s -0.995 365s 365s y: [2, 4, 1, 1] 365s -0.204 0.278 365s 0.055 -0.075 365s -0.088 0.121 365s -0.272 0.372 365s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 3, 3, 1] 365s 0.879 -0.319 365s -0.240 -0.687 365s -0.353 0.437 365s 365s x1: [2, 1, 3, 1] 365s 0.070 -0.442 365s 365s y: [3, 1, 3, 1] 365s 0.203 0.287 -0.218 365s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 2, 3, 1] 365s 0.187 -0.332 0.879 365s -0.712 -0.089 0.832 365s 365s x1: [3, 2, 3, 1] 365s -0.049 0.552 -0.925 365s 0.627 -0.175 -0.299 365s 365s y: [2, 2, 3, 1] 365s -1.006 -0.784 365s -0.088 -0.679 365s test-mul-mat0: iter:54/500 365s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.412 0.133 0.917 -0.677 365s 365s x1: [4, 2, 1, 1] 365s 0.064 0.098 -0.606 -0.695 365s -0.185 -0.897 -0.774 -0.155 365s 365s y: [1, 2, 1, 1] 365s -0.046 365s -0.801 365s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 365s x0: [4, 1, 2, 1] 365s 0.514 -0.094 0.726 0.833 365s 365s x1: [4, 1, 2, 1] 365s 0.013 0.749 0.479 -0.921 365s 365s y: [1, 1, 2, 1] 365s -0.484 365s testing: mul_mat, [4, 1, 2, 2] = [4, 4, 2, 2] * [4, 1, 2, 2] 365s x0: [4, 4, 2, 2] 365s -0.477 -0.952 -0.269 -0.457 365s 0.162 -0.027 -0.943 -0.932 365s -0.300 0.890 0.467 -0.394 365s 0.427 0.007 0.845 -0.973 365s 365s x1: [4, 1, 2, 2] 365s 0.239 -0.400 -0.362 0.633 365s 365s y: [4, 1, 2, 2] 365s 0.075 -0.199 -0.846 -0.823 365s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.633 0.282 0.760 0.511 365s 0.949 0.514 -0.819 -0.397 365s 0.435 -0.223 0.593 -0.773 365s 365s x1: [4, 4, 1, 1] 365s 0.183 0.887 0.189 -0.294 365s -0.716 -0.778 -0.071 -0.429 365s -0.278 0.757 -0.715 -0.771 365s -0.569 -0.974 0.382 -0.417 365s 365s y: [3, 4, 1, 1] 365s 0.360 0.592 0.222 365s -0.946 -0.851 0.151 365s -0.900 1.017 -0.118 365s -0.558 -1.187 0.518 365s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 2, 2, 1] 365s 0.247 -0.385 -0.935 365s 0.462 -0.242 -0.158 365s 365s x1: [3, 4, 2, 1] 365s 0.059 -0.173 0.411 365s -0.054 -0.983 -0.883 365s 0.230 -0.761 0.046 365s 0.801 -0.039 -0.197 365s 365s y: [2, 4, 2, 1] 365s -0.303 0.004 365s 1.191 0.352 365s 0.307 0.283 365s 0.397 0.410 365s testing: mul_mat, [2, 3, 2, 2] = [2, 2, 2, 2] * [2, 3, 2, 2] 365s x0: [2, 2, 2, 2] 365s 0.382 0.776 365s -0.263 -0.856 365s 365s x1: [2, 3, 2, 2] 365s 0.991 0.148 365s 0.415 0.792 365s -0.892 -0.782 365s 365s y: [2, 3, 2, 2] 365s 0.493 -0.387 365s 0.773 -0.788 365s -0.947 0.904 365s test-mul-mat0: iter:55/500 365s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.859 0.701 365s 365s x1: [2, 1, 1, 1] 365s 0.541 0.636 365s 365s y: [1, 1, 1, 1] 365s 0.910 365s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 3, 3, 1] 365s -0.780 0.681 365s 0.602 0.866 365s 0.827 0.737 365s 365s x1: [2, 1, 3, 1] 365s -0.489 0.639 365s 365s y: [3, 1, 3, 1] 365s 0.816 0.259 0.067 365s testing: mul_mat, [2, 3, 3, 3] = [2, 2, 3, 3] * [2, 3, 3, 3] 365s x0: [2, 2, 3, 3] 365s -0.548 0.526 365s 0.044 0.908 365s 365s x1: [2, 3, 3, 3] 365s 0.972 -0.554 365s 0.490 0.676 365s -0.695 0.191 365s 365s y: [2, 3, 3, 3] 365s -0.824 -0.461 365s 0.087 0.636 365s 0.481 0.143 365s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.994 0.891 365s 0.564 -0.710 365s -0.284 0.980 365s -0.719 0.186 365s 365s x1: [2, 2, 1, 1] 365s 0.592 -0.314 365s -0.110 -0.221 365s 365s y: [4, 2, 1, 1] 365s -0.869 0.557 -0.476 -0.484 365s -0.088 0.095 -0.186 0.038 365s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 365s x0: [4, 3, 3, 1] 365s 0.226 0.515 -0.580 -0.861 365s 0.658 0.638 -0.537 -0.397 365s 0.456 -0.357 0.739 -0.127 365s 365s x1: [4, 2, 3, 1] 365s -0.095 -0.175 0.097 0.719 365s -0.860 0.134 0.717 -0.012 365s 365s y: [3, 2, 3, 1] 365s -0.787 -0.512 -0.001 365s -0.531 -0.861 0.092 365s testing: mul_mat, [3, 4, 3, 3] = [3, 3, 3, 3] * [3, 4, 3, 3] 365s x0: [3, 3, 3, 3] 365s -0.731 -0.772 -0.397 365s 0.887 -0.402 -0.994 365s -0.951 0.291 -0.651 365s 365s x1: [3, 4, 3, 3] 365s 0.577 0.934 -0.093 365s 0.316 -0.927 0.511 365s -0.811 0.930 0.102 365s -0.467 0.059 -0.211 365s 365s y: [3, 4, 3, 3] 365s -1.106 0.228 -0.216 365s 0.281 0.146 -0.903 365s -0.165 -1.195 0.975 365s 0.380 -0.228 0.599 365s test-mul-mat0: iter:56/500 365s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.384 -0.647 0.775 0.177 365s -0.869 0.252 -0.860 0.407 365s 365s x1: [4, 4, 1, 1] 365s 0.441 -0.627 -0.060 -0.468 365s -0.624 -0.293 0.367 0.987 365s 0.891 0.833 0.255 -0.283 365s -0.473 0.470 0.790 0.456 365s 365s y: [2, 4, 1, 1] 365s 0.106 -0.680 365s 0.889 0.554 365s -0.734 -0.898 365s 0.571 0.035 365s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 365s x0: [4, 1, 4, 1] 365s 0.180 0.875 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s .745 0.661 365s 365s x1: [4, 2, 4, 1] 365s -0.868 -0.012 -0.838 0.972 365s -0.702 -0.520 0.414 -0.329 365s 365s y: [1, 2, 4, 1] 365s -0.149 365s -0.491 365s testing: mul_mat, [1, 1, 4, 4] = [4, 1, 4, 4] * [4, 1, 4, 4] 365s x0: [4, 1, 4, 4] 365s 0.414 0.038 0.993 0.374 365s 365s x1: [4, 1, 4, 4] 365s 0.366 -0.451 -0.511 -0.506 365s 365s y: [1, 1, 4, 4] 365s -0.563 365s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.067 0.535 -0.239 0.676 365s 0.471 -0.267 0.394 0.757 365s 0.098 0.782 -0.940 0.377 365s 0.460 -0.894 0.093 -0.561 365s 365s x1: [4, 1, 1, 1] 365s 0.028 -0.081 0.355 0.055 365s 365s y: [4, 1, 1, 1] 365s -0.093 0.216 -0.374 0.088 365s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 4, 4, 1] 365s 0.090 -0.103 -0.061 0.182 365s -0.025 -0.465 0.277 -0.626 365s 0.980 0.986 -0.667 0.807 365s -0.625 -0.002 0.663 0.161 365s 365s x1: [4, 4, 4, 1] 365s 0.534 -0.778 0.618 -0.498 365s 0.164 -0.356 0.086 0.152 365s -0.071 -0.674 -0.820 0.848 365s 0.681 0.234 -0.679 -0.386 365s 365s y: [4, 4, 4, 1] 365s 0.000 0.831 -1.058 -0.002 365s 0.074 0.090 -0.125 -0.020 365s 0.268 -0.443 0.498 -0.362 365s 0.008 -0.072 1.039 -0.939 365s testing: mul_mat, [3, 4, 4, 4] = [4, 3, 4, 4] * [4, 4, 4, 4] 365s x0: [4, 3, 4, 4] 365s 0.052 0.331 0.096 -0.870 365s -0.512 -0.876 -0.506 -0.331 365s 0.130 -0.513 -0.122 -0.524 365s 365s x1: [4, 4, 4, 4] 365s 0.684 -0.568 -0.327 -0.990 365s -0.716 -0.119 0.274 0.412 365s 0.421 -0.797 0.723 -0.664 365s -0.062 0.194 0.906 0.776 365s 365s y: [3, 4, 4, 4] 365s 0.677 0.640 0.939 365s -0.409 0.196 -0.282 365s 0.405 0.337 0.724 365s -0.527 -0.854 -0.625 365s test-mul-mat0: iter:57/500 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.607 -0.385 365s 365s x1: [2, 3, 1, 1] 365s -0.162 0.510 365s 0.988 0.570 365s -0.602 0.217 365s 365s y: [1, 3, 1, 1] 365s -0.295 365s 0.380 365s -0.449 365s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 365s x0: [2, 1, 2, 1] 365s 0.978 0.485 365s 365s x1: [2, 1, 2, 1] 365s -0.839 0.035 365s 365s y: [1, 1, 2, 1] 365s -0.803 365s testing: mul_mat, [2, 1, 2, 4] = [2, 2, 2, 4] * [2, 1, 2, 4] 365s x0: [2, 2, 2, 4] 365s 0.142 -0.761 365s 0.601 -0.251 365s 365s x1: [2, 1, 2, 4] 365s 0.213 -0.215 365s 365s y: [2, 1, 2, 4] 365s 0.194 0.182 365s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.836 -0.533 365s 0.401 -0.222 365s -0.803 0.798 365s -0.040 -0.327 365s 365s x1: [2, 2, 1, 1] 365s 0.458 0.340 365s -0.948 -0.687 365s 365s y: [4, 2, 1, 1] 365s 0.202 0.108 -0.097 -0.129 365s -0.426 -0.227 0.212 0.263 365s testing: mul_mat, [4, 2, 2, 1] = [4, 4, 2, 1] * [4, 2, 2, 1] 365s x0: [4, 4, 2, 1] 365s -0.231 -0.239 0.551 0.415 365s -0.552 0.353 0.617 0.302 365s -0.748 -0.342 -0.371 -0.101 365s -0.179 -0.838 -0.672 -0.164 365s 365s x1: [4, 2, 2, 1] 365s -0.429 0.421 -0.343 -0.744 365s -0.889 0.706 -0.588 0.976 365s 365s y: [4, 2, 2, 1] 365s -0.499 -0.051 0.379 0.076 365s 0.118 0.672 0.543 -0.198 365s testing: mul_mat, [4, 4, 2, 4] = [4, 4, 2, 4] * [4, 4, 2, 4] 365s x0: [4, 4, 2, 4] 365s -0.401 -0.883 -0.558 0.762 365s 0.909 -0.011 -0.175 -0.370 365s -0.266 0.944 0.924 -0.809 365s -0.666 0.076 -0.637 0.859 365s 365s x1: [4, 4, 2, 4] 365s -0.648 0.311 0.916 0.113 365s -0.335 -0.426 0.275 -0.785 365s -0.809 0.904 -0.456 0.607 365s 0.206 0.443 -0.557 0.166 365s 365s y: [4, 4, 2, 4] 365s -0.440 -0.795 1.222 -0.031 365s -0.241 -0.059 0.576 -0.658 365s 0.244 -0.889 0.156 1.419 365s -0.037 0.219 -0.285 0.394 365s test-mul-mat0: iter:58/500 365s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.094 -0.879 365s -0.285 -0.037 365s 0.643 -0.185 365s -0.780 -0.818 365s 365s x1: [2, 3, 1, 1] 365s 0.769 -0.829 365s 0.736 0.785 365s -0.638 0.017 365s 365s y: [4, 3, 1, 1] 365s 0.657 -0.188 0.647 0.079 365s -0.759 -0.239 0.328 -1.215 365s 0.045 0.181 -0.413 0.484 365s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.116 0.202 365s 0.708 -0.942 365s -0.024 -0.611 365s 365s x1: [2, 4, 1, 1] 365s -0.800 0.624 365s -0.115 -0.968 365s 0.939 0.228 365s -0.423 -0.248 365s 365s y: [3, 4, 1, 1] 365s 0.034 -1.155 -0.362 365s -0.209 0.830 0.595 365s 0.154 0.451 -0.162 365s -0.099 -0.066 0.162 365s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.893 0.827 365s -0.104 -0.392 365s 365s x1: [2, 3, 1, 1] 365s -0.977 -0.256 365s -0.441 0.758 365s -0.471 -0.079 365s 365s y: [2, 3, 1, 1] 365s -1.084 0.202 365s 0.234 -0.252 365s -0.485 0.080 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.623 0.607 365s -0.365 0.655 365s 365s x1: [2, 2, 1, 1] 365s -0.210 -0.461 365s 0.423 0.011 365s 365s y: [2, 2, 1, 1] 365s -0.410 -0.226 365s 0.270 -0.147 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.348 -0.844 365s -0.472 0.406 365s 365s x1: [2, 2, 1, 1] 365s -0.425 -0.166 365s -0.767 0.326 365s 365s y: [2, 2, 1, 1] 365s -0.008 0.133 365s -0.543 0.494 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.187 0.962 365s -0.222 -0.329 365s -0.975 0.852 365s 365s x1: [2, 2, 1, 1] 365s 0.504 -0.456 365s 0.429 -0.751 365s 365s y: [3, 2, 1, 1] 365s -0.344 0.038 -0.880 365s -0.643 0.152 -1.059 365s test-mul-mat0: iter:59/500 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.810 0.007 0.561 365s -0.282 0.163 -0.033 365s -0.777 0.707 -0.604 365s -0.529 -0.189 0.583 365s 365s x1: [3, 3, 1, 1] 365s -0.299 0.654 -0.881 365s -0.675 -0.711 0.726 365s 0.980 -0.136 -0.440 365s 365s y: [4, 3, 1, 1] 365s -0.248 0.220 1.227 -0.479 365s 0.949 0.050 -0.417 0.914 365s -1.042 -0.284 -0.593 -0.749 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.274 0.645 -0.795 365s -0.863 -0.795 -0.583 365s -0.673 0.212 0.978 365s 365s x1: [3, 4, 1, 1] 365s 0.249 -0.164 0.545 365s 0.920 -0.312 -0.597 365s 0.198 0.331 0.801 365s 0.900 -0.015 0.919 365s 365s y: [3, 4, 1, 1] 365s -0.471 -0.402 0.331 365s 0.525 -0.198 -1.269 365s -0.369 -0.901 0.720 365s -0.495 -1.300 0.291 365s testing: mul_mat, [2, 3, 1, 2] = [3, 2, 1, 2] * [3, 3, 1, 2] 365s x0: [3, 2, 1, 2] 365s -0.266 -0.409 -0.770 365s -0.993 -0.764 -0.565 365s 365s x1: [3, 3, 1, 2] 365s 0.046 -0.625 -0.055 365s 0.268 -0.917 0.341 365s 0.739 -0.106 -0.077 365s 365s y: [2, 3, 1, 2] 365s 0.286 0.463 365s 0.041 0.242 365s -0.094 -0.609 365s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.005 -0.266 0.152 365s -0.961 -0.493 0.379 365s 0.514 -0.578 -0.471 365s 365s x1: [3, 2, 1, 1] 365s 0.518 -0.971 0.776 365s -0.214 -0.889 0.116 365s 365s y: [3, 2, 1, 1] 365s 0.379 0.275 0.462 365s 0.253 0.687 0.349 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.319 -0.117 -0.437 365s -0.947 0.777 0.112 365s -0.741 0.001 0.775 365s -0.028 -0.341 0.089 365s 365s x1: [3, 3, 1, 1] 365s -0.099 -0.950 -0.465 365s 0.635 -0.359 -0.235 365s 0.642 -0.122 0.200 365s 365s y: [4, 3, 1, 1] 365s 0.283 -0.696 -0.288 0.286 365s 0.347 -0.906 -0.653 0.084 365s 0.132 -0.681 -0.321 0.042 365s testing: mul_mat, [2, 3, 1, 2] = [4, 2, 1, 2] * [4, 3, 1, 2] 365s x0: [4, 2, 1, 2] 365s -0.437 -0.433 0.423 0.792 365s 0.648 0.675 0.656 0.238 365s 365s x1: [4, 3, 1, 2] 365s -0.883 0.814 -0.397 -0.148 365s -0.678 0.025 -0.996 0.700 365s 0.554 -0.095 0.751 -0.912 365s 365s y: [2, 3, 1, 2] 365s -0.252 -0.318 365s 0.419 -0.909 365s -0.606 0.570 365s test-mul-mat0: iter:60/500 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.759 0.262 0.938 365s -0.532 -0.177 -0.073 365s -0.528 0.386 -0.424 365s 0.039 0.061 0.999 365s 365s x1: [3, 1, 1, 1] 365s 0.778 -0.727 -0.327 365s 365s y: [4, 1, 1, 1] 365s -1.088 -0.262 -0.552 -0.341 365s testing: mul_mat, [1, 4, 2, 1] = [3, 1, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 1, 2, 1] 365s 0.242 -0.387 0.049 365s 365s x1: [3, 4, 2, 1] 365s -0.306 -0.147 0.237 365s -0.047 0.497 -0.204 365s -0.764 -0.856 -0.944 365s 0.073 0.696 0.067 365s 365s y: [1, 4, 2, 1] 365s -0.005 365s -0.214 365s 0.101 365s -0.249 365s testing: mul_mat, [2, 1, 2, 1] = [3, 2, 2, 1] * [3, 1, 2, 1] 365s x0: [3, 2, 2, 1] 365s -0.833 0.136 0.237 365s 0.863 -0.797 0.235 365s 365s x1: [3, 1, 2, 1] 365s -0.025 -0.474 -0.716 365s 365s y: [2, 1, 2, 1] 365s -0.214 0.188 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.940 -0.693 -0.267 365s 0.686 0.734 0.282 365s 365s x1: [3, 2, 1, 1] 365s -0.986 -0.780 -0.836 365s -0.216 0.447 -0.363 365s 365s y: [2, 2, 1, 1] 365s 1.691 -1.484 365s -0.010 0.078 365s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 3, 2, 1] 365s -0.892 0.924 365s -0.144 0.475 365s 0.677 0.691 365s 365s x1: [2, 3, 2, 1] 365s -0.740 0.017 365s 0.210 -0.717 365s -0.903 0.374 365s 365s y: [3, 3, 2, 1] 365s 0.675 0.114 -0.489 365s -0.850 -0.370 -0.353 365s 1.151 0.307 -0.353 365s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 365s x0: [3, 2, 2, 1] 365s 0.040 -0.309 0.130 365s 0.284 -0.524 -0.344 365s 365s x1: [3, 2, 2, 1] 365s 0.651 0.961 -0.193 365s 0.385 -0.306 -0.911 365s 365s y: [2, 2, 2, 1] 365s -0.296 -0.253 365s -0.008 0.583 365s test-mul-mat0: iter:61/500 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.741 0.194 365s -0.149 0.548 365s 0.867 0.229 365s -0.412 0.151 365s 365s x1: [2, 4, 1, 1] 365s 0.375 -0.001 365s 0.894 0.336 365s 0.806 0.279 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s -0.970 0.895 365s 365s y: [4, 4, 1, 1] 365s 0.277 -0.056 0.325 -0.154 365s 0.727 0.051 0.852 -0.318 365s 0.651 0.033 0.763 -0.290 365s -0.545 0.635 -0.636 0.535 365s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.066 -0.192 365s -0.174 0.433 365s 365s x1: [2, 4, 1, 1] 365s 0.921 0.064 365s -0.720 -0.423 365s 0.648 -0.734 365s -0.090 -0.844 365s 365s y: [2, 4, 1, 1] 365s -0.073 -0.133 365s 0.128 -0.058 365s 0.098 -0.431 365s 0.168 -0.350 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.761 -0.776 365s 0.566 -0.866 365s 365s x1: [2, 2, 1, 1] 365s 0.051 -0.800 365s -0.567 -0.055 365s 365s y: [2, 2, 1, 1] 365s 0.582 0.721 365s 0.474 -0.274 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.195 -0.654 365s 0.228 0.149 365s 0.443 -0.493 365s 365s x1: [2, 2, 1, 1] 365s 0.147 0.307 365s 0.328 0.999 365s 365s y: [3, 2, 1, 1] 365s -0.172 0.079 -0.086 365s -0.589 0.223 -0.347 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.570 0.445 -0.829 365s 0.120 0.004 0.645 365s 365s x1: [3, 2, 1, 1] 365s -0.373 0.726 -0.845 365s -0.107 -0.364 -0.689 365s 365s y: [2, 2, 1, 1] 365s 0.811 -0.587 365s 0.348 -0.459 365s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.513 -0.091 365s -0.946 -0.666 365s 0.139 0.714 365s -0.514 -0.648 365s 365s x1: [2, 3, 1, 1] 365s 0.437 0.116 365s 0.181 0.676 365s 0.340 -0.252 365s 365s y: [4, 3, 1, 1] 365s -0.234 -0.490 0.143 -0.299 365s -0.154 -0.622 0.508 -0.531 365s -0.151 -0.153 -0.133 -0.011 365s test-mul-mat0: iter:62/500 365s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.839 -0.280 0.558 0.893 365s 365s x1: [4, 4, 1, 1] 365s -0.412 0.014 -0.800 0.224 365s 0.325 -0.181 -0.206 -0.555 365s -0.736 0.798 -0.384 0.909 365s 0.234 0.731 0.091 -0.090 365s 365s y: [1, 4, 1, 1] 365s -0.596 365s -0.287 365s -0.243 365s -0.038 365s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 2, 1, 1] 365s 0.155 -0.126 -0.380 -0.986 365s -0.819 0.208 0.028 -0.619 365s 365s x1: [4, 1, 1, 1] 365s 0.859 -0.956 -0.198 -0.807 365s 365s y: [2, 1, 1, 1] 365s 1.124 -0.409 365s testing: mul_mat, [2, 2, 1, 2] = [4, 2, 1, 2] * [4, 2, 1, 2] 365s x0: [4, 2, 1, 2] 365s -0.804 0.748 0.818 -0.002 365s 0.941 0.576 -0.847 -0.185 365s 365s x1: [4, 2, 1, 2] 365s -0.568 -0.647 0.200 0.226 365s -0.202 0.464 0.024 0.414 365s 365s y: [2, 2, 1, 2] 365s 0.136 -1.118 365s 0.528 -0.019 365s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.877 0.513 0.122 -0.820 365s 365s x1: [4, 2, 1, 1] 365s -0.424 0.063 -0.654 -0.960 365s -0.913 0.760 0.413 -0.655 365s 365s y: [1, 2, 1, 1] 365s 0.368 365s 0.177 365s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 2, 1, 1] 365s 0.374 365s 0.090 365s 365s x1: [1, 4, 1, 1] 365s 0.402 365s -0.103 365s -0.624 365s 0.149 365s 365s y: [2, 4, 1, 1] 365s 0.150 0.036 365s -0.039 -0.009 365s -0.233 -0.056 365s 0.056 0.013 365s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 365s x0: [2, 4, 1, 2] 365s -0.100 0.730 365s 0.893 -0.564 365s -0.112 -0.148 365s -0.724 -0.206 365s 365s x1: [2, 1, 1, 2] 365s -0.708 0.527 365s 365s y: [4, 1, 1, 2] 365s 0.456 -0.930 0.001 0.404 365s test-mul-mat0: iter:63/500 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.637 -0.716 -0.038 365s 0.331 0.184 -0.145 365s 365s x1: [3, 2, 1, 1] 365s 0.467 -0.263 0.290 365s -0.818 -0.889 -0.620 365s 365s y: [2, 2, 1, 1] 365s 0.475 0.064 365s 0.139 -0.344 365s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.203 -0.740 0.978 365s 0.507 0.486 0.426 365s 0.688 -0.775 0.996 365s -0.951 -0.634 0.463 365s 365s x1: [3, 2, 1, 1] 365s -0.782 0.460 -0.414 365s -0.346 -0.688 0.379 365s 365s y: [4, 2, 1, 1] 365s -0.904 -0.349 -1.307 0.260 365s 0.810 -0.348 0.673 0.942 365s testing: mul_mat, [2, 4, 1, 3] = [3, 2, 1, 3] * [3, 4, 1, 3] 365s x0: [3, 2, 1, 3] 365s 0.937 -0.752 0.881 365s 0.801 0.751 0.818 365s 365s x1: [3, 4, 1, 3] 365s -0.214 0.655 0.645 365s -0.104 -0.965 -0.881 365s -0.466 -0.681 0.081 365s 0.865 0.503 0.937 365s 365s y: [2, 4, 1, 3] 365s -0.125 0.848 365s -0.147 -1.528 365s 0.147 -0.819 365s 1.256 1.837 365s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.649 0.753 0.690 365s -0.943 0.985 -0.767 365s 0.808 -0.392 -0.512 365s 365s x1: [3, 2, 1, 1] 365s -0.480 -0.876 0.925 365s -0.908 0.844 -0.837 365s 365s y: [3, 2, 1, 1] 365s 0.289 -1.120 -0.517 365s 0.648 2.330 -0.637 365s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.158 0.609 0.733 365s 0.292 0.344 0.269 365s 0.734 -0.746 -0.654 365s 365s x1: [3, 3, 1, 1] 365s 0.491 0.985 -0.694 365s 0.376 -0.300 -0.354 365s 0.997 -0.310 -0.112 365s 365s y: [3, 3, 1, 1] 365s 0.169 0.296 0.080 365s -0.383 -0.089 0.731 365s -0.114 0.154 1.036 365s testing: mul_mat, [1, 3, 1, 3] = [3, 1, 1, 3] * [3, 3, 1, 3] 365s x0: [3, 1, 1, 3] 365s 0.582 -0.877 0.359 365s 365s x1: [3, 3, 1, 3] 365s 0.577 0.432 0.474 365s 0.928 0.488 0.282 365s 0.682 0.474 0.889 365s 365s y: [1, 3, 1, 3] 365s 0.127 365s 0.212 365s 0.300 365s test-mul-mat0: iter:64/500 365s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.243 365s -0.694 365s -0.606 365s 365s x1: [1, 4, 1, 1] 365s -0.777 365s -0.619 365s 0.004 365s 0.915 365s 365s y: [3, 4, 1, 1] 365s -0.189 0.539 0.471 365s -0.150 0.429 0.375 365s 0.001 -0.003 -0.002 365s 0.222 -0.634 -0.554 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.627 365s -0.878 365s 0.262 365s 365s x1: [1, 3, 1, 1] 365s 0.478 365s 0.388 365s -0.388 365s 365s y: [3, 3, 1, 1] 365s 0.300 -0.420 0.125 365s 0.243 -0.341 0.102 365s -0.243 0.341 -0.102 365s testing: mul_mat, [1, 3, 1, 3] = [1, 1, 1, 3] * [1, 3, 1, 3] 365s x0: [1, 1, 1, 3] 365s 0.695 365s 365s x1: [1, 3, 1, 3] 365s 0.476 365s -0.024 365s -0.725 365s 365s y: [1, 3, 1, 3] 365s 0.331 365s -0.017 365s -0.504 365s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 1, 1, 1] 365s -0.787 365s 365s x1: [1, 1, 1, 1] 365s -0.789 365s 365s y: [1, 1, 1, 1] 365s 0.621 365s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.217 365s -0.652 365s -0.970 365s 365s x1: [1, 1, 1, 1] 365s -0.566 365s 365s y: [3, 1, 1, 1] 365s -0.123 0.369 0.549 365s testing: mul_mat, [2, 1, 1, 3] = [3, 2, 1, 3] * [3, 1, 1, 3] 365s x0: [3, 2, 1, 3] 365s 0.146 0.978 0.232 365s -0.866 -0.797 0.612 365s 365s x1: [3, 1, 1, 3] 365s -0.541 -0.345 -0.576 365s 365s y: [2, 1, 1, 3] 365s -0.550 0.391 365s test-mul-mat0: iter:65/500 365s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.435 -0.569 0.691 -0.904 365s -0.774 0.481 0.342 -0.766 365s 0.705 0.553 0.459 -0.722 365s 0.795 0.143 0.505 -0.539 365s 365s x1: [4, 4, 1, 1] 365s 0.227 -0.334 0.622 0.270 365s 0.702 -0.938 -0.060 -0.140 365s 0.765 -0.914 -0.006 0.743 365s -0.711 -0.774 0.355 -0.291 365s 365s y: [4, 4, 1, 1] 365s 0.474 -0.331 0.066 0.301 365s 0.924 -0.907 0.050 0.469 365s 0.176 -1.602 -0.505 0.073 365s 0.640 0.522 -0.556 -0.339 365s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 365s x0: [4, 2, 3, 1] 365s -0.149 -0.112 0.739 0.414 365s -0.420 -0.030 0.531 0.731 365s 365s x1: [4, 4, 3, 1] 365s -0.235 -0.224 -0.838 -0.173 365s 0.716 0.022 -0.408 0.802 365s -0.984 -0.665 -0.909 -0.758 365s 0.690 -0.200 -0.647 0.125 365s 365s y: [2, 4, 3, 1] 365s -0.630 -0.465 365s -0.079 0.069 365s -0.763 -0.603 365s -0.506 -0.535 365s testing: mul_mat, [2, 2, 3, 3] = [4, 2, 3, 3] * [4, 2, 3, 3] 365s x0: [4, 2, 3, 3] 365s -0.904 0.667 0.260 0.861 365s 0.948 -0.707 0.976 -0.414 365s 365s x1: [4, 2, 3, 3] 365s 0.294 -0.231 0.694 0.463 365s -0.602 0.124 0.676 0.249 365s 365s y: [2, 2, 3, 3] 365s 0.159 0.928 365s 1.018 -0.102 365s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.020 -0.325 -0.654 0.584 365s 0.955 -0.716 -0.674 0.379 365s 0.077 -0.370 -0.571 0.884 365s -0.601 -0.917 0.126 -0.737 365s 365s x1: [4, 2, 1, 1] 365s -0.359 -0.819 0.410 -0.792 365s 0.333 0.116 -0.429 -0.294 365s 365s y: [4, 2, 1, 1] 365s -0.457 -0.333 -0.659 1.601 365s 0.065 0.412 -0.032 -0.144 365s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 365s x0: [4, 4, 3, 1] 365s 0.853 -0.638 -0.932 -0.929 365s -0.561 0.304 -0.054 0.057 365s 0.666 -0.916 -0.555 -0.723 365s -0.195 -0.286 -0.284 0.538 365s 365s x1: [4, 4, 3, 1] 365s -0.119 -0.087 -0.659 0.522 365s -0.010 0.344 -0.837 0.171 365s -0.247 -0.629 -0.496 0.870 365s -0.058 0.210 0.931 0.922 365s 365s y: [4, 4, 3, 1] 365s 0.083 0.106 -0.012 0.516 365s 0.394 0.166 0.019 0.234 365s -0.154 0.024 0.059 0.837 365s -1.908 0.099 -1.414 0.183 365s testing: mul_mat, [2, 4, 3, 3] = [4, 2, 3, 3] * [4, 4, 3, 3] 365s x0: [4, 2, 3, 3] 365s 0.233 0.024 -0.862 0.606 365s -0.531 -0.322 -0.264 0.225 365s 365s x1: [4, 4, 3, 3] 365s 0.944 -0.058 -0.495 -0.507 365s -0.145 -0.340 0.267 -0.799 365s -0.575 0.476 -0.087 0.721 365s -0.716 0.997 -0.129 0.530 365s 365s y: [2, 4, 3, 3] 365s 0.338 -0.466 365s -0.756 -0.064 365s 0.390 0.338 365s 0.290 0.213 365s test-mul-mat0: iter:66/500 365s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.222 -0.767 0.203 365s 0.093 0.139 -0.973 365s 0.963 0.206 0.983 365s -0.433 -0.958 -0.701 365s 365s x1: [3, 4, 1, 1] 365s -0.241 -0.366 -0.274 365s -0.170 -0.321 -0.790 365s -0.390 -0.400 -0.361 365s -0.925 0.325 0.811 365s 365s y: [4, 4, 1, 1] 365s 0.171 0.193 -0.577 0.646 365s 0.048 0.709 -1.007 0.934 365s 0.147 0.259 -0.813 0.805 365s -0.290 -0.830 -0.027 -0.479 365s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.654 -0.804 -0.490 365s 0.671 -0.994 0.886 365s 365s x1: [3, 4, 1, 1] 365s -0.961 0.502 0.054 365s -0.765 -0.567 -0.380 365s -0.006 0.067 0.347 365s 0.825 0.746 0.557 365s 365s y: [2, 4, 1, 1] 365s 0.198 -1.096 365s 1.142 -0.285 365s -0.220 0.237 365s -1.412 0.305 365s testing: mul_mat, [2, 2, 1, 3] = [3, 2, 1, 3] * [3, 2, 1, 3] 365s x0: [3, 2, 1, 3] 365s 0.628 -0.580 0.610 365s 0.374 0.977 -0.954 365s 365s x1: [3, 2, 1, 3] 365s -0.107 -0.761 0.089 365s 0.987 0.378 0.116 365s 365s y: [2, 2, 1, 3] 365s 0.429 -0.868 365s 0.471 0.627 365s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.134 0.596 -0.916 365s 365s x1: [3, 2, 1, 1] 365s -0.779 -0.620 0.745 365s -0.262 0.006 0.144 365s 365s y: [1, 2, 1, 1] 365s -1.157 365s -0.163 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.089 365s 0.480 365s -0.303 365s 365s x1: [1, 3, 1, 1] 365s 0.695 365s -0.331 365s 0.870 365s 365s y: [3, 3, 1, 1] 365s 0.062 0.334 -0.211 365s -0.030 -0.159 0.100 365s 0.078 0.418 -0.264 365s testing: mul_mat, [3, 1, 1, 3] = [3, 3, 1, 3] * [3, 1, 1, 3] 365s x0: [3, 3, 1, 3] 365s -0.947 -0.301 -0.556 365s -0.637 -0.416 0.321 365s 0.631 -0.990 0.016 365s 365s x1: [3, 1, 1, 3] 365s 0.066 0.526 -0.584 365s 365s y: [3, 1, 1, 3] 365s 0.103 -0.449 -0.488 365s test-mul-mat0: iter:67/500 365s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.762 0.151 -1.000 365s -0.669 -0.681 -0.057 365s 365s x1: [3, 4, 1, 1] 365s -0.768 -0.457 -0.870 365s -0.069 0.126 -0.859 365s 0.375 -0.552 0.156 365s -0.037 0.062 0.306 365s 365s y: [2, 4, 1, 1] 365s 0.215 0.875 365s 0.826 0.009 365s 0.046 0.116 365s -0.325 -0.035 365s testing: mul_mat, [3, 2, 4, 1] = [3, 3, 4, 1] * [3, 2, 4, 1] 365s x0: [3, 3, 4, 1] 365s 0.672 0.249 0.440 365s -0.997 0.568 -0.616 365s -0.617 0.752 0.336 365s 365s x1: [3, 2, 4, 1] 365s -0.620 -0.817 -0.048 365s 0.956 -0.536 -0.800 365s 365s y: [3, 2, 4, 1] 365s -0.641 0.183 -0.247 365s 0.156 -0.765 -1.262 365s testing: mul_mat, [4, 3, 4, 1] = [3, 4, 4, 1] * [3, 3, 4, 1] 365s x0: [3, 4, 4, 1] 365s 0.954 -0.459 -0.842 365s -0.164 -0.438 -0.008 365s 0.482 -0.651 -0.401 365s 0.386 -0.142 0.380 365s 365s x1: [3, 3, 4, 1] 365s 0.610 0.989 -0.647 365s -0.054 -0.672 -0.432 365s -0.519 -0.722 0.335 365s 365s y: [4, 3, 4, 1] 365s 0.673 -0.529 -0.091 -0.151 365s 0.621 0.307 0.584 -0.090 365s -0.446 0.399 0.085 0.029 365s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.632 -0.626 -0.576 365s -0.302 0.740 -0.126 365s 365s x1: [3, 4, 1, 1] 365s 0.444 -0.615 -0.528 365s -0.300 -0.378 0.408 365s 0.901 -0.239 -0.129 365s 0.037 0.153 0.807 365s 365s y: [2, 4, 1, 1] 365s 0.408 -0.523 365s 0.191 -0.240 365s -0.346 -0.433 365s -0.584 0.000 365s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 365s x0: [2, 2, 4, 1] 365s 0.744 0.829 365s 0.269 0.484 365s 365s x1: [2, 3, 4, 1] 365s -0.896 0.632 365s 0.724 0.068 365s -0.773 0.432 365s 365s y: [2, 3, 4, 1] 365s -0.142 0.064 365s 0.595 0.228 365s -0.217 0.001 365s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 365s x0: [2, 2, 4, 1] 365s 0.765 -0.889 365s -0.856 -0.532 365s 365s x1: [2, 2, 4, 1] 365s 0.014 0.432 365s 0.391 0.854 365s 365s y: [2, 2, 4, 1] 365s -0.374 -0.242 365s -0.461 -0.789 365s test-mul-mat0: iter:68/500 365s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.274 0.910 365s 0.269 -0.809 365s 0.058 0.205 365s -0.148 0.475 365s 365s x1: [2, 1, 1, 1] 365s 0.733 0.335 365s 365s y: [4, 1, 1, 1] 365s 0.506 -0.073 0.111 0.050 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.089 0.575 365s 0.558 0.262 365s -0.960 -0.880 365s -0.132 0.896 365s 365s x1: [2, 4, 1, 1] 365s -0.030 -0.004 365s 0.585 0.438 365s -0.271 0.022 365s -0.666 0.118 365s 365s y: [4, 4, 1, 1] 365s -0.005 -0.018 0.032 0.001 365s 0.304 0.442 -0.948 0.315 365s -0.012 -0.146 0.241 0.055 365s 0.009 -0.341 0.536 0.194 365s testing: mul_mat, [1, 4, 1, 2] = [2, 1, 1, 2] * [2, 4, 1, 2] 365s x0: [2, 1, 1, 2] 365s -0.102 -0.689 365s 365s x1: [2, 4, 1, 2] 365s -0.464 -0.609 365s 0.226 -0.731 365s 0.725 -0.984 365s 0.543 0.636 365s 365s y: [1, 4, 1, 2] 365s 0.467 365s 0.480 365s 0.604 365s -0.493 365s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.004 0.583 365s -0.656 0.044 365s 365s x1: [2, 1, 1, 1] 365s 0.429 -0.214 365s 365s y: [2, 1, 1, 1] 365s -0.123 -0.291 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.834 -0.818 365s 0.269 -0.188 365s -0.433 -0.797 365s 365s x1: [2, 2, 1, 1] 365s -0.536 -0.550 365s -0.061 -0.001 365s 365s y: [3, 2, 1, 1] 365s 0.897 -0.040 0.671 365s 0.051 -0.016 0.027 365s testing: mul_mat, [1, 2, 1, 2] = [3, 1, 1, 2] * [3, 2, 1, 2] 365s x0: [3, 1, 1, 2] 365s -0.546 -0.620 0.866 365s 365s x1: [3, 2, 1, 2] 365s 0.467 -0.454 -0.104 365s 0.957 0.132 0.065 365s 365s y: [1, 2, 1, 2] 365s -0.063 365s -0.548 365s test-mul-mat0: iter:69/500 365s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.307 0.789 -0.733 365s 365s x1: [3, 1, 1, 1] 365s -0.813 -0.259 0.607 365s 365s y: [1, 1, 1, 1] 365s -0.899 365s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 365s x0: [3, 1, 2, 1] 365s -0.372 -0.341 -0.710 365s 365s x1: [3, 1, 2, 1] 365s 0.510 0.255 -0.187 365s 365s y: [1, 1, 2, 1] 365s -0.144 365s testing: mul_mat, [4, 1, 2, 3] = [3, 4, 2, 3] * [3, 1, 2, 3] 365s x0: [3, 4, 2, 3] 365s -0.809 -0.102 0.667 365s 0.403 -0.157 0.138 365s -0.969 0.503 0.428 365s -0.027 0.187 0.311 365s 365s x1: [3, 1, 2, 3] 365s -0.512 -0.861 -0.736 365s 365s y: [4, 1, 2, 3] 365s 0.011 -0.173 -0.252 -0.377 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.260 0.957 -0.823 365s -0.735 0.011 -0.463 365s 0.954 0.578 -0.963 365s 365s x1: [3, 4, 1, 1] 365s -0.451 -0.112 -0.087 365s 0.311 -0.215 0.433 365s -0.004 0.245 0.573 365s 0.704 -0.558 0.036 365s 365s y: [3, 4, 1, 1] 365s 0.081 0.371 -0.412 365s -0.643 -0.432 -0.245 365s -0.237 -0.260 -0.414 365s -0.746 -0.541 0.315 365s testing: mul_mat, [4, 3, 2, 1] = [3, 4, 2, 1] * [3, 3, 2, 1] 365s x0: [3, 4, 2, 1] 365s 0.986 0.130 0.345 365s -0.135 0.767 -0.141 365s 0.813 -0.317 -0.322 365s -0.274 -0.859 0.382 365s 365s x1: [3, 3, 2, 1] 365s -0.702 -0.493 -0.265 365s 0.164 -0.939 0.701 365s -0.722 -0.374 0.911 365s 365s y: [4, 3, 2, 1] 365s -0.848 -0.246 -0.329 0.514 365s 0.282 -0.840 0.205 1.029 365s -0.446 -0.317 -0.761 0.866 365s testing: mul_mat, [3, 3, 2, 3] = [4, 3, 2, 3] * [4, 3, 2, 3] 365s x0: [4, 3, 2, 3] 365s 0.903 -0.677 -0.803 -0.291 365s 0.492 0.217 -0.215 -0.687 365s -0.483 0.840 0.803 -0.271 365s 365s x1: [4, 3, 2, 3] 365s 0.566 0.686 -0.191 -0.300 365s 0.229 -0.361 0.538 0.215 365s 0.504 0.351 0.941 -0.366 365s 365s y: [3, 3, 2, 3] 365s 0.287 0.674 0.230 365s -0.043 -0.229 -0.040 365s -0.432 0.373 0.906 365s test-mul-mat0: iter:70/500 365s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.933 0.951 0.655 365s 365s x1: [3, 2, 1, 1] 365s -0.319 -0.850 0.881 365s -0.492 -0.237 0.392 365s 365s y: [1, 2, 1, 1] 365s 0.065 365s 0.490 365s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 365s x0: [3, 1, 2, 1] 365s -0.919 -0.736 0.405 365s 365s x1: [3, 1, 2, 1] 365s -0.750 -0.194 -0.648 365s 365s y: [1, 1, 2, 1] 365s 0.570 365s testing: mul_mat, [2, 1, 2, 3] = [3, 2, 2, 3] * [3, 1, 2, 3] 365s x0: [3, 2, 2, 3] 365s 0.560 0.325 0.754 365s 0.776 -0.284 -0.925 365s 365s x1: [3, 1, 2, 3] 365s -0.654 0.946 0.509 365s 365s y: [2, 1, 2, 3] 365s 0.324 -1.246 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.835 -0.890 -0.537 365s 0.497 -0.255 -0.974 365s 365s x1: [3, 2, 1, 1] 365s -0.573 -0.344 0.219 365s 0.908 0.238 -0.614 365s 365s y: [2, 2, 1, 1] 365s 0.666 -0.411 365s -0.639 0.989 365s testing: mul_mat, [2, 3, 2, 1] = [2, 2, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 2, 2, 1] 365s -0.936 -0.238 365s -0.457 -0.136 365s 365s x1: [2, 3, 2, 1] 365s 0.074 -0.041 365s 0.684 0.302 365s -0.134 0.593 365s 365s y: [2, 3, 2, 1] 365s -0.059 -0.028 365s -0.712 -0.354 365s -0.016 -0.019 365s testing: mul_mat, [1, 2, 2, 3] = [2, 1, 2, 3] * [2, 2, 2, 3] 365s x0: [2, 1, 2, 3] 365s 0.218 0.716 365s 365s x1: [2, 2, 2, 3] 365s 0.097 -0.242 365s -0.612 -0.649 365s 365s y: [1, 2, 2, 3] 365s -0.153 365s -0.598 365s test-mul-mat0: iter:71/500 365s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.649 0.573 365s 365s x1: [2, 4, 1, 1] 365s 0.325 -0.138 365s -0.026 0.368 365s 0.035 -0.262 365s 0.603 -0.216 365s 365s y: [1, 4, 1, 1] 365s 0.132 365s 0.194 365s -0.127 365s 0.268 365s testing: mul_mat, [1, 1, 4, 1] = [2, 1, 4, 1] * [2, 1, 4, 1] 365s x0: [2, 1, 4, 1] 365s -0.072 0.233 365s 365s x1: [2, 1, 4, 1] 365s -0.077 -0.895 365s 365s y: [1, 1, 4, 1] 365s -0.203 365s testing: mul_mat, [3, 1, 4, 3] = [2, 3, 4, 3] * [2, 1, 4, 3] 365s x0: [2, 3, 4, 3] 365s -0.420 -0.596 365s -0.703 -0.131 365s 0.857 0.699 365s 365s x1: [2, 1, 4, 3] 365s -0.126 0.008 365s 365s y: [3, 1, 4, 3] 365s 0.048 0.088 -0.103 365s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.747 -0.258 365s -0.457 0.100 365s -0.438 0.298 365s 365s x1: [2, 3, 1, 1] 365s 0.729 -0.676 365s 0.767 -0.747 365s -0.833 -0.593 365s 365s y: [3, 3, 1, 1] 365s -0.371 -0.401 -0.521 365s -0.381 -0.426 -0.559 365s 0.775 0.322 0.188 365s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 365s x0: [3, 1, 4, 1] 365s 0.754 0.104 0.488 365s 365s x1: [3, 2, 4, 1] 365s -0.133 -0.674 -0.853 365s 0.854 -0.803 -0.682 365s 365s y: [1, 2, 4, 1] 365s -0.586 365s 0.227 365s testing: mul_mat, [3, 3, 4, 3] = [1, 3, 4, 3] * [1, 3, 4, 3] 365s x0: [1, 3, 4, 3] 365s -0.571 365s -0.811 365s 0.905 365s 365s x1: [1, 3, 4, 3] 365s 0.075 365s -0.352 365s 0.640 365s 365s y: [3, 3, 4, 3] 365s -0.043 -0.061 0.068 365s 0.201 0.285 -0.318 365s -0.365 -0.519 0.579 365s test-mul-mat0: iter:72/500 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.266 0.799 0.691 0.498 365s -0.109 -0.942 -0.655 -0.801 365s -0.121 0.622 0.695 0.606 365s -0.493 -0.635 -0.924 -0.253 365s 365s x1: [4, 3, 1, 1] 365s -0.846 0.471 -0.500 0.557 365s -0.645 -0.549 0.935 0.309 365s -0.734 -0.925 0.696 -0.527 365s 365s y: [4, 3, 1, 1] 365s 0.532 -0.470 0.385 0.438 365s 0.534 -0.273 0.574 -0.276 365s -0.324 0.917 -0.321 0.440 365s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 365s x0: [4, 1, 3, 1] 365s -0.517 0.904 -0.397 -0.585 365s 365s x1: [4, 4, 3, 1] 365s -0.475 -0.716 -0.099 0.996 365s -0.216 -0.542 -0.650 0.235 365s -0.607 0.659 0.501 -0.532 365s 0.355 0.974 0.379 -0.911 365s 365s y: [1, 4, 3, 1] 365s -0.944 365s -0.259 365s 1.022 365s 1.079 365s testing: mul_mat, [1, 1, 3, 1] = [4, 1, 3, 1] * [4, 1, 3, 1] 365s x0: [4, 1, 3, 1] 365s 0.191 0.591 -0.296 -0.912 365s 365s x1: [4, 1, 3, 1] 365s -0.162 0.695 0.883 -0.454 365s 365s y: [1, 1, 3, 1] 365s 0.532 365s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.526 0.409 -0.952 -0.321 365s 0.822 -0.632 0.080 -0.649 365s -0.231 0.907 -0.161 0.223 365s 365s x1: [4, 1, 1, 1] 365s -0.046 -0.174 0.615 -0.016 365s 365s y: [3, 1, 1, 1] 365s -0.676 0.132 -0.250 365s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 4, 3, 1] 365s 0.409 0.466 0.159 365s 0.151 -0.854 0.114 365s -0.913 -0.171 -0.503 365s 0.369 0.174 0.157 365s 365s x1: [3, 4, 3, 1] 365s -0.197 -0.351 -0.585 365s -0.605 0.352 -0.498 365s 0.225 0.532 0.073 365s 0.959 -0.685 -0.941 365s 365s y: [4, 4, 3, 1] 365s -0.337 0.203 0.534 -0.225 365s -0.163 -0.449 0.743 -0.240 365s 0.352 -0.412 -0.334 0.187 365s -0.077 0.623 -0.284 0.087 365s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 365s x0: [4, 4, 3, 1] 365s -0.547 -0.749 -0.996 0.499 365s 0.588 -0.081 0.191 -0.229 365s 0.743 -0.065 -0.642 -0.347 365s 0.112 0.550 0.986 -0.874 365s 365s x1: [4, 3, 3, 1] 365s 0.365 -0.525 0.730 -0.476 365s 0.589 -0.773 0.682 -0.937 365s 0.572 -0.342 -0.337 0.176 365s 365s y: [4, 3, 3, 1] 365s -0.771 0.506 0.001 0.888 365s -0.890 0.754 0.376 1.132 365s 0.366 0.260 0.603 -0.610 365s test-mul-mat0: iter:73/500 365s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.038 -0.703 365s 365s x1: [2, 2, 1, 1] 365s 0.226 0.560 365s 0.639 0.790 365s 365s y: [1, 2, 1, 1] 365s -0.385 365s -0.531 365s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 2, 3, 1] 365s 0.242 -0.265 365s 0.640 0.557 365s 365s x1: [2, 1, 3, 1] 365s -0.478 0.964 365s 365s y: [2, 1, 3, 1] 365s -0.371 0.231 365s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 365s x0: [2, 2, 3, 1] 365s -0.621 0.605 365s 0.709 -0.144 365s 365s x1: [2, 2, 3, 1] 365s -0.581 0.564 365s 0.316 -0.869 365s 365s y: [2, 2, 3, 1] 365s 0.701 -0.492 365s -0.721 0.349 365s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.484 -0.890 365s 0.978 0.917 365s -0.821 -0.626 365s 0.832 0.800 365s 365s x1: [2, 2, 1, 1] 365s 0.886 0.223 365s -0.923 0.559 365s 365s y: [4, 2, 1, 1] 365s -0.628 1.071 -0.867 0.915 365s -0.051 -0.391 0.408 -0.321 365s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 365s x0: [4, 3, 3, 1] 365s 0.508 0.425 -0.282 -0.573 365s -0.428 -0.055 0.651 -0.014 365s -0.768 -0.968 0.178 -0.809 365s 365s x1: [4, 2, 3, 1] 365s 0.294 -0.294 0.146 0.709 365s 0.335 0.013 -0.912 -0.060 365s 365s y: [3, 2, 3, 1] 365s -0.423 -0.024 -0.489 365s 0.467 -0.737 -0.383 365s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 2, 3, 1] 365s -0.622 -0.630 0.667 365s -0.462 -0.026 0.371 365s 365s x1: [3, 4, 3, 1] 365s -0.684 -0.274 -0.778 365s 0.493 0.154 0.208 365s 0.684 0.669 0.134 365s -0.404 0.128 0.001 365s 365s y: [2, 4, 3, 1] 365s 0.079 0.034 365s -0.265 -0.155 365s -0.757 -0.284 365s 0.171 0.183 365s test-mul-mat0: iter:74/500 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.970 0.797 0.213 0.268 365s 0.711 -0.447 0.397 0.201 365s -0.864 -0.855 -0.988 -0.724 365s 0.442 0.222 -0.974 0.339 365s 365s x1: [4, 3, 1, 1] 365s -0.218 0.180 -0.208 0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s .982 365s -0.104 -0.180 -0.324 0.765 365s -0.944 0.054 -0.697 -0.574 365s 365s y: [4, 3, 1, 1] 365s 0.150 -0.121 -0.472 0.479 365s -0.109 0.031 0.010 0.489 365s -1.175 -1.087 1.874 0.078 365s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 4, 4, 1] 365s 0.030 -0.675 0.692 0.575 365s 0.830 0.814 0.375 -0.984 365s 0.768 -0.269 0.309 0.582 365s 0.616 0.507 -0.531 0.295 365s 365s x1: [4, 4, 4, 1] 365s 0.413 0.012 -0.879 -0.407 365s 0.805 -0.897 0.489 -0.375 365s -0.221 0.254 -0.319 0.832 365s 0.557 0.108 0.860 0.528 365s 365s y: [4, 4, 4, 1] 365s -0.838 0.424 -0.195 0.607 365s 0.752 0.490 0.792 -0.330 365s 0.080 -0.916 0.148 0.408 365s 0.842 0.353 0.972 0.097 365s testing: mul_mat, [4, 4, 4, 4] = [4, 4, 4, 4] * [4, 4, 4, 4] 365s x0: [4, 4, 4, 4] 365s 0.271 0.256 -0.850 0.715 365s -0.806 0.583 0.184 -0.461 365s -0.363 0.938 0.351 -0.963 365s 0.373 0.658 -0.665 0.091 365s 365s x1: [4, 4, 4, 4] 365s -0.758 -0.710 -0.508 -0.430 365s 0.165 -0.958 -0.333 -0.229 365s -0.100 -0.654 0.993 0.266 365s -0.507 -0.490 0.952 0.621 365s 365s y: [4, 4, 4, 4] 365s -0.263 0.302 -0.155 -0.452 365s -0.082 -0.648 -0.856 -0.369 365s -0.849 -0.241 -0.484 -1.104 365s -0.628 0.011 -0.540 -1.088 365s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.298 -0.397 0.156 -0.057 365s -0.807 -0.764 -0.308 -0.646 365s -0.228 -0.206 -0.388 0.142 365s -0.217 -0.859 0.169 -0.592 365s 365s x1: [4, 4, 1, 1] 365s 0.973 0.465 0.623 -0.787 365s 0.151 -0.225 0.438 0.937 365s 0.083 -0.831 0.918 -0.472 365s 0.540 -0.968 0.234 -0.214 365s 365s y: [4, 4, 1, 1] 365s -0.332 -0.824 -0.671 -0.040 365s 0.059 -0.691 -0.025 -0.320 365s 0.475 0.590 -0.272 1.130 365s 0.271 0.369 -0.046 0.880 365s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 2, 4, 1] 365s -0.699 0.414 0.714 -0.828 365s -0.883 -0.945 0.800 0.816 365s 365s x1: [4, 4, 4, 1] 365s 0.976 0.356 0.559 -0.249 365s -0.206 0.496 0.835 -0.037 365s 0.413 -0.638 -0.496 0.446 365s 0.596 0.289 -0.991 -0.702 365s 365s y: [2, 4, 4, 1] 365s 0.070 -0.954 365s 0.976 0.351 365s -1.276 0.204 365s -0.423 -2.165 365s testing: mul_mat, [3, 4, 4, 4] = [2, 3, 4, 4] * [2, 4, 4, 4] 365s x0: [2, 3, 4, 4] 365s -0.231 -0.207 365s 0.828 0.398 365s 0.740 -0.950 365s 365s x1: [2, 4, 4, 4] 365s 0.944 0.352 365s -0.804 0.658 365s 0.152 -0.633 365s 0.474 -0.216 365s 365s y: [3, 4, 4, 4] 365s -0.291 0.922 0.364 365s 0.050 -0.404 -1.221 365s 0.096 -0.126 0.714 365s -0.065 0.306 0.556 365s test-mul-mat0: iter:75/500 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.256 -0.673 -0.023 365s -0.059 -0.159 0.299 365s 0.050 0.195 0.333 365s -0.854 0.372 -0.972 365s 365s x1: [3, 3, 1, 1] 365s 0.823 0.537 -0.410 365s -0.867 0.633 -0.574 365s 0.901 -0.761 -0.513 365s 365s y: [4, 3, 1, 1] 365s -0.142 -0.257 0.009 -0.104 365s -0.635 -0.221 -0.111 1.534 365s 0.755 -0.085 -0.274 -0.554 365s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 3, 2, 1] 365s -0.420 -0.521 0.909 365s -0.274 0.851 0.938 365s 0.905 0.229 0.040 365s 365s x1: [3, 4, 2, 1] 365s 0.179 0.378 0.102 365s -0.075 -0.036 0.064 365s 0.560 0.926 -0.386 365s 0.383 0.463 0.204 365s 365s y: [3, 4, 2, 1] 365s -0.179 0.369 0.253 365s 0.109 0.050 -0.074 365s -1.068 0.273 0.704 365s -0.216 0.480 0.461 365s testing: mul_mat, [3, 3, 2, 4] = [3, 3, 2, 4] * [3, 3, 2, 4] 365s x0: [3, 3, 2, 4] 365s 0.400 -0.776 0.471 365s -0.506 0.472 0.885 365s 0.314 0.426 0.643 365s 365s x1: [3, 3, 2, 4] 365s 0.289 -0.324 0.552 365s -0.295 -0.989 0.669 365s 0.006 -0.399 -0.888 365s 365s y: [3, 3, 2, 4] 365s 0.626 0.189 0.307 365s 0.965 0.274 -0.084 365s -0.106 -0.977 -0.739 365s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.477 0.807 -0.809 365s 365s x1: [3, 3, 1, 1] 365s 0.237 0.622 -0.772 365s -0.832 -0.126 0.329 365s 0.893 0.023 0.669 365s 365s y: [1, 3, 1, 1] 365s 1.014 365s 0.029 365s -0.948 365s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 365s x0: [1, 4, 2, 1] 365s -0.243 365s -0.892 365s -0.873 365s 0.609 365s 365s x1: [1, 3, 2, 1] 365s -0.836 365s 0.838 365s 0.774 365s 365s y: [4, 3, 2, 1] 365s 0.203 0.745 0.730 -0.509 365s -0.204 -0.748 -0.732 0.511 365s -0.188 -0.690 -0.676 0.471 365s testing: mul_mat, [1, 1, 2, 4] = [4, 1, 2, 4] * [4, 1, 2, 4] 365s x0: [4, 1, 2, 4] 365s 0.766 -0.619 0.544 -0.481 365s 365s x1: [4, 1, 2, 4] 365s -0.042 -0.132 -0.475 -0.806 365s 365s y: [1, 1, 2, 4] 365s 0.178 365s test-mul-mat0: iter:76/500 365s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.884 -0.654 365s 0.065 0.191 365s 0.317 0.489 365s 365s x1: [2, 3, 1, 1] 365s -0.719 0.622 365s -0.659 0.577 365s 0.974 -0.388 365s 365s y: [3, 3, 1, 1] 365s -1.042 0.072 0.076 365s -0.959 0.067 0.073 365s 1.115 -0.011 0.119 365s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 2, 3, 1] 365s -0.709 -0.625 365s 0.149 -0.735 365s 365s x1: [2, 3, 3, 1] 365s 0.885 0.472 365s 0.039 -0.277 365s -0.080 0.841 365s 365s y: [2, 3, 3, 1] 365s -0.922 -0.215 365s 0.145 0.210 365s -0.469 -0.631 365s testing: mul_mat, [1, 2, 3, 2] = [2, 1, 3, 2] * [2, 2, 3, 2] 365s x0: [2, 1, 3, 2] 365s 0.626 0.752 365s 365s x1: [2, 2, 3, 2] 365s 0.224 -0.878 365s -0.475 0.947 365s 365s y: [1, 2, 3, 2] 365s -0.519 365s 0.415 365s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.250 0.623 365s 365s x1: [2, 1, 1, 1] 365s 0.399 0.020 365s 365s y: [1, 1, 1, 1] 365s -0.087 365s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 365s x0: [1, 2, 3, 1] 365s -0.284 365s -0.990 365s 365s x1: [1, 2, 3, 1] 365s 0.161 365s -0.980 365s 365s y: [2, 2, 3, 1] 365s -0.046 -0.159 365s 0.278 0.971 365s testing: mul_mat, [4, 1, 3, 2] = [2, 4, 3, 2] * [2, 1, 3, 2] 365s x0: [2, 4, 3, 2] 365s 0.805 0.976 365s -0.648 -0.072 365s 0.767 -0.425 365s -0.445 0.851 365s 365s x1: [2, 1, 3, 2] 365s -0.786 0.113 365s 365s y: [4, 1, 3, 2] 365s -0.522 0.501 -0.651 0.446 365s test-mul-mat0: iter:77/500 365s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.685 -0.939 0.132 0.609 365s 0.979 -0.051 0.254 0.665 365s 0.321 -0.125 -0.722 -0.732 365s 365s x1: [4, 1, 1, 1] 365s -0.556 0.515 -0.646 0.079 365s 365s y: [3, 1, 1, 1] 365s -0.141 -0.682 0.166 365s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.465 -0.427 0.857 -0.112 365s 365s x1: [4, 3, 1, 1] 365s -0.653 -0.999 -0.629 -0.534 365s -0.925 -0.652 0.243 0.170 365s -0.804 -0.836 -0.624 -0.871 365s 365s y: [1, 3, 1, 1] 365s 0.251 365s 0.898 365s 0.294 365s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.188 0.692 0.059 -0.934 365s 0.356 -0.620 -0.059 0.634 365s 365s x1: [4, 1, 1, 1] 365s 0.652 -0.023 0.204 0.713 365s 365s y: [2, 1, 1, 1] 365s -0.792 0.686 365s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 2, 1, 1] 365s 0.562 0.257 0.504 -0.736 365s 0.162 0.691 0.792 0.361 365s 365s x1: [4, 2, 1, 1] 365s -0.353 0.288 0.635 0.019 365s 0.755 0.711 0.367 -0.002 365s 365s y: [2, 2, 1, 1] 365s 0.182 0.652 365s 0.793 0.902 365s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.238 0.886 365s -0.324 0.704 365s 365s x1: [2, 4, 1, 1] 365s -0.320 0.916 365s -0.662 0.884 365s 0.628 -0.552 365s -0.304 0.320 365s 365s y: [2, 4, 1, 1] 365s 0.888 0.748 365s 0.941 0.836 365s -0.639 -0.592 365s 0.356 0.324 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.253 -0.081 365s 0.657 0.745 365s 365s x1: [2, 2, 1, 1] 365s -0.690 -0.466 365s -0.008 0.946 365s 365s y: [2, 2, 1, 1] 365s 0.212 -0.800 365s -0.074 0.699 365s test-mul-mat0: iter:78/500 365s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.564 0.770 365s -0.732 -0.648 365s 365s x1: [2, 1, 1, 1] 365s -0.828 -0.590 365s 365s y: [2, 1, 1, 1] 365s 0.012 0.989 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.301 -0.937 365s 0.610 0.977 365s 365s x1: [2, 2, 1, 1] 365s -0.891 -0.848 365s 0.980 -0.444 365s 365s y: [2, 2, 1, 1] 365s 0.527 -1.373 365s 0.710 0.165 365s testing: mul_mat, [2, 2, 1, 2] = [2, 2, 1, 2] * [2, 2, 1, 2] 365s x0: [2, 2, 1, 2] 365s -0.045 -0.202 365s -0.573 0.437 365s 365s x1: [2, 2, 1, 2] 365s 0.949 0.314 365s -0.713 -0.517 365s 365s y: [2, 2, 1, 2] 365s -0.106 -0.406 365s 0.136 0.182 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.993 -0.622 365s -0.771 -0.451 365s 365s x1: [2, 2, 1, 1] 365s 0.809 -0.955 365s 0.884 -0.923 365s 365s y: [2, 2, 1, 1] 365s 1.398 -0.194 365s 1.453 -0.266 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.438 0.942 365s -0.999 0.045 365s 365s x1: [2, 2, 1, 1] 365s 0.077 0.678 365s -0.388 -0.313 365s 365s y: [2, 2, 1, 1] 365s 0.605 -0.046 365s -0.125 0.373 365s testing: mul_mat, [2, 2, 1, 2] = [2, 2, 1, 2] * [2, 2, 1, 2] 365s x0: [2, 2, 1, 2] 365s 0.862 0.082 365s -0.285 0.976 365s 365s x1: [2, 2, 1, 2] 365s -0.693 -0.825 365s 0.081 0.360 365s 365s y: [2, 2, 1, 2] 365s -0.665 -0.607 365s 0.099 0.328 365s test-mul-mat0: iter:79/500 365s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.099 0.353 0.701 365s 365s x1: [3, 3, 1, 1] 365s -0.598 0.906 0.580 365s 0.014 -0.407 0.235 365s 0.576 -0.406 0.177 365s 365s y: [1, 3, 1, 1] 365s 0.786 365s 0.020 365s -0.076 365s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.910 0.211 -0.71ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 9 365s -0.375 -0.707 -0.743 365s 0.149 0.421 -0.883 365s 365s x1: [3, 1, 1, 1] 365s -0.739 0.418 0.738 365s 365s y: [3, 1, 1, 1] 365s -1.115 -0.567 -0.586 365s testing: mul_mat, [3, 3, 1, 2] = [3, 3, 1, 2] * [3, 3, 1, 2] 365s x0: [3, 3, 1, 2] 365s 0.611 0.021 0.255 365s 0.521 -0.768 0.536 365s -0.853 -0.475 0.793 365s 365s x1: [3, 3, 1, 2] 365s 0.750 -0.055 0.970 365s -0.420 -0.152 -0.628 365s -0.514 -0.572 0.386 365s 365s y: [3, 3, 1, 2] 365s 0.705 0.953 0.154 365s -0.420 -0.439 -0.068 365s -0.227 0.378 1.016 365s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.137 -0.525 0.996 365s 0.463 -0.516 -0.284 365s -0.723 0.532 0.068 365s 365s x1: [3, 3, 1, 1] 365s -0.887 -0.833 0.637 365s 0.193 0.830 -0.402 365s -0.134 -0.330 -0.820 365s 365s y: [3, 3, 1, 1] 365s 1.193 -0.162 0.242 365s -0.863 -0.225 0.275 365s -0.625 0.341 -0.134 365s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.819 0.986 0.816 365s -0.099 -0.462 0.136 365s -0.776 0.417 -0.717 365s 365s x1: [3, 3, 1, 1] 365s -0.857 0.242 -0.139 365s 0.438 0.188 -0.230 365s -0.515 -0.920 -0.350 365s 365s y: [3, 3, 1, 1] 365s -0.578 -0.046 0.866 365s 0.357 -0.162 -0.097 365s -1.616 0.429 0.267 365s testing: mul_mat, [2, 3, 1, 2] = [3, 2, 1, 2] * [3, 3, 1, 2] 365s x0: [3, 2, 1, 2] 365s -0.510 0.998 0.782 365s -0.119 -0.691 -0.778 365s 365s x1: [3, 3, 1, 2] 365s -0.513 0.316 0.724 365s 0.350 -0.221 -1.000 365s 0.825 0.263 0.532 365s 365s y: [2, 3, 1, 2] 365s 1.142 -0.720 365s -1.181 0.889 365s 0.257 -0.694 365s test-mul-mat0: iter:80/500 365s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.986 0.237 365s -0.041 -0.576 365s 365s x1: [2, 4, 1, 1] 365s 0.550 0.645 365s 0.774 -0.187 365s 0.177 0.594 365s -0.207 0.776 365s 365s y: [2, 4, 1, 1] 365s -0.390 -0.394 365s -0.807 0.076 365s -0.034 -0.350 365s 0.388 -0.439 365s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 365s x0: [2, 3, 3, 1] 365s -0.375 -0.772 365s -0.325 -0.730 365s -0.823 -0.775 365s 365s x1: [2, 2, 3, 1] 365s 0.467 -0.154 365s -0.087 -0.652 365s 365s y: [3, 2, 3, 1] 365s -0.056 -0.039 -0.265 365s 0.536 0.504 0.577 365s testing: mul_mat, [2, 3, 3, 2] = [2, 2, 3, 2] * [2, 3, 3, 2] 365s x0: [2, 2, 3, 2] 365s 0.930 -0.909 365s 0.792 -0.572 365s 365s x1: [2, 3, 3, 2] 365s 0.571 -0.092 365s 0.182 0.919 365s 0.751 -0.596 365s 365s y: [2, 3, 3, 2] 365s 0.615 0.505 365s -0.666 -0.382 365s 1.240 0.936 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.598 -0.507 365s 0.357 0.149 365s -0.630 -0.202 365s 365s x1: [2, 2, 1, 1] 365s -0.980 -0.442 365s 0.127 0.486 365s 365s y: [3, 2, 1, 1] 365s 0.810 -0.416 0.707 365s -0.322 0.118 -0.178 365s testing: mul_mat, [1, 2, 3, 1] = [3, 1, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 1, 3, 1] 365s -0.299 0.766 0.807 365s 365s x1: [3, 2, 3, 1] 365s -0.896 -0.508 0.628 365s 0.860 0.614 0.535 365s 365s y: [1, 2, 3, 1] 365s 0.385 365s 0.645 365s testing: mul_mat, [2, 3, 3, 2] = [1, 2, 3, 2] * [1, 3, 3, 2] 365s x0: [1, 2, 3, 2] 365s 0.439 365s 0.372 365s 365s x1: [1, 3, 3, 2] 365s -0.276 365s 0.759 365s 0.414 365s 365s y: [2, 3, 3, 2] 365s -0.121 -0.103 365s 0.333 0.282 365s 0.182 0.154 365s test-mul-mat0: iter:81/500 365s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.559 365s -0.125 365s 365s x1: [1, 1, 1, 1] 365s 0.010 365s 365s y: [2, 1, 1, 1] 365s -0.006 -0.001 365s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 365s x0: [1, 2, 4, 1] 365s 0.856 365s -0.557 365s 365s x1: [1, 2, 4, 1] 365s 0.296 365s 0.159 365s 365s y: [2, 2, 4, 1] 365s 0.254 -0.165 365s 0.136 -0.089 365s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 365s x0: [1, 3, 4, 1] 365s -0.571 365s 0.527 365s 0.738 365s 365s x1: [1, 2, 4, 1] 365s 0.553 365s -0.243 365s 365s y: [3, 2, 4, 1] 365s -0.315 0.291 0.408 365s 0.138 -0.128 -0.179 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.053 365s -0.034 365s 0.011 365s 365s x1: [1, 3, 1, 1] 365s -0.467 365s 0.504 365s 0.738 365s 365s y: [3, 3, 1, 1] 365s 0.025 0.016 -0.005 365s -0.026 -0.017 0.006 365s -0.039 -0.025 0.008 365s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 365s x0: [3, 1, 4, 1] 365s 0.892 0.626 0.813 365s 365s x1: [3, 1, 4, 1] 365s -0.469 0.332 -0.554 365s 365s y: [1, 1, 4, 1] 365s -0.661 365s testing: mul_mat, [2, 3, 4, 1] = [1, 2, 4, 1] * [1, 3, 4, 1] 365s x0: [1, 2, 4, 1] 365s -0.833 365s 0.640 365s 365s x1: [1, 3, 4, 1] 365s 0.131 365s 0.932 365s 0.788 365s 365s y: [2, 3, 4, 1] 365s -0.109 0.084 365s -0.776 0.596 365s -0.656 0.504 365s test-mul-mat0: iter:82/500 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.749 -0.479 365s 365s x1: [2, 3, 1, 1] 365s -0.154 -0.875 365s 0.922 -0.527 365s -0.681 -0.267 365s 365s y: [1, 3, 1, 1] 365s 0.535 365s -0.438 365s 0.638 365s testing: mul_mat, [3, 1,ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 4, 1] = [2, 3, 4, 1] * [2, 1, 4, 1] 365s x0: [2, 3, 4, 1] 365s 0.516 0.864 365s -0.026 -0.440 365s -0.604 -0.498 365s 365s x1: [2, 1, 4, 1] 365s 0.682 0.148 365s 365s y: [3, 1, 4, 1] 365s 0.480 -0.083 -0.486 365s testing: mul_mat, [1, 3, 4, 2] = [2, 1, 4, 2] * [2, 3, 4, 2] 365s x0: [2, 1, 4, 2] 365s 0.737 0.914 365s 365s x1: [2, 3, 4, 2] 365s -0.515 -0.568 365s -0.309 0.049 365s 0.417 -0.611 365s 365s y: [1, 3, 4, 2] 365s -0.898 365s -0.183 365s -0.251 365s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.461 -0.551 365s 0.152 -0.165 365s 365s x1: [2, 1, 1, 1] 365s 0.445 0.532 365s 365s y: [2, 1, 1, 1] 365s -0.088 -0.020 365s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 365s x0: [2, 2, 4, 1] 365s 0.152 -0.602 365s 0.435 -0.411 365s 365s x1: [2, 2, 4, 1] 365s -0.734 -0.600 365s -0.930 0.418 365s 365s y: [2, 2, 4, 1] 365s 0.250 -0.072 365s -0.393 -0.576 365s testing: mul_mat, [1, 2, 4, 2] = [2, 1, 4, 2] * [2, 2, 4, 2] 365s x0: [2, 1, 4, 2] 365s -0.275 -0.834 365s 365s x1: [2, 2, 4, 2] 365s 0.057 0.612 365s -0.499 0.879 365s 365s y: [1, 2, 4, 2] 365s -0.527 365s -0.596 365s test-mul-mat0: iter:83/500 365s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.529 -0.607 -0.037 -0.363 365s -0.200 -0.574 0.240 -0.353 365s -0.332 0.154 0.005 0.772 365s -0.936 -0.921 -0.439 0.122 365s 365s x1: [4, 2, 1, 1] 365s -0.288 -0.701 -0.526 -0.046 365s -0.670 -0.986 -0.816 -0.445 365s 365s y: [4, 2, 1, 1] 365s 0.614 0.350 -0.050 1.141 365s 1.144 0.661 -0.277 1.839 365s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 2, 4, 1] 365s -0.430 0.506 -0.782 -0.826 365s 0.741 -0.662 0.372 -0.768 365s 365s x1: [4, 4, 4, 1] 365s 0.304 0.431 -0.084 0.926 365s 0.628 -0.867 -0.362 0.927 365s -0.394 0.593 -0.743 -0.380 365s 0.777 -0.188 -0.142 -0.752 365s 365s y: [2, 4, 4, 1] 365s -0.611 -0.803 365s -1.191 0.193 365s 1.364 -0.669 365s 0.302 1.224 365s testing: mul_mat, [1, 2, 4, 3] = [4, 1, 4, 3] * [4, 2, 4, 3] 365s x0: [4, 1, 4, 3] 365s -0.755 0.194 0.565 -0.961 365s 365s x1: [4, 2, 4, 3] 365s -0.158 -0.476 0.462 -0.417 365s -0.138 -0.166 -0.185 0.942 365s 365s y: [1, 2, 4, 3] 365s 0.688 365s -0.938 365s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.867 -0.938 -0.823 0.690 365s -0.320 0.672 -0.348 0.135 365s 0.522 0.652 -0.661 0.610 365s -0.102 -0.397 -0.753 -0.546 365s 365s x1: [4, 1, 1, 1] 365s 0.682 0.479 0.368 0.075 365s 365s y: [4, 1, 1, 1] 365s -0.109 -0.014 0.471 -0.577 365s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 2, 4, 1] 365s 0.331 -0.689 -0.678 -0.420 365s -0.081 -0.250 -0.533 0.372 365s 365s x1: [4, 4, 4, 1] 365s 0.439 -0.086 0.246 -0.294 365s -0.324 0.467 0.731 0.966 365s -0.548 -0.589 0.648 0.931 365s 0.779 -0.277 -0.383 0.645 365s 365s y: [2, 4, 4, 1] 365s 0.160 -0.255 365s -1.330 -0.120 365s -0.605 0.192 365s 0.438 0.450 365s testing: mul_mat, [2, 4, 4, 3] = [2, 2, 4, 3] * [2, 4, 4, 3] 365s x0: [2, 2, 4, 3] 365s -0.164 0.258 365s 0.947 0.936 365s 365s x1: [2, 4, 4, 3] 365s 0.400 -0.263 365s 0.173 0.722 365s 0.204 0.754 365s 0.094 0.759 365s 365s y: [2, 4, 4, 3] 365s -0.134 0.133 365s 0.158 0.840 365s 0.161 0.899 365s 0.180 0.799 365s test-mul-mat0: iter:84/500 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.852 0.157 365s 0.691 0.340 365s 0.756 -0.172 365s 365s x1: [2, 2, 1, 1] 365s -0.047 -0.847 365s 0.268 -0.688 365s 365s y: [3, 2, 1, 1] 365s -0.093 -0.321 0.110 365s -0.336 -0.049 0.321 365s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 1, 2, 1] 365s -0.453 -0.027 365s 365s x1: [2, 3, 2, 1] 365s -0.521 0.613 365s 0.901 0.019 365s 0.709 -0.646 365s 365s y: [1, 3, 2, 1] 365s 0.220 365s -0.408 365s -0.304 365s testing: mul_mat, [2, 1, 2, 4] = [2, 2, 2, 4] * [2, 1, 2, 4] 365s x0: [2, 2, 2, 4] 365s 0.255 0.428 365s 0.177 -0.125 365s 365s x1: [2, 1, 2, 4] 365s 0.929 -0.140 365s 365s y: [2, 1, 2, 4] 365s 0.177 0.182 365s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.214 -0.593 365s 365s x1: [2, 2, 1, 1] 365s 0.207 -0.320 365s 0.139 0.692 365s 365s y: [1, 2, 1, 1] 365s 0.145 365s -0.440 365s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 365s x0: [1, 1, 2, 1] 365s -0.005 365s 365s x1: [1, 2, 2, 1] 365s 0.573 365s 0.091 365s 365s y: [1, 2, 2, 1] 365s -0.003 365s -0.000 365s testing: mul_mat, [3, 1, 2, 4] = [1, 3, 2, 4] * [1, 1, 2, 4] 365s x0: [1, 3, 2, 4] 365s 0.414 365s 0.706 365s -0.284 365s 365s x1: [1, 1, 2, 4] 365s -0.272 365s 365s y: [3, 1, 2, 4] 365s -0.113 -0.192 0.077 365s test-mul-mat0: iter:85/500 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.652 365s 0.850 365s -0.641 365s 365s x1: [1, 3, 1, 1] 365s -0.151 365s -0.359 365s 0.134 365s 365s y: [3, 3, 1, 1] 365s 0.098 -0.128 0.097 365s 0.234 -0.305 0.230 365s -0.088 0.114 -0.086 365s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 365s x0: [1, 2, 3, 1] 365s 0.240 365s -0.733 365s 365s x1: [1, 3, 3, 1] 365s 0.063 365s -0.656 365s 0.958 365s 365s y: [2, 3, 3, 1] 365s 0.015 -0.046 365s -0.157 0.480 365s 0.229 -0.702 365s testing: mul_mat, [1, 2, 3, 2] = [1, 1, 3, 2] * [1, 2, 3, 2] 365s x0: [1, 1, 3, 2] 365s -0.858 365s 365s x1: [1, 2, 3, 2] 365s 0.173 365s -0.092 365s 365s y: [1, 2, 3, 2] 365s -0.149 365s 0.079 365s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.078 365s 0.971 365s 365s x1: [1, 1, 1, 1] 365s -0.468 365s 365s y: [2, 1, 1, 1] 365s 0.036 -0.455 365s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 4, 3, 1] 365s -0.417 -0.946 365s 0.093 0.158 365s -0.522 0.255 365s 0.322 -0.279 365s 365s x1: [2, 1, 3, 1] 365s 0.064 0.987 365s 365s y: [4, 1, 3, 1] 365s -0.961 0.161 0.219 -0.254 365s testing: mul_mat, [1, 2, 3, 2] = [4, 1, 3, 2] * [4, 2, 3, 2] 365s x0: [4, 1, 3, 2] 365s 0.258 -0.918 0.189 0.977 365s 365s x1: [4, 2, 3, 2] 365s -0.486 -0.921 -0.053 0.846 365s 0.126 -0.134 0.416 0.709 365s 365s y: [1, 2, 3, 2] 365s 1.537 365s 0.927 365s test-mul-mat0: iter:86/500 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.435 0.467 0.166 0.204 365s 0.868 -0.327 0.401 0.594 365s 0.447 -0.427 -0.645 0.448 365s -0.387 0.182 -0.312 -0.428 365s 365s x1: [4, 3, 1, 1] 365s 0.245 0.852 0.950 -0.497 365s 0.934 0.139 -0.520 0.913 365s 0.188 -0.589 0.640 0.267 365s 365s y: [4, 3, 1, 1] 365s 0.348 0.019 -1.090 -0.024 365s -0.241 1.099 1.103 -0.565 365s -0.196 0.771 0.043 -0.494 365s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.561 0.468 -0.221 0.112 365s 365s x1: [4, 4, 1, 1] 365s -0.641 0.900 0.817 -0.789 365s 0.850 -0.680 -0.854 -0.011 365s -0.200 -0.941 -0.823 0.212 365s 0.699 0.444 0.273 -0.736 365s 365s y: [1, 4, 1, 1] 365s 0.511 365s -0.607 365s -0.123 365s -0.327 365s testing: mul_mat, [3, 1, 1, 3] = [4, 3, 1, 3] * [4, 1, 1, 3] 365s x0: [4, 3, 1, 3] 365s -0.520 -0.555 -0.236 -0.374 365s 0.434 0.564 -0.316 0.610 365s -0.224 -0.617 0.054 -0.951 365s 365s x1: [4, 1, 1, 3] 365s -0.131 -0.627 -0.441 0.442 365s 365s y: [3, 1, 1, 3] 365s 0.355 -0.002 -0.028 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.944 0.670 0.980 0.513 365s 0.825 0.118 0.961 0.947 365s -0.049 -0.148 0.445 0.340 365s -0.629 0.423 0.437 0.829 365s 365s x1: [4, 3, 1, 1] 365s 0.880 0.812 -0.054 -0.344 365s -0.805 -1.000 -0.295 -0.157 365s 0.966 0.192 -0.041 -0.291 365s 365s y: [4, 3, 1, 1] 365s 1.145 0.445 -0.304 -0.519 365s -1.799 -1.215 0.002 -0.176 365s 0.850 0.504 -0.193 -0.786 365s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.994 0.197 0.792 0.124 365s 365s x1: [4, 4, 1, 1] 365s 0.075 -0.324 0.710 -0.113 365s 0.622 -0.634 0.082 0.622 365s 0.070 0.925 0.588 -0.737 365s -0.117 -0.703 0.055 -0.173 365s 365s y: [1, 4, 1, 1] 365s 0.410 365s -0.602 365s 0.487 365s -0.001 365s testing: mul_mat, [4, 4, 1, 3] = [1, 4, 1, 3] * [1, 4, 1, 3] 365s x0: [1, 4, 1, 3] 365s -0.431 365s -0.822 365s 0.428 365s 0.651 365s 365s x1: [1, 4, 1, 3] 365s -0.950 365s 0.215 365s 0.104 365s -0.990 365s 365s y: [4, 4, 1, 3] 365s 0.409 0.781 -0.407 -0.619 365s -0.093 -0.177 0.092 0.140 365s -0.045 -0.085 0.044 0.067 365s 0.426 0.814 -0.424 -0.645 365s test-mul-mat0: iter:87/500 365s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 1, 1, 1] 365s -0.457 365s 365s x1: [1, 4, 1, 1] 365s 0.599 365s -0.055 365s -0.467 365s -0.186 365s 365s y: [1, 4, 1, 1] 365s -0.273 365s 0.025 365s 0.213 365s 0.085 365s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 365s x0: [1, 1, 4, 1] 365s 0.419 365s 365s x1: [1, 1, 4, 1] 365s 0.475 365s 365s y: [1, 1, 4, 1] 365s 0.199 365s testing: mul_mat, [4, 1, 4, 2] = [1, 4, 4, 2] * [1, 1, 4, 2] 365s x0: [1, 4, 4, 2] 365s -0.978 365s -0.924 365s -0.015 365s 0.308 365s 365s x1: [1, 1, 4, 2] 365s -0.665 365s 365s y: [4, 1, 4, 2] 365s 0.650 0.615 0.010 -0.205 365s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.377 365s 0.922 365s 0.865 365s -0.053 365s 365s x1: [1, 4, 1, 1] 365s -0.442 365s 0.307 365s -0.168 365s -0.559 365s 365s y: [4, 4, 1, 1] 365s 0.166 -0.407 -0.382 0.023 365s -0.116 0.283 0.266 -0.016 365s 0.063 -0.155 -0.145 0.009 365s 0.211 -0.516 -0.483 0.030 365s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 365s x0: [4, 1, 4, 1] 365s 0.836 -0.142 -0.541 -0.842 365s 365s x1: [4, 1, 4, 1] 365s -0.782 0.664 0.815 0.988 365s 365s y: [1, 1, 4, 1] 365s -2.021 365s testing: mul_mat, [3, 4, 4, 2] = [1, 3, 4, 2] * [1, 4, 4, 2] 365s x0: [1, 3, 4, 2] 365s 0.823 365s -0.319 365s 0.661 365s 365s x1: [1, 4, 4, 2] 365s -0.401 365s 0.587 365s 0.161 365s 0.348 365s 365s y: [3, 4, 4, 2] 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s -0.330 0.128 -0.265 365s 0.483 -0.187 0.388 365s 0.133 -0.051 0.106 365s 0.286 -0.111 0.230 365s test-mul-mat0: iter:88/500 365s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.243 365s 0.246 365s -0.267 365s 365s x1: [1, 1, 1, 1] 365s 0.423 365s 365s y: [3, 1, 1, 1] 365s 0.103 0.104 -0.113 365s testing: mul_mat, [1, 3, 3, 1] = [1, 1, 3, 1] * [1, 3, 3, 1] 365s x0: [1, 1, 3, 1] 365s 0.682 365s 365s x1: [1, 3, 3, 1] 365s -0.096 365s 0.142 365s 0.865 365s 365s y: [1, 3, 3, 1] 365s -0.066 365s 0.097 365s 0.590 365s testing: mul_mat, [1, 1, 3, 2] = [1, 1, 3, 2] * [1, 1, 3, 2] 365s x0: [1, 1, 3, 2] 365s -0.649 365s 365s x1: [1, 1, 3, 2] 365s 0.126 365s 365s y: [1, 1, 3, 2] 365s -0.082 365s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 1, 1, 1] 365s -0.286 365s 365s x1: [1, 1, 1, 1] 365s -0.464 365s 365s y: [1, 1, 1, 1] 365s 0.133 365s testing: mul_mat, [1, 1, 3, 1] = [1, 1, 3, 1] * [1, 1, 3, 1] 365s x0: [1, 1, 3, 1] 365s 0.148 365s 365s x1: [1, 1, 3, 1] 365s 0.782 365s 365s y: [1, 1, 3, 1] 365s 0.116 365s testing: mul_mat, [2, 1, 3, 2] = [1, 2, 3, 2] * [1, 1, 3, 2] 365s x0: [1, 2, 3, 2] 365s -0.970 365s 0.072 365s 365s x1: [1, 1, 3, 2] 365s -0.466 365s 365s y: [2, 1, 3, 2] 365s 0.452 -0.033 365s test-mul-mat0: iter:89/500 365s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.280 0.386 365s -0.347 -0.017 365s 0.899 -0.157 365s -0.071 0.891 365s 365s x1: [2, 3, 1, 1] 365s -0.411 -0.039 365s 0.703 -0.793 365s 0.886 -0.149 365s 365s y: [4, 3, 1, 1] 365s 0.100 0.143 -0.363 -0.005 365s -0.503 -0.231 0.756 -0.756 365s -0.306 -0.305 0.820 -0.195 365s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 365s x0: [2, 4, 4, 1] 365s 0.570 0.181 365s 0.790 -0.323 365s 0.435 0.583 365s 0.315 0.059 365s 365s x1: [2, 4, 4, 1] 365s -0.127 -0.999 365s 0.928 -0.335 365s 0.849 0.107 365s -0.746 0.793 365s 365s y: [4, 4, 4, 1] 365s -0.253 0.222 -0.638 -0.099 365s 0.468 0.841 0.208 0.272 365s 0.503 0.636 0.432 0.274 365s -0.282 -0.845 0.138 -0.188 365s testing: mul_mat, [2, 4, 4, 2] = [2, 2, 4, 2] * [2, 4, 4, 2] 365s x0: [2, 2, 4, 2] 365s 0.551 0.384 365s -0.481 -0.595 365s 365s x1: [2, 4, 4, 2] 365s 0.668 -0.509 365s -0.959 0.103 365s -0.926 0.356 365s -0.838 0.855 365s 365s y: [2, 4, 4, 2] 365s 0.172 -0.019 365s -0.488 0.400 365s -0.373 0.233 365s -0.133 -0.105 365s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.436 0.025 365s 365s x1: [2, 2, 1, 1] 365s 0.209 0.631 365s 0.463 -0.626 365s 365s y: [1, 2, 1, 1] 365s 0.107 365s 0.186 365s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 365s x0: [1, 3, 4, 1] 365s 0.025 365s 0.973 365s 0.220 365s 365s x1: [1, 2, 4, 1] 365s -0.729 365s -0.844 365s 365s y: [3, 2, 4, 1] 365s -0.018 -0.709 -0.161 365s -0.021 -0.821 -0.186 365s testing: mul_mat, [2, 1, 4, 2] = [3, 2, 4, 2] * [3, 1, 4, 2] 365s x0: [3, 2, 4, 2] 365s 0.740 0.798 -0.714 365s -0.586 -0.602 -0.443 365s 365s x1: [3, 1, 4, 2] 365s -0.517 -0.016 0.817 365s 365s y: [2, 1, 4, 2] 365s -0.978 -0.050 365s test-mul-mat0: iter:90/500 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.099 -0.437 -0.019 365s 0.689 -0.377 -0.702 365s 365s x1: [3, 3, 1, 1] 365s -0.631 0.449 -0.883 365s -0.159 -0.547 0.428 365s 0.257 -0.094 -0.211 365s 365s y: [2, 3, 1, 1] 365s -0.241 0.016 365s 0.215 -0.204 365s 0.071 0.361 365s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 2, 3, 1] 365s -0.683 0.494 0.395 365s 0.223 -0.717 0.614 365s 365s x1: [3, 2, 3, 1] 365s 0.161 0.953 0.989 365s 0.207 0.170 0.333 365s 365s y: [2, 2, 3, 1] 365s 0.751 -0.040 365s 0.074 0.129 365s testing: mul_mat, [1, 2, 3, 3] = [3, 1, 3, 3] * [3, 2, 3, 3] 365s x0: [3, 1, 3, 3] 365s -0.383 0.019 0.420 365s 365s x1: [3, 2, 3, 3] 365s -0.729 -0.506 0.260 365s -0.423 -0.448 0.260 365s 365s y: [1, 2, 3, 3] 365s 0.380 365s 0.263 365s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.609 -0.567 0.119 365s 365s x1: [3, 1, 1, 1] 365s 0.067 0.775 0.414 365s 365s y: [1, 1, 1, 1] 365s -0.431 365s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 365s x0: [1, 2, 3, 1] 365s 0.929 365s -0.812 365s 365s x1: [1, 3, 3, 1] 365s 0.328 365s 0.708 365s 0.919 365s 365s y: [2, 3, 3, 1] 365s 0.305 -0.266 365s 0.658 -0.574 365s 0.854 -0.746 365s testing: mul_mat, [1, 1, 3, 3] = [2, 1, 3, 3] * [2, 1, 3, 3] 365s x0: [2, 1, 3, 3] 365s -0.829 0.737 365s 365s x1: [2, 1, 3, 3] 365s -0.871 -0.001 365s 365s y: [1, 1, 3, 3] 365s 0.720 365s test-mul-mat0: iter:91/500 365s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.059 365s -0.885 365s 365s x1: [1, 2, 1, 1] 365s 0.987 365s -0.109 365s 365s y: [2, 2, 1, 1] 365s -0.058 -0.873 365s 0.006 0.097 365s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 1, 1, 1] 365s -0.252 365s 365s x1: [1, 2, 1, 1] 365s 0.768 365s 0.603 365s 365s y: [1, 2, 1, 1] 365s -0.194 365s -0.152 365s testing: mul_mat, [2, 1, 1, 2] = [1, 2, 1, 2] * [1, 1, 1, 2] 365s x0: [1, 2, 1, 2] 365s -0.640 365s -0.572 365s 365s x1: [1, 1, 1, 2] 365s 0.774 365s 365s y: [2, 1, 1, 2] 365s -0.495 -0.443 365s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.425 365s 0.126 365s 0.744 365s 365s x1: [1, 2, 1, 1] 365s -0.952 365s 0.625 365s 365s y: [3, 2, 1, 1] 365s 0.404 -0.120 -0.708 365s -0.265 0.079 0.465 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.459 -0.655 -0.145 365s 0.609 0.914 -0.668 365s 0.927 -0.783 0.702 365s 0.326 0.933 -0.543 365s 365s x1: [3, 1, 1, 1] 365s -0.899 -0.836 0.333 365s 365s y: [4, 1, 1, 1] 365s 0.912 -1.534 0.056 -1.255 365s testing: mul_mat, [1, 3, 1, 2] = [4, 1, 1, 2] * [4, 3, 1, 2] 365s x0: [4, 1, 1, 2] 365s -0.538 -0.324 -0.105 -0.206 365s 365s x1: [4, 3, 1, 2] 365s 0.591 -0.551 -0.769 -0.412 365s -0.759 -0.409 0.016 0.464 365s 0.467 0.064 0.088 0.712 365s 365s y: [1, 3, 1, 2] 365s 0.026 365s 0.443 365s -0.427 365s test-mul-mat0: iter:92/500 365s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.620 0.353 -0.308 365s 0.835 -0.191 -0.568 365s 365s x1: [3, 1, 1, 1] 365s 0.989 -0.468 0.641 365s 365s y: [2, 1, 1, 1] 365s 0.251 0.551 365s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.261 -0.201 0.839 365s 365s x1: [3, 2, 1, 1] 365s -0.788 0.599 -0.368 365s 0.131 0.997 -0.242 365s 365s y: [1, 2, 1, 1] 365s -0.224 365s -0.438 365s testing: mul_mat, [1, 1, 1, 4] = [3, 1, 1, 4] * [3, 1, 1, 4] 365s x0: [3, 1, 1, 4] 365s -0.554 0.036 0.841 365s 365s x1: [3, 1, 1, 4] 365s 0.415 0.694 -0.366 365s 365s y: [1, 1, 1, 4] 365s -0.514 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.651 -0.131 -0.228 365s -0.765 -0.849 0.718 365s -0.642 0.089 0.881 365s -0.957 0.888 -0.540 365s 365s x1: [3, 1, 1, 1] 365s -0.797 0.550 -0.604 365s 365s y: [4, 1, 1, 1] 365s 0.584 -0.291 0.028 1.577 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.578 0.502 0.086 -0.391 365s 0.106 -0.273 0.520 -0.506 365s 0.785 0.477 -0.394 -0.991 365s 0.128 0.737 0.237 -0.673 365s 365s x1: [4, 3, 1, 1] 365s -0.478 0.863 -0.319 -0.032 365s -0.101 -0.478 -0.233 -0.374 365s -0.802 0.375 -0.214 -0.914 365s 365s y: [4, 3, 1, 1] 365s 0.142 -0.435 0.195 0.521 365s -0.172 0.187 0.155 -0.169 365s 0.064 0.165 0.540 0.738 365s testing: mul_mat, [4, 4, 1, 4] = [4, 4, 1, 4] * [4, 4, 1, 4] 365s x0: [4, 4, 1, 4] 365s 0.528 0.742 -0.398 -0.362 365s -0.934 -0.795 0.050 0.359 365s 0.767 0.589 -0.208 0.800 365s -0.322 0.288 -0.462 0.481 365s 365s x1: [4, 4, 1, 4] 365s -0.625 -0.531 -0.151 -0.762 365s 0.150 0.817 0.136 0.673 365s -0.416 0.763 0.871 0.959 365s -0.451 0.957 0.540 -0.873 365s 365s y: [4, 4, 1, 4] 365s -0.388 0.725 -1.371 -0.248 365s 0.388 -0.542 1.107 0.448 365s -0.349 0.170 0.716 0.413 365s 0.572 -0.626 -0.592 -0.248 365s test-mul-mat0: iter:93/500 365s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.034 -0.547 365s 0.353 -0.427 365s 365s x1: [2, 4, 1, 1] 365s 0.189 -0.417 365s 0.213 0.552 365s -0.863 0.689 365s -0.447 0.050 365s 365s y: [2, 4, 1, 1] 365s 0.235 0.245 365s -0.294 -0.161 365s -0.406 -0.599 365s -0.043 -0.179 365s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 365s x0: [2, 3, 4, 1] 365s -0.538 0.280 365s 0.972 0.676 365s -0.168 -0.891 365s 365s x1: [2, 2, 4, 1] 365s 0.779 -0.692 365s -0.005 0.366 365s 365s y: [3, 2, 4, 1] 365s -0.612 0.290 0.486 365s 0.105 0.243 -0.325 365s testing: mul_mat, [4, 3, 4, 2] = [2, 4, 4, 2] * [2, 3, 4, 2] 365s x0: [2, 4, 4, 2] 365s -0.949 0.553 365s -0.902 0.724 365s 0.902 0.071 365s -0.846 -0.104 365s 365s x1: [2, 3, 4, 2] 365s -0.529 -0.339 365s 0.990 -0.960 365s 0.847 -0.919 365s 365s y: [4, 3, 4, 2] 365s 0.314 0.231 -0.501 0.482 365s -1.470 -1.588 0.825 -0.737 365s -1.312 -1.430 0.699 -0.621 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.309 -0.261 365s -0.080 -0.705 365s 0.993 0.408 365s -0.092 -0.330 365s 365s x1: [2, 4, 1, 1] 365s 0.067 -0.440 365s 0.778 0.466 365s 0.743 -0.530 365s -0.114 0.224 365s 365s y: [4, 4, 1, 1] 365s 0.094 0.305 -0.113 0.139 365s -0.362 -0.391 0.963 -0.226 365s -0.092 0.314 0.522 0.106 365s -0.023 -0.149 -0.022 -0.063 365s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 365s x0: [4, 3, 4, 1] 365s 0.527 0.978 -0.060 0.640 365s -0.096 -0.243 -0.164 -0.554 365s 0.473 0.191 -0.003 -0.216 365s 365s x1: [4, 2, 4, 1] 365s 0.395 -0.626 -0.940 -0.118 365s 0.972 -0.548 0.853 0.718 365s 365s y: [3, 2, 4, 1] 365s -0.423 0.333 0.095 365s 0.384 -0.498 0.197 365s testing: mul_mat, [4, 4, 4, 2] = [3, 4, 4, 2] * [3, 4, 4, 2] 365s x0: [3, 4, 4, 2] 365s -0.002 0.301 0.278 365s 0.816 0.071 0.830 365s -0.814 -0.864 -0.927 365s -0.449 -0.014 0.686 365s 365s x1: [3, 4, 4, 2] 365s 0.097 -0.118 0.461 365s 0.821 -0.546 0.183 365s -0.123 0.744 0.703 365s -0.439 -0.840 0.689 365s 365s y: [4, 4, 4, 2] 365s 0.092 0.452 -0.403 0.274 365s -0.115 0.783 -0.366 -0.235 365s 0.419 0.536 -1.194 0.527 365s -0.060 0.154 0.445 0.681 365s test-mul-mat0: iter:94/500 365s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.655 -0.917 365s -0.411 0.628 365s 365s x1: [2, 3, 1, 1] 365s -0.104 0.125 365s -0.413 0.089 365s -0.722 0.317 365s 365s y: [2, 3, 1, 1] 365s -0.046 0.121 365s 0.188 0.226 365s 0.182 0.495 365s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 1, 2, 1] 365s 0.728 0.800 365s 365s x1: [2, 2, 2, 1] 365s 0.004 -0.989 365s -0.808 0.564 365s 365s y: [1, 2, 2, 1] 365s -0.788 365s -0.137 365s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 365s x0: [2, 1, 2, 1] 365s 0.856 0.467 365s 365s x1: [2, 1, 2, 1] 365s 0.378 0.297 365s 365s y: [1, 1, 2, 1] 365s 0.462 365s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.372 0.603 365s -0.679 0.454 365s 365s x1: [2, 1, 1, 1] 365s -0.945 0.885 365s 365s y: [2, 1, 1, 1] 365s 0.183 1.043 365s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 4, 2, 1] 365s 0.812 0.484 365s -0.679 0.564 365s -0.710 -0.010 365s 0.612 -0.219 365s 365s x1: [2, 2, 2, 1] 365s -0.089 0.231 365s -0.542 -0.078 365s 365s y: [4, 2, 2, 1] 365s 0.039 0.191 0.061 -0.105 365s -0.478 0.324 0.386 -0.315 365s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 365s x0: [4, 3, 2, 1] 365s -0.772 0.792 -0.129 0.806 365s -0.084 0.906 -0.544 -0.528 365s -0.099 0.682 -0.637 0.839 365s 365s x1: [4, 2, 2, 1] 365s 0.362 0.174 0.443 0.683 365s -0.223 -0.103 -0.406 -0.992 365s 365s y: [3, 2, 2, 1] 365s 0.352 -0.475 0.374 365s -0.657 0.669 -0.622 365s test-mul-mat0: iter:95/500 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.069 -0.430 -0.750 365s 0.542 -0.591 0.971 365s -0.915 0.611 -0.134 365s -0.928 -0.504 -0.463 365s 365s x1: [3, 3, 1, 1] 365s 0.553 0.791 0.624 365s 0.781 -0.293 -0.475 365s 0.573 -0.386 -0.793 365s 365s y: [4, 3, 1, 1] 365s -0.770 0.438 -0.106 -1.200 365s 0.536 0.134 -0.829 -0.357 365s 0.801 -0.231 -0.653 0.030 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.957 0.684 0.715 365s -0.429 0.891 -0.842 365s 0.640 -0.539 -0.592 365s 365s x1: [3, 4, 1, 1] 365s -0.044 0.975 -0.783 365s 0.194 -0.029 0.328 365s -0.191 0.458 0.535 365s -0.639 0.250 0.159 365s 365s y: [3, 4, 1, 1] 365s 0.066 1.547 -0.090 365s 0.401 -0.386 -0.054 365s 0.513 0.040 -0.686 365s -0.327 0.363 -0.637 365s testing: mul_mat, [3, 3, 1, 4] = [3, 3, 1, 4] * [3, 3, 1, 4] 365s x0: [3, 3, 1, 4] 365s -0.414 -0.613 -0.512 365s -0.119 0.339 -0.513 365s -0.630 0.263 0.814 365s 365s x1: [3, 3, 1, 4] 365s 0.181 -0.130 -0.621 365s 0.267 -0.519 0.245 365s 0.339 -0.023 0.782 365s 365s y: [3, 3, 1, 4] 365s 0.322 0.253 -0.654 365s 0.082 -0.333 -0.105 365s -0.526 -0.449 0.417 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.009 0.141 -0.808 365s 0.170 0.654 0.143 365s 365s x1: [3, 3, 1, 1] 365s 0.545 -0.117 0.388 365s 0.360 0.552 -0.968 365s -0.963 -0.121 0.307 365s 365s y: [2, 3, 1, 1] 365s -0.326 0.072 365s 0.863 0.283 365s -0.273 -0.199 365s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.141 0.809 365s -0.407 -0.509 365s 365s x1: [2, 3, 1, 1] 365s -0.027 -0.988 365s 0.204 0.375 365s 0.812 -0.806 365s 365s y: [2, 3, 1, 1] 365s -0.795 0.514 365s 0.274 -0.274 365s -0.767 0.080 365s testing: mul_mat, [4, 2, 1, 4] = [2, 4, 1, 4] * [2, 2, 1, 4] 365s x0: [2, 4, 1, 4] 365s -0.019 0.607 365s 0.595 -0.966 365s -0.170 0.329 365s 0.954 0.419 365s 365s x1: [2, 2, 1, 4] 365s 0.380 -0.133 365s 0.004 0.506 365s 365s y: [4, 2, 1, 4] 365s -0.088 0.354 -0.108 0.307 365s 0.307 -0.486 0.166 0.216 365s test-mul-mat0: iter:96/500 365s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.583 0.199 365s 365s x1: [2, 4, 1, 1] 365s -0.667 0.011 365s 0.784 -0.439 365s 0.579 -0.939 365s 0.382 0.956 365s 365s y: [1, 4, 1, 1] 365s -0.387 365s 0.370 365s 0.151 365s 0.413 365s testing: mul_mat, [2, 1, 2, 1] = [2, 2, 2, 1] * [2, 1, 2, 1] 365s x0: [2, 2, 2, 1] 365s -0.716 0.364 365s 0.417 0.522 365s 365s x1: [2, 1, 2, 1] 365s -0.397 0.290 365s 365s y: [2, 1, 2, 1] 365s 0.390 -0.014 365s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 2, 2, 1] 365s 0.247 0.968 365s 0.532 0.308 365s 365s x1: [2, 2, 2, 1] 365s -0.661 -0.225 365s 0.259 0.098 365s 365s y: [2, 2, 2, 1] 365s -0.381 -0.421 365s 0.158 0.168 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.887 -0.631 365s 0.825 0.530 365s 365s x1: [2, 2, 1, 1] 365s -0.314 -0.259 365s -0.777 0.542 365s 365s y: [2, 2, 1, 1] 365s 0.442 -0.396 365s 0.347 -0.354 365s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 1, 2, 1] 365s -0.499 -0.117 365s 365s x1: [2, 2, 2, 1] 365s 0.347 0.153 365s -0.293 -0.348 365s 365s y: [1, 2, 2, 1] 365s -0.191 365s 0.187 365s testing: mul_mat, [2, 2, 2, 1] = [1, 2, 2, 1] * [1, 2, 2, 1] 365s x0: [1, 2, 2, 1] 365s 0.225 365s 0.855 365s 365s x1: [1, 2, 2, 1] 365s -0.164 365s -0.147 365s 365s y: [2, 2, 2, 1] 365s -0.037 -0.140 365s -0.033 -0.126 365s test-mul-mat0: iter:97/500 365s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.255 365s 365s x1: [1, 4, 1, 1] 365s -0.955 365s -0.505 365s 0.908 365s -0.586 365s 365s y: [1, 4, 1, 1] 365s -0.244 365s -0.129 365s 0.232 365s -0.150 365s testing: mul_mat, [3, 1, 3, 1] = [1, 3, 3, 1] * [1, 1, 3, 1] 365s x0: [1, 3, 3, 1] 365s 0.778 365s 0.335 365s -0.808 365s 365s x1: [1, 1, 3, 1] 365s 0.566 365s 365s y: [3, 1, 3, 1] 365s 0.440 0.190 -0.458 365s testing: mul_mat, [3, 3, 3, 2] = [1, 3, 3, 2] * [1, 3, 3, 2] 365s x0: [1, 3, 3, 2] 365s -0.364 365s 0.829 365s -0.845 365s 365s x1: [1, 3, 3, 2] 365s -0.619 365s 0.620 365s -0.721 365s 365s y: [3, 3, 3, 2] 365s 0.226 -0.513 0.523 365s -0.226 0.514 -0.524 365s 0.263 -0.598 0.609 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.598 365s 0.178 365s 0.432 365s 365s x1: [1, 3, 1, 1] 365s 0.625 365s -0.787 365s -0.580 365s 365s y: [3, 3, 1, 1] 365s 0.374 0.111 0.270 365s -0.471 -0.140 -0.340 365s -0.346 -0.103 -0.250 365s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 365s x0: [3, 2, 3, 1] 365s -0.243 -0.234 0.601 365s -0.821 0.757 -0.372 365s 365s x1: [3, 1, 3, 1] 365s 0.477 0.843 0.772 365s 365s y: [2, 1, 3, 1] 365s 0.150 -0.041 365s testing: mul_mat, [2, 3, 3, 2] = [2, 2, 3, 2] * [2, 3, 3, 2] 365s x0: [2, 2, 3, 2] 365s 0.317 0.057 365s -0.115 -0.922 365s 365s x1: [2, 3, 3, 2] 365s 0.884 0.890 365s -0.781 0.361 365s 0.733 0.991 365s 365s y: [2, 3, 3, 2] 365s 0.331 -0.923 365s -0.227 -0.243 365s 0.289 -0.998 365s test-mul-mat0: iter:98/500 365s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.263 365s 0.976 365s 365s x1: [1, 1, 1, 1] 365s -0.341 365s 365s y: [2, 1, 1, 1] 365s 0.090 -0.333 365s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 365s x0: [1, 4, 2, 1] 365s 0.323 365s 0.034 365s -0.411 365s -0.451 365s 365s x1: [1, 2, 2, 1] 365s 0.236 365s 0.794 365s 365s y: [4, 2, 2, 1] 365s 0.076 0.008 -0.097 -0.106 365s 0.256 0.027 -0.326 -0.358 365s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 365s x0: [1, 3, 2, 1] 365s 0.189 365s -0.276 365s -0.003 365s 365s x1: [1, 4, 2, 1] 365s 0.840 365s 0.627 365s -0.505 365s -0.666 365s 365s y: [3, 4, 2, 1] 365s 0.159 -0.232 -0.003 365s 0.119 -0.173 -0.002 365s -0.096 0.139 0.002 365s -0.126 0.184 0.002 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.398 365s 0.685 365s -0.048 365s -0.046 365s 365s x1: [1, 3, 1, 1] 365s -0.496 365s -0.192 365s 0.631 365s 365s y: [4, 3, 1, 1] 365s 0.197 -0.339 0.024 0.023 365s 0.076 -0.131 0.009 0.009 365s -0.251 0.432 -0.031 -0.029 365s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 365s x0: [4, 2, 2, 1] 365s 0.543 0.824 -0.025 -0.677 365s -0.064 0.549 0.703 0.887 365s 365s x1: [4, 1, 2, 1] 365s 0.007 0.986 0.543 0.556 365s 365s y: [2, 1, 2, 1] 365s 0.426 1.415 365s testing: mul_mat, [4, 4, 2, 1] = [2, 4, 2, 1] * [2, 4, 2, 1] 365s x0: [2, 4, 2, 1] 365s 0.991 0.750 365s 0.426 -0.086 365s 0.028 -0.413 365s 0.694 -0.203 365s 365s x1: [2, 4, 2, 1] 365s -0.830 0.291 365s -0.471 0.854 365s -0.758 0.483 365s -0.138 -0.772 365s 365s y: [4, 4, 2, 1] 365s -0.604 -0.379 -0.143 -0.635 365s 0.174 -0.274 -0.366 -0.501 365s -0.389 -0.364 -0.220 -0.624 365s -0.716 0.007 0.315 0.061 365s test-mul-mat0: iter:99/500 365s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.121 365s 0.640 365s 0.621 365s 0.555 365s 365s x1: [1, 1, 1, 1] 365s 0.222 365s 365s y: [4, 1, 1, 1] 365s -0.027 0.142 0.138 0.123 365s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 365s x0: [1, 1, 2, 1] 365s 0.027 365s 365s x1: [1, 4, 2, 1] 365s 0.974 365s -0.263 365s 0.721 365s 0.852 365s 365s y: [1, 4, 2, 1] 365s 0.027 365s -0.007 365s 0.020 365s 0.023 365s testing: mul_mat, [3, 1, 2, 4] = [1, 3, 2, 4] * [1, 1, 2, 4] 365s x0: [1, 3, 2, 4] 365s 0.284 365s 0.952 365s 0.730 365s 365s x1: [1, 1, 2, 4] 365s 0.993 365s 365s y: [3, 1, 2, 4] 365s 0.282 0.945 0.725 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s 0.505 365s 0.869 365s 0.405 365s -0.212 365s 365s x1: [1, 3, 1, 1] 365s 0.170 365s -0.305 365s -0.713 365s 365s y: [4, 3, 1, 1] 365s 0.086 0.148 0.069 -0.036 365s -0.154 -0.265 -0.124 0.064 365s -0.360 -0.619 -0.289 0.151 365s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 365s x0: [4, 3, 2, 1] 365s -0.454 -0.494 -0.823 0.253 365s 0.246 0.664 0.359 -0.918 365s 0.605 0.698 0.608 -0.750 365s 365s x1: [4, 1, 2, 1] 365s 0.820 0.136 -0.211 0.625 365s 365s y: [3, 1, 2, 1] 365s -0.107 -0.357 -0.006 365s testing: mul_mat, [1, 4, 2, 4] = [3, 1, 2, 4] * [3, 4, 2, 4] 365s x0: [3, 1, 2, 4] 365s 0.003 -0.138 0.991 365s 365s x1: [3, 4, 2, 4] 365s -0.900 -0.769 -0.915 365s 0.835 -0.460 -0.235 365s -0.863 0.086 -0.989 365s 0.742 0.591 0.675 365s 365s y: [1, 4, 2, 4] 365s -0.804 365s -0.167 365s -0.995 365s 0.589 365s test-mul-mat0: iter:100/500 365s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.241 -0.852 0.192 365s 365s x1: [3, 1, 1, 1] 365s -0.470 0.897 -0.843 365s 365s y: [1, 1, 1, 1] 365s -0.814 365s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.048 -0.746 -0.447 365s 0.714 0.220 0.641 365s 365s x1: [3, 1, 1, 1] 365s 0.272 0.929 0.008 365s 365s y: [2, 1, 1, 1] 365s -0.710 0.405 365s testing: mul_mat, [4, 2, 1, 3] = [3, 4, 1, 3] * [3, 2, 1, 3] 365s x0: [3, 4, 1, 3] 365s -0.356 -0.589 0.746 365s -0.427 0.419 -0.252 365s 0.525 0.673 0.301 365s 0.240 -0.107 -0.058 365s 365s x1: [3, 2, 1, 3] 365s -0.986 0.545 0.732 365s -0.635 0.226 -0.046 365s 365s y: [4, 2, 1, 3] 365s 0.576 0.465 0.069 -0.337 365s 0.058 0.377 -0.195 -0.174 365s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.973 0.088 -0.327 365s 0.882 0.321 0.628 365s 0.655 0.995 -0.324 365s -0.924 -0.366 0.244 365s 365s x1: [3, 4, 1, 1] 365s -0.399 -0.216 -0.926 365s 0.902 -0.976 -0.033 365s -0.156 0.277 0.405 365s 0.517 -0.105 0.069 365s 365s y: [4, 4, 1, 1] 365s -0.104 -1.003 -0.176 0.222 365s 0.803 0.462 -0.369 -0.485 365s -0.260 0.206 0.042 0.142 365s 0.471 0.466 0.212 -0.423 365s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.294 -0.547 -0.857 0.575 365s 0.073 0.602 0.597 -0.975 365s 365s x1: [4, 3, 1, 1] 365s 0.735 0.362 -0.774 0.588 365s -0.379 -0.373 -0.810 0.405 365s -0.299 -0.908 0.429 0.668 365s 365s y: [2, 3, 1, 1] 365s 0.589 -0.764 365s 1.243 -1.131 365s 0.601 -0.963 365s testing: mul_mat, [2, 4, 1, 3] = [2, 2, 1, 3] * [2, 4, 1, 3] 365s x0: [2, 2, 1, 3] 365s -0.732 -0.074 365s 0.549 0.865 365s 365s x1: [2, 4, 1, 3] 365s 0.252 0.651 365s 0.113 -0.427 365s 0.646 0.747 365s 0.246 0.274 365s 365s y: [2, 4, 1, 3] 365s -0.232 0.701 365s -0.051 -0.308 365s -0.528 1.001 365s -0.200 0.372 365s test-mul-mat0: iter:101/500 365s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.763 0.525 365s -0.033 0.478 365s 0.235 -0.217 365s 365s x1: [2, 3, 1, 1] 365s -0.296 -0.980 365s -0.860 0.301 365s -0.574 0.482 365s 365s y: [3, 3, 1, 1] 365s -0.288 -0.459 0.143 365s 0.814 0.172 -0.268 365s 0.691 0.249 -0.240 365s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.896 -0.967 365s -0.127 0.960 365s 0.723 0.294 365s 0.771 -0.341 365s 365s x1: [2, 3, 1, 1] 365s 0.425 -0.981 365s 0.051 -0.026 365s -0.055 -0.084 365s 365s y: [4, 3, 1, 1] 365s 1.329 -0.996 0.019 0.663 365s 0.071 -0.032 0.029 0.048 365s 0.032 -0.073 -0.064 -0.014 365s testing: mul_mat, [2, 4, 1, 2] = [2, 2, 1, 2] * [2, 4, 1, 2] 365s x0: [2, 2, 1, 2] 365s -0.514 0.882 365s -0.434 0.431 365s 365s x1: [2, 4, 1, 2] 365s -0.012 -0.994 365s -0.916 0.692 365s -0.974 -0.776 365s -0.007 -0.548 365s 365s y: [2, 4, 1, 2] 365s -0.871 -0.423 365s 1.082 0.695 365s -0.183 0.088 365s -0.480 -0.233 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.214 0.064 365s 0.820 0.298 365s 365s x1: [2, 2, 1, 1] 365s 0.538 0.287 365s -0.445 -0.169 365s 365s y: [2, 2, 1, 1] 365s 0.133 0.526 365s -0.106 -0.415 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.228 -0.769 365s 0.586 -0.540 365s 365s x1: [2, 2, 1, 1] 365s 0.512 0.090 365s 0.522 -0.495 365s 365s y: [2, 2, 1, 1] 365s 0.048 0.251 365s 0.500 0.573 365s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 365s x0: [2, 1, 1, 2] 365s 0.146 0.221 365s 365s x1: [2, 2, 1, 2] 365s 0.634 -0.602 365s -0.074 -0.535 365s 365s y: [1, 2, 1, 2] 365s -0.040 365s -0.129 365s test-mul-mat0: iter:102/500 365s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.112 -0.171 -0.408 0.459 365s 365s x1: [4, 4, 1, 1] 365s -0.485 0.747 0.410 0.579 365s 0.044 -0.079 -0.331 -0.434 365s 0.426 -0.790 0.794 0.012 365s -0.559 -0.746 -0.354 -0.161 365s 365s y: [1, 4, 1, 1] 365s -0.083 365s -0.046 365s -0.136 365s 0.135 365s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 365s x0: [4, 1, 4, 1] 365s -0.527 -0.287 -0.965 -0.733 365s 365s x1: [4, 1, 4, 1] 365s 0.386 0.435 -0.395 -0.393 365s 365s y: [1, 1, 4, 1] 365s 0.340 365s testing: mul_mat, [1, 1, 4, 3] = [4, 1, 4, 3] * [4, 1, 4, 3] 365s x0: [4, 1, 4, 3] 365s 0.639 -0.130 0.211 -0.085 365s 365s x1: [4, 1, 4, 3] 365s 0.547 -0.862 0.332 -0.870 365s 365s y: [1, 1, 4, 3] 365s 0.605 365s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.931 0.731 0.128 -0.208 365s -0.741 -0.094 -0.149 -0.363 365s -0.566 0.824 -0.817 0.981 365s -0.564 0.522 -0.439 -0.375 365s 365s x1: [4, 1, 1, 1] 365s 0.681 -0.167 0.821 -0.720 365s 365s y: [4, 1, 1, 1] 365s 0.767 -0.351 -1.900 -0.562 365s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 1, 4, 1] 365s 0.481 0.055 0.944 -0.716 365s 365s x1: [4, 4, 4, 1] 365s 0.287 -0.691 0.618 0.480 365s 0.996 -0.234 -0.890 0.633 365s -0.603 -0.799 0.315 0.230 365s -0.978 0.595 0.956 0.953 365s 365s y: [1, 4, 4, 1] 365s 0.339 365s -0.827 365s -0.202 365s -0.217 365s testing: mul_mat, [4, 4, 4, 3] = [1, 4, 4, 3] * [1, 4, 4, 3] 365s x0: [1, 4, 4, 3] 365s 0.027 365s 0.814 365s -0.135 365s 0.357 365s 365s x1: [1, 4, 4, 3] 365s -0.090 365s 0.122 365s -0.761 365s 0.268 365s 365s y: [4, 4, 4, 3] 365s -0.002 -0.073 0.012 -0.032 365s 0.003 0.099 -0.017 0.044 365s -0.020 -0.620 0.103 -0.272 365s 0.007 0.218 -0.036 0.096 365s test-mul-mat0: iter:103/500 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s 0.561 365s 0.479 365s 0.673 365s 0.586 365s 365s x1: [1, 3, 1, 1] 365s 0.505 365s 0.520 365s -0.633 365s 365s y: [4, 3, 1, 1] 365s 0.283 0.242 0.340 0.296 365s 0.292 0.249 0.350 0.305 365s -0.355 -0.303 -0.426 -0.371 365s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 365s x0: [1, 4, 2, 1] 365s -0.223 365s 0.389 365s 0.803 365s -0.664 365s 365s x1: [1, 4, 2, 1] 365s 0.871 365s -0.518 365s -0.362 365s -0.922 365s 365s y: [4, 4, 2, 1] 365s -0.195 0.339 0.700 -0.578 365s 0.116 -0.201 -0.416 0.344 365s 0.081 -0.141 -0.291 0.240 365s 0.206 -0.359 -0.741 0.612 365s testing: mul_mat, [1, 4, 2, 2] = [1, 1, 2, 2] * [1, 4, 2, 2] 365s x0: [1, 1, 2, 2] 365s -0.741 365s 365s x1: [1, 4, 2, 2] 365s -0.076 365s -0.398 365s -0.495 365s -0.750 365s 365s y: [1, 4, 2, 2] 365s 0.056 365s 0.295 365s 0.367 365s 0.556 365s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.847 365s 365s x1: [1, 1, 1, 1] 365s -0.813 365s 365s y: [1, 1, 1, 1] 365s -0.689 365s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 365s x0: [1, 1, 2, 1] 365s -0.620 365s 365s x1: [1, 1, 2, 1] 365s 0.576 365s 365s y: [1, 1, 2, 1] 365s -0.357 365s testing: mul_mat, [4, 1, 2, 2] = [1, 4, 2, 2] * [1, 1, 2, 2] 365s x0: [1, 4, 2, 2] 365s -0.226 365s -0.143 365s 0.117 365s -0.471 365s 365s x1: [1, 1, 2, 2] 365s -0.052 365s 365s y: [4, 1, 2, 2] 365s 0.012 0.007 -0.006 0.025 365s test-mul-mat0: iter:104/500 365s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.209 -0.085 -0.478 0.204 365s 365s x1: [4, 4, 1, 1] 365s 0.618 -0.982 -0.643 0.802 365s -0.602 0.958 -0.250 -0.454 365s -0.518 -0.700 0.198 0.256 365s 0.157 -0.685 0.784 -0.159 365s 365s y: [1, 4, 1, 1] 365s 0.684 365s -0.181 365s -0.091 365s -0.316 365s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.494 0.945 0.434 -0.052 365s 0.382 0.053 -0.034 0.740 365s -0.145 0.364 0.698 0.605 365s 365s x1: [4, 1, 1, 1] 365s -0.037 -0.037 -0.749 0.624 365s 365s y: [3, 1, 1, 1] 365s -0.374 0.471 -0.153 365s testing: mul_mat, [3, 3, 1, 4] = [4, 3, 1, 4] * [4, 3, 1, 4] 365s x0: [4, 3, 1, 4] 365s -0.989 0.342 -0.999 -0.721 365s -0.320 -0.268 0.600 -0.015 365s 0.203 0.776 -0.347 0.371 365s 365s x1: [4, 3, 1, 4] 365s 0.910 -0.820 0.905 0.108 365s 0.436 0.062 0.423 0.220 365s 0.903 0.686 -0.571 -0.182 365s 365s y: [3, 3, 1, 4] 365s -2.163 0.470 -0.726 365s -0.991 0.095 0.071 365s 0.043 -0.813 0.846 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.439 -0.433 0.304 0.244 365s 0.480 0.777 0.673 0.346 365s 0.248 0.228 -0.807 0.811 365s 365s x1: [4, 3, 1, 1] 365s -0.466 -0.541 0.799 0.042 365s 0.098 -0.375 -0.018 0.203 365s 0.456 0.989 -0.444 0.463 365s 365s y: [3, 3, 1, 1] 365s 0.692 -0.091 -0.850 365s 0.164 -0.186 0.118 365s -0.650 0.849 1.072 365s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.807 0.775 0.329 365s -0.688 -0.918 0.492 365s -0.866 -0.075 0.031 365s -0.637 0.255 -0.443 365s 365s x1: [3, 4, 1, 1] 365s -0.444 -0.424 0.661 365s 0.587 0.047 -0.569 365s -0.879 0.506 -0.769 365s 0.163 -0.396 -0.144 365s 365s y: [4, 4, 1, 1] 365s -0.469 1.020 0.437 -0.118 365s 0.322 -0.727 -0.529 -0.110 365s -0.570 -0.239 0.698 1.029 365s -0.223 0.181 -0.116 -0.141 365s testing: mul_mat, [1, 3, 1, 4] = [4, 1, 1, 4] * [4, 3, 1, 4] 365s x0: [4, 1, 1, 4] 365s 0.372 0.726 -0.424 0.118 365s 365s x1: [4, 3, 1, 4] 365s -0.204 -0.295 -0.250 -0.959 365s -0.949 -0.439 -0.404 -0.373 365s -0.778 -0.817 0.674 -0.347 365s 365s y: [1, 3, 1, 4] 365s -0.298 365s -0.545 365s -1.209 365s test-mul-mat0: iter:105/500 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.392 -0.703 365s 0.246 0.151 365s -0.569 0.771 365s 365s x1: [2, 2, 1, 1] 365s -0.223 -0.441 365s 0.099 0.947 365s 365s y: [3, 2, 1, 1] 365s 0.223 -0.121 -0.213 365s -0.627 0.167 0.673 365s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 2, 3, 1] 365s 0.847 -0.699 365s -0.329 -0.054 365s 365s x1: [2, 3, 3, 1] 365s -0.477 -0.843 365s -0.653 0.641 365s 0.838 -0.250 365s 365s y: [2, 3, 3, 1] 365s 0.185 0.203 365s -1.001 0.181 365s 0.884 -0.263 365s testing: mul_mat, [3, 2, 3, 2] = [2, 3, 3, 2] * [2, 2, 3, 2] 365s x0: [2, 3, 3, 2] 365s 0.992 -0.833 365s -0.463 0.480 365s 0.142 -0.315 365s 365s x1: [2, 2, 3, 2] 365s -0.989 0.133 365s -0.505 0.652 365s 365s y: [3, 2, 3, 2] 365s -1.091 0.521 -0.183 365s -1.045 0.547 -0.278 365s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.850 -0.712 365s -0.730 -0.532 365s 365s x1: [2, 3, 1, 1] 365s -0.216 0.622 365s -0.588 0.963 365s -0.234 0.657 365s 365s y: [2, 3, 1, 1] 365s -0.259 -0.173 365s -0.186 -0.083 365s -0.269 -0.178 365s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 365s x0: [2, 1, 3, 1] 365s -0.885 0.584 365s 365s x1: [2, 2, 3, 1] 365s 0.581 0.496 365s -0.766 0.018 365s 365s y: [1, 2, 3, 1] 365s -0.225 365s 0.688 365s testing: mul_mat, [2, 2, 3, 2] = [1, 2, 3, 2] * [1, 2, 3, 2] 365s x0: [1, 2, 3, 2] 365s -0.399 365s -0.001 365s 365s x1: [1, 2, 3, 2] 365s 0.246 365s -0.532 365s 365s y: [2, 2, 3, 2] 365s -0.098 -0.000 365s 0.213 0.001 365s test-mul-mat0: iter:106/500 365s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.126 -0.990 0.250 0.979 365s 365s x1: [4, 2, 1, 1] 365s 0.158 -0.814 -0.306 0.626 365s 0.567 -0.649 -0.944 0.714 365s 365s y: [1, 2, 1, 1] 365s 1.322 365s 1.034 365s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.433 0.021 -0.902 -0.630 365s 365s x1: [4, 1, 1, 1] 365s -0.225 -0.585 -0.420 0.774 365s 365s y: [1, 1, 1, 1] 365s -0.219 365s testing: mul_mat, [4, 1, 1, 3] = [4, 4, 1, 3] * [4, 1, 1, 3] 365s x0: [4, 4, 1, 3] 365s -0.104 0.426 0.094 -0.390 365s 0.091 0.968 -0.380 -0.659 365s 0.947 0.395 -0.243 -0.474 365s 0.169 0.356 0.960 -0.809 365s 365s x1: [4, 1, 1, 3] 365s -0.111 0.918 -0.406 -0.221 365s 365s y: [4, 1, 1, 3] 365s 0.451 1.178 0.461 0.097 365s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.497 0.853 0.451 0.265 365s 0.280 0.186 0.422 -0.622 365s 365s x1: [4, 4, 1, 1] 365s -0.615 -0.482 0.109 0.308 365s -0.623 0.196 0.164 -0.511 365s -0.574 0.401 -0.987 -0.015 365s -0.287 -0.425 -0.561 0.451 365s 365s y: [2, 4, 1, 1] 365s 0.026 -0.408 365s 0.416 0.248 365s 0.178 -0.493 365s -0.353 -0.677 365s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.265 -0.154 365s 365s x1: [2, 4, 1, 1] 365s 0.517 -0.487 365s -0.339 0.573 365s 0.951 -0.602 365s 0.957 -0.531 365s 365s y: [1, 4, 1, 1] 365s 0.212 365s -0.178 365s 0.344 365s 0.335 365s testing: mul_mat, [3, 2, 1, 3] = [1, 3, 1, 3] * [1, 2, 1, 3] 365s x0: [1, 3, 1, 3] 365s -0.459 365s 0.923 365s -0.447 365s 365s x1: [1, 2, 1, 3] 365s -0.297 365s -0.572 365s 365s y: [3, 2, 1, 3] 365s 0.136 -0.274 0.133 365s 0.262 -0.528 0.256 365s test-mul-mat0: iter:107/500 365s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.483 0.972 365s -0.839 -0.526 365s -0.559 0.668 365s 365s x1: [2, 1, 1, 1] 365s -0.055 -0.980 365s 365s y: [3, 1, 1, 1] 365s -0.925 0.562 -0.624 365s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 365s x0: [2, 4, 4, 1] 365s -0.058 -0.035 365s 0.551 0.416 365s 0.406 0.220 365s 0.155 -0.307 365s 365s x1: [2, 3, 4, 1] 365s 0.739 0.287 365s -0.628 -0.833 365s -0.377 0.501 365s 365s y: [4, 3, 4, 1] 365s -0.053 0.527 0.363 0.026 365s 0.066 -0.693 -0.438 0.159 365s 0.004 0.000 -0.043 -0.212 365s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 365s x0: [2, 3, 4, 1] 365s -0.495 -0.089 365s -0.683 0.202 365s 0.508 -0.592 365s 365s x1: [2, 4, 4, 1] 365s -0.502 0.280 365s 0.542 0.904 365s -0.501 -0.303 365s -0.403 -0.909 365s 365s y: [3, 4, 4, 1] 365s 0.224 0.399 -0.421 365s -0.349 -0.188 -0.259 365s 0.275 0.281 -0.075 365s 0.280 0.092 0.333 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.452 -0.037 365s 365s x1: [2, 3, 1, 1] 365s -0.637 0.985 365s 0.044 -0.099 365s -0.053 -0.948 365s 365s y: [1, 3, 1, 1] 365s -0.324 365s 0.024 365s 0.011 365s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 365s x0: [1, 4, 4, 1] 365s -0.481 365s -0.823 365s 0.513 365s -0.229 365s 365s x1: [1, 2, 4, 1] 365s 0.035 365s -0.347 365s 365s y: [4, 2, 4, 1] 365s -0.017 -0.029 0.018 -0.008 365s 0.167 0.285 -0.178 0.079 365s testing: mul_mat, [3, 1, 4, 1] = [4, 3, 4, 1] * [4, 1, 4, 1] 365s x0: [4, 3, 4, 1] 365s -0.566 -0.444 -0.823 -0.053 365s -0.081 -0.188 -0.736 -0.750 365s 0.785 -0.635 -0.502 -0.006 365s 365s x1: [4, 1, 4, 1] 365s 0.397 0.701 0.082 -0.885 365s 365s y: [3, 1, 4, 1] 365s -0.557 0.439 -0.170 365s test-mul-mat0: iter:108/500 365s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.420 -0.234 -0.415 365s 365s x1: [3, 1, 1, 1] 365s -0.610 0.247 -0.113 365s 365s y: [1, 1, 1, 1] 365s 0.246 365s testing: mul_mat, [2, 1, 2, 1] = [3, 2, 2, 1] * [3, 1, 2, 1] 365s x0: [3, 2, 2, 1] 365s 0.498 0.837 -0.813 365s 0.520 -0.076 -0.445 365s 365s x1: [3, 1, 2, 1] 365s -0.166 -0.207 0.024 365s 365s y: [2, 1, 2, 1] 365s -0.275 -0.082 365s testing: mul_mat, [3, 2, 2, 1] = [3, 3, 2, 1] * [3, 2, 2, 1] 365s x0: [3, 3, 2, 1] 365s 0.540 -0.444 0.763 365s 0.280 0.804 -0.471 365s -0.223 0.641 -0.284 365s 365s x1: [3, 2, 2, 1] 365s 0.259 -0.542 -0.876 365s 0.582 0.094 -0.486 365s 365s y: [3, 2, 2, 1] 365s -0.288 0.049 -0.156 365s -0.098 0.467 0.069 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.330 -0.248 0.663 365s 0.107 -0.130 -0.485 365s 365s x1: [3, 3, 1, 1] 365s -0.308 0.508 -0.886 365s -0.726 -0.398 -0.371 365s 0.102 -0.417 0.575 365s 365s y: [2, 3, 1, 1] 365s -0.815 0.330 365s -0.387 0.154 365s 0.518 -0.214 365s testing: mul_mat, [2, 3, 2, 1] = [2, 2, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 2, 2, 1] 365s -0.706 0.438 365s -0.084 0.802 365s 365s x1: [2, 3, 2, 1] 365s -0.850 0.467 365s 0.045 -0.072 365s 0.108 0.761 365s 365s y: [2, 3, 2, 1] 365s 0.804 0.446 365s -0.063 -0.062 365s 0.257 0.601 365s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 2, 2, 1] 365s -0.409 0.471 365s 0.338 0.636 365s 365s x1: [2, 2, 2, 1] 365s -0.186 -0.013 365s 0.235 0.324 365s 365s y: [2, 2, 2, 1] 365s 0.070 -0.071 365s 0.057 0.285 365s test-mul-mat0: iter:109/500 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.012 365s 0.593 365s 0.300 365s 365s x1: [1, 3, 1, 1] 365s 0.353 365s 0.550 365s 0.791 365s 365s y: [3, 3, 1, 1] 365s -0.004 0.210 0.106 365s -0.006 0.326 0.165 365s -0.009 0.469 0.238 365s testing: mul_mat, [2, 3, 2, 1] = [1, 2, 2, 1] * [1, 3, 2, 1] 365s x0: [1, 2, 2, 1] 365s 0.301 365s -0.464 365s 365s x1: [1, 3, 2, 1] 365s -0.300 365s 0.997 365s -0.040 365s 365s y: [2, 3, 2, 1] 365s -0.090 0.139 365s 0.300 -0.463 365s -0.012 0.019 365s testing: mul_mat, [3, 2, 2, 4] = [1, 3, 2, 4] * [1, 2, 2, 4] 365s x0: [1, 3, 2, 4] 365s -0.077 365s 0.723 365s 0.888 365s 365s x1: [1, 2, 2, 4] 365s -0.460 365s 0.505 365s 365s y: [3, 2, 2, 4] 365s 0.035 -0.333 -0.408 365s -0.039 0.365 0.448 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.685 365s -0.968 365s 0.942 365s 0.238 365s 365s x1: [1, 3, 1, 1] 365s 0.036 365s -0.620 365s 0.902 365s 365s y: [4, 3, 1, 1] 365s -0.025 -0.035 0.034 0.009 365s 0.425 0.600 -0.584 -0.148 365s -0.618 -0.873 0.849 0.215 365s testing: mul_mat, [4, 1, 2, 1] = [4, 4, 2, 1] * [4, 1, 2, 1] 365s x0: [4, 4, 2, 1] 365s -0.814 0.416 -0.206 0.408 365s 0.298 0.259 0.414 0.829 365s -0.135 0.002 -0.734 0.902 365s -0.319 0.546 -0.134 0.788 365s 365s x1: [4, 1, 2, 1] 365s 0.756 0.830 -0.138 0.476 365s 365s y: [4, 1, 2, 1] 365s -0.047 0.777 0.430 0.605 365s testing: mul_mat, [2, 4, 2, 4] = [4, 2, 2, 4] * [4, 4, 2, 4] 365s x0: [4, 2, 2, 4] 365s -0.556 -0.396 -0.664 0.449 365s 0.754 -0.437 -0.287 -0.574 365s 365s x1: [4, 4, 2, 4] 365s -0.278 0.042 -0.957 -0.862 365s -0.699 0.044 0.684 0.095 365s -0.542 0.950 0.961 0.867 365s 0.778 0.863 0.655 0.509 365s 365s y: [2, 4, 2, 4] 365s 0.387 0.542 365s -0.041 -0.797 365s -0.325 -1.596 365s -0.982 -0.270 365s test-mul-mat0: iter:110/500 365s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.792 0.563 365s -0.563 -0.376 365s -0.371 0.355 365s 365s x1: [2, 3, 1, 1] 365s 0.092 0.179 365s -0.886 -0.255 365s 0.979 0.153 365s 365s y: [3, 3, 1, 1] 365s 0.028 -0.119 0.029 365s 0.558 0.595 0.238 365s -0.690 -0.609 -0.309 365s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 365s x0: [2, 4, 4, 1] 365s -0.856 -0.153 365s 0.765 -0.233 365s 0.476 0.120 365s -0.530 -0.322 365s 365s x1: [2, 3, 4, 1] 365s 0.703 0.202 365s 0.118 -0.296 365s 0.937 0.744 365s 365s y: [4, 3, 4, 1] 365s -0.633 0.491 0.359 -0.437 365s -0.056 0.159 0.021 0.032 365s -0.916 0.544 0.535 -0.736 365s testing: mul_mat, [4, 4, 4, 3] = [2, 4, 4, 3] * [2, 4, 4, 3] 365s x0: [2, 4, 4, 3] 365s 0.847 0.948 365s -0.692 0.272 365s -0.006 -0.222 365s 0.419 -0.605 365s 365s x1: [2, 4, 4, 3] 365s -0.258 0.496 365s -0.389 -0.782 365s -0.384 0.081 365s -0.104 -0.146 365s 365s y: [4, 4, 4, 3] 365s 0.252 0.314 -0.109 -0.408 365s -1.070 0.056 0.176 0.310 365s -0.248 0.288 -0.016 -0.210 365s -0.226 0.032 0.033 0.045 365s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.434 -0.633 365s 365s x1: [2, 4, 1, 1] 365s 0.185 0.121 365s -0.935 -0.439 365s -0.244 0.800 365s -0.654 0.065 365s 365s y: [1, 4, 1, 1] 365s 0.004 365s -0.129 365s -0.612 365s -0.326 365s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 365s x0: [1, 3, 4, 1] 365s 0.975 365s 0.824 365s -0.654 365s 365s x1: [1, 2, 4, 1] 365s -0.342 365s -0.877 365s 365s y: [3, 2, 4, 1] 365s -0.334 -0.282 0.224 365s -0.855 -0.722 0.573 365s testing: mul_mat, [4, 1, 4, 3] = [3, 4, 4, 3] * [3, 1, 4, 3] 365s x0: [3, 4, 4, 3] 365s 0.420 -0.280 0.936 365s 0.106 -0.787 0.556 365s -0.464 0.090 -0.707 365s -0.501 -0.433 0.124 365s 365s x1: [3, 1, 4, 3] 365s -0.110 -0.761 0.774 365s 365s y: [4, 1, 4, 3] 365s 0.892 1.018 -0.565 0.480 365s test-mul-mat0: iter:111/500 365s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.093 -0.529 -0.654 -0.424 365s -0.914 -0.711 -0.504 -0.587 365s 365s x1: [4, 3, 1, 1] 365s -0.855 0.586 0.746 0.154 365s 0.296 -0.482 -0.519 0.243 365s -0.906 -0.513 -0.095 -0.734 365s 365s y: [2, 3, 1, 1] 365s -0.782 -0.101 365s 0.463 0.191 365s 0.729 1.672 365s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 365s x0: [4, 4, 3, 1] 365s -0.520 -0.547 -0.533 -0.231 365s -0.051 -0.119 0.116 0.343 365s 0.364 0.236 -0.779 -0.390 365s 0.462 0.269 -0.449 -0.252 365s 365s x1: [4, 2, 3, 1] 365s -0.653 -0.606 -0.517 -0.880 365s -0.122 0.246 -0.774 0.049 365s 365s y: [4, 2, 3, 1] 365s 1.149 -0.256 0.365 -0.012 365s 0.330 -0.096 0.598 0.346 365s testing: mul_mat, [1, 4, 3, 3] = [4, 1, 3, 3] * [4, 4, 3, 3] 365s x0: [4, 1, 3, 3] 365s -0.207 0.234 0.732 -0.926 365s 365s x1: [4, 4, 3, 3] 365s 0.218 -0.319 0.322 -0.884 365s -0.987 -0.038 0.080 -0.012 365s 0.917 0.015 0.874 0.006 365s -0.172 0.142 -0.425 0.093 365s 365s y: [1, 4, 3, 3] 365s 0.934 365s 0.265 365s 0.448 365s -0.328 365s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.711 -0.365 0.712 0.168 365s 365s x1: [4, 1, 1, 1] 365s 0.526 0.518 -0.056 -0.502 365s 365s y: [1, 1, 1, 1] 365s 0.061 365s testing: mul_mat, [2, 4, 3, 1] = [1, 2, 3, 1] * [1, 4, 3, 1] 365s x0: [1, 2, 3, 1] 365s -0.376 365s -0.215 365s 365s x1: [1, 4, 3, 1] 365s -0.648 365s -0.860 365s -0.248 365s 0.099 365s 365s y: [2, 4, 3, 1] 365s 0.244 0.139 365s 0.324 0.185 365s 0.093 0.053 365s -0.037 -0.021 365s testing: mul_mat, [4, 1, 3, 3] = [2, 4, 3, 3] * [2, 1, 3, 3] 365s x0: [2, 4, 3, 3] 365s 0.672 -0.140 365s 0.172 0.196 365s -0.083 0.684 365s 0.060 0.645 365s 365s x1: [2, 1, 3, 3] 365s 0.978 -0.361 365s 365s y: [4, 1, 3, 3] 365s 0.708 0.097 -0.329 -0.174 365s test-mul-mat0: iter:112/500 365s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.587 -0.562 -0.181 365s -0.623 -0.932 -0.604 365s 365s x1: [3, 4, 1, 1] 365s -0.792 -0.179 -0.749 365s -0.607 -0.613 -0.606 365s -0.598 0.194 0.705 365s -0.624 -0.769 -0.793 365s 365s y: [2, 4, 1, 1] 365s -0.228 1.112 365s 0.098 1.316 365s -0.587 -0.234 365s 0.210 1.584 365s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 3, 3, 1] 365s -0.849 0.052 0.612 365s -0.618 0.259 -0.186 365s 0.968 0.697 0.634 365s 365s x1: [3, 2, 3, 1] 365s 0.520 -0.842 -0.205 365s -0.790 -0.608 0.659 365s 365s y: [3, 2, 3, 1] 365s -0.611 -0.502 -0.214 365s 1.043 0.208 -0.771 365s testing: mul_mat, [4, 3, 3, 2] = [3, 4, 3, 2] * [3, 3, 3, 2] 365s x0: [3, 4, 3, 2] 365s 0.470 -0.598 0.706 365s 0.331 0.861 0.189 365s -0.360 -0.743 -0.850 365s -0.253 -0.175 -0.261 365s 365s x1: [3, 3, 3, 2] 365s -0.063 0.478 -0.203 365s -0.827 0.628 0.848 365s 0.785 -0.990 0.108 365s 365s y: [4, 3, 3, 2] 365s -0.459 0.352 -0.160 -0.014 365s -0.166 0.428 -0.891 -0.122 365s 1.037 -0.573 0.362 -0.054 365s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.980 0.324 -0.659 365s 0.832 0.721 -0.234 365s 0.977 0.266 -0.560 365s -0.529 -0.709 0.168 365s 365s x1: [3, 4, 1, 1] 365s -0.463 -0.698 0.561 365s 0.364 -0.099 0.621 365s -0.678 0.415 0.312 365s -0.402 -0.255 0.073 365s 365s y: [4, 4, 1, 1] 365s -1.049 -1.020 -0.952 0.834 365s -0.084 0.087 -0.018 -0.018 365s -0.735 -0.337 -0.726 0.116 365s -0.525 -0.536 -0.502 0.406 365s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 365s x0: [4, 4, 3, 1] 365s -0.336 -0.963 0.889 -0.045 365s -0.170 0.289 -0.226 0.373 365s -0.999 -0.768 -0.974 0.068 365s -0.713 -0.341 0.562 0.112 365s 365s x1: [4, 3, 3, 1] 365s -0.167 -0.039 -0.160 0.722 365s -0.997 -0.894 -0.741 -0.695 365s 0.667 0.623 0.207 0.288 365s 365s y: [4, 3, 3, 1] 365s -0.081 0.323 0.402 0.123 365s 0.569 -0.181 2.357 0.522 365s -0.653 0.127 -1.326 -0.540 365s testing: mul_mat, [2, 4, 3, 2] = [4, 2, 3, 2] * [4, 4, 3, 2] 365s x0: [4, 2, 3, 2] 365s 0.283 0.653 0.980 -0.385 365s -0.706 -0.906 0.784 0.715 365s 365s x1: [4, 4, 3, 2] 365s -0.134 -0.772 -0.982 0.884 365s 0.438 -0.855 -0.537 -0.965 365s -0.045 0.563 -0.140 -0.830 365s 0.188 -0.981 -0.950 0.644 365s 365s y: [2, 4, 3, 2] 365s -1.845 0.656 365s -0.590 -0.646 365s 0.537 -1.181 365s -1.767 0.470 365s test-mul-mat0: iter:113/500 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.862 0.819 0.567 -0.428 365s 0.359 0.714 0.871 -0.555 365s 0.941 0.192 0.223 -0.807 365s 365s x1: [4, 3, 1, 1] 365s 0.401 0.031 0.532 0.618 365s -0.848 -0.402 -0.514 -0.406 365s -0.851 -0.517 -0.681 0.994 365s 365s y: [3, 3, 1, 1] 365s -0.283 0.287 0.003 365s 0.285 -0.814 -0.662 365s -0.502 -1.820 -1.854 365s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 365s x0: [4, 1, 4, 1] 365s 0.960 0.621 0.365 -0.187 365s 365s x1: [4, 3, 4, 1] 365s -0.775 -0.237 -0.719 0.164 365s -0.154 -0.655 -0.434 0.877 365s 0.878 -0.816 -0.971 -0.524 365s 365s y: [1, 3, 4, 1] 365s -1.184 365s -0.877 365s 0.079 365s testing: mul_mat, [1, 1, 4, 3] = [4, 1, 4, 3] * [4, 1, 4, 3] 365s x0: [4, 1, 4, 3] 365s -0.165 0.299 -0.764 0.251 365s 365s x1: [4, 1, 4, 3] 365s -0.453 -0.825 -0.106 0.216 365s 365s y: [1, 1, 4, 3] 365s -0.037 365s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.515 -0.224 0.835 0.730 365s -0.109 -0.264 -0.264 -0.965 365s 0.182 0.754 0.024 -0.469 365s 365s x1: [4, 1, 1, 1] 365s 0.794 -0.999 0.827 0.327 365s 365s y: [3, 1, 1, 1] 365s 1.563 -0.357 -0.741 365s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 365s x0: [3, 3, 4, 1] 365s 0.535 -0.289 0.641 365s 0.517 0.454 0.226 365s -0.732 -0.918 -0.369 365s 365s x1: [3, 4, 4, 1] 365s -0.125 -0.428 0.217 365s 0.574 0.789 -0.104 365s -0.284 0.576 -0.518 365s -0.240 -0.631 0.065 365s 365s y: [3, 4, 4, 1] 365s 0.196 -0.210 0.405 365s 0.012 0.631 -1.106 365s -0.650 -0.002 -0.130 365s 0.096 -0.396 0.731 365s testing: mul_mat, [4, 3, 4, 3] = [3, 4, 4, 3] * [3, 3, 4, 3] 365s x0: [3, 4, 4, 3] 365s -0.496 -0.966 -0.399 365s -0.531 -0.822 0.109 365s -0.814 0.187 -0.615 365s 0.112 -0.083 -0.732 365s 365s x1: [3, 3, 4, 3] 365s -0.849 0.060 0.994 365s -0.217 -0.344 -0.277 365s 0.999 -0.263 0.828 365s 365s y: [4, 3, 4, 3] 365s -0.034 0.509 0.090 -0.827 365s 0.551 0.368 0.283 0.207 365s -0.571 -0.223 -1.371 -0.472 365s test-mul-mat0: iter:114/500 365s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 2, 1, 1] 365s 0.938 365s 0.822 365s 365s x1: [1, 4, 1, 1] 365s 0.886 365s 0.706 365s -0.432 365s 0.615 365s 365s y: [2, 4, 1, 1] 365s 0.831 0.728 365s 0.662 0.581 365s -0.405 -0.355 365s 0.577 0.506 365s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 365s x0: [1, 3, 2, 1] 365s -0.560 365s -0.906 365s 0.587 365s 365s x1: [1, 2, 2, 1] 365s 0.842 365s 0.036 365s 365s y: [3, 2, 2, 1] 365s -0.472 -0.763 0.494 365s -0.020 -0.032 0.021 365s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 365s x0: [1, 1, 2, 1] 365s -0.395 365s 365s x1: [1, 3, 2, 1] 365s 0.784 365s 0.809 365s -0.160 365s 365s y: [1, 3, 2, 1] 365s -0.310 365s -0.320 365s 0.063 365s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.316 365s 0.406 365s -0.593 365s 365s x1: [1, 1, 1, 1] 365s -0.966 365s 365s y: [3, 1, 1, 1] 365s 0.305 -0.392 0.572 365s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 365s x0: [3, 4, 2, 1] 365s -0.053 0.453 -0.824 365s 0.013 0.179 0.274 365s 0.085 0.957 -0.619 365s -0.629 -0.403 0.985 365s 365s x1: [3, 1, 2, 1] 365s -0.609 0.974 -0.977 365s 365s y: [4, 1, 2, 1] 365s 1.279 -0.101 1.486 -0.972 365s testing: mul_mat, [3, 3, 2, 1] = [4, 3, 2, 1] * [4, 3, 2, 1] 365s x0: [4, 3, 2, 1] 365s -0.176 -0.749 -0.931 0.042 365s -0.762 0.941 0.617 0.205 365s 0.653 0.029 -0.572 0.671 365s 365s x1: [4, 3, 2, 1] 365s 0.818 0.676 0.398 -0.027 365s 0.588 0.243 0.970 -0.465 365s -0.743 0.055 -0.094 0.710 365s 365s y: [3, 3, 2, 1] 365s -1.023 0.254 0.308 365s -1.208 0.284 -0.476 365s 0.207 0.706 0.046 365s test-mul-mat0: iter:115/500 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.252 -0.188 0.178 -0.007 365s -0.260 0.246 0.639 -0.623 365s 0.548 0.148 -0.679 -0.739 365s 365s x1: [4, 3, 1, 1] 365s 0.979 -0.749 -0.969 -0.197 365s -0.511 0.684 0.054 -0.570 365s -0.287 0.123 -0.953 0.141 365s 365s y: [3, 3, 1, 1] 365s 0.217 -0.935 1.230 365s -0.244 0.691 0.206 365s -0.266 -0.592 0.404 365s testing: mul_mat, [1, 3, 2, 1] = [4, 1, 2, 1] * [4, 3, 2, 1] 365s x0: [4, 1, 2, 1] 365s -0.639 -0.787 0.789 0.509 365s 365s x1: [4, 3, 2, 1] 365s -0.205 -0.048 0.369 -0.307 365s 0.262 -0.285 -0.328 0.512 365s -0.254 0.475 -0.999 -0.570 365s 365s y: [1, 3, 2, 1] 365s 0.304 365s 0.059 365s -1.290 365s testing: mul_mat, [3, 1, 2, 4] = [4, 3, 2, 4] * [4, 1, 2, 4] 365s x0: [4, 3, 2, 4] 365s -0.346 0.772 -0.535 -0.085 365s -0.347 0.610 0.176 0.963 365s 0.284 0.869 -0.582 -0.390 365s 365s x1: [4, 1, 2, 4] 365s 0.419 -0.002 -0.252 -0.865 365s 365s y: [3, 1, 2, 4] 365s 0.062 -1.025 0.602 365s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.801 0.356 0.146 0.077 365s 365s x1: [4, 3, 1, 1] 365s 0.180 0.682 0.148 0.075 365s -0.953 0.013 -0.541 0.512 365s -0.522 -0.632 0.231 0.295 365s 365s y: [1, 3, 1, 1] 365s 0.125 365s 0.729 365s 0.250 365s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 365s x0: [1, 1, 2, 1] 365s 0.616 365s 365s x1: [1, 4, 2, 1] 365s -0.071 365s 0.120 365s -0.583 365s -0.547 365s 365s y: [1, 4, 2, 1] 365s -0.044 365s 0.074 365s -0.360 365s -0.337 365s testing: mul_mat, [4, 1, 2, 4] = [1, 4, 2, 4] * [1, 1, 2, 4] 365s x0: [1, 4, 2, 4] 365s 0.877 365s -0.145 365s 0.745 365s -0.755 365s 365s x1: [1, 1, 2, 4] 365s 0.163 365s 365s y: [4, 1, 2, 4] 365s 0.143 -0.024 0.121 -0.123 365s test-mul-mat0: iter:116/500 365s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.702 -0.215 -0.222 0.822 365s 0.169 -0.628 -0.548 0.897 365s 365s x1: [4, 3, 1, 1] 365s -0.064 0.258 0.457 0.332 365s 0.393 -0.180 -0.343 -0.351 365s -0.402 -0.565 0.608 0.563 365s 365s y: [2, 3, 1, 1] 365s 0.161 -0.126 365s -0.450 0.053 365s 0.731 0.458 365s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 2, 1, 1] 365s 0.753 -0.681 0.592 0.210 365s 0.651 -0.015 -0.970 -0.692 365s 365s x1: [4, 2, 1, 1] 365s 0.261 0.336 -0.957 0.493 365s 0.487 -0.934 -0.420 0.382 365s 365s y: [2, 2, 1, 1] 365s -0.495 0.752 365s 0.834 0.473 365s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 365s x0: [4, 4, 1, 2] 365s 0.446 -0.365 -0.849 -0.974 365s -0.982 -0.515 0.778 -0.663 365s -0.923 -0.011 0.987 0.063 365s 0.018 -0.705 -0.303 0.646 365s 365s x1: [4, 2, 1, 2] 365s 0.634 -0.372 -0.257 0.242 365s -0.809 0.542 0.540 -0.024 365s 365s y: [4, 2, 1, 2] 365s 0.401 -0.792 -0.819 0.508 365s -0.993 0.952 1.272 -0.576 365s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.717 -0.026 0.590 -0.945 365s -0.309 0.097 0.814 0.181 365s 365s x1: [4, 4, 1, 1] 365s -0.563 0.236 0.014 -0.545 365s 0.721 -0.207 -0.209 0.799 365s 0.781 -0.221 -0.139 -0.201 365s 0.074 0.558 -0.554 -0.888 365s 365s y: [2, 4, 1, 1] 365s 0.921 0.110 365s -1.389 -0.268 365s -0.446 -0.412 365s 0.446 -0.580 365s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.160 0.952 365s 0.097 -0.104 365s 0.519 0.300 365s 365s x1: [2, 4, 1, 1] 365s -0.533 0.055 365s -0.276 -0.732 365s -0.860 0.712 365s -0.294 0.376 365s 365s y: [3, 4, 1, 1] 365s -0.033 -0.058 -0.260 365s -0.741 0.049 -0.363 365s 0.540 -0.158 -0.233 365s 0.311 -0.068 -0.040 365s testing: mul_mat, [2, 2, 1, 2] = [3, 2, 1, 2] * [3, 2, 1, 2] 365s x0: [3, 2, 1, 2] 365s 0.600 0.378 0.035 365s 0.980 -0.934 -0.899 365s 365s x1: [3, 2, 1, 2] 365s -0.270 0.757 -0.901 365s 0.804 0.315 -0.455 365s 365s y: [2, 2, 1, 2] 365s 0.093 -0.161 365s 0.585 0.903 365s test-mul-mat0: iter:117/500 365s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.456 -0.313 0.617 365s 365s x1: [3, 1, 1, 1] 365s -0.456 0.357 -0.117 365s 365s y: [1, 1, 1, 1] 365s -0.392 365s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 365s x0: [3, 2, 3, 1] 365s -0.217 0.879 0.769 365s -0.116 -0.786 -0.056 365s 365s x1: [3, 1, 3, 1] 365s -0.999 -0.398 -0.572 365s 365s y: [2, 1, 3, 1] 365s -0.573 0.460 365s testing: mul_mat, [3, 2, 3, 2] = [3, 3, 3, 2] * [3, 2, 3, 2] 365s x0: [3, 3, 3, 2] 365s 0.097 -0.093 0.342 365s -0.210 0.302 0.402 365s 0.218 -0.520 0.341 365s 365s x1: [3, 2, 3, 2] 365s -0.892 0.177 0.606 365s -0.891 0.780 -0.966 365s 365s y: [3, 2, 3, 2] 365s 0.104 0.484 -0.080 365s -0.489 0.034 -0.929 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.337 0.816 -0.671 365s 0.974 0.849 0.197 365s 0.911 0.576 -0.118 365s 0.449 0.072 -0.888 365s 365s x1: [3, 3, 1, 1] 365s 0.713 -0.714 -0.222 365s 0.045 0.235 0.515 365s -0.705 0.278 0.349 365s 365s y: [4, 3, 1, 1] 365s -0.674 0.045 0.264 0.466 365s -0.170 0.345 0.115 -0.421 365s 0.230 -0.381 -0.523 -0.607 365s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 365s x0: [4, 4, 3, 1] 365s -0.823 -0.052 -0.461 0.307 365s -0.582 0.184 0.276 0.462 365s 0.345 0.698 0.756 0.919 365s -0.586 -0.287 0.242 0.248 365s 365s x1: [4, 3, 3, 1] 365s 0.719 0.881 0.192 -0.995 365s 0.846 -0.138 -0.476 -0.755 365s 0.061 -0.762 -0.469 0.839 365s 365s y: [4, 3, 3, 1] 365s -1.032 -0.663 0.094 -0.874 365s -0.702 -0.998 -0.858 -0.758 365s 0.463 0.082 -0.095 0.277 365s testing: mul_mat, [1, 4, 3, 2] = [4, 1, 3, 2] * [4, 4, 3, 2] 365s x0: [4, 1, 3, 2] 365s 0.581 0.967 0.284 0.576 365s 365s x1: [4, 4, 3, 2] 365s 0.738 0.732 -0.819 0.714 365s -0.110 0.456 -0.864 -0.722 365s -0.622 -0.115 0.090 -0.048 365s 0.806 -0.814 0.703 0.145 365s 365s y: [1, 4, 3, 2] 365s 1.315 365s -0.285 365s -0.475 365s -0.036 365s test-mul-mat0: iter:118/500 365s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.319 -0.367 365s 0.044 -0.829 365s 0.957 -0.206 365s -0.018 0.589 365s 365s x1: [2, 3, 1, 1] 365s -0.683 0.548 365s 0.758 0.898 365s 0.515 0.042 365s 365s y: [4, 3, 1, 1] 365s -0.419 -0.484 -0.767 0.335 365s -0.088 -0.711 0.541 0.515 365s 0.149 -0.012 0.484 0.015 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.881 0.552 365s 0.764 -0.614 365s 0.372 -0.074 365s 0.543 0.689 365s 365s x1: [2, 4, 1, 1] 365s 0.538 0.533 365s 0.531 -0.660 365s 0.337 -0.777 365s -0.315 0.777 365s 365s y: [4, 4, 1, 1] 365s 0.768 0.083 0.160 0.659 365s 0.104 0.812 0.247 -0.166 365s -0.132 0.735 0.183 -0.352 365s 0.152 -0.718 -0.175 0.364 365s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.478 -0.067 365s -0.102 -0.541 365s 365s x1: [2, 4, 1, 1] 365s -0.527 0.301 365s 0.586 0.988 365s -0.657 0.060 365s 0.307 -0.024 365s 365s y: [2, 4, 1, 1] 365s -0.272 -0.109 365s 0.214 -0.594 365s -0.318 0.035 365s 0.148 -0.018 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.330 0.461 365s -0.724 0.107 365s 365s x1: [2, 2, 1, 1] 365s -0.478 -0.564 365s 0.992 -0.947 365s 365s y: [2, 2, 1, 1] 365s -0.418 0.286 365s -0.109 -0.820 365s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.968 -0.272 365s -0.745 -0.428 365s 0.271 0.307 365s 0.720 -0.284 365s 365s x1: [2, 2, 1, 1] 365s 0.565 0.342 365s -0.340 0.329 365s 365s y: [4, 2, 1, 1] 365s -0.640 -0.567 0.258 0.309 365s 0.240 0.113 0.009 -0.339 365s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.346 0.804 0.952 0.399 365s 365s x1: [4, 2, 1, 1] 365s -0.084 -0.634 -0.977 0.892 365s -0.530 0.500 -0.175 0.368 365s 365s y: [1, 2, 1, 1] 365s -1.113 365s 0.199 365s test-mul-mat0: iter:119/500 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.177 -0.617 -0.200 365s 0.551 -0.045 -0.893 365s 365s x1: [3, 2, 1, 1] 365s -0.611 0.240 0.383 365s 0.049 -0.431 0.111 365s 365s y: [2, 2, 1, 1] 365s -0.117 -0.690 365s 0.235 -0.053 365s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.412 -0.688 0.749 365s 0.934 -0.884 0.701 365s 0.333 -0.304 -0.018 365s 0.008 -0.263 0.371 365s 365s x1: [3, 2, 1, 1] 365s -0.733 0.871 -0.527 365s -0.711 0.763 -0.056 365s 365s y: [4, 2, 1, 1] 365s -0.691 -1.823 -0.500 -0.431 365s -0.274 -1.377 -0.468 -0.227 365s testing: mul_mat, [4, 4, 1, 2] = [3, 4, 1, 2] * [3, 4, 1, 2] 365s x0: [3, 4, 1, 2] 365s -0.942 -0.050 0.969 365s 0.067 0.687 0.340 365s 0.315 0.807 0.760 365s -0.868 0.039 0.260 365s 365s x1: [3, 4, 1, 2] 365s -0.752 -0.879 -0.580 365s -0.183 0.232 0.500 365s 0.640 0.615 -0.699 365s 0.191 -0.430 -0.592 365s 365s y: [4, 4, 1, 2] 365s 0.190 -0.851 -1.388 0.467 365s 0.645 0.317 0.510 0.298 365s -1.311 0.227 0.166 -0.714 365s -0.732 -0.484 -0.737 -0.337 365s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.826 0.509 0.047 365s 365s x1: [3, 4, 1, 1] 365s 0.706 -0.340 -0.188 365s -0.008 0.386 0.065 365s 0.978 0.444 -0.985 365s 0.947 -0.489 0.702 365s 365s y: [1, 4, 1, 1] 365s 0.401 365s 0.193 365s 0.987 365s 0.566 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s 0.202 365s 0.869 365s -0.308 365s -0.573 365s 365s x1: [1, 3, 1, 1] 365s 0.957 365s -0.452 365s -0.692 365s 365s y: [4, 3, 1, 1] 365s 0.194 0.832 -0.295 -0.549 365s -0.091 -0.393 0.139 0.259 365s -0.140 -0.601 0.213 0.397 365s testing: mul_mat, [4, 1, 1, 2] = [4, 4, 1, 2] * [4, 1, 1, 2] 365s x0: [4, 4, 1, 2] 365s -0.825 0.958 0.962 -0.534 365s 0.980 0.451 -0.043 -0.760 365s -0.993 0.255 0.192 0.746 365s 0.240 0.905 0.788 0.014 365s 365s x1: [4, 1, 1, 2] 365s 0.838 0.114 -0.908 -0.312 365s 365s y: [4, 1, 1, 2] 365s -1.290 1.149 -1.210 -0.416 365s test-mul-mat0: iter:120/500 365s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.934 365s 365s x1: [1, 3, 1, 1] 365s -0.223 365s -0.023 365s 0.395 365s 365s y: [1, 3, 1, 1] 365s -0.208 365s -0.022 365s 0.369 365s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 365s x0: [1, 1, 4, 1] 365s 0.414 365s 365s x1: [1, 1, 4, 1] 365s -0.413 365s 365s y: [1, 1, 4, 1] 365s -0.171 365s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 365s x0: [1, 2, 4, 1] 365s 0.010 365s -0.011 365s 365s x1: [1, 1, 4, 1] 365s 0.792 365s 365s y: [2, 1, 4, 1] 365s 0.008 -0.009 365s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.668 365s 0.127 365s -0.527 365s 365s x1: [1, 2, 1, 1] 365s -0.157 365s -0.650 365s 365s y: [3, 2, 1, 1] 365s 0.105 -0.020 0.083 365s 0.434 -0.082 0.342 365s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 365s x0: [3, 4, 4, 1] 365s -0.671 0.135 -0.361 365s -0.875 0.344 -0.405 365s -0.645 0.284 0.866 365s 0.525 -0.159 -0.586 365s 365s x1: [3, 1, 4, 1] 365s 0.728 0.866 -0.593 365s 365s y: [4, 1, 4, 1] 365s -0.157 -0.098 -0.737 0.592 365s testing: mul_mat, [2, 3, 4, 1] = [4, 2, 4, 1] * [4, 3, 4, 1] 365s x0: [4, 2, 4, 1] 365s 0.165 -0.678 0.570 0.197 365s -0.301 -0.637 0.067 -0.028 365s 365s x1: [4, 3, 4, 1] 365s -0.173 0.122 -0.597 0.618 365s -0.065 -0.995 0.216 0.889 365s -0.727 0.034 0.360 -0.043 365s 365s y: [2, 3, 4, 1] 365s -0.330 -0.083 365s 0.962 0.643 365s 0.054 0.222 365s test-mul-mat0: iter:121/500 365s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.924 0.210 0.155 0.596 365s 365s x1: [4, 2, 1, 1] 365s 0.279 0.519 0.079 -0.054 365s 0.402 0.895 -0.162 0.165 365s 365s y: [1, 2, 1, 1] 365s -0.168 365s -0.110 365s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.494 -0.152 0.306 -0.601 365s -0.255 -0.900 -0.763 -0.364 365s -0.940 -0.226 -0.152 -0.618 365s -0.118 -0.873 0.902 0.961 365s 365s x1: [4, 1, 1, 1] 365s 0.543 -0.567 0.286 -0.563 365s 365s y: [4, 1, 1, 1] 365s 0.244 0.359 -0.079 0.147 365s testing: mul_mat, [2, 4, 1, 2] = [4, 2, 1, 2] * [4, 4, 1, 2] 365s x0: [4, 2, 1, 2] 365s 0.482 0.846 -0.403 -0.282 365s 0.832 -0.724 0.110 0.414 365s 365s x1: [4, 4, 1, 2] 365s 0.073 0.304 0.856 0.910 365s -0.531 0.236 0.986 0.679 365s -0.609 0.582 0.223 -0.175 365s -0.133 0.659 0.074 0.374 365s 365s y: [2, 4, 1, 2] 365s -0.310 0.311 365s -0.646 -0.224 365s 0.158 -0.976 365s 0.359 -0.425 365s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.885 -0.595 -0.847 0.512 365s 365s x1: [4, 2, 1, 1] 365s 0.427 -0.082 0.868 0.436 365s -0.194 -0.592 -0.780 0.542 365s 365s y: [1, 2, 1, 1] 365s -0.085 365s 1.120 365s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.712 365s -0.982 365s 0.153 365s 365s x1: [1, 4, 1, 1] 365s -0.659 365s -0.720 365s -0.006 365s -0.814 365s 365s y: [3, 4, 1, 1] 365s -0.470 0.648 -0.101 365s -0.513 0.707 -0.110 365s -0.004 0.006 -0.001 365s -0.580 0.799 -0.124 365s testing: mul_mat, [4, 1, 1, 2] = [3, 4, 1, 2] * [3, 1, 1, 2] 365s x0: [3, 4, 1, 2] 365s 0.367 -0.607 0.428 365s 0.475 0.021 -0.550 365s 0.153 -0.770 -0.258 365s -0.206 0.200 -0.242 365s 365s x1: [3, 1, 1, 2] 365s -0.178 0.431 -0.348 365s 365s y: [4, 1, 1, 2] 365s -0.476 0.116 -0.270 0.207 365s test-mul-mat0: iter:122/500 365s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.550 -0.930 0.226 365s -0.250 0.498 0.676 365s 0.492 -0.744 -0.989 365s -0.361 0.167 0.858 365s 365s x1: [3, 2, 1, 1] 365s -0.164 0.682 0.896 365s -0.798 0.157 0.049 365s 365s y: [4, 2, 1, 1] 365s -0.522 0.987 -1.475 0.942 365s -0.574 0.311 -0.558 0.356 365s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 3, 2, 1] 365s -0.850 0.528 0.563 365s -0.211 -0.305 0.421 365s -0.084 0.053 -0.880 365s 365s x1: [3, 4, 2, 1] 365s -0.873 -0.642 -0.301 365s 0.801 -0.574 0.252 365s 0.492 0.647 -0.373 365s -0.384 0.519 -0.242 365s 365s y: [3, 4, 2, 1] 365s 0.233 0.253 0.304 365s -0.842 0.112 -0.319 365s -0.287 -0.458 0.321 365s 0.465 -0.179 0.272 365s testing: mul_mat, [2, 3, 2, 2] = [3, 2, 2, 2] * [3, 3, 2, 2] 365s x0: [3, 2, 2, 2] 365s 0.166 -0.146 -0.340 365s 0.623 0.627 -0.178 365s 365s x1: [3, 3, 2, 2] 365s -0.724 0.847 0.412 365s 0.930 -0.795 0.116 365s -0.420 -0.886 -0.110 365s 365s y: [2, 3, 2, 2] 365s -0.384 0.006 365s 0.231 0.061 365s 0.097 -0.797 365s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.424 -0.801 0.831 365s -0.884 0.743 -0.708 365s -0.391 0.431 0.944 365s 365s x1: [3, 2, 1, 1] 365s 0.524 -0.176 -0.478 365s -0.822 0.409 0.262 365s 365s y: [3, 2, 1, 1] 365s -0.478 -0.255 -0.732 365s 0.238 0.845 0.746 365s testing: mul_mat, [4, 3, 2, 1] = [3, 4, 2, 1] * [3, 3, 2, 1] 365s x0: [3, 4, 2, 1] 365s 0.867 0.619 -0.687 365s 0.503 -0.118 -0.528 365s -0.727 -0.359 0.654 365s -0.558 0.362 -0.744 365s 365s x1: [3, 3, 2, 1] 365s 0.035 -0.447 -0.158 365s 0.233 0.443 0.179 365s -0.455 0.677 -0.566 365s 365s y: [4, 3, 2, 1] 365s -0.137 0.154 0.031 -0.064 365s 0.353 -0.030 -0.211 -0.103 365s 0.414 -0.010 -0.282 0.919 365s testing: mul_mat, [2, 3, 2, 2] = [4, 2, 2, 2] * [4, 3, 2, 2] 365s x0: [4, 2, 2, 2] 365s -0.646 0.158 -0.711 0.724 365s 0.734 0.900 0.557 -0.870 365s 365s x1: [4, 3, 2, 2] 365s -0.749 -0.573 -0.837 -0.602 365s -0.115 -0.596 -0.277 -0.249 365s 0.907 -0.004 0.194 0.526 365s 365s y: [2, 3, 2, 2] 365s 0.552 -1.008 365s -0.003 -0.559 365s -0.344 0.313 365s test-mul-mat0: iter:123/500 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.975 365s -0.775 365s -0.548 365s 365s x1: [1, 3, 1, 1] 365s 0.365 365s 0.714 365s -0.374 365s 365s y: [3, 3, 1, 1] 365s 0.355 -0.283 -0.200 365s 0.696 -0.554 -0.392 365s -0.365 0.290 0.205 365s testing: mul_mat, [2, 3, 2, 1] = [1, 2, 2, 1] * [1, 3, 2, 1] 365s x0: [1, 2, 2, 1] 365s 0.258 365s -0.051 365s 365s x1: [1, 3, 2, 1] 365s 0.217 365s 0.544 365s 0.202 365s 365s y: [2, 3, 2, 1] 365s 0.056 -0.011 365s 0.140 -0.028 365s 0.052 -0.010 365s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 365s x0: [1, 4, 2, 1] 365s -0.060 365s -0.082 365s -0.813 365s 0.031 365s 365s x1: [1, 2, 2, 1] 365s 0.891 365s 0.299 365s 365s y: [4, 2, 2, 1] 365s -0.053 -0.073 -0.724 0.027 365s -0.018 -0.025 -0.243 0.009 365s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.529 365s -0.541 365s 365s x1: [1, 4, 1, 1] 365s 0.359 365s -0.265 365s -0.754 365s -0.993 365s 365s y: [2, 4, 1, 1] 365s -0.190 -0.194 365s 0.140 0.143 365s 0.399 0.408 365s 0.526 0.537 365s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 365s x0: [2, 4, 2, 1] 365s 0.445 0.995 365s 0.067 0.626 365s -0.747 -0.714 365s -0.606 0.293 365s 365s x1: [2, 1, 2, 1] 365s 0.926 -0.986 365s 365s y: [4, 1, 2, 1] 365s -0.568 -0.555 0.012 -0.850 365s testing: mul_mat, [4, 2, 2, 1] = [4, 4, 2, 1] * [4, 2, 2, 1] 365s x0: [4, 4, 2, 1] 365s -0.630 -0.375 0.908 0.396 365s -0.661 0.077 -0.404 0.775 365s 0.783 -0.174 0.003 0.940 365s 0.996 0.355 -0.073 0.217 365s 365s x1: [4, 2, 2, 1] 365s 0.013 0.821 -0.608 -0.627 365s -0.444 -0.363 -0.620 0.266 365s 365s y: [4, 2, 2, 1] 365s -1.117 -0.186 -0.725 0.213 365s -0.042 0.723 -0.036 -0.468 365s test-mul-mat0: iter:124/500 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.061 0.298 -0.564 365s -0.665 0.074 -0.623 365s 0.552 0.241 -0.310 365s 0.325 -0.955 0.382 365s 365s x1: [3, 1, 1, 1] 365s 0.601 -0.656 -0.705 365s 365s y: [4, 1, 1, 1] 365s 0.165 -0.009 0.392 0.552 365s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 4, 3, 1] 365s -0.819 0.221 0.587 365s -0.531 -0.115 -0.177 365s 0.282 -0.933 -0.001 365s 0.695 0.593 0.469 365s 365s x1: [3, 4, 3, 1] 365s -0.637 0.957 -0.086 365s -0.406 -0.008 -0.055 365s 0.547 -0.857 -0.079 365s -0.712 0.664 0.236 365s 365s y: [4, 4, 3, 1] 365s 0.683 0.243 -1.072 0.084 365s 0.298 0.226 -0.106 -0.313 365s -0.684 -0.178 0.954 -0.165 365s 0.868 0.260 -0.820 0.010 365s testing: mul_mat, [3, 4, 3, 2] = [3, 3, 3, 2] * [3, 4, 3, 2] 365s x0: [3, 3, 3, 2] 365s -0.115 -0.832 0.964 365s -0.402 0.035 -0.980 365s -0.451 0.893 -0.347 365s 365s x1: [3, 4, 3, 2] 365s 0.743 0.490 0.633 365s -0.259 0.186 0.226 365s -0.789 -0.196 0.284 365s 0.449 -0.219 -0.858 365s 365s y: [3, 4, 3, 2] 365s 0.117 -0.901 -0.117 365s 0.093 -0.111 0.204 365s 0.528 0.032 0.083 365s -0.698 0.653 -0.100 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.337 0.913 0.878 365s -0.754 0.795 -0.348 365s 365s x1: [3, 3, 1, 1] 365s -0.035 0.494 0.079 365s 0.611 -0.573 -0.321 365s 0.255 -0.100 0.163 365s 365s y: [2, 3, 1, 1] 365s 0.508 0.392 365s -0.599 -0.805 365s 0.138 -0.328 365s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 4, 3, 1] 365s -0.073 -0.482 365s 0.355 0.300 365s -0.767 0.164 365s 0.826 -0.236 365s 365s x1: [2, 3, 3, 1] 365s -0.094 0.170 365s -0.485 -0.453 365s 0.955 -0.436 365s 365s y: [4, 3, 3, 1] 365s -0.075 0.018 0.100 -0.118 365s 0.254 -0.308 0.298 -0.294 365s 0.140 0.208 -0.804 0.892 365s testing: mul_mat, [2, 2, 3, 2] = [4, 2, 3, 2] * [4, 2, 3, 2] 365s x0: [4, 2, 3, 2] 365s 0.873 0.088 0.376 -0.086 365s -0.408 0.922 0.133 0.741 365s 365s x1: [4, 2, 3, 2] 365s -0.289 0.131 -0.489 -0.921 365s 0.631 0.187 0.057 -0.442 365s 365s y: [2, 2, 3, 2] 365s -0.345 -0.509 365s 0.627 -0.406 365s test-mul-mat0: iter:125/500 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.484 0.581 -0.497 365s 0.485 -0.829 -0.036 365s 365s x1: [3, 3, 1, 1] 365s -0.574 0.199 0.340 365s 0.509 0.199 0.743 365s -0.063 0.405 -0.634 365s 365s y: [2, 3, 1, 1] 365s -0.331 -0.455 365s -0.007 0.055 365s 0.520 -0.343 365s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 365s x0: [3, 4, 4, 1] 365s -0.500 0.935 -0.698 365s 0.985 -0.894 0.266 365s -0.616 -0.391 -0.841 365s -0.078 0.860 -0.090 365s 365s x1: [3, 2, 4, 1] 365s -0.600 -0.497 -0.107 365s -0.462 0.251 -0.248 365s 365s y: [4, 2, 4, 1] 365s -0.089 -0.175 0.654 -0.371 365s 0.639 -0.745 0.396 0.274 365s testing: mul_mat, [1, 4, 4, 3] = [3, 1, 4, 3] * [3, 4, 4, 3] 365s x0: [3, 1, 4, 3] 365s -0.802 -0.071 0.527 365s 365s x1: [3, 4, 4, 3] 365s -0.060 -0.035 -0.282 365s 0.144 -0.179 0.614 365s 0.595 -0.638 -0.880 365s -0.966 -0.474 0.642 365s 365s y: [1, 4, 4, 3] 365s -0.098 365s 0.221 365s -0.896 365s 1.146 365s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.167 -0.846 0.559 365s 0.003 0.919 -0.384 365s 365s x1: [3, 1, 1, 1] 365s -0.322 0.181 -0.458 365s 365s y: [2, 1, 1, 1] 365s -0.463 0.341 365s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 365s x0: [2, 3, 4, 1] 365s 0.676 0.595 365s 0.172 -0.269 365s -0.283 0.333 365s 365s x1: [2, 3, 4, 1] 365s -0.345 0.899 365s 0.012 0.239 365s 0.970 0.202 365s 365s y: [3, 3, 4, 1] 365s 0.302 -0.301 0.397 365s 0.150 -0.062 0.076 365s 0.777 0.113 -0.207 365s testing: mul_mat, [4, 2, 4, 3] = [3, 4, 4, 3] * [3, 2, 4, 3] 365s x0: [3, 4, 4, 3] 365s 0.359 0.206 -0.696 365s -0.763 0.697 0.702 365s -0.932 -0.614 -0.195 365s 0.382 0.621 -0.136 365s 365s x1: [3, 2, 4, 3] 365s 0.697 0.360 0.677 365s -0.059 0.869 0.846 365s 365s y: [4, 2, 4, 3] 365s -0.146 0.194 -1.002 0.398 365s -0.431 1.245 -0.643 0.402 365s test-mul-mat0: iter:126/500 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.906 0.888 365s 0.061 -0.208 365s 0.238 0.671 365s 365s x1: [2, 2, 1, 1] 365s 0.483 0.611 365s 0.662 -0.582 365s 365s y: [3, 2, 1, 1] 365s 0.980 -0.098 0.525 365s 0.083 0.161 -0.233 365s testing: mul_mat, [2, 3, 2, 1] = [2, 2, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 2, 2, 1] 365s 0.101 -0.329 365s -0.054 0.025 365s 365s x1: [2, 3, 2, 1] 365s -0.728 0.925 365s 0.572 -0.710 365s 0.823 -0.339 365s 365s y: [2, 3, 2, 1] 365s -0.378 0.063 365s 0.291 -0.049 365s 0.195 -0.053 365s testing: mul_mat, [3, 2, 2, 2] = [2, 3, 2, 2] * [2, 2, 2, 2] 365s x0: [2, 3, 2, 2] 365s 0.685 0.698 365s 0.747 -0.727 365s -0.481 0.585 365s 365s x1: [2, 2, 2, 2] 365s 0.242 0.162 365s -0.274 -0.078 365s 365s y: [3, 2, 2, 2] 365s 0.279 0.063 -0.021 365s -0.242 -0.148 0.086 365s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.420 0.293 365s -0.226 0.882 365s -0.703 -0.773 365s -0.147 -0.517 365s 365s x1: [2, 3, 1, 1] 365s -0.878 0.294 365s 0.189 -0.456 365s -0.105 0.076 365s 365s y: [4, 3, 1, 1] 365s 0.456 0.458 0.390 -0.023 365s -0.213 -0.445 0.219 0.208 365s 0.067 0.091 0.015 -0.024 365s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 365s x0: [4, 3, 2, 1] 365s -0.650 0.244 0.824 -0.323 365s 0.766 -0.158 0.616 -0.090 365s -0.627 -0.077 0.219 0.101 365s 365s x1: [4, 2, 2, 1] 365s -0.587 -0.373 -0.501 -0.425 365s -0.474 -0.177 0.978 0.588 365s 365s y: [3, 2, 2, 1] 365s 0.016 -0.661 0.244 365s 0.881 0.215 0.584 365s testing: mul_mat, [2, 4, 2, 2] = [3, 2, 2, 2] * [3, 4, 2, 2] 365s x0: [3, 2, 2, 2] 365s -0.855 0.159 -0.340 365s -0.050 -0.826 0.138 365s 365s x1: [3, 4, 2, 2] 365s 0.363 -0.874 -0.016 365s 0.070 0.181 0.561 365s -0.100 0.531 0.326 365s 0.273 -0.225 -0.832 365s 365s y: [2, 4, 2, 2] 365s -0.444 0.702 365s -0.222 -0.076 365s 0.059 -0.389 365s 0.013 0.058 365s test-mul-mat0: iter:127/500 365s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.999 -0.571 365s 0.678 -0.573 365s -0.740 -0.848 365s 365s x1: [2, 3, 1, 1] 365s 0.070 -0.611 365s -0.868 0.215 365s 0.339 0.291 365s 365s y: [3, 3, 1, 1] 365s 0.419 0.397 0.466 365s -0.990 -0.712 0.460 365s 0.172 0.063 -0.498 365s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 2, 3, 1] 365s -0.632 0.374 365s 0.507 0.707 365s 365s x1: [2, 3, 3, 1] 365s -0.732 0.373 365s 0.179 -0.115 365s 0.893 0.745 365s 365s y: [2, 3, 3, 1] 365s 0.602 -0.108 365s -0.156 0.010 365s -0.286 0.980 365s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 365s x0: [2, 1, 3, 1] 365s -0.785 -0.262 365s 365s x1: [2, 2, 3, 1] 365s -0.598 -0.274 365s 0.905 0.287 365s 365s y: [1, 2, 3, 1] 365s 0.542 365s -0.786 365s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.732 0.974 365s -0.982 0.112 365s -0.110 0.464 365s 365s x1: [2, 1, 1, 1] 365s -0.439 0.353 365s 365s y: [3, 1, 1, 1] 365s 0.665 0.471 0.212 365s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 4, 3, 1] 365s 0.697 0.432 -0.845 365s -0.922 0.265 -0.500 365s 0.271 -0.474 -0.439 365s -0.742 0.450 0.618 365s 365s x1: [3, 2, 3, 1] 365s -0.892 -0.535 -0.810 365s -0.490 0.192 -0.905 365s 365s y: [4, 2, 3, 1] 365s -0.168 1.086 0.367 -0.080 365s 0.506 0.955 0.173 -0.110 365s testing: mul_mat, [2, 3, 3, 1] = [4, 2, 3, 1] * [4, 3, 3, 1] 365s x0: [4, 2, 3, 1] 365s -0.777 -0.648 -0.854 0.032 365s 0.180 -0.324 0.030 -0.975 365s 365s x1: [4, 3, 3, 1] 365s -0.128 -0.260 -0.129 0.026 365s 0.240 0.432 -0.355 0.848 365s -0.541 -0.547 -0.034 -0.322 365s 365s y: [2, 3, 3, 1] 365s 0.379 0.032 365s -0.136 -0.934 365s 0.793 0.392 365s test-mul-mat0: iter:128/500 365s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.013 0.289 0.171 -0.977 365s -0.463 0.082 0.514 0.971 365s 0.919 0.272 0.142 0.940 365s 0.209 -0.614 0.100 0.315 365s 365s x1: [4, 2, 1, 1] 365s 0.056 0.937 -0.950 0.279 365s 0.117 -0.598 0.955 0.263 365s 365s y: [4, 2, 1, 1] 365s -0.166 -0.166 0.434 -0.570 365s -0.268 0.643 0.328 0.570 365s testing: mul_mat, [3, 4, 2, 1] = [4, 3, 2, 1] * [4, 4, 2, 1] 365s x0: [4, 3, 2, 1] 365s -0.590 0.960 -0.906 0.577 365s 0.804 -0.752 -0.206 -0.072 365s 0.520 -0.587 -0.994 -0.945 365s 365s x1: [4, 4, 2, 1] 365s 0.311 0.613 0.011 -0.211 365s -0.783 0.167 0.844 -0.846 365s 0.217 0.123 0.272 0.619 365s 0.079 -0.465 0.052 -0.934 365s 365s y: [3, 4, 2, 1] 365s 0.273 -0.197 -0.009 365s -0.630 -0.868 -0.546 365s 0.101 -0.019 -0.815 365s -1.078 0.470 1.145 365s testing: mul_mat, [3, 3, 2, 3] = [4, 3, 2, 3] * [4, 3, 2, 3] 365s x0: [4, 3, 2, 3] 365s -0.844 0.991 0.467 -0.045 365s 0.072 0.576 0.528 0.592 365s -0.537 0.947 0.430 0.816 365s 365s x1: [4, 3, 2, 3] 365s 0.380 -0.277 0.905 0.644 365s -0.878 0.076 0.521 -0.469 365s 0.036 0.615 -0.892 -0.160 365s 365s y: [3, 3, 2, 3] 365s -0.202 0.727 0.448 365s 1.081 -0.022 0.385 365s 0.169 -0.209 0.049 365s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.036 0.279 -0.974 -0.658 365s 365s x1: [4, 3, 1, 1] 365s 0.297 -0.727 -0.674 0.847 365s 0.584 0.743 -0.251 -0.995 365s -0.661 0.700 0.077 0.853 365s 365s y: [1, 3, 1, 1] 365s -0.093 365s 1.128 365s -0.464 365s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 365s x0: [1, 2, 2, 1] 365s 0.905 365s -0.580 365s 365s x1: [1, 4, 2, 1] 365s 0.396 365s 0.460 365s 0.287 365s -0.124 365s 365s y: [2, 4, 2, 1] 365s 0.358 -0.230 365s 0.416 -0.267 365s 0.260 -0.167 365s -0.112 0.072 365s testing: mul_mat, [1, 1, 2, 3] = [2, 1, 2, 3] * [2, 1, 2, 3] 365s x0: [2, 1, 2, 3] 365s -0.211 0.467 365s 365s x1: [2, 1, 2, 3] 365s 0.898 -0.989 365s 365s y: [1, 1, 2, 3] 365s -0.652 365s test-mul-mat0: iter:129/500 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.234 0.444 0.597 365s 0.197 0.604 -0.614 365s -0.335 0.042 0.201 365s -0.526 0.876 0.476 365s 365s x1: [3, 3, 1, 1] 365s -0.186 -0.704 -0.054 365s -0.176 -0.938 0.170 365s -0.318 0.412 0.138 365s 365s y: [4, 3, 1, 1] 365s -0.301 -0.428 0.022 -0.544 365s -0.274 -0.705 0.054 -0.648 365s 0.339 0.101 0.151 0.593 365s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 3, 2, 1] 365s 0.686 0.709 0.423 365s -0.840 0.585 -0.101 365s 0.730 0.172 0.589 365s 365s x1: [3, 4, 2, 1] 365s 0.570 0.587 -0.311 365s -0.470 -0.515 0.318 365s -0.370 0.288 -0.635 365s 0.443 0.584 0.312 365s 365s y: [3, 4, 2, 1] 365s 0.676 -0.104 0.334 365s -0.554 0.061 -0.245 365s -0.318 0.543 -0.594 365s 0.851 -0.062 0.608 365s testing: mul_mat, [2, 3, 2, 1] = [3, 2, 2, 1] * [3, 3, 2, 1] 365s x0: [3, 2, 2, 1] 365s 0.656 -0.118 0.759 365s 0.916 -0.462 -0.335 365s 365s x1: [3, 3, 2, 1] 365s -0.399 -0.825 0.065 365s 0.551 0.233 -0.315 365s 0.791 0.057 0.749 365s 365s y: [2, 3, 2, 1] 365s -0.115 -0.006 365s 0.095 0.503 365s 1.080 0.447 365s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.524 -0.566 0.828 365s -0.981 -0.443 -0.753 365s -0.291 -0.679 -0.731 365s -0.444 -0.441 0.998 365s 365s x1: [3, 2, 1, 1] 365s 0.740 -0.814 0.585 365s 0.291 0.419 -0.730 365s 365s y: [4, 2, 1, 1] 365s 0.557 -0.806 -0.090 0.614 365s -0.994 0.078 0.164 -1.043 365s testing: mul_mat, [3, 3, 2, 1] = [4, 3, 2, 1] * [4, 3, 2, 1] 365s x0: [4, 3, 2, 1] 365s -0.859 -0.613 -0.710 -0.891 365s 0.745 -0.985 0.166 -0.144 365s -0.029 -0.031 -0.274 0.148 365s 365s x1: [4, 3, 2, 1] 365s 0.903 0.151 0.757 0.819 365s 0.689 -0.578 -0.375 0.172 365s -0.642 0.764 0.183 -0.122 365s 365s y: [3, 3, 2, 1] 365s -2.136 0.531 -0.117 365s -0.125 0.996 0.126 365s 0.063 -1.183 -0.073 365s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 4, 2, 1] 365s -0.393 -0.520 -0.135 365s 0.114 -0.466 0.337 365s -0.046 -0.922 0.559 365s 0.404 0.110 -0.618 365s 365s x1: [3, 4, 2, 1] 365s -0.326 -0.370 -0.669 365s 0.260 -0.551 0.517 365s -0.598 -0.410 0.262 365s 0.373 -0.023 0.277 365s 365s y: [4, 4, 2, 1] 365s 0.411 -0.090 -0.018 0.241 365s 0.115 0.461 0.785 -0.275 365s 0.412 0.212 0.552 -0.449 365s -0.172 0.147 0.159 -0.023 365s test-mul-mat0: iter:130/500 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.229 -0.074 0.910 365s -0.212 0.308 0.650 365s 365s x1: [3, 3, 1, 1] 365s -0.516 -0.475 -0.427 365s 0.091 0.639 0.527 365s -0.505 -0.881 -0.939 365s 365s y: [2, 3, 1, 1] 365s -0.471 -0.314 365s 0.454 0.520 365s -0.905 -0.774 365s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 365s x0: [3, 4, 4, 1] 365s 0.402 0.040 -0.347 365s -0.810 -0.652 -0.697 365s 0.126 0.301 0.066 365s 0.189 0.174 0.284 365s 365s x1: [3, 2, 4, 1] 365s -0.063 -0.048 -0.237 365s -0.937 0.874 -0.781 365s 365s y: [4, 2, 4, 1] 365s 0.055 0.248 -0.038 -0.088 365s -0.071 0.735 0.093 -0.247 365s testing: mul_mat, [4, 4, 4, 2] = [3, 4, 4, 2] * [3, 4, 4, 2] 365s x0: [3, 4, 4, 2] 365s -0.205 0.704 -0.579 365s -0.496 0.865 0.213 365s -0.569 -0.653 -0.391 365s 0.139 -0.507 0.461 365s 365s x1: [3, 4, 4, 2] 365s 0.992 -0.765 -0.046 365s 0.459 -0.212 0.573 365s 0.451 0.018 -0.753 365s 0.496 0.418 0.093 365s 365s y: [4, 4, 4, 2] 365s -0.715 -1.164 -0.047 0.504 365s -0.574 -0.288 -0.347 0.435 365s 0.356 -0.369 0.027 -0.294 365s 0.139 0.135 -0.591 -0.100 365s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.980 0.723 0.856 365s 365s x1: [3, 4, 1, 1] 365s 0.847 0.154 0.091 365s 0.702 0.996 0.141 365s 0.403 0.306 -0.487 365s 0.933 0.163 0.377 365s 365s y: [1, 4, 1, 1] 365s -0.641 365s 0.153 365s -0.591 365s -0.473 365s testing: mul_mat, [1, 3, 4, 1] = [1, 1, 4, 1] * [1, 3, 4, 1] 365s x0: [1, 1, 4, 1] 365s -0.219 365s 365s x1: [1, 3, 4, 1] 365s 0.582 365s 0.736 365s 0.388 365s 365s y: [1, 3, 4, 1] 365s -0.128 365s -0.161 365s -0.085 365s testing: mul_mat, [1, 1, 4, 2] = [1, 1, 4, 2] * [1, 1, 4, 2] 365s x0: [1, 1, 4, 2] 365s 0.213 365s 365s x1: [1, 1, 4, 2] 365s -0.818 365s 365s y: [1, 1, 4, 2] 365s -0.174 365s test-mul-mat0: iter:131/500 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.139 365s 0.240 365s 0.279 365s 0.952 365s 365s x1: [1, 3, 1, 1] 365s -0.996 365s -0.330 365s 0.813 365s 365s y: [4, 3, 1, 1] 365s 0.139 -0.239 -0.278 -0.948 365s 0.046 -0.079 -0.092 -0.314 365s -0.113 0.195 0.227 0.774 365s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 365s x0: [1, 1, 2, 1] 365s -0.819 365s 365s x1: [1, 4, 2, 1] 365s -0.486 365s 0.214 365s -0.866 365s 0.490 365s 365s y: [1, 4, 2, 1] 365s 0.399 365s -0.175 365s 0.710 365s -0.401 365s testing: mul_mat, [4, 1, 2, 3] = [1, 4, 2, 3] * [1, 1, 2, 3] 365s x0: [1, 4, 2, 3] 365s 0.377 365s 0.387 365s -0.331 365s 0.820 365s 365s x1: [1, 1, 2, 3] 365s -0.494 365s 365s y: [4, 1, 2, 3] 365s -0.186 -0.191 0.164 -0.405 365s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.620 365s -0.753 365s 365s x1: [1, 4, 1, 1] 365s -0.309 365s 0.791 365s 0.175 365s -0.867 365s 365s y: [2, 4, 1, 1] 365s 0.192 0.233 365s -0.491 -0.596 365s -0.109 -0.132 365s 0.538 0.653 365s testing: mul_mat, [2, 1, 2, 1] = [2, 2, 2, 1] * [2, 1, 2, 1] 365s x0: [2, 2, 2, 1] 365s 0.588 -0.476 365s 0.578 -0.241 365s 365s x1: [2, 1, 2, 1] 365s 0.268 0.766 365s 365s y: [2, 1, 2, 1] 365s -0.207 -0.030 365s testing: mul_mat, [1, 2, 2, 3] = [2, 1, 2, 3] * [2, 2, 2, 3] 365s x0: [2, 1, 2, 3] 365s -0.072 0.077 365s 365s x1: [2, 2, 2, 3] 365s -0.913 -0.128 365s 0.667 0.616 365s 365s y: [1, 2, 2, 3] 365s 0.056 365s -0.000 365s test-mul-mat0: iter:132/500 365s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 3, 1, 1] 365s -0.060 365s -0.024 365s 0.158 365s 365s x1: [1, 4, 1, 1] 365s -0.443 365s 0.774 365s -0.333 365s -0.423 365s 365s y: [3, 4, 1, 1] 365s 0.027 0.010 -0.070 365s -0.046 -0.018 0.123 365s 0.020 0.008 -0.053 365s 0.025 0.010 -0.067 365s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 365s x0: [1, 2, 3, 1] 365s 0.452 365s 0.571 365s 365s x1: [1, 3, 3, 1] 365s -0.518 365s 0.142 365s 0.889 365s 365s y: [2, 3, 3, 1] 365s -0.234 -0.296 365s 0.064 0.081 365s 0.402 0.508 365s testing: mul_mat, [1, 2, 3, 4] = [1, 1, 3, 4] * [1, 2, 3, 4] 365s x0: [1, 1, 3, 4] 365s -0.992 365s 365s x1: [1, 2, 3, 4] 365s 0.197 365s 0.569 365s 365s y: [1, 2, 3, 4] 365s -0.195 365s -0.565 365s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.371 365s 0.629 365s 365s x1: [1, 1, 1, 1] 365s 0.787 365s 365s y: [2, 1, 1, 1] 365s -0.292 0.495 365s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 2, 3, 1] 365s -0.449 0.856 365s -0.951 0.711 365s 365s x1: [2, 1, 3, 1] 365s 0.850 -0.466 365s 365s y: [2, 1, 3, 1] 365s -0.781 -1.140 365s testing: mul_mat, [4, 2, 3, 4] = [2, 4, 3, 4] * [2, 2, 3, 4] 365s x0: [2, 4, 3, 4] 365s 0.250 -0.915 365s 0.027 0.748 365s 0.178 0.640 365s -0.761 0.300 365s 365s x1: [2, 2, 3, 4] 365s -0.080 -0.309 365s -0.334 0.236 365s 365s y: [4, 2, 3, 4] 365s 0.262 -0.233 -0.212 -0.032 365s -0.299 0.167 0.091 0.325 365s test-mul-mat0: iter:133/500 365s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 1, 1, 1] 365s -0.439 -0.610 -0.220 365s 365s x1: [3, 4, 1, 1] 365s -0.390 0.736 0.010 365s -0.487 0.525 -0.823 365s -0.676 0.065 -0.745 365s -0.415 -0.108 0.612 365s 365s y: [1, 4, 1, 1] 365s -0.280 365s 0.074 365s 0.420 365s 0.113 365s testing: mul_mat, [3, 1, 3, 1] = [3, 3, 3, 1] * [3, 1, 3, 1] 365s x0: [3, 3, 3, 1] 365s -0.776 0.230 -0.273 365s -0.627 -0.671 0.337 365s -0.890 0.339 0.851 365s 365s x1: [3, 1, 3, 1] 365s 0.316 0.199 -0.401 365s 365s y: [3, 1, 3, 1] 365s -0.090 -0.467 -0.554 365s testing: mul_mat, [2, 3, 3, 4] = [3, 2, 3, 4] * [3, 3, 3, 4] 365s x0: [3, 2, 3, 4] 365s -0.722 -0.961 -0.617 365s 0.828 0.335 -0.483 365s 365s x1: [3, 3, 3, 4] 365s 0.022 0.189 0.485 365s -0.866 0.413 -0.284 365s -0.139 0.786 0.045 365s 365s y: [2, 3, 3, 4] 365s -0.497 -0.153 365s 0.405 -0.442 365s -0.682 0.127 365s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.259 -0.616 0.223 365s 365s x1: [3, 2, 1, 1] 365s 0.396 0.936 0.558 365s 0.494 0.306 -0.448 365s 365s y: [1, 2, 1, 1] 365s -0.350 365s -0.160 365s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3, 1] 365s x0: [1, 3, 3, 1] 365s -0.592 365s -0.373 365s 0.514 365s 365s x1: [1, 3, 3, 1] 365s 0.365 365s -0.748 365s 0.473 365s 365s y: [3, 3, 3, 1] 365s -0.216 -0.136 0.188 365s 0.442 0.279 -0.384 365s -0.280 -0.177 0.243 365s testing: mul_mat, [2, 1, 3, 4] = [3, 2, 3, 4] * [3, 1, 3, 4] 365s x0: [3, 2, 3, 4] 365s -0.564 -0.962 -0.848 365s 0.802 -0.946 0.002 365s 365s x1: [3, 1, 3, 4] 365s 0.117 0.856 0.985 365s 365s y: [2, 1, 3, 4] 365s -1.724 -0.713 365s test-mul-mat0: iter:134/500 365s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.743 365s 365s x1: [1, 1, 1, 1] 365s -0.467 365s 365s y: [1, 1, 1, 1] 365s -0.347 365s testing: mul_mat, [4, 1, 4, 1] = [1, 4, 4, 1] * [1, 1, 4, 1] 365s x0: [1, 4, 4, 1] 365s -0.097 365s -0.539 365s 0.561 365s -0.409 365s 365s x1: [1, 1, 4, 1] 365s -0.242 365s 365s y: [4, 1, 4, 1] 365s 0.024 0.131 -0.136 0.099 365s testing: mul_mat, [4, 4, 4, 3] = [1, 4, 4, 3] * [1, 4, 4, 3] 365s x0: [1, 4, 4, 3] 365s -0.479 365s -0.317 365s 0.554 365s -0.142 365s 365s x1: [1, 4, 4, 3] 365s -0.956 365s 0.598 365s 0.932 365s -0.303 365s 365s y: [4, 4, 4, 3] 365s 0.458 0.303 -0.529 0.136 365s -0.286 -0.189 0.331 -0.085 365s -0.446 -0.295 0.516 -0.132 365s 0.145 0.096 -0.168 0.043 365s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.041 365s 0.437 365s -0.249 365s 365s x1: [1, 4, 1, 1] 365s -0.685 365s 0.644 365s -0.910 365s -0.866 365s 365s y: [3, 4, 1, 1] 365s -0.028 -0.299 0.171 365s 0.026 0.281 -0.161 365s -0.037 -0.397 0.227 365s -0.035 -0.378 0.216 365s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 365s x0: [3, 3, 4, 1] 365s -0.572 0.711 0.829 365s 0.431 0.698 0.773 365s -0.917 -0.514 0.228 365s 365s x1: [3, 1, 4, 1] 365s -0.366 -0.993 0.574 365s 365s y: [3, 1, 4, 1] 365s -0.020 -0.407 0.977 365s testing: mul_mat, [3, 3, 4, 3] = [3, 3, 4, 3] * [3, 3, 4, 3] 365s x0: [3, 3, 4, 3] 365s -0.934 -0.293 0.499 365s -0.652 -0.985 0.987 365s -0.029 -0.985 0.226 365s 365s x1: [3, 3, 4, 3] 365s 0.318 0.166 -0.610 365s -0.454 -0.961 -0.523 365s 0.419 0.001 -0.301 365s 365s y: [3, 3, 4, 3] 365s -0.651 -0.973 -0.311 365s 0.445 0.727 0.842 365s -0.542 -0.572 -0.081 365s test-mul-mat0: iter:135/500 365s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.267 0.658 365s 0.564 0.346 365s 365s x1: [2, 1, 1, 1] 365s 0.819 -0.818 365s 365s y: [2, 1, 1, 1] 365s -0.319 0.179 365s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.737 -0.136 365s 0.022 0.341 365s -0.516 -0.764 365s -0.039 -0.966 365s 365s x1: [2, 2, 1, 1] 365s 0.846 0.475 365s 0.756 0.786 365s 365s y: [4, 2, 1, 1] 365s -0.688 0.181 -0.800 -0.492 365s -0.664 0.285 -0.991 -0.789 365s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 365s x0: [2, 4, 1, 2] 365s 0.561 -0.865 365s -0.487 0.317 365s 0.921 -0.786 365s 0.580 -0.215 365s 365s x1: [2, 4, 1, 2] 365s 0.363 0.393 365s -0.629 0.033 365s -0.313 0.352 365s -0.335 -0.140 365s 365s y: [4, 4, 1, 2] 365s -0.136 -0.052 0.025 0.126 365s -0.382 0.317 -0.606 -0.372 365s -0.480 0.264 -0.565 -0.257 365s -0.067 0.119 -0.198 -0.164 365s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.102 0.610 365s 0.611 0.316 365s 365s x1: [2, 4, 1, 1] 365s 0.674 0.868 365s 0.962 0.026 365s -0.467 -0.178 365s -0.846 -0.019 365s 365s y: [2, 4, 1, 1] 365s 0.598 0.687 365s 0.114 0.596 365s -0.156 -0.342 365s -0.097 -0.523 365s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.806 -0.259 365s 0.956 -0.389 365s 0.375 -0.306 365s 0.773 -0.989 365s 365s x1: [2, 2, 1, 1] 365s 0.900 -0.352 365s -0.123 -0.965 365s 365s y: [4, 2, 1, 1] 365s -0.635 0.997 0.445 1.044 365s 0.349 0.258 0.249 0.860 365s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 365s x0: [4, 4, 1, 2] 365s -0.414 0.844 0.566 -0.876 365s 0.809 -0.809 0.430 -0.448 365s 0.030 0.657 -0.459 -0.741 365s 0.463 0.800 -0.823 -0.299 365s 365s x1: [4, 2, 1, 2] 365s -0.917 -0.424 0.314 -0.083 365s 0.851 -0.012 -0.216 0.812 365s 365s y: [4, 2, 1, 2] 365s 0.272 -0.227 -0.388 -0.997 365s -1.196 0.241 -0.485 0.319 365s test-mul-mat0: iter:136/500 365s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.479 365s 365s x1: [1, 4, 1, 1] 365s 0.372 365s -0.988 365s -0.062 365s -0.062 365s 365s y: [1, 4, 1, 1] 365s 0.178 365s -0.474 365s -0.030 365s -0.030 365s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.969 365s 0.374 365s -0.733 365s 0.897 365s 365s x1: [1, 1, 1, 1] 365s 0.115 365s 365s y: [4, 1, 1, 1] 365s -0.111 0.043 -0.084 0.103 365s testing: mul_mat, [4, 4, 1, 4] = [1, 4, 1, 4] * [1, 4, 1, 4] 365s x0: [1, 4, 1, 4] 365s -0.793 365s 0.279 365s -0.345 365s -0.781 365s 365s x1: [1, 4, 1, 4] 365s -0.450 365s -0.181 365s -0.250 365s -0.490 365s 365s y: [4, 4, 1, 4] 365s 0.357 -0.126 0.155 0.352 365s 0.144 -0.051 0.063 0.142 365s 0.198 -0.070 0.086 0.195 365s 0.388 -0.137 0.169 0.383 365s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.676 365s -0.354 365s -0.709 365s 365s x1: [1, 4, 1, 1] 365s -0.626 365s 0.141 365s 0.496 365s 0.992 365s 365s y: [3, 4, 1, 1] 365s -0.423 0.221 0.444 365s 0.096 -0.050 -0.100 365s 0.335 -0.175 -0.351 365s 0.671 -0.351 -0.703 365s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.458 0.279 0.010 365s 365s x1: [3, 1, 1, 1] 365s 0.627 0.606 0.138 365s 365s y: [1, 1, 1, 1] 365s 0.458 365s testing: mul_mat, [2, 3, 1, 4] = [1, 2, 1, 4] * [1, 3, 1, 4] 365s x0: [1, 2, 1, 4] 365s 0.823 365s -0.322 365s 365s x1: [1, 3, 1, 4] 365s 0.665 365s -0.441 365s 0.665 365s 365s y: [2, 3, 1, 4] 365s 0.548 -0.214 365s -0.364 0.142 365s 0.547 -0.214 365s test-mul-mat0: iter:137/500 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.985 0.783 0.451 365s -0.209 0.040 -0.004 365s 365s x1: [3, 2, 1, 1] 365s -0.557 -0.823 0.108 365s 0.901 0.457 -0.882 365s 365s y: [2, 2, 1, 1] 365s -0.046 0.083 365s -0.927 -0.167 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.730 -0.381 0.483 365s 0.549 -0.304 0.673 365s 365s x1: [3, 2, 1, 1] 365s 0.263 0.005 0.770 365s 0.462 0.052 -0.962 365s 365s y: [2, 2, 1, 1] 365s 0.562 0.661 365s -0.148 -0.410 365s testing: mul_mat, [4, 2, 1, 3] = [3, 4, 1, 3] * [3, 2, 1, 3] 365s x0: [3, 4, 1, 3] 365s -0.117 -0.364 -0.499 365s 0.935 -0.327 0.787 365s 0.665 0.292 0.270 365s 0.213 0.988 -0.057 365s 365s x1: [3, 2, 1, 3] 365s -0.703 -0.551 -0.491 365s 0.336 0.829 -0.048 365s 365s y: [4, 2, 1, 3] 365s 0.527 -0.864 -0.761 -0.667 365s -0.317 0.005 0.452 0.894 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.385 -0.287 0.027 365s -0.090 0.168 -0.736 365s -0.255 -0.579 0.968 365s 365s x1: [3, 4, 1, 1] 365s 0.417 0.457 0.014 365s -0.313 -0.329 0.002 365s 0.630 -0.819 0.439 365s 0.083 0.026 0.706 365s 365s y: [3, 4, 1, 1] 365s 0.029 0.029 -0.357 365s -0.026 -0.028 0.272 365s 0.489 -0.518 0.739 365s 0.043 -0.523 0.647 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.654 0.166 -0.309 365s -0.836 0.608 -0.777 365s 0.978 0.197 -0.689 365s 0.016 -0.395 -0.822 365s 365s x1: [3, 3, 1, 1] 365s 0.116 0.238 -0.620 365s -0.755 0.096 0.428 365s 0.875 -0.112 0.150 365s 365s y: [4, 3, 1, 1] 365s 0.155 0.529 0.588 0.417 365s 0.378 0.357 -1.015 -0.402 365s -0.637 -0.916 0.730 -0.065 365s testing: mul_mat, [1, 3, 1, 3] = [4, 1, 1, 3] * [4, 3, 1, 3] 365s x0: [4, 1, 1, 3] 365s 0.933 -0.222 0.908 -0.603 365s 365s x1: [4, 3, 1, 3] 365s 0.608 0.221 -0.077 -0.679 365s -0.611 0.344 0.347 -0.347 365s 0.312 -0.536 0.891 0.693 365s 365s y: [1, 3, 1, 3] 365s 0.858 365s -0.122 365s 0.802 365s test-mul-mat0: iter:138/500 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.654 0.485 365s -0.397 -0.068 365s 0.610 0.199 365s -0.135 -0.612 365s 365s x1: [2, 4, 1, 1] 365s -0.010 0.027 365s -0.127 0.424 365s 0.232 0.024 365s -0.919 0.748 365s 365s y: [4, 4, 1, 1] 365s 0.020 0.002 -0.001 -0.015 365s 0.288 0.021 0.007 -0.242 365s -0.140 -0.094 0.146 -0.046 365s 0.964 0.314 -0.412 -0.334 365s testing: mul_mat, [1, 4, 3, 1] = [2, 1, 3, 1] * [2, 4, 3, 1] 365s x0: [2, 1, 3, 1] 365s -0.117 0.770 365s 365s x1: [2, 4, 3, 1] 365s 0.107 0.262 365s 0.057 -0.292 365s -0.633 -0.287 365s 0.429 -0.945 365s 365s y: [1, 4, 3, 1] 365s 0.189 365s -0.232 365s -0.147 365s -0.777 365s testing: mul_mat, [1, 1, 3, 4] = [2, 1, 3, 4] * [2, 1, 3, 4] 365s x0: [2, 1, 3, 4] 365s -0.036 0.850 365s 365s x1: [2, 1, 3, 4] 365s 0.189 0.897 365s 365s y: [1, 1, 3, 4] 365s 0.755 365s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.710 0.589 365s -0.375 -0.460 365s -0.310 -0.205 365s -0.933 -0.365 365s 365s x1: [2, 1, 1, 1] 365s 0.854 -0.456 365s 365s y: [4, 1, 1, 1] 365s -0.875 -0.111 -0.171 -0.631 365s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 365s x0: [4, 3, 3, 1] 365s 0.390 0.930 -0.045 -0.638 365s 0.954 -0.252 0.610 -0.066 365s 0.294 0.929 -0.200 0.803 365s 365s x1: [4, 2, 3, 1] 365s 0.025 -0.139 -0.129 0.407 365s 0.324 0.003 -0.947 -0.103 365s 365s y: [3, 2, 3, 1] 365s -0.373 -0.047 0.231 365s 0.238 -0.262 0.205 365s testing: mul_mat, [2, 4, 3, 4] = [3, 2, 3, 4] * [3, 4, 3, 4] 365s x0: [3, 2, 3, 4] 365s 0.455 0.053 -0.939 365s 0.191 -0.327 -0.410 365s 365s x1: [3, 4, 3, 4] 365s 0.690 0.540 0.939 365s -0.510 0.902 -0.127 365s -0.707 -0.684 -0.296 365s -0.443 0.749 -0.642 365s 365s y: [2, 4, 3, 4] 365s -0.538 -0.430 365s -0.065 -0.340 365s -0.080 0.210 365s 0.440 -0.066 365s test-mul-mat0: iter:139/500 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.928 -0.217 -0.205 -0.499 365s -0.431 -0.544 0.294 0.582 365s 0.977 -0.104 -0.717 -0.645 365s 365s x1: [4, 3, 1, 1] 365s -0.974 0.790 0.851 0.838 365s 0.419 0.073 0.868 -0.707 365s 0.412 0.676 -0.480 0.549 365s 365s y: [3, 3, 1, 1] 365s -1.668 0.728 -2.184 365s 0.547 -0.376 0.235 365s 0.060 -0.367 0.323 365s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 365s x0: [4, 1, 4, 1] 365s -0.921 0.304 -0.546 -0.621 365s 365s x1: [4, 3, 4, 1] 365s 0.129 0.314 0.429 0.165 365s 0.950 -0.288 0.191 0.740 365s -0.437 0.029 0.160 0.636 365s 365s y: [1, 3, 4, 1] 365s -0.360 365s -1.525 365s -0.072 365s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 365s x0: [4, 2, 4, 1] 365s 0.927 -0.179 -0.410 -0.670 365s 0.955 0.992 -0.736 0.822 365s 365s x1: [4, 1, 4, 1] 365s -0.397 0.953 -0.300 -0.320 365s 365s y: [2, 1, 4, 1] 365s -0.201 0.524 365s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.260 -0.013 0.934 0.373 365s 0.272 0.578 -0.435 0.337 365s -0.349 0.492 0.103 0.462 365s 365s x1: [4, 2, 1, 1] 365s -0.813 0.590 0.487 -0.858 365s 0.582 0.751 0.964 0.569 365s 365s y: [3, 2, 1, 1] 365s 0.339 -0.381 0.228 365s 0.952 0.365 0.529 365s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 365s x0: [3, 2, 4, 1] 365s 0.705 0.794 -0.697 365s -0.903 -0.762 -0.500 365s 365s x1: [3, 4, 4, 1] 365s -0.267 0.897 0.998 365s -0.509 -0.180 -0.885 365s 0.722 -0.814 0.115 365s -0.121 -0.627 -0.295 365s 365s y: [2, 4, 4, 1] 365s -0.171 -0.941 365s 0.116 1.039 365s -0.218 -0.089 365s -0.378 0.734 365s testing: mul_mat, [1, 3, 4, 1] = [2, 1, 4, 1] * [2, 3, 4, 1] 365s x0: [2, 1, 4, 1] 365s 0.686 -0.773 365s 365s x1: [2, 3, 4, 1] 365s 0.612 -0.556 365s -0.993 -0.578 365s -0.732 -0.115 365s 365s y: [1, 3, 4, 1] 365s 0.850 365s -0.234 365s -0.413 365s test-mul-mat0: iter:140/500 365s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.954 -0.582 0.275 365s -0.540 0.245 0.907 365s -0.854 0.472 -0.737 365s -0.079 -0.563 -0.773 365s 365s x1: [3, 4, 1, 1] 365s 0.898 0.650 0.836 365s -0.219 0.861 0.121 365s -0.836 -0.665 -0.152 365s 0.319 -0.800 -0.468 365s 365s y: [4, 4, 1, 1] 365s -1.005 0.434 -1.076 -1.084 365s -0.259 0.439 0.504 -0.561 365s 1.143 0.151 0.512 0.558 365s 0.032 -0.793 -0.305 0.787 365s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 3, 3, 1] 365s -0.415 0.506 -0.056 365s 0.381 -0.458 -0.047 365s -0.658 0.913 0.810 365s 365s x1: [3, 4, 3, 1] 365s 0.308 0.847 -0.803 365s -0.883 0.524 0.052 365s -0.985 0.174 -0.112 365s -0.204 0.036 -0.991 365s 365s y: [3, 4, 3, 1] 365s 0.346 -0.232 -0.080 365s 0.629 -0.579 1.102 365s 0.503 -0.450 0.717 365s 0.159 -0.047 -0.636 365s testing: mul_mat, [2, 3, 3, 2] = [3, 2, 3, 2] * [3, 3, 3, 2] 365s x0: [3, 2, 3, 2] 365s 0.964 -0.510 0.577 365s -0.009 -0.694 -0.069 365s 365s x1: [3, 3, 3, 2] 365s -0.133 0.568 -0.315 365s -0.078 -0.847 -0.808 365s 0.866 0.534 -0.266 365s 365s y: [2, 3, 3, 2] 365s -0.599 -0.371 365s -0.109 0.645 365s 0.409 -0.360 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.450 0.548 0.219 365s -0.137 -0.763 0.568 365s 365s x1: [3, 2, 1, 1] 365s 0.232 0.417 -0.797 365s 0.084 0.855 -0.202 365s 365s y: [2, 2, 1, 1] 365s -0.050 -0.802 365s 0.386 -0.778 365s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 1, 3, 1] 365s -0.268 0.425 365s 365s x1: [2, 3, 3, 1] 365s 0.563 -0.506 365s -0.430 -0.343 365s -0.483 -0.198 365s 365s y: [1, 3, 3, 1] 365s -0.366 365s -0.030 365s 0.046 365s testing: mul_mat, [4, 2, 3, 2] = [1, 4, 3, 2] * [1, 2, 3, 2] 365s x0: [1, 4, 3, 2] 365s 0.440 365s -0.222 365s -0.399 365s 0.656 365s 365s x1: [1, 2, 3, 2] 365s 0.137 365s 0.085 365s 365s y: [4, 2, 3, 2] 365s 0.060 -0.030 -0.055 0.090 365s 0.038 -0.019 -0.034 0.056 365s test-mul-mat0: iter:141/500 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.500 -0.637 -0.512 365s -0.970 0.334 0.047 365s -0.217 0.914 -0.257 365s 365s x1: [3, 4, 1, 1] 365s -0.745 0.855 0.947 365s 0.695 -0.367 -0.452 365s 0.350 -0.490 -0.872 365s 0.275 -0.606 -0.011 365s 365s y: [3, 4, 1, 1] 365s -1.402 1.053 0.700 365s 0.812 -0.817 -0.370 365s 0.933 -0.544 -0.300 365s 0.529 -0.470 -0.611 365s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 365s x0: [3, 2, 3, 1] 365s 0.577 -0.351 0.072 365s -0.802 -0.700 -0.114 365s 365s x1: [3, 3, 3, 1] 365s -0.379 0.651 0.814 365s -0.237 -0.886 0.822 365s 0.704 0.871 0.424 365s 365s y: [2, 3, 3, 1] 365s -0.389 -0.245 365s 0.234 0.716 365s 0.130 -1.222 365s testing: mul_mat, [1, 2, 3, 4] = [3, 1, 3, 4] * [3, 2, 3, 4] 365s x0: [3, 1, 3, 4] 365s -0.994 -0.409 0.414 365s 365s x1: [3, 2, 3, 4] 365s 0.279 0.238 0.470 365s 0.557 -0.499 -0.494 365s 365s y: [1, 2, 3, 4] 365s -0.180 365s -0.554 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.413 -0.696 -0.891 365s 0.256 -0.380 0.442 365s 0.027 -0.629 0.296 365s 0.642 0.090 0.131 365s 365s x1: [3, 1, 1, 1] 365s -0.634 -0.090 0.684 365s 365s y: [4, 1, 1, 1] 365s -0.285 0.174 0.242 -0.325 365s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 365s x0: [4, 1, 3, 1] 365s -0.506 0.972 0.505 0.852 365s 365s x1: [4, 3, 3, 1] 365s -0.241 -0.925 -0.119 -0.273 365s 0.088 -0.095 0.017 0.900 365s -0.619 0.358 -0.282 -0.552 365s 365s y: [1, 3, 3, 1] 365s -1.069 365s 0.639 365s 0.048 365s testing: mul_mat, [2, 4, 3, 4] = [1, 2, 3, 4] * [1, 4, 3, 4] 365s x0: [1, 2, 3, 4] 365s -0.086 365s 0.544 365s 365s x1: [1, 4, 3, 4] 365s -0.107 365s -0.786 365s -0.590 365s -0.630 365s 365s y: [2, 4, 3, 4] 365s 0.009 -0.058 365s 0.068 -0.428 365s 0.051 -0.321 365s 0.054 -0.343 365s test-mul-mat0: iter:142/500 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.670 0.946 365s -0.992 -0.500 365s -0.468 0.525 365s 365s x1: [2, 2, 1, 1] 365s 0.259 0.415 365s 0.328 -0.827 365s 365s y: [3, 2, 1, 1] 365s 0.219 -0.464 0.097 365s -1.002 0.088 -0.588 365s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 2, 3, 1] 365s 0.439 -0.016 365s -0.274 -0.233 365s 365s x1: [2, 3, 3, 1] 365s 0.569 0.161 365s 0.955 0.214 365s -0.975 -0.810 365s 365s y: [2, 3, 3, 1] 365s 0.247 -0.193 365s 0.416 -0.311 365s -0.415 0.456 365s testing: mul_mat, [2, 2, 3, 2] = [2, 2, 3, 2] * [2, 2, 3, 2] 365s x0: [2, 2, 3, 2] 365s -0.242 -0.387 365s 0.496 -0.645 365s 365s x1: [2, 2, 3, 2] 365s 0.166 -0.205 365s 0.174 -0.620 365s 365s y: [2, 2, 3, 2] 365s 0.039 0.215 365s 0.198 0.486 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.713 -0.674 365s -0.101 -0.606 365s -0.202 0.153 365s 365s x1: [2, 2, 1, 1] 365s -0.021 -0.622 365s -0.798 0.096 365s 365s y: [3, 2, 1, 1] 365s 0.405 0.379 -0.091 365s -0.633 0.023 0.176 365s testing: mul_mat, [1, 2, 3, 1] = [3, 1, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 1, 3, 1] 365s -0.369 0.801 -0.216 365s 365s x1: [3, 2, 3, 1] 365s -0.427 0.109 -0.327 365s 0.615 -0.002 -0.426 365s 365s y: [1, 2, 3, 1] 365s 0.316 365s -0.136 365s testing: mul_mat, [4, 3, 3, 2] = [1, 4, 3, 2] * [1, 3, 3, 2] 365s x0: [1, 4, 3, 2] 365s 0.100 365s -0.701 365s -0.540 365s 0.731 365s 365s x1: [1, 3, 3, 2] 365s -0.128 365s -0.810 365s -0.226 365s 365s y: [4, 3, 3, 2] 365s -0.013 0.090 0.069 -0.094 365s -0.081 0.568 0.438 -0.593 365s -0.023 0.159 0.122 -0.166 365s test-mul-mat0: iter:143/500 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.641 0.896 0.945 365s 0.340 -0.643 0.372 365s 0.619 -0.735 0.859 365s 365s x1: [3, 4, 1, 1] 365s 0.455 -0.825 -0.561 365s -0.445 -0.526 -0.101 365s -0.713 -0.427 -0.858 365s -0.149 0.568 -0.791 365s 365s y: [3, 4, 1, 1] 365s -1.561 0.476 0.407 365s -0.282 0.150 0.025 365s -0.736 -0.287 -0.864 365s -0.143 -0.710 -1.189 365s testing: mul_mat, [3, 3, 3, 1] = [3, 3, 3, 1] * [3, 3, 3, 1] 365s x0: [3, 3, 3, 1] 365s 0.452 -0.269 -0.257 365s 0.647 -0.075 0.624 365s -0.033 0.572 0.761 365s 365s x1: [3, 3, 3, 1] 365s -0.804 -0.806 -0.119 365s 0.319 -0.354 -0.863 365s -0.543 0.344 -0.478 365s 365s y: [3, 3, 3, 1] 365s -0.116 -0.535 -0.524 365s 0.461 -0.306 -0.870 365s -0.215 -0.676 -0.149 365s testing: mul_mat, [4, 3, 3, 2] = [3, 4, 3, 2] * [3, 3, 3, 2] 365s x0: [3, 4, 3, 2] 365s -0.257 0.781 -0.765 365s -0.940 -0.684 -0.135 365s -0.197 -0.906 -0.055 365s 0.440 -0.998 0.926 365s 365s x1: [3, 3, 3, 2] 365s 0.244 0.450 0.893 365s -0.871 -0.099 -0.376 365s -0.128 -0.451 0.549 365s 365s y: [4, 3, 3, 2] 365s -0.395 -0.658 -0.504 0.486 365s 0.434 0.937 0.281 -0.633 365s -0.740 0.355 0.404 0.902 365s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.269 0.837 0.507 365s -0.553 -0.622 0.190 365s -0.744 -0.370 -0.949 365s 0.917 0.627 0.703 365s 365s x1: [3, 4, 1, 1] 365s 0.236 -0.260 0.385 365s -0.724 -0.433 -0.094 365s 0.371 -0.105 0.867 365s -0.947 0.709 -0.765 365s 365s y: [4, 4, 1, 1] 365s 0.042 0.104 -0.445 0.325 365s -0.606 0.652 0.789 -1.002 365s 0.451 0.025 -1.059 0.883 365s -0.050 -0.062 1.169 -0.962 365s testing: mul_mat, [3, 3, 3, 1] = [4, 3, 3, 1] * [4, 3, 3, 1] 365s x0: [4, 3, 3, 1] 365s -0.812 -0.410 -0.638 -0.813 365s -0.096 -0.444 0.102 0.356 365s 0.506 -0.060 -0.759 0.776 365s 365s x1: [4, 3, 3, 1] 365s 0.911 0.976 0.768 -0.110 365s -0.655 0.143 -0.874 0.085 365s -0.472 -0.598 0.652 0.434 365s 365s y: [3, 3, 3, 1] 365s -1.541 -0.483 -0.267 365s 0.961 -0.059 0.390 365s -0.140 0.532 -0.360 365s testing: mul_mat, [1, 4, 3, 2] = [3, 1, 3, 2] * [3, 4, 3, 2] 365s x0: [3, 1, 3, 2] 365s -0.485 0.236 -0.782 365s 365s x1: [3, 4, 3, 2] 365s 0.425 -0.971 -0.623 365s 0.666 -0.784 0.733 365s 0.442 -0.375 -0.386 365s 0.201 0.511 -0.064 365s 365s y: [1, 4, 3, 2] 365s 0.052 365s -1.081 365s -0.001 365s 0.073 365s test-mul-mat0: iter:144/500 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.555 -0.835 -0.585 -0.370 365s -0.232 -0.915 -0.222 -0.147 365s -0.626 0.031 -0.074 0.059 365s -0.752 0.978 0.584 -0.434 365s 365s x1: [4, 3, 1, 1] 365s 0.931 0.928 -0.159 0.790 365s -0.067 -0.843 -0.491 -0.337 365s 0.298 0.025 0.899 0.516 365s 365s y: [4, 3, 1, 1] 365s -1.491 -1.146 -0.496 -0.228 365s 1.153 0.945 0.033 -0.914 365s -0.903 -0.367 -0.223 0.100 365s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 2, 4, 1] 365s -0.421 -0.778 -0.650 -0.426 365s -0.670 0.477 -0.480 -0.844 365s 365s x1: [4, 4, 4, 1] 365s -0.573 0.422 -0.503 -0.645 365s -0.737 -0.713 0.288 -0.580 365s -0.204 0.951 0.718 0.821 365s 0.849 0.234 -0.513 -0.706 365s 365s y: [2, 4, 4, 1] 365s 0.514 1.371 365s 0.925 0.505 365s -1.471 -0.448 365s 0.094 0.385 365s testing: mul_mat, [4, 2, 4, 4] = [4, 4, 4, 4] * [4, 2, 4, 4] 365s x0: [4, 4, 4, 4] 365s -0.200 0.394 0.239 -0.144 365s 0.304 -0.730 -0.901 0.207 365s -0.671 -0.871 -0.520 -0.406 365s 0.425 0.315 -0.715 -0.078 365s 365s x1: [4, 2, 4, 4] 365s 0.177 -0.645 -0.907 0.506 365s 0.832 -0.387 0.662 -0.742 365s 365s y: [4, 2, 4, 4] 365s -0.579 1.447 0.709 0.481 365s -0.053 -0.215 -0.264 -0.184 365s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.408 -0.425 -0.870 -0.159 365s -0.308 0.425 0.383 0.833 365s -0.521 0.941 0.641 -0.434 365s -0.195 0.842 0.430 0.206 365s 365s x1: [4, 4, 1, 1] 365s 0.135 -0.635 -0.031 0.230 365s 0.844 0.909 -0.646 -0.807 365s 0.885 0.005 -0.632 -0.042 365s 0.066 -0.129 0.419 0.575 365s 365s y: [4, 4, 1, 1] 365s 0.316 -0.132 -0.788 -0.527 365s 0.648 -0.793 0.352 0.156 365s 0.915 -0.548 -0.844 -0.450 365s -0.373 0.564 -0.137 0.177 365s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 2, 4, 1] 365s 0.367 -0.425 0.724 0.893 365s -0.177 0.853 0.036 -0.153 365s 365s x1: [4, 4, 4, 1] 365s -0.199 0.796 0.050 -0.289 365s -0.850 0.243 -0.404 0.155 365s 0.611 0.554 -0.779 -0.518 365s -0.028 0.796 0.771 -0.620 365s 365s y: [2, 4, 4, 1] 365s -0.633 0.760 365s -0.570 0.320 365s -1.039 0.415 365s -0.344 0.806 365s testing: mul_mat, [3, 4, 4, 4] = [2, 3, 4, 4] * [2, 4, 4, 4] 365s x0: [2, 3, 4, 4] 365s -0.260 0.620 365s -0.918 0.848 365s -0.102 -0.214 365s 365s x1: [2, 4, 4, 4] 365s 0.788 -0.227 365s 0.923 0.512 365s 0.809 0.816 365s -0.641 0.787 365s 365s y: [3, 4, 4, 4] 365s -0.346 -0.916 -0.032 365s 0.077 -0.414 -0.204 365s 0.296 -0.051 -0.257 365s 0.655 1.256 -0.103 365s test-mul-mat0: iter:145/500 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.420 -0.973 -0.403 365s -0.725 -0.230 0.558 365s -0.112 0.576 -0.102 365s 0.169 -0.156 0.924 365s 365s x1: [3, 1, 1, 1] 365s -0.225 -0.896 -0.931 365s 365s y: [4, 1, 1, 1] 365s 1.342 -0.151 -0.396 -0.759 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.710 -0.488 -0.281 365s 0.814 -0.419 0.797 365s -0.606 -0.393 -0.605 365s 365s x1: [3, 4, 1, 1] 365s 0.103 -0.391 0.038 365s -0.411 -0.873 -0.681 365s -0.631 0.411 -0.414 365s 0.263 0.373 -0.723 365s 365s y: [3, 4, 1, 1] 365s 0.253 0.279 0.068 365s 0.326 -0.511 1.004 365s -0.532 -1.017 0.472 365s 0.208 -0.518 0.131 365s testing: mul_mat, [3, 3, 1, 2] = [3, 3, 1, 2] * [3, 3, 1, 2] 365s x0: [3, 3, 1, 2] 365s 0.317 -0.004 0.994 365s 0.027 0.508 -0.287 365s -0.158 -0.912 -0.490 365s 365s x1: [3, 3, 1, 2] 365s -0.331 0.377 0.953 365s 0.557 -0.047 -0.149 365s -0.273 0.797 -0.224 365s 365s y: [3, 3, 1, 2] 365s 0.841 -0.091 -0.758 365s 0.029 0.034 0.027 365s -0.313 0.462 -0.574 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.706 -0.853 0.983 365s 0.151 0.271 -0.108 365s 0.276 -0.950 0.743 365s 0.023 0.654 -0.506 365s 365s x1: [3, 3, 1, 1] 365s -0.811 -0.177 -0.515 365s -0.982 0.230 -0.701 365s -0.563 0.762 -0.568 365s 365s y: [4, 3, 1, 1] 365s -0.928 -0.114 -0.439 0.127 365s -1.578 -0.010 -1.010 0.483 365s -1.606 0.183 -1.301 0.773 365s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 2, 1, 1] 365s 0.590 -0.862 0.184 0.058 365s -0.249 0.352 0.364 -0.666 365s 365s x1: [4, 3, 1, 1] 365s -0.873 -0.562 0.399 -0.968 365s -0.489 -0.743 0.494 0.537 365s 0.967 0.683 -0.640 -0.548 365s 365s y: [2, 3, 1, 1] 365s -0.014 0.809 365s 0.473 -0.318 365s -0.167 0.132 365s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 365s x0: [2, 4, 1, 2] 365s -0.636 -0.538 365s -0.990 -0.941 365s -0.642 -0.686 365s -0.276 0.213 365s 365s x1: [2, 4, 1, 2] 365s -0.360 -0.919 365s -0.519 0.910 365s -0.869 -0.865 365s 0.894 0.023 365s 365s y: [4, 4, 1, 2] 365s 0.724 1.221 0.861 -0.097 365s -0.160 -0.342 -0.291 0.337 365s 1.018 1.674 1.151 0.055 365s -0.580 -0.906 -0.589 -0.241 365s test-mul-mat0: iter:146/500 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.710 -0.631 -0.777 365s -0.034 -0.267 -0.767 365s 365s x1: [3, 2, 1, 1] 365s -0.995 0.959 0.225 365s -0.607 0.108 0.542 365s 365s y: [2, 2, 1, 1] 365s -0.074 -0.395 365s -0.058 -0.423 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.438 -0.695 0.143 365s -0.837 0.036 0.344 365s 365s x1: [3, 2, 1, 1] 365s 0.324 0.457 -0.305 365s 0.384 0.772 0.908 365s 365s y: [2, 2, 1, 1] 365s -0.219 -0.360 365s -0.238 0.020 365s testing: mul_mat, [1, 2, 1, 3] = [3, 1, 1, 3] * [3, 2, 1, 3] 365s x0: [3, 1, 1, 3] 365s -0.079 0.186 -0.065 365s 365s x1: [3, 2, 1, 3] 365s -0.032 -0.752 0.456 365s 0.904 0.596 0.461 365s 365s y: [1, 2, 1, 3] 365s -0.167 365s 0.009 365s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.316 -0.720 0.101 365s -0.540 0.922 0.776 365s 0.166 0.029 -1.000 365s 365s x1: [3, 1, 1, 1] 365s -0.706 -0.564 0.004 365s 365s y: [3, 1, 1, 1] 365s 0.629 -0.136 -0.138 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.744 0.273 -0.509 365s 0.975 -0.162 0.404 365s 365s x1: [3, 3, 1, 1] 365s 0.464 0.810 0.261 365s -0.805 -0.932 0.488 365s 0.659 -0.442 0.040 365s 365s y: [2, 3, 1, 1] 365s 0.433 0.426 365s -1.102 -0.436 365s 0.349 0.730 365s testing: mul_mat, [4, 3, 1, 3] = [2, 4, 1, 3] * [2, 3, 1, 3] 365s x0: [2, 4, 1, 3] 365s 0.191 -0.054 365s 0.906 -0.365 365s -0.945 0.492 365s 0.748 -0.080 365s 365s x1: [2, 3, 1, 3] 365s -0.763 -0.849 365s 0.699 -0.327 365s 0.155 -0.039 365s 365s y: [4, 3, 1, 3] 365s -0.099 -0.382 0.303 -0.503 365s 0.151 0.752 -0.821 0.549 365s 0.032 0.155 -0.166 0.119 365s test-mul-mat0: iter:147/500 365s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.682 -0.740 -0.265 365s -0.398 -0.832 0.064 365s -0.986 0.481 0.631 365s 365s x1: [3, 2, 1, 1] 365s -0.696 -0.170 0.570 365s 0.494 -0.264 0.625 365s 365s y: [3, 2, 1, 1] 365s -0.500 0.455 0.965 365s 0.367 0.063 -0.220 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.818 -0.327 0.673 365s -0.770 0.846 0.977 365s 365s x1: [3, 3, 1, 1] 365s 0.576 -0.508 -0.129 365s -0.707 -0.536 0.467 365s -0.376 -0.437 0.701 365s 365s y: [2, 3, 1, 1] 365s 0.550 -1.000 365s -0.089 0.547 365s 0.308 0.604 365s testing: mul_mat, [4, 2, 1, 3] = [3, 4, 1, 3] * [3, 2, 1, 3] 365s x0: [3, 4, 1, 3] 365s 0.906 -0.518 -0.414 365s -0.531 -0.817 0.078 365s -0.712 -0.144 -0.248 365s -0.482 -0.297 -0.271 365s 365s x1: [3, 2, 1, 3] 365s 0.060 0.416 0.471 365s 0.796 0.041 -0.286 365s 365s y: [4, 2, 1, 3] 365s -0.357 -0.335 -0.219 -0.280 365s 0.818 -0.478 -0.502 -0.319 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.305 -0.765 0.257 365s -0.998 0.933 -0.186 365s 0.275 -0.848 0.173 365s 365s x1: [3, 4, 1, 1] 365s -0.683 -0.013 0.558 365s 0.069 0.505 -0.739 365s 0.797 0.082 -0.620 365s 0.998 -0.545 0.540 365s 365s y: [3, 4, 1, 1] 365s -0.055 0.566 -0.080 365s -0.556 0.539 -0.537 365s 0.021 -0.604 0.042 365s 0.861 -1.605 0.831 365s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.056 -0.440 0.642 365s 0.989 -0.750 -0.370 365s 0.226 0.023 0.021 365s 365s x1: [3, 3, 1, 1] 365s 0.949 0.138 0.210 365s -0.398 0.369 0.729 365s 0.840 -0.932 -0.569 365s 365s y: [3, 3, 1, 1] 365s 0.127 0.757 0.222 365s 0.284 -0.940 -0.066 365s 0.091 1.740 0.156 365s testing: mul_mat, [4, 3, 1, 3] = [3, 4, 1, 3] * [3, 3, 1, 3] 365s x0: [3, 4, 1, 3] 365s -0.247 -0.056 0.854 365s -0.227 -0.354 0.890 365s -0.287 -0.747 0.221 365s -0.149 0.116 0.215 365s 365s x1: [3, 3, 1, 3] 365s -0.903 -0.830 0.933 365s 0.402 -0.828 0.209 365s 0.637 -0.895 0.361 365s 365s y: [4, 3, 1, 3] 365s 1.066 1.329 1.085 0.239 365s 0.125 0.388 0.549 -0.111 365s 0.201 0.494 0.565 -0.121 365s test-mul-mat0: iter:148/500 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.101 -0.663 -0.058 365s 0.299 0.831 -0.985 365s -0.075 -0.203 -0.628 365s -0.955 0.666 -0.827 365s 365s x1: [3, 1, 1, 1] 365s -0.471 -0.040 -0.412 365s 365s y: [4, 1, 1, 1] 365s 0.003 0.232 0.302 0.763 365s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 4, 2, 1] 365s 0.658 -0.881 -0.768 365s 0.704 0.785 -0.596 365s 0.420 -0.786 0.046 365s 0.918 0.547 0.847 365s 365s x1: [3, 4, 2, 1] 365s 0.716 -0.572 -0.359 365s -0.501 0.334 -0.198 365s -0.966 0.013 0.675 365s -0.580 0.635 0.193 365s 365s y: [4, 4, 2, 1] 365s 1.251 0.269 0.734 0.041 365s -0.472 0.028 -0.482 -0.446 365s -1.165 -1.072 -0.385 -0.309 365s -1.090 -0.024 -0.734 -0.021 365s testing: mul_mat, [4, 4, 2, 2] = [3, 4, 2, 2] * [3, 4, 2, 2] 365s x0: [3, 4, 2, 2] 365s 0.459 -0.933 -0.412 365s -0.863 0.164 -0.738 365s 0.530 0.693 0.504 365s -0.795 0.242 -0.349 365s 365s x1: [3, 4, 2, 2] 365s 0.924 0.338 0.929 365s 0.768 -0.929 0.252 365s 0.628 0.730 0.371 365s 0.859 0.433 0.157 365s 365s y: [4, 4, 2, 2] 365s -0.274 -1.428 1.192 -0.977 365s 1.114 -1.002 -0.109 -0.924 365s -0.546 -0.696 1.026 -0.452 365s -0.075 -0.787 0.835 -0.633 365s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.714 0.484 -0.027 365s -0.471 -0.262 -0.675 365s 365s x1: [3, 4, 1, 1] 365s -0.735 -0.497 0.272 365s -0.760 -0.183 -0.173 365s 0.934 0.952 0.373 365s -0.801 0.278 -0.778 365s 365s y: [2, 4, 1, 1] 365s -0.773 0.292 365s -0.626 0.523 365s 1.118 -0.941 365s -0.416 0.829 365s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 4, 2, 1] 365s -0.021 0.669 365s -0.158 0.267 365s 0.333 0.748 365s 0.796 0.042 365s 365s x1: [2, 3, 2, 1] 365s -0.977 -0.053 365s -0.142 -0.187 365s 0.127 -0.367 365s 365s y: [4, 3, 2, 1] 365s -0.015 0.140 -0.365 -0.780 365s -0.122 -0.028 -0.187 -0.121 365s -0.248 -0.118 -0.232 0.085 365s testing: mul_mat, [2, 2, 2, 2] = [4, 2, 2, 2] * [4, 2, 2, 2] 365s x0: [4, 2, 2, 2] 365s 0.207 0.614 0.199 -0.912 365s -0.642 0.521 -0.812 -0.338 365s 365s x1: [4, 2, 2, 2] 365s -0.233 0.084 0.501 0.714 365s 0.942 -0.686 -0.159 -0.425 365s 365s y: [2, 2, 2, 2] 365s -0.549 -0.454 365s 0.130 -0.689 365s test-mul-mat0: iter:149/500 365s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.070 0.738 -0.465 365s -0.493 0.924 -0.520 365s 365s x1: [3, 4, 1, 1] 365s 0.329 -0.061 -0.315 365s 0.216 -0.578 0.286 365s -0.630 -0.606 -0.194 365s 0.633 -0.813 0.040 365s 365s y: [2, 4, 1, 1] 365s 0.124 -0.055 365s -0.544 -0.789 365s -0.401 -0.148 365s -0.574 -1.084 365s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 4, 3, 1] 365s -0.605 -0.610 -0.155 365s -0.417 0.430 0.519 365s 0.652 0.168 -0.946 365s -0.841 0.092 -0.466 365s 365s x1: [3, 2, 3, 1] 365s 0.774 -0.661 -0.698 365s 0.629 -0.539 -0.232 365s 365s y: [4, 2, 3, 1] 365s 0.043 -0.970 1.054 -0.386 365s -0.016 -0.615 0.539 -0.470 365s testing: mul_mat, [2, 4, 3, 2] = [3, 2, 3, 2] * [3, 4, 3, 2] 365s x0: [3, 2, 3, 2] 365s 0.122 -0.171 0.402 365s 0.142 -0.201 0.370 365s 365s x1: [3, 4, 3, 2] 365s 0.061 -0.465 0.681 365s 0.115 -0.306 -0.227 365s 0.649 -0.373 -0.796 365s -0.515 -0.811 -0.904 365s 365s y: [2, 4, 3, 2] 365s 0.361 0.354 365s -0.025 -0.006 365s -0.178 -0.128 365s -0.288 -0.244 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.779 0.779 -0.162 365s 0.421 0.439 0.213 365s 365s x1: [3, 2, 1, 1] 365s 0.577 -0.028 -0.989 365s 0.479 -0.641 -0.863 365s 365s y: [2, 2, 1, 1] 365s 0.588 0.020 365s 0.013 -0.264 365s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 2, 3, 1] 365s -0.027 0.115 365s 0.858 0.332 365s 365s x1: [2, 3, 3, 1] 365s -0.569 0.878 365s 0.614 -0.621 365s 0.709 0.448 365s 365s y: [2, 3, 3, 1] 365s 0.116 -0.197 365s -0.088 0.321 365s 0.032 0.757 365s testing: mul_mat, [1, 2, 3, 2] = [2, 1, 3, 2] * [2, 2, 3, 2] 365s x0: [2, 1, 3, 2] 365s 0.545 -0.881 365s 365s x1: [2, 2, 3, 2] 365s -0.019 -0.868 365s 0.686 0.360 365s 365s y: [1, 2, 3, 2] 365s 0.755 365s 0.057 365s test-mul-mat0: iter:150/500 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.193 -0.441 365s 0.620 -0.250 365s 0.661 -0.671 365s 365s x1: [2, 2, 1, 1] 365s -0.699 0.765 365s 0.271 -0.570 365s 365s y: [3, 2, 1, 1] 365s -0.472 -0.625 -0.976 365s 0.304 0.311 0.562 365s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.737 0.524 365s -0.537 -0.186 365s 365s x1: [2, 3, 1, 1] 365s 0.422 -0.905 365s 0.032 0.173 365s 0.641 0.151 365s 365s y: [2, 3, 1, 1] 365s -0.163 -0.058 365s 0.114 -0.049 365s 0.551 -0.372 365s testing: mul_mat, [2, 2, 1, 2] = [2, 2, 1, 2] * [2, 2, 1, 2] 365s x0: [2, 2, 1, 2] 365s 0.136 0.553 365s 0.330 0.565 365s 365s x1: [2, 2, 1, 2] 365s -0.972 -0.171 365s 0.251 -0.627 365s 365s y: [2, 2, 1, 2] 365s -0.226 -0.418 365s -0.312 -0.271 365s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.303 0.651 365s 0.868 0.258 365s 0.522 0.977 365s -0.546 -0.886 365s 365s x1: [2, 2, 1, 1] 365s 0.273 0.785 365s -0.729 -0.305 365s 365s y: [4, 2, 1, 1] 365s 0.594 0.440 0.910 -0.845 365s -0.419 -0.711 -0.678 0.668 365s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.968 -0.388 0.914 0.885 365s -0.651 0.362 -0.066 -0.962 365s 365s x1: [4, 2, 1, 1] 365s -0.928 -0.995 -0.057 0.323 365s -0.622 0.272 0.973 0.213 365s 365s y: [2, 2, 1, 1] 365s 1.518 -0.063 365s 1.575 0.234 365s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 365s x0: [2, 4, 1, 2] 365s 0.879 -0.521 365s 0.266 -0.240 365s 0.701 0.884 365s 0.852 -0.171 365s 365s x1: [2, 4, 1, 2] 365s -0.123 0.158 365s 0.824 0.148 365s 0.853 0.704 365s -0.549 0.721 365s 365s y: [4, 4, 1, 2] 365s -0.191 -0.071 0.053 -0.132 365s 0.647 0.184 0.708 0.676 365s 0.383 0.058 1.221 0.606 365s -0.858 -0.319 0.253 -0.591 365s test-mul-mat0: iter:151/500 365s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 1, 1, 1] 365s 0.236 365s 365s x1: [1, 3, 1, 1] 365s -0.218 365s 0.864 365s 0.535 365s 365s y: [1, 3, 1, 1] 365s -0.051 365s 0.204 365s 0.126 365s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 365s x0: [1, 1, 4, 1] 365s 0.151 365s 365s x1: [1, 1, 4, 1] 365s -0.981 365s 365s y: [1, 1, 4, 1] 365s -0.148 365s testing: mul_mat, [3, 1, 4, 1] = [1, 3, 4, 1] * [1, 1, 4, 1] 365s x0: [1, 3, 4, 1] 365s -0.016 365s 0.437 365s -0.226 365s 365s x1: [1, 1, 4, 1] 365s -0.694 365s 365s y: [3, 1, 4, 1] 365s 0.011 -0.303 0.157 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.900 365s -0.978 365s 0.843 365s -0.178 365s 365s x1: [1, 3, 1, 1] 365s 0.586 365s -0.136 365s -0.997 365s 365s y: [4, 3, 1, 1] 365s -0.527 -0.573 0.494 -0.104 365s 0.123 0.133 -0.115 0.024 365s 0.897 0.974 -0.840 0.178 365s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 365s x0: [4, 1, 4, 1] 365s 0.182 -0.589 0.748 0.986 365s 365s x1: [4, 1, 4, 1] 365s -0.403 0.301 0.973 0.052 365s 365s y: [1, 1, 4, 1] 365s 0.528 365s testing: mul_mat, [1, 4, 4, 1] = [1, 1, 4, 1] * [1, 4, 4, 1] 365s x0: [1, 1, 4, 1] 365s 0.870 365s 365s x1: [1, 4, 4, 1] 365s 0.169 365s 0.892 365s -0.987 365s 0.994 365s 365s y: [1, 4, 4, 1] 365s 0.147 365s 0.776 365s -0.858 365s 0.864 365s test-mul-mat0: iter:152/500 365s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.469 -0.924 -0.401 365s 0.330 -0.424 -0.552 365s 0.114 0.109 0.007 365s 365s x1: [3, 3, 1, 1] 365s 0.810 -0.450 -0.168 365s 0.007 0.389 -0.179 365s -0.824 0.281 -0.165 365s 365s y: [3, 3, 1, 1] 365s 0.104 0.551 0.042 365s -0.291 -0.065 0.042 365s 0.193 -0.300 -0.065 365s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.495 0.918 -0.393 365s -0.956 -0.250 0.613 365s 0.434 0.572 0.789 365s 365s x1: [3, 3, 1, 1] 365s -0.548 0.231 -0.252 365s -0.799 0.101 -0.525 365s 0.361 0.188 0.774 365s 365s y: [3, 3, 1, 1] 365s 0.040 0.312 -0.304 365s -0.096 0.417 -0.703 365s 0.046 0.082 0.874 365s testing: mul_mat, [3, 3, 1, 3] = [3, 3, 1, 3] * [3, 3, 1, 3] 365s x0: [3, 3, 1, 3] 365s -0.342 0.250 -0.491 365s 0.373 0.656 -0.532 365s -0.381 -0.861 0.026 365s 365s x1: [3, 3, 1, 3] 365s -0.285 -0.594 0.959 365s 0.246 -0.518 -0.442 365s -0.424 0.058 0.006 365s 365s y: [3, 3, 1, 3] 365s -0.521 -1.006 0.645 365s 0.004 -0.013 0.341 365s 0.157 -0.124 0.112 365s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.999 -0.294 0.401 365s 365s x1: [3, 3, 1, 1] 365s -0.321 0.349 0.661 365s 0.399 -0.993 -0.089 365s 0.908 0.380 -0.432 365s 365s y: [1, 3, 1, 1] 365s -0.158 365s 0.654 365s 0.622 365s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.266 365s 0.479 365s 0.102 365s 365s x1: [1, 3, 1, 1] 365s 0.193 365s 0.903 365s 0.433 365s 365s y: [3, 3, 1, 1] 365s 0.051 0.092 0.020 365s 0.240 0.432 0.092 365s 0.115 0.207 0.044 365s testing: mul_mat, [1, 1, 1, 3] = [3, 1, 1, 3] * [3, 1, 1, 3] 365s x0: [3, 1, 1, 3] 365s 0.367 0.793 0.704 365s 365s x1: [3, 1, 1, 3] 365s -0.973 -0.714 -0.530 365s 365s y: [1, 1, 1, 3] 365s -1.297 365s test-mul-mat0: iter:153/500 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.475 0.925 0.788 -0.411 365s -0.720 0.156 -0.618 0.984 365s -0.816 0.574 0.695 -0.876 365s 365s x1: [4, 3, 1, 1] 365s -0.123 -0.717 0.117 -0.690 365s -0.640 -0.617 0.789 0.462 365s -0.591 -0.925 0.933 0.695 365s 365s y: [3, 3, 1, 1] 365s -0.346 -0.774 0.374 365s -0.443 0.332 0.311 365s -0.687 0.389 -0.009 365s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 365s x0: [4, 1, 4, 1] 365s 0.054 0.969 -0.780 -0.553 365s 365s x1: [4, 3, 4, 1] 365s -0.326 -0.213 -0.368 -0.850 365s 0.878 0.846 0.027 -0.839 365s -0.038 0.337 -0.479 0.345 365s 365s y: [1, 3, 4, 1] 365s 0.533 365s 1.310 365s 0.508 365s testing: mul_mat, [2, 1, 4, 2] = [4, 2, 4, 2] * [4, 1, 4, 2] 365s x0: [4, 2, 4, 2] 365s -0.473 0.394 0.102 -0.764 365s 0.919 -0.095 0.589 -0.306 365s 365s x1: [4, 1, 4, 2] 365s 0.616 -0.036 0.412 0.402 365s 365s y: [2, 1, 4, 2] 365s -0.570 0.689 365s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.200 0.800 -0.458 0.009 365s -0.081 0.708 -0.602 0.604 365s 365s x1: [4, 2, 1, 1] 365s -0.431 0.507 0.085 -0.740 365s 0.107 -0.354 0.322 0.046 365s 365s y: [2, 2, 1, 1] 365s 0.446 -0.104 365s -0.452 -0.426 365s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 365s x0: [2, 3, 4, 1] 365s 0.700 0.405 365s -0.994 -0.544 365s 0.366 0.213 365s 365s x1: [2, 4, 4, 1] 365s 0.640 0.977 365s 0.309 0.978 365s -0.869 0.705 365s -0.550 0.847 365s 365s y: [3, 4, 4, 1] 365s 0.844 -1.168 0.442 365s 0.613 -0.840 0.321 365s -0.323 0.480 -0.168 365s -0.042 0.086 -0.021 365s testing: mul_mat, [2, 2, 4, 2] = [3, 2, 4, 2] * [3, 2, 4, 2] 365s x0: [3, 2, 4, 2] 365s 0.196 -0.182 0.989 365s -0.948 -0.749 0.143 365s 365s x1: [3, 2, 4, 2] 365s 0.905 -0.901 0.715 365s 0.163 -0.510 0.743 365s 365s y: [2, 2, 4, 2] 365s 1.049 -0.081 365s 0.860 0.334 365s test-mul-mat0: iter:154/500 365s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.774 -0.590 -0.865 365s -0.628 0.742 -0.560 365s 365s x1: [3, 2, 1, 1] 365s 0.255 0.196 0.341 365s 0.395 -0.628 0.131 365s 365s y: [2, 2, 1, 1] 365s -0.213 -0.206 365s 0.562 -0.787 365s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 4, 3, 1] 365s -0.233 0.763 0.661 365s 0.140 -0.106 -0.691 365s -0.086 0.304 -0.557 365s 0.285 0.046 -0.117 365s 365s x1: [3, 2, 3, 1] 365s 0.221 0.062 0.888 365s -0.475 0.424 0.873 365s 365s y: [4, 2, 3, 1] 365s 0.583 -0.590 -0.495 -0.038 365s 1.011 -0.715 -0.316 -0.218 365s testing: mul_mat, [1, 4, 3, 4] = [3, 1, 3, 4] * [3, 4, 3, 4] 365s x0: [3, 1, 3, 4] 365s 0.953 -0.168 0.108 365s 365s x1: [3, 4, 3, 4] 365s 0.713 -0.893 0.485 365s -0.844 0.392 -0.469 365s 0.039 0.899 -0.361 365s 0.811 -0.069 -0.829 365s 365s y: [1, 4, 3, 4] 365s 0.882 365s -0.921 365s -0.153 365s 0.695 365s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 2, 1, 1] 365s -0.315 -0.363 0.623 365s -0.004 0.235 -0.341 365s 365s x1: [3, 1, 1, 1] 365s 0.193 0.916 0.471 365s 365s y: [2, 1, 1, 1] 365s -0.100 0.054 365s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 4, 3, 1] 365s -0.830 -0.145 365s 0.384 0.577 365s 0.247 -0.229 365s -0.914 0.851 365s 365s x1: [2, 3, 3, 1] 365s -0.721 -0.644 365s 0.762 -0.678 365s 0.405 0.024 365s 365s y: [4, 3, 3, 1] 365s 0.692 -0.649 -0.031 0.112 365s -0.534 -0.099 0.344 -1.273 365s -0.340 0.170 0.095 -0.350 365s testing: mul_mat, [4, 2, 3, 4] = [4, 4, 3, 4] * [4, 2, 3, 4] 365s x0: [4, 4, 3, 4] 365s -0.842 -0.832 0.976 -0.243 365s -0.320 0.972 0.469 0.596 365s 0.165 -0.327 -0.805 0.566 365s 0.768 0.442 0.456 0.422 365s 365s x1: [4, 2, 3, 4] 365s -0.339 -0.649 0.020 0.187 365s 0.569 0.603 0.241 0.739 365s 365s y: [4, 2, 3, 4] 365s 0.799 -0.402 0.245 -0.459 365s -0.925 0.958 0.121 1.126 365s test-mul-mat0: iter:155/500 365s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.733 -0.551 0.879 -0.974 365s -0.778 -0.782 -0.168 0.193 365s -0.185 -0.989 0.999 0.716 365s 0.589 -0.464 0.784 -0.242 365s 365s x1: [4, 4, 1, 1] 365s -0.933 0.169 0.566 -0.884 365s -0.160 -0.755 -0.531 -0.460 365s 0.630 0.713 0.115 0.601 365s -0.710 0.947 -0.856 0.732 365s 365s y: [4, 4, 1, 1] 365s 1.949 0.329 -0.063 0.030 365s 0.515 0.716 -0.083 -0.048 365s -1.339 -0.951 -0.275 -0.015 365s -1.467 0.098 -1.135 -1.705 365s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 1, 1, 1] 365s 1.000 -0.985 -0.226 -0.782 365s 365s x1: [4, 4, 1, 1] 365s -0.898 0.900 0.598 -0.653 365s -0.631 -0.862 0.977 -0.918 365s 0.253 0.578 -0.629 0.200 365s 0.722 -0.896 -0.879 0.989 365s 365s y: [1, 4, 1, 1] 365s -1.408 365s 0.716 365s -0.330 365s 1.030 365s testing: mul_mat, [2, 1, 1, 2] = [4, 2, 1, 2] * [4, 1, 1, 2] 365s x0: [4, 2, 1, 2] 365s 0.205 -0.397 -0.567 -0.197 365s -0.050 -0.198 -0.058 -0.073 365s 365s x1: [4, 1, 1, 2] 365s -0.153 0.967 0.033 -0.142 365s 365s y: [2, 1, 1, 2] 365s -0.406 -0.175 365s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.936 -0.383 0.517 -0.618 365s 365s x1: [4, 2, 1, 1] 365s -0.784 0.911 0.516 -0.769 365s -0.315 0.733 0.077 -0.348 365s 365s y: [1, 2, 1, 1] 365s -0.341 365s -0.320 365s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 1, 1, 1] 365s -0.609 365s 365s x1: [1, 4, 1, 1] 365s 0.118 365s -0.950 365s 0.587 365s 0.722 365s 365s y: [1, 4, 1, 1] 365s -0.072 365s 0.579 365s -0.358 365s -0.440 365s testing: mul_mat, [1, 1, 1, 2] = [1, 1, 1, 2] * [1, 1, 1, 2] 365s x0: [1, 1, 1, 2] 365s 0.599 365s 365s x1: [1, 1, 1, 2] 365s -0.328 365s 365s y: [1, 1, 1, 2] 365s -0.196 365s test-mul-mat0: iter:156/500 365s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.954 -0.223 0.380 0.687 365s 0.854 -0.969 -0.546 0.790 365s -0.352 0.970 -0.828 0.767 365s -0.980 0.760 0.488 -0.497 365s 365s x1: [4, 1, 1, 1] 365s -0.670 -0.217 -0.562 -0.454 365s 365s y: [4, 1, 1, 1] 365s -1.116 -0.414 0.142 0.443 365s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 365s x0: [4, 1, 3, 1] 365s -0.629 0.770 0.623 -0.164 365s 365s x1: [4, 4, 3, 1] 365s -0.850 -0.840 0.789 0.483 365s 0.759 0.958 0.010 0.862 365s 0.382 0.932 -0.808 -0.835 365s -0.630 -0.262 -0.140 -0.676 365s 365s y: [1, 4, 3, 1] 365s 0.300 365s 0.126 365s 0.111 365s 0.218 365s testing: mul_mat, [4, 1, 3, 2] = [4, 4, 3, 2] * [4, 1, 3, 2] 365s x0: [4, 4, 3, 2] 365s 0.590 -0.821 -0.659 0.318 365s 0.530 0.744 -0.995 -0.879 365s 0.480 0.440 0.437 0.310 365s 0.582 -0.514 0.095 0.811 365s 365s x1: [4, 1, 3, 2] 365s -0.313 -0.408 0.737 0.435 365s 365s y: [4, 1, 3, 2] 365s -0.196 -1.585 0.127 0.451 365s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.552 -0.213 0.621 -0.074 365s 0.822 0.113 0.444 0.336 365s -0.385 0.952 -0.363 0.123 365s 365s x1: [4, 4, 1, 1] 365s -0.942 -0.753 0.354 0.404 365s -0.632 -0.524 0.966 0.735 365s -0.003 0.325 -0.169 -0.990 365s 0.292 -0.143 -0.455 -0.839 365s 365s y: [3, 4, 1, 1] 365s 0.870 -0.567 -0.433 365s 1.006 0.097 -0.515 365s -0.099 -0.374 0.250 365s -0.352 -0.260 -0.187 365s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 3, 3, 1] 365s -0.743 0.465 0.504 365s 0.198 0.681 -0.826 365s 0.306 0.183 -0.647 365s 365s x1: [3, 4, 3, 1] 365s -0.141 0.656 0.181 365s 0.105 0.010 -0.414 365s 0.473 0.486 -0.448 365s 0.208 -0.517 0.877 365s 365s y: [3, 4, 3, 1] 365s 0.502 0.268 -0.041 365s -0.283 0.370 0.302 365s -0.351 0.795 0.524 365s 0.047 -1.036 -0.599 365s testing: mul_mat, [1, 3, 3, 2] = [3, 1, 3, 2] * [3, 3, 3, 2] 365s x0: [3, 1, 3, 2] 365s -0.872 -0.403 0.066 365s 365s x1: [3, 3, 3, 2] 365s -0.417 -0.852 -0.726 365s 0.337 -0.595 0.472 365s -0.357 0.871 0.153 365s 365s y: [1, 3, 3, 2] 365s 0.659 365s -0.023 365s -0.030 365s test-mul-mat0: iter:157/500 365s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.028 -0.684 365s 365s x1: [2, 2, 1, 1] 365s 0.096 -0.202 365s 0.786 -0.196 365s 365s y: [1, 2, 1, 1] 365s 0.141 365s 0.156 365s testing: mul_mat, [1, 1, 3, 1] = [2, 1, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 1, 3, 1] 365s 0.590 -0.308 365s 365s x1: [2, 1, 3, 1] 365s -0.068 0.339 365s 365s y: [1, 1, 3, 1] 365s -0.145 365s testing: mul_mat, [2, 1, 3, 3] = [2, 2, 3, 3] * [2, 1, 3, 3] 365s x0: [2, 2, 3, 3] 365s -0.077 0.530 365s -0.565 0.329 365s 365s x1: [2, 1, 3, 3] 365s 0.163 0.394 365s 365s y: [2, 1, 3, 3] 365s 0.196 0.037 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.663 0.976 365s -0.815 -0.927 365s -0.422 0.528 365s 365s x1: [2, 2, 1, 1] 365s 0.964 0.494 365s -0.257 -0.860 365s 365s y: [3, 2, 1, 1] 365s 1.121 -1.243 -0.147 365s -1.010 1.007 -0.345 365s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 2, 3, 1] 365s 0.809 0.331 0.235 365s 0.095 -0.448 -0.214 365s 365s x1: [3, 2, 3, 1] 365s 0.365 -0.255 -0.116 365s -0.224 -0.714 -0.081 365s 365s y: [2, 2, 3, 1] 365s 0.184 0.174 365s -0.437 0.316 365s testing: mul_mat, [3, 3, 3, 3] = [2, 3, 3, 3] * [2, 3, 3, 3] 365s x0: [2, 3, 3, 3] 365s -0.807 -0.556 365s 0.875 -0.301 365s 0.207 0.688 365s 365s x1: [2, 3, 3, 3] 365s 0.290 -0.681 365s 0.124 -0.014 365s -0.607 0.039 365s 365s y: [3, 3, 3, 3] 365s 0.144 0.459 -0.408 365s -0.092 0.113 0.016 365s 0.468 -0.543 -0.099 365s test-mul-mat0: iter:158/500 365s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.707 0.479 -0.662 365s -0.032 0.403 -0.359 365s -0.337 0.712 -0.152 365s -0.692 -0.118 -0.869 365s 365s x1: [3, 4, 1, 1] 365s 0.127 0.465 0.902 365s 0.660 -0.721 -0.612 365s 0.399 0.537 0.573 365s -0.173 -0.815 0.570 365s 365s y: [4, 4, 1, 1] 365s -0.464 -0.140 0.151 -0.927 365s -0.406 -0.092 -0.642 0.160 365s -0.404 -0.002 0.161 -0.837 365s -0.645 -0.527 -0.609 -0.279 365s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.050 0.346 0.255 365s 0.136 -0.084 0.997 365s 0.428 -0.606 -0.666 365s -0.604 0.797 -0.025 365s 365s x1: [3, 4, 1, 1] 365s -0.601 -0.177 -0.806 365s -0.757 0.668 0.484 365s 0.370 0.133 0.386 365s 0.029 0.413 0.774 365s 365s y: [4, 4, 1, 1] 365s -0.237 -0.870 0.386 0.242 365s 0.393 0.324 -1.051 0.978 365s 0.126 0.424 -0.179 -0.126 365s 0.339 0.741 -0.753 0.292 365s testing: mul_mat, [2, 4, 1, 4] = [3, 2, 1, 4] * [3, 4, 1, 4] 365s x0: [3, 2, 1, 4] 365s -0.837 0.071 -0.102 365s -0.745 -0.414 0.952 365s 365s x1: [3, 4, 1, 4] 365s 0.059 0.510 0.823 365s 0.367 -0.608 0.954 365s 0.766 0.215 -0.853 365s -0.991 -0.117 0.631 365s 365s y: [2, 4, 1, 4] 365s -0.097 0.528 365s -0.447 0.886 365s -0.539 -1.472 365s 0.757 1.388 365s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.482 0.761 0.812 365s 0.175 -0.587 0.880 365s -0.539 0.115 -0.840 365s -0.227 0.291 -0.543 365s 365s x1: [3, 2, 1, 1] 365s -0.168 0.027 0.413 365s -0.198 -0.645 -0.896 365s 365s y: [4, 2, 1, 1] 365s 0.437 0.318 -0.253 -0.178 365s -1.123 -0.444 0.785 0.344 365s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.836 -0.127 0.497 -0.482 365s 365s x1: [4, 3, 1, 1] 365s -0.957 0.428 0.856 0.427 365s -0.538 -0.599 0.984 0.029 365s 0.308 0.864 0.136 0.459 365s 365s y: [1, 3, 1, 1] 365s 0.966 365s 1.001 365s -0.521 365s testing: mul_mat, [4, 4, 1, 4] = [1, 4, 1, 4] * [1, 4, 1, 4] 365s x0: [1, 4, 1, 4] 365s -0.726 365s 0.256 365s -0.541 365s -0.741 365s 365s x1: [1, 4, 1, 4] 365s -0.023 365s 0.060 365s 0.036 365s -0.847 365s 365s y: [4, 4, 1, 4] 365s 0.017 -0.006 0.012 0.017 365s -0.043 0.015 -0.032 -0.044 365s -0.026 0.009 -0.020 -0.027 365s 0.615 -0.217 0.459 0.628 365s test-mul-mat0: iter:159/500 365s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.494 -0.165 0.800 0.761 365s 0.293 -0.941 -0.954 0.060 365s 365s x1: [4, 2, 1, 1] 365s 0.142 -0.171 0.873 -0.653 365s -0.425 -0.322 0.712 -0.223 365s 365s y: [2, 2, 1, 1] 365s 0.160 -0.669 365s 0.664 -0.514 365s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 365s x0: [4, 2, 3, 1] 365s 0.121 -0.845 0.893 0.180 365s -0.799 -0.047 0.362 0.669 365s 365s x1: [4, 2, 3, 1] 365s -0.819 0.467 -0.715 -0.001 365s 0.052 -0.616 0.314 0.155 365s 365s y: [2, 2, 3, 1] 365s -1.132 0.372 365s 0.835 0.205 365s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 365s x0: [4, 2, 3, 1] 365s 0.657 -0.736 0.863 -0.982 365s -0.205 0.637 0.366 0.122 365s 365s x1: [4, 2, 3, 1] 365s 0.348 -0.673 0.662 0.113 365s 0.648 -0.944 0.712 -0.231 365s 365s y: [2, 2, 3, 1] 365s 1.185 -0.244 365s 1.963 -0.501 365s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.806 0.831 0.129 -0.517 365s 0.203 -0.431 0.047 0.238 365s 0.447 -0.431 0.339 -0.123 365s 365s x1: [4, 2, 1, 1] 365s 0.567 0.818 0.538 -0.935 365s 0.528 -0.612 -0.669 0.185 365s 365s y: [3, 2, 1, 1] 365s 0.775 -0.435 0.198 365s -1.116 0.384 0.250 365s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 2, 3, 1] 365s 0.299 -0.082 0.290 365s -0.816 -0.757 -0.629 365s 365s x1: [3, 4, 3, 1] 365s -0.952 0.034 -0.635 365s -0.974 0.389 0.164 365s -0.916 0.539 0.635 365s -0.056 0.106 0.452 365s 365s y: [2, 4, 3, 1] 365s -0.471 1.150 365s -0.275 0.398 365s -0.134 -0.060 365s 0.106 -0.319 365s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 1, 3, 1] 365s 0.657 0.867 365s 365s x1: [2, 3, 3, 1] 365s 0.627 0.845 365s 0.806 -0.496 365s -0.015 -0.972 365s 365s y: [1, 3, 3, 1] 365s 1.144 365s 0.100 365s -0.852 365s test-mul-mat0: iter:160/500 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.330 -0.407 0.507 365s 0.358 -0.148 0.681 365s -0.356 0.230 0.070 365s -0.425 -0.108 0.758 365s 365s x1: [3, 1, 1, 1] 365s -0.323 0.524 -0.911 365s 365s y: [4, 1, 1, 1] 365s -0.782 -0.814 0.172 -0.610 365s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.246 -0.565 0.959 365s -0.230 -0.477 -0.519 365s 365s x1: [3, 4, 1, 1] 365s -0.310 0.473 -0.311 365s -0.904 -0.576 -0.654 365s 0.962 -0.274 0.600 365s 0.792 -0.456 0.973 365s 365s y: [2, 4, 1, 1] 365s -0.642 0.008 365s -0.524 0.822 365s 0.967 -0.402 365s 1.386 -0.469 365s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.198 -0.260 -0.324 365s -0.910 -0.503 0.366 365s 0.563 0.186 0.462 365s 365s x1: [3, 2, 1, 1] 365s -0.900 0.116 0.988 365s 0.458 0.968 0.670 365s 365s y: [3, 2, 1, 1] 365s -0.529 1.122 -0.029 365s -0.378 -ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png0.658 0.747 365s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.539 -0.937 0.179 365s -0.544 -0.062 -0.074 365s 0.272 0.372 -0.170 365s 365s x1: [3, 3, 1, 1] 365s 0.987 0.532 0.424 365s -0.288 0.131 0.216 365s 0.256 0.105 -0.503 365s 365s y: [3, 3, 1, 1] 365s 0.110 -0.601 0.394 365s -0.240 0.132 -0.066 365s -0.050 -0.109 0.194 365s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.019 0.859 0.954 365s -0.565 -0.302 -0.213 365s 0.986 0.682 0.571 365s -0.449 0.074 -0.544 365s 365s x1: [3, 3, 1, 1] 365s 0.147 -0.404 -0.069 365s -0.655 0.336 0.607 365s -0.564 0.833 -0.027 365s 365s y: [4, 3, 1, 1] 365s -0.410 0.054 -0.170 -0.059 365s 0.855 0.139 -0.069 -0.011 365s 0.679 0.073 -0.003 0.330 365s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 2, 1, 1] 365s 0.823 0.496 0.442 -0.325 365s -0.531 0.656 0.495 0.877 365s 365s x1: [4, 3, 1, 1] 365s -0.674 -0.972 0.728 -0.611 365s -0.035 0.100 0.568 0.891 365s 0.930 -0.285 -0.513 -0.139 365s 365s y: [2, 3, 1, 1] 365s -0.516 -0.454 365s -0.018 1.147 365s 0.442 -1.057 365s test-mul-mat0: iter:161/500 365s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.265 0.348 -0.308 0.752 365s 365s x1: [4, 3, 1, 1] 365s -0.092 -0.191 -0.867 0.694 365s -0.619 -0.411 -0.980 -0.592 365s -0.682 -0.591 0.373 0.418 365s 365s y: [1, 3, 1, 1] 365s 0.698 365s -0.450 365s -0.187 365s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 365s x0: [4, 3, 3, 1] 365s -0.595 -0.963 -0.458 -0.462 365s 0.731 -0.078 0.127 0.752 365s 0.331 0.445 -0.839 -0.296 365s 365s x1: [4, 1, 3, 1] 365s -0.790 0.753 0.574 -0.322 365s 365s y: [3, 1, 3, 1] 365s -0.369 -0.806 -0.313 365s testing: mul_mat, [3, 3, 3, 4] = [4, 3, 3, 4] * [4, 3, 3, 4] 365s x0: [4, 3, 3, 4] 365s 0.430 -0.032 -0.746 -0.241 365s -0.069 0.852 0.598 0.186 365s 0.675 -0.002 -0.072 0.134 365s 365s x1: [4, 3, 3, 4] 365s -0.588 -0.128 0.078 0.743 365s -0.683 0.240 -0.553 0.180 365s 0.378 -0.584 0.391 -0.791 365s 365s y: [3, 3, 3, 4] 365s -0.486 0.117 -0.302 365s 0.068 -0.046 -0.398 365s 0.080 -0.437 0.122 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.254 -0.853 -0.606 0.930 365s 0.590 0.965 0.190 -0.589 365s -0.096 -0.450 0.484 0.551 365s 365s x1: [4, 3, 1, 1] 365s -0.908 0.708 -0.333 -0.574 365s -0.814 0.428 -0.861 -0.283 365s 0.546 0.564 0.393 -0.344 365s 365s y: [3, 3, 1, 1] 365s -1.167 0.422 -0.709 365s -0.313 -0.064 -0.687 365s -0.900 1.143 -0.305 365s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 4, 3, 1] 365s 0.221 -0.696 0.480 365s 0.377 -0.167 0.031 365s 0.731 -0.328 -0.014 365s 0.877 0.949 -0.215 365s 365s x1: [3, 4, 3, 1] 365s -0.910 0.142 -0.051 365s -0.861 0.033 0.007 365s -0.770 -0.259 0.674 365s -0.343 -0.073 0.102 365s 365s y: [4, 4, 3, 1] 365s -0.324 -0.368 -0.711 -0.652 365s -0.210 -0.330 -0.640 -0.726 365s 0.334 -0.226 -0.487 -1.066 365s 0.024 -0.114 -0.229 -0.392 365s testing: mul_mat, [2, 3, 3, 4] = [4, 2, 3, 4] * [4, 3, 3, 4] 365s x0: [4, 2, 3, 4] 365s -0.445 -0.299 -0.502 -0.821 365s 0.585 -0.677 0.689 -0.491 365s 365s x1: [4, 3, 3, 4] 365s -0.453 -0.080 -0.085 -0.972 365s 0.952 0.900 -0.188 -0.406 365s 0.178 -0.509 0.753 -0.107 365s 365s y: [2, 3, 3, 4] 365s 1.066 0.207 365s -0.266 0.017 365s -0.217 1.020 365s test-mul-mat0: iter:162/500 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.232 0.880 365s 0.533 0.677 365s 365s x1: [2, 2, 1, 1] 365s -0.259 -0.373 365s 0.096 0.831 365s 365s y: [2, 2, 1, 1] 365s -0.388 -0.390 365s 0.754 0.614 365s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 365s x0: [2, 1, 3, 1] 365s 0.082 -0.876 365s 365s x1: [2, 2, 3, 1] 365s -0.922 0.036 365s 0.340 -0.842 365s 365s y: [1, 2, 3, 1] 365s -0.107 365s 0.765 365s testing: mul_mat, [4, 1, 3, 2] = [2, 4, 3, 2] * [2, 1, 3, 2] 365s x0: [2, 4, 3, 2] 365s -0.617 -0.273 365s -0.403 -0.940 365s 0.041 0.800 365s 0.514 0.288 365s 365s x1: [2, 1, 3, 2] 365s 0.908 -0.898 365s 365s y: [4, 1, 3, 2] 365s -0.316 0.478 -0.681 0.208 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.764 0.664 365s -0.609 0.634 365s 0.495 0.567 365s 0.302 0.534 365s 365s x1: [2, 4, 1, 1] 365s 0.214 0.090 365s -0.266 0.175 365s 0.083 0.329 365s 0.614 0.557 365s 365s y: [4, 4, 1, 1] 365s 0.223 -0.073 0.157 0.113 365s -0.087 0.273 -0.033 0.013 365s 0.283 0.158 0.228 0.201 365s 0.840 -0.021 0.620 0.483 365s testing: mul_mat, [1, 2, 3, 1] = [4, 1, 3, 1] * [4, 2, 3, 1] 365s x0: [4, 1, 3, 1] 365s -0.476 0.164 -0.142 -0.842 365s 365s x1: [4, 2, 3, 1] 365s 0.833 -0.127 0.204 0.546 365s 0.468 0.090 0.016 0.255 365s 365s y: [1, 2, 3, 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 1] 365s -0.906 365s -0.425 365s testing: mul_mat, [1, 4, 3, 2] = [1, 1, 3, 2] * [1, 4, 3, 2] 365s x0: [1, 1, 3, 2] 365s -0.134 365s 365s x1: [1, 4, 3, 2] 365s -0.448 365s -0.673 365s -0.854 365s 0.878 365s 365s y: [1, 4, 3, 2] 365s 0.060 365s 0.090 365s 0.115 365s -0.118 365s test-mul-mat0: iter:163/500 365s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.822 -0.107 0.367 365s 0.856 0.248 -0.470 365s 365s x1: [3, 1, 1, 1] 365s 0.580 -0.728 0.782 365s 365s y: [2, 1, 1, 1] 365s 0.841 -0.052 365s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 365s x0: [3, 2, 2, 1] 365s -0.238 0.583 0.090 365s -0.354 -0.462 0.984 365s 365s x1: [3, 2, 2, 1] 365s 0.987 -0.594 -0.277 365s 0.981 -0.244 0.440 365s 365s y: [2, 2, 2, 1] 365s -0.606 -0.347 365s -0.336 0.198 365s testing: mul_mat, [4, 2, 2, 3] = [3, 4, 2, 3] * [3, 2, 2, 3] 365s x0: [3, 4, 2, 3] 365s 0.025 -0.701 -0.968 365s 0.618 -0.245 0.149 365s 0.350 0.176 -0.419 365s -0.889 0.161 0.194 365s 365s x1: [3, 2, 2, 3] 365s -0.230 -0.954 -0.577 365s -0.375 0.294 -0.046 365s 365s y: [4, 2, 2, 3] 365s 1.221 0.005 -0.007 -0.061 365s -0.171 -0.311 -0.060 0.372 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s -0.867 0.898 0.055 365s 0.655 0.129 0.382 365s 0.454 0.982 -0.876 365s 365s x1: [3, 4, 1, 1] 365s 0.701 0.301 -0.259 365s -0.607 -0.573 -0.403 365s -0.611 0.679 -0.421 365s 0.758 0.161 -0.395 365s 365s y: [3, 4, 1, 1] 365s -0.352 0.398 0.841 365s -0.010 -0.625 -0.485 365s 1.116 -0.474 0.758 365s -0.535 0.366 0.848 365s testing: mul_mat, [1, 3, 2, 1] = [3, 1, 2, 1] * [3, 3, 2, 1] 365s x0: [3, 1, 2, 1] 365s 0.604 -0.369 0.874 365s 365s x1: [3, 3, 2, 1] 365s 0.904 -0.554 0.499 365s 0.199 0.471 0.457 365s -0.884 0.036 -0.891 365s 365s y: [1, 3, 2, 1] 365s 1.187 365s 0.346 365s -1.327 365s testing: mul_mat, [3, 3, 2, 3] = [1, 3, 2, 3] * [1, 3, 2, 3] 365s x0: [1, 3, 2, 3] 365s -0.148 365s -0.204 365s -0.466 365s 365s x1: [1, 3, 2, 3] 365s 0.635 365s 0.414 365s 0.310 365s 365s y: [3, 3, 2, 3] 365s -0.094 -0.129 -0.296 365s -0.061 -0.084 -0.193 365s -0.046 -0.063 -0.144 365s test-mul-mat0: iter:164/500 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s -0.339 -0.019 0.741 365s -0.990 -0.084 -0.408 365s -0.194 0.450 0.887 365s 0.628 -0.658 -0.213 365s 365s x1: [3, 1, 1, 1] 365s 0.904 -0.838 0.256 365s 365s y: [4, 1, 1, 1] 365s -0.101 -0.929 -0.326 1.065 365s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 365s x0: [3, 3, 4, 1] 365s 0.067 0.488 0.823 365s 0.475 0.535 -0.466 365s -0.524 -0.612 -0.305 365s 365s x1: [3, 4, 4, 1] 365s 0.081 0.107 0.423 365s -0.181 0.201 0.694 365s 0.972 -0.395 -0.345 365s 0.920 0.443 -0.790 365s 365s y: [3, 4, 4, 1] 365s 0.406 -0.101 -0.237 365s 0.658 -0.302 -0.240 365s -0.412 0.411 -0.162 365s -0.373 1.043 -0.512 365s testing: mul_mat, [2, 3, 4, 2] = [3, 2, 4, 2] * [3, 3, 4, 2] 365s x0: [3, 2, 4, 2] 365s 0.548 -0.843 0.354 365s -0.103 -0.289 -0.948 365s 365s x1: [3, 3, 4, 2] 365s 0.638 -0.762 -0.169 365s -0.667 0.603 0.701 365s 0.531 -0.358 -0.868 365s 365s y: [2, 3, 4, 2] 365s 0.932 0.314 365s -0.625 -0.769 365s 0.286 0.871 365s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.511 0.874 0.417 365s 0.500 0.380 -0.162 365s 0.501 0.912 -0.386 365s -0.660 0.796 -0.835 365s 365s x1: [3, 2, 1, 1] 365s 0.389 0.261 0.386 365s 0.935 -0.475 0.436 365s 365s y: [4, 2, 1, 1] 365s 0.588 0.231 0.284 -0.372 365s 0.244 0.217 -0.133 -1.359 365s testing: mul_mat, [3, 3, 4, 1] = [4, 3, 4, 1] * [4, 3, 4, 1] 365s x0: [4, 3, 4, 1] 365s 0.237 0.727 0.597 -0.116 365s -0.498 -0.130 0.151 -0.730 365s -0.677 -0.472 -0.471 0.144 365s 365s x1: [4, 3, 4, 1] 365s 0.816 0.226 0.067 0.669 365s -0.100 -0.576 -0.406 -0.509 365s -0.184 0.446 -0.268 -0.992 365s 365s y: [3, 3, 4, 1] 365s 0.320 -0.914 -0.595 365s -0.625 0.435 0.458 365s 0.235 0.718 -0.102 365s testing: mul_mat, [4, 4, 4, 2] = [3, 4, 4, 2] * [3, 4, 4, 2] 365s x0: [3, 4, 4, 2] 365s -0.877 -0.829 -0.579 365s 0.660 -0.430 0.702 365s 0.748 0.111 -0.357 365s -0.520 -0.142 -0.006 365s 365s x1: [3, 4, 4, 2] 365s 0.722 0.813 -0.836 365s -0.319 0.375 0.812 365s 0.604 -0.294 -0.062 365s -0.516 0.199 -0.687 365s 365s y: [4, 4, 4, 2] 365s -0.823 -0.460 0.929 -0.486 365s -0.501 0.198 -0.487 0.108 365s -0.250 0.482 0.441 -0.272 365s 0.686 -0.909 -0.119 0.244 365s test-mul-mat0: iter:165/500 365s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.446 0.410 365s -0.186 -0.554 365s 365s x1: [2, 4, 1, 1] 365s 0.980 -0.613 365s -0.678 -0.662 365s 0.590 0.255 365s -0.336 0.315 365s 365s y: [2, 4, 1, 1] 365s 0.185 0.158 365s -0.574 0.492 365s 0.368 -0.251 365s -0.021 -0.112 365s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 365s x0: [2, 2, 3,ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 1] 365s -0.499 -0.400 365s -0.271 0.500 365s 365s x1: [2, 2, 3, 1] 365s -0.257 -0.801 365s 0.435 -0.143 365s 365s y: [2, 2, 3, 1] 365s 0.449 -0.331 365s -0.159 -0.190 365s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 365s x0: [2, 3, 3, 1] 365s 0.915 -0.846 365s -0.840 0.984 365s -0.056 -0.912 365s 365s x1: [2, 2, 3, 1] 365s -0.208 0.224 365s 0.156 -0.798 365s 365s y: [3, 2, 3, 1] 365s -0.380 0.396 -0.193 365s 0.818 -0.917 0.719 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.009 -0.921 365s 365s x1: [2, 3, 1, 1] 365s -0.858 -0.267 365s 0.439 -0.656 365s -0.228 0.042 365s 365s y: [1, 3, 1, 1] 365s 0.238 365s 0.608 365s -0.040 365s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, 1] * [1, 2, 3, 1] 365s x0: [1, 3, 3, 1] 365s 0.019 365s -0.676 365s 0.725 365s 365s x1: [1, 2, 3, 1] 365s 0.091 365s 0.120 365s 365s y: [3, 2, 3, 1] 365s 0.002 -0.061 0.066 365s 0.002 -0.081 0.087 365s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 365s x0: [3, 2, 3, 1] 365s -0.642 -0.511 0.481 365s 0.402 0.368 0.579 365s 365s x1: [3, 1, 3, 1] 365s 0.364 0.344 -0.361 365s 365s y: [2, 1, 3, 1] 365s -0.583 0.063 365s test-mul-mat0: iter:166/500 365s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.479 -0.654 -0.840 0.888 365s 0.827 0.739 -0.624 -0.467 365s -0.205 0.900 -0.381 0.487 365s -0.557 0.029 0.904 -0.510 365s 365s x1: [4, 2, 1, 1] 365s 0.722 0.664 0.616 -0.424 365s -0.415 -0.529 -0.838 -0.055 365s 365s y: [4, 2, 1, 1] 365s -1.674 0.902 0.008 0.389 365s 1.200 -0.186 -0.098 -0.513 365s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.152 -0.447 0.253 -0.413 365s -0.071 0.786 0.382 -0.171 365s 365s x1: [4, 4, 1, 1] 365s 0.306 -0.138 -0.971 0.684 365s 0.887 -0.069 0.406 0.550 365s -0.453 0.982 -0.865 0.018 365s -0.856 0.080 -0.311 -0.335 365s 365s y: [2, 4, 1, 1] 365s -0.513 -0.618 365s -0.228 -0.056 365s -0.596 0.471 365s 0.154 0.062 365s testing: mul_mat, [4, 2, 1, 4] = [4, 4, 1, 4] * [4, 2, 1, 4] 365s x0: [4, 4, 1, 4] 365s -0.795 -0.409 0.434 0.967 365s 0.331 0.069 0.758 0.755 365s -0.410 -0.520 0.585 0.661 365s 0.749 0.398 -0.132 -0.965 365s 365s x1: [4, 2, 1, 4] 365s -0.595 -0.131 0.272 0.434 365s -0.227 0.762 -0.260 0.635 365s 365s y: [4, 2, 1, 4] 365s 1.064 0.328 0.758 -0.952 365s 0.369 0.259 -0.035 -0.445 365s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.773 0.097 -0.915 0.795 365s 0.926 0.577 0.091 -0.833 365s -0.892 -0.223 -0.656 0.632 365s -0.351 -0.148 -0.708 -0.145 365s 365s x1: [4, 4, 1, 1] 365s 0.341 0.060 0.482 0.543 365s 0.175 -0.103 -0.665 0.770 365s 0.897 -0.212 -0.886 -0.706 365s -0.890 -0.507 -0.954 -0.415 365s 365s y: [4, 4, 1, 1] 365s 0.259 -0.058 -0.290 -0.549 365s 1.346 -0.600 0.790 0.313 365s 0.922 1.215 -0.619 0.446 365s -0.194 -0.857 1.270 1.124 365s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.602 0.981 0.212 0.984 365s 0.204 -0.133 -0.648 0.723 365s -0.984 -0.940 -0.133 0.027 365s 0.975 0.073 0.379 0.159 365s 365s x1: [4, 4, 1, 1] 365s 0.649 0.175 -0.970 -0.454 365s 0.510 0.800 -0.558 -0.701 365s 0.914 -0.263 -0.592 -0.593 365s -0.217 -0.007 -0.710 -0.445 365s 365s y: [4, 4, 1, 1] 365s -0.871 0.410 -0.687 0.206 365s -0.330 -0.147 -1.199 0.233 365s -1.517 0.177 -0.590 0.554 365s -0.464 0.095 0.302 -0.552 365s testing: mul_mat, [4, 4, 1, 4] = [4, 4, 1, 4] * [4, 4, 1, 4] 365s x0: [4, 4, 1, 4] 365s 0.759 0.043 -0.505 0.110 365s -0.087 0.018 -0.200 0.511 365s 0.911 0.053 -0.538 0.329 365s 0.469 -0.293 0.260 0.956 365s 365s x1: [4, 4, 1, 4] 365s 0.753 0.573 -0.330 0.553 365s -0.985 -0.032 0.467 -0.248 365s 0.377 0.874 0.535 -0.630 365s -0.836 -0.910 0.288 -0.438 365s 365s y: [4, 4, 1, 4] 365s 0.824 0.293 1.075 0.628 365s -1.012 -0.135 -1.231 -0.568 365s -0.016 -0.446 -0.106 -0.543 365s -0.867 -0.225 -1.108 -0.470 365s test-mul-mat0: iter:167/500 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.827 0.005 365s 365s x1: [2, 3, 1, 1] 365s -0.908 0.494 365s -0.129 0.909 365s 0.905 -0.726 365s 365s y: [1, 3, 1, 1] 365s -0.748 365s -0.102 365s 0.745 365s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 3, 3, 1] 365s -0.497 0.863 365s 0.277 -0.619 365s -0.320 0.011 365s 365s x1: [2, 1, 3, 1] 365s 0.720 0.583 365s 365s y: [3, 1, 3, 1] 365s 0.145 -0.162 -0.224 365s testing: mul_mat, [3, 3, 3, 2] = [2, 3, 3, 2] * [2, 3, 3, 2] 365s x0: [2, 3, 3, 2] 365s -0.351 -0.983 365s -0.901 -0.611 365s 0.302 0.621 365s 365s x1: [2, 3, 3, 2] 365s 0.180 -0.970 365s -0.582 -0.984 365s 0.857 0.423 365s 365s y: [3, 3, 3, 2] 365s 0.891 0.431 -0.548 365s 1.171 1.125 -0.787 365s -0.716 -1.030 0.521 365s testing: mggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.640 -0.373 365s -0.496 -0.802 365s 0.055 -0.093 365s 365s x1: [2, 3, 1, 1] 365s 0.063 -0.084 365s -0.181 -0.932 365s -0.776 -0.921 365s 365s y: [3, 3, 1, 1] 365s 0.072 0.036 0.011 365s 0.232 0.838 0.076 365s -0.153 1.123 0.043 365s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 4, 3, 1] 365s -0.444 0.725 -0.682 365s 0.646 0.264 0.046 365s -0.874 -0.581 -0.573 365s -0.221 -0.771 0.226 365s 365s x1: [3, 2, 3, 1] 365s 0.729 -0.662 0.326 365s -0.782 -0.804 -0.564 365s 365s y: [4, 2, 3, 1] 365s -1.026 0.311 -0.439 0.423 365s 0.149 -0.743 1.473 0.665 365s testing: mul_mat, [1, 3, 3, 2] = [4, 1, 3, 2] * [4, 3, 3, 2] 365s x0: [4, 1, 3, 2] 365s 0.633 0.223 0.371 -0.937 365s 365s x1: [4, 3, 3, 2] 365s 0.440 -0.142 0.455 0.758 365s 0.904 0.882 -0.016 -0.322 365s -0.352 0.536 0.670 0.610 365s 365s y: [1, 3, 3, 2] 365s -0.295 365s 1.064 365s -0.426 365s test-mul-mat0: iter:168/500 365s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.672 -0.999 -0.930 -0.230 365s -0.454 -0.855 0.621 0.086 365s -0.768 -0.788 -0.759 -0.080 365s 365s x1: [4, 1, 1, 1] 365s -0.696 -0.948 -0.043 0.937 365s 365s y: [3, 1, 1, 1] 365s 1.240 1.181 1.239 365s testing: mul_mat, [4, 3, 2, 1] = [4, 4, 2, 1] * [4, 3, 2, 1] 365s x0: [4, 4, 2, 1] 365s -0.731 0.289 0.215 -0.519 365s 0.530 -0.866 -0.599 0.818 365s -0.568 -0.297 0.761 -0.889 365s 0.052 -0.458 0.945 0.870 365s 365s x1: [4, 3, 2, 1] 365s 0.920 -0.712 -0.703 -0.698 365s 0.943 0.679 -0.650 -0.219 365s 0.834 -0.182 -0.721 0.415 365s 365s y: [4, 3, 2, 1] 365s -0.667 0.954 -0.225 -0.897 365s -0.519 0.122 -1.038 -1.067 365s -1.033 1.371 -1.338 -0.194 365s testing: mul_mat, [3, 4, 2, 3] = [4, 3, 2, 3] * [4, 4, 2, 3] 365s x0: [4, 3, 2, 3] 365s 0.037 -0.748 0.672 -0.492 365s -0.248 -0.907 0.546 -0.326 365s -0.047 0.383 -0.719 -0.395 365s 365s x1: [4, 4, 2, 3] 365s 0.883 -0.391 -0.763 0.413 365s -0.257 -0.362 0.230 0.174 365s 0.341 -0.009 0.285 -0.607 365s 0.533 0.230 -0.737 -0.647 365s 365s y: [3, 4, 2, 3] 365s -0.391 -0.415 0.194 365s 0.330 0.461 -0.361 365s 0.510 0.277 0.015 365s -0.329 -0.532 0.848 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.927 0.332 -0.503 0.810 365s 0.561 0.931 0.369 0.677 365s 0.839 -0.340 0.507 -0.882 365s -0.070 -0.910 -0.956 -0.344 365s 365s x1: [4, 3, 1, 1] 365s 0.532 -0.813 -0.498 0.368 365s 0.314 0.909 -0.138 -0.244 365s 0.950 -0.886 -0.130 -0.381 365s 365s y: [4, 3, 1, 1] 365s 0.773 -0.393 0.146 1.052 365s 0.466 0.807 0.099 -0.634 365s 0.343 -0.597 1.368 0.994 365s testing: mul_mat, [3, 4, 2, 1] = [4, 3, 2, 1] * [4, 4, 2, 1] 365s x0: [4, 3, 2, 1] 365s -0.780 0.777 -0.804 -0.271 365s -0.458 0.595 0.405 -0.915 365s -0.150 0.363 0.826 -0.091 365s 365s x1: [4, 4, 2, 1] 365s 0.553 -0.898 -0.811 0.739 365s -0.396 0.557 0.054 -0.487 365s -0.581 0.810 -0.537 -0.467 365s -0.320 0.082 0.716 -0.393 365s 365s y: [3, 4, 2, 1] 365s -0.677 -1.793 -1.147 365s 0.830 0.980 0.350 365s 1.641 0.958 -0.020 365s -0.156 0.845 0.705 365s testing: mul_mat, [3, 4, 2, 3] = [3, 3, 2, 3] * [3, 4, 2, 3] 365s x0: [3, 3, 2, 3] 365s 0.649 0.831 -0.141 365s 0.162 -0.366 -0.064 365s 0.321 0.202 0.849 365s 365s x1: [3, 4, 2, 3] 365s 0.492 0.367 -0.584 365s 0.636 0.399 0.219 365s 0.882 0.619 0.760 365s -0.269 0.396 0.355 365s 365s y: [3, 4, 2, 3] 365s 0.707 -0.017 -0.263 365s 0.713 -0.057 0.471 365s 0.980 -0.132 1.054 365s 0.105 -0.211 0.296 365s test-mul-mat0: iter:169/500 365s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.748 -0.605 365s 365s x1: [2, 1, 1, 1] 365s -0.796 0.416 365s 365s y: [1, 1, 1, 1] 365s 0.343 365s testing: mul_mat, [4, 1, 4, 1] = [2, 4, 4, 1] * [2, 1, 4, 1] 365s x0: [2, 4, 4, 1] 365s 0.010 -0.703 365s -0.177 0.657 365s -0.098 0.968 365s -0.383 0.807 365s 365s x1: [2, 1, 4, 1] 365s 0.893 0.621 365s 365s y: [4, 1, 4, 1] 365s -0.428 0.251 0.514 0.160 365s testing: mul_mat, [3, 4, 4, 2] = [2, 3, 4, 2] * [2, 4, 4, 2] 365s x0: [2, 3, 4, 2] 365s 0.910 -0.919 365s -0.161 -0.895 365s -0.509 -0.162 365s 365s x1: [2, 4, 4, 2] 365s -0.362 -0.589 365s 0.834 0.540 365s -0.621 -0.549 365s 0.347 -0.504 365s 365s y: [3, 4, 4, 2] 365s 0.212 0.586 0.279 365s 0.262 -0.618 -0.512 365s -0.061 0.591 0.405 365s 0.780 0.395 -0.095 365s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.634 0.054 365s 365s x1: [2, 3, 1, 1] 365s -1.000 -0.340 365s 0.377 0.732 365s -0.310 0.338 365s 365s y: [1, 3, 1, 1] 365s -0.652 365s 0.278 365s -0.179 365s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 365s x0: [1, 4, 4, 1] 365s 0.584 365s 0.847 365s 0.345 365s 0.125 365s 365s x1: [1, 2, 4, 1] 365s -0.165ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 365s -0.905 365s 365s y: [4, 2, 4, 1] 365s -0.097 -0.140 -0.057 -0.021 365s -0.529 -0.767 -0.313 -0.113 365s testing: mul_mat, [2, 1, 4, 2] = [4, 2, 4, 2] * [4, 1, 4, 2] 365s x0: [4, 2, 4, 2] 365s 0.191 0.231 -0.469 0.399 365s -0.292 0.380 -0.340 0.222 365s 365s x1: [4, 1, 4, 2] 365s -0.980 0.700 0.135 -0.011 365s 365s y: [2, 1, 4, 2] 365s -0.093 0.503 365s test-mul-mat0: iter:170/500 365s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.631 -0.975 -0.982 -0.788 365s 0.351 -0.762 0.471 -0.036 365s 365s x1: [4, 2, 1, 1] 365s -0.695 -0.408 0.049 -0.187 365s -0.204 -0.480 0.104 0.147 365s 365s y: [2, 2, 1, 1] 365s 0.936 0.097 365s 0.379 0.338 365s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 365s x0: [4, 1, 4, 1] 365s -0.230 -0.304 0.775 -0.342 365s 365s x1: [4, 2, 4, 1] 365s 0.889 0.332 0.124 -0.329 365s 0.991 0.888 -0.697 0.344 365s 365s y: [1, 2, 4, 1] 365s -0.096 365s -1.155 365s testing: mul_mat, [4, 1, 4, 2] = [4, 4, 4, 2] * [4, 1, 4, 2] 365s x0: [4, 4, 4, 2] 365s 0.845 -0.257 0.655 -0.929 365s -0.830 0.275 0.277 -0.722 365s 0.031 -0.064 -0.967 -0.187 365s -0.621 -0.443 0.291 0.596 365s 365s x1: [4, 1, 4, 2] 365s -0.212 0.839 0.457 -0.905 365s 365s y: [4, 1, 4, 2] 365s 0.746 1.187 -0.334 -0.647 365s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.064 0.503 -0.474 -0.497 365s -0.024 0.746 -1.000 -0.447 365s 365s x1: [4, 4, 1, 1] 365s -0.988 -0.177 0.712 0.347 365s 0.643 -0.633 -0.139 0.707 365s 0.026 -0.187 0.439 0.105 365s 0.200 -0.965 0.954 -0.952 365s 365s y: [2, 4, 1, 1] 365s -0.536 -0.975 365s -0.645 -0.665 365s -0.356 -0.626 365s -0.479 -1.254 365s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 365s x0: [2, 3, 4, 1] 365s -0.323 0.987 365s 0.765 -0.949 365s -0.849 -0.225 365s 365s x1: [2, 4, 4, 1] 365s 0.116 0.690 365s -0.932 -0.726 365s 0.801 0.310 365s -0.714 -0.376 365s 365s y: [3, 4, 4, 1] 365s 0.643 -0.565 -0.253 365s -0.416 -0.024 0.954 365s 0.048 0.319 -0.750 365s -0.141 -0.190 0.690 365s testing: mul_mat, [1, 2, 4, 2] = [3, 1, 4, 2] * [3, 2, 4, 2] 365s x0: [3, 1, 4, 2] 365s -0.612 0.341 0.945 365s 365s x1: [3, 2, 4, 2] 365s -0.622 -0.866 0.502 365s 0.038 -0.257 -0.007 365s 365s y: [1, 2, 4, 2] 365s 0.560 365s -0.118 365s test-mul-mat0: iter:171/500 365s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 4, 1, 1] 365s -0.699 -0.744 365s -0.103 -0.607 365s -0.863 0.980 365s 0.178 0.455 365s 365s x1: [2, 2, 1, 1] 365s -0.327 -0.751 365s -0.644 0.061 365s 365s y: [4, 2, 1, 1] 365s 0.787 0.489 -0.454 -0.400 365s 0.404 0.029 0.616 -0.086 365s testing: mul_mat, [2, 4, 2, 1] = [2, 2, 2, 1] * [2, 4, 2, 1] 365s x0: [2, 2, 2, 1] 365s 0.124 -0.612 365s -0.103 0.480 365s 365s x1: [2, 4, 2, 1] 365s 0.059 -0.268 365s 0.337 -0.487 365s 0.189 0.284 365s 0.380 0.293 365s 365s y: [2, 4, 2, 1] 365s 0.172 -0.135 365s 0.340 -0.268 365s -0.150 0.117 365s -0.132 0.102 365s testing: mul_mat, [2, 2, 2, 2] = [2, 2, 2, 2] * [2, 2, 2, 2] 365s x0: [2, 2, 2, 2] 365s 0.319 -0.584 365s 0.139 0.878 365s 365s x1: [2, 2, 2, 2] 365s 0.384 -0.825 365s 0.842 0.364 365s 365s y: [2, 2, 2, 2] 365s 0.604 -0.671 365s 0.057 0.436 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.875 0.280 365s 0.448 0.148 365s 365s x1: [2, 2, 1, 1] 365s 0.516 0.920 365s 0.791 -0.186 365s 365s y: [2, 2, 1, 1] 365s 0.709 0.367 365s 0.640 0.327 365s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 4, 2, 1] 365s -0.228 -0.974 365s 0.507 -0.647 365s 0.668 -0.744 365s 0.774 0.994 365s 365s x1: [2, 2, 2, 1] 365s 0.367 0.597 365s 0.713 0.575 365s 365s y: [4, 2, 2, 1] 365s -0.665 -0.200 -0.199 0.877 365s -0.723 -0.011 0.049 1.124 365s testing: mul_mat, [1, 2, 2, 2] = [4, 1, 2, 2] * [4, 2, 2, 2] 365s x0: [4, 1, 2, 2] 365s -0.064 0.379 0.702 -0.292 365s 365s x1: [4, 2, 2, 2] 365s 0.998 -0.656 -0.376 0.446 365s 0.624 0.772 -0.187 0.221 365s 365s y: [1, 2, 2, 2] 365s -0.707 365s 0.057 365s test-mul-mat0: iter:172/500 365s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 365s x0: [3, 4, 1, 1] 365s 0.821 0.997 -0.416 365s 0.556 -0.691 -0.482 365s 0.633 0.245 0.897 365s 0.335 0.952 -0.911 365s 365s x1: [3, 1, 1, 1] 365s 0.896 -0.912 -0.565 365s 365s y: [4, 1, 1, 1] 365s 0.062 1.401 -0.163 -0.054 365s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 4, 2, 1] 365s -0.071 -0.992 -0.902 365s -0.736 0.960 -0.813 365s -0.478 -0.895 -0.145 365s 0.882 -0.330 0.385 365s 365s x1: [3, 4, 2, 1] 365s -0.743 -0.855 -0.332 365s 0.360 -0.435 -0.470 365s -0.980 0.093 -0.553 365s 0.742 0.398 -0.808 365s 365s y: [4, 4, 2, 1] 365s 1.200 -0.004 1.169 -0.500 365s 0.830 -0.300 0.285 0.280 365s 0.476 1.261 0.466 -1.108 365s 0.282 0.493 -0.594 0.212 365s testing: mul_mat, [3, 4, 2, 1] = [3, 3ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s , 2, 1] * [3, 4, 2, 1] 365s x0: [3, 3, 2, 1] 365s 0.289 -0.622 -0.940 365s 0.938 0.290 -0.437 365s 0.687 0.942 0.521 365s 365s x1: [3, 4, 2, 1] 365s 0.749 -0.348 -0.043 365s 0.432 0.949 0.720 365s 0.633 -0.122 0.728 365s 0.731 0.143 0.688 365s 365s y: [3, 4, 2, 1] 365s 0.472 0.621 0.165 365s -1.143 0.366 1.566 365s -0.426 0.241 0.699 365s -0.524 0.427 0.995 365s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 365s x0: [3, 2, 1, 1] 365s 0.658 0.334 -0.099 365s -0.477 -0.053 0.394 365s 365s x1: [3, 3, 1, 1] 365s -0.599 0.576 -0.550 365s 0.174 0.121 -0.088 365s 0.332 -0.432 0.137 365s 365s y: [2, 3, 1, 1] 365s -0.147 0.038 365s 0.164 -0.124 365s 0.060 -0.082 365s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 3, 2, 1] 365s -0.295 -0.719 365s -0.655 -0.206 365s 0.330 -0.496 365s 365s x1: [2, 3, 2, 1] 365s -0.114 -0.809 365s 0.957 -0.427 365s -0.867 0.478 365s 365s y: [3, 3, 2, 1] 365s 0.616 0.242 0.364 365s 0.025 -0.539 0.528 365s -0.088 0.470 -0.523 365s testing: mul_mat, [3, 2, 2, 1] = [3, 3, 2, 1] * [3, 2, 2, 1] 365s x0: [3, 3, 2, 1] 365s -0.387 0.406 -0.889 365s -0.962 0.726 -0.929 365s -0.829 0.099 -0.571 365s 365s x1: [3, 2, 2, 1] 365s 0.629 0.366 -0.509 365s -0.424 -0.734 0.885 365s 365s y: [3, 2, 2, 1] 365s 0.358 0.134 -0.194 365s -0.921 -0.947 -0.226 365s test-mul-mat0: iter:173/500 365s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.502 -0.857 365s 0.464 -0.908 365s 0.463 0.190 365s 365s x1: [2, 1, 1, 1] 365s 0.045 -0.699 365s 365s y: [3, 1, 1, 1] 365s 0.622 0.656 -0.111 365s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 3, 2, 1] 365s -0.266 -0.410 365s -0.828 -0.853 365s -0.485 0.564 365s 365s x1: [2, 3, 2, 1] 365s -0.295 0.501 365s 0.360 -0.888 365s 0.228 -0.540 365s 365s y: [3, 3, 2, 1] 365s -0.127 -0.184 0.426 365s 0.268 0.459 -0.676 365s 0.161 0.272 -0.415 365s testing: mul_mat, [1, 3, 2, 4] = [2, 1, 2, 4] * [2, 3, 2, 4] 365s x0: [2, 1, 2, 4] 365s 0.962 0.197 365s 365s x1: [2, 3, 2, 4] 365s 0.474 0.376 365s -0.575 0.566 365s -0.161 0.615 365s 365s y: [1, 3, 2, 4] 365s 0.531 365s -0.442 365s -0.034 365s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.006 -0.054 365s -0.245 0.510 365s 365s x1: [2, 1, 1, 1] 365s -0.116 -0.215 365s 365s y: [2, 1, 1, 1] 365s 0.011 -0.082 365s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 4, 2, 1] 365s 0.331 -0.053 365s -0.818 0.810 365s -0.319 -0.618 365s -0.562 0.895 365s 365s x1: [2, 2, 2, 1] 365s -0.706 0.255 365s -0.378 -0.301 365s 365s y: [4, 2, 2, 1] 365s -0.247 0.784 0.067 0.625 365s -0.109 0.066 0.307 -0.057 365s testing: mul_mat, [4, 2, 2, 4] = [4, 4, 2, 4] * [4, 2, 2, 4] 365s x0: [4, 4, 2, 4] 365s 0.945 0.364 0.624 -0.391 365s -0.694 -0.928 0.359 0.055 365s 0.667 0.520 -0.686 0.600 365s 0.759 -0.523 0.240 0.706 365s 365s x1: [4, 2, 2, 4] 365s -0.803 0.966 -0.941 -0.047 365s -0.088 0.569 0.247 -0.833 365s 365s y: [4, 2, 2, 4] 365s -0.976 -0.680 0.584 -1.374 365s 0.604 -0.424 -0.432 -0.894 365s test-mul-mat0: iter:174/500 365s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.185 0.171 365s -0.014 -0.749 365s 0.621 0.891 365s -0.130 0.051 365s 365s x1: [2, 1, 1, 1] 365s -0.252 -0.710 365s 365s y: [4, 1, 1, 1] 365s -0.168 0.535 -0.789 -0.004 365s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 365s x0: [2, 1, 4, 1] 365s 0.832 0.908 365s 365s x1: [2, 4, 4, 1] 365s -0.300 0.788 365s -0.410 0.002 365s -0.003 0.315 365s -0.517 0.177 365s 365s y: [1, 4, 4, 1] 365s 0.465 365s -0.340 365s 0.283 365s -0.269 365s testing: mul_mat, [2, 1, 4, 2] = [2, 2, 4, 2] * [2, 1, 4, 2] 365s x0: [2, 2, 4, 2] 365s 0.534 0.387 365s -0.922 -0.224 365s 365s x1: [2, 1, 4, 2] 365s -0.769 -0.961 365s 365s y: [2, 1, 4, 2] 365s -0.782 0.924 365s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.720 -0.058 365s 365s x1: [2, 2, 1, 1] 365s -0.553 0.524 365s 0.112 -0.260 365s 365s y: [1, 2, 1, 1] 365s -0.429 365s 0.096 365s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 365s x0: [1, 2, 4, 1] 365s -0.498 365s 0.034 365s 365s x1: [1, 2, 4, 1] 365s 0.530 365s 0.476 365s 365s y: [2, 2, 4, 1] 365s -0.264 0.018 365s -0.237 0.016 365s testing: mul_mat, [3, 1, 4, 2] = [2, 3, 4, 2] * [2, 1, 4, 2] 365s x0: [2, 3, 4, 2] 365s 0.494 0.458 365s -0.785 0.641 365s -0.582 -0.707 365s 365s x1: [2, 1, 4, 2] 365s 0.337 -0.129 365s 365s y: [3, 1, 4, 2] 365s 0.107 -0.348 -0.105 365s test-mul-mat0: iter:175/500 365s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.384 0.373 365s 0.167 0.231 365s 365s x1: [2, 3, 1, 1] 365s 0.776 -0.459 365s -0.824 0.003 365s 0.032 -0.617 365s 365s y: [2, 3, 1, 1] 365s 0.127 0.024 365s -0.316 -0.137 365s -0.218 -0.137 365s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 365s x0:ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s [2, 4, 2, 1] 365s 0.508 0.560 365s 0.136 0.387 365s -0.310 0.071 365s 0.391 -0.345 365s 365s x1: [2, 2, 2, 1] 365s -0.423 -0.536 365s -0.465 0.423 365s 365s y: [4, 2, 2, 1] 365s -0.515 -0.265 0.093 0.019 365s 0.001 0.101 0.174 -0.328 365s testing: mul_mat, [1, 4, 2, 2] = [2, 1, 2, 2] * [2, 4, 2, 2] 365s x0: [2, 1, 2, 2] 365s 0.261 0.920 365s 365s x1: [2, 4, 2, 2] 365s -0.509 0.805 365s -0.269 0.864 365s -0.028 0.962 365s -0.559 0.437 365s 365s y: [1, 4, 2, 2] 365s 0.608 365s 0.725 365s 0.878 365s 0.256 365s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.402 -0.223 365s 0.311 0.745 365s 0.352 0.056 365s 365s x1: [2, 1, 1, 1] 365s 0.979 -0.986 365s 365s y: [3, 1, 1, 1] 365s -0.174 -0.429 0.289 365s testing: mul_mat, [3, 2, 2, 1] = [3, 3, 2, 1] * [3, 2, 2, 1] 365s x0: [3, 3, 2, 1] 365s -0.399 0.652 -0.860 365s 0.992 0.215 -0.067 365s -0.828 -0.467 -0.242 365s 365s x1: [3, 2, 2, 1] 365s 0.647 0.249 -0.850 365s 0.171 0.492 0.018 365s 365s y: [3, 2, 2, 1] 365s 0.636 0.752 -0.447 365s 0.237 0.274 -0.376 365s testing: mul_mat, [1, 3, 2, 2] = [3, 1, 2, 2] * [3, 3, 2, 2] 365s x0: [3, 1, 2, 2] 365s -0.370 -0.357 0.275 365s 365s x1: [3, 3, 2, 2] 365s 0.140 0.253 -0.650 365s -0.688 -0.255 0.369 365s -0.059 -0.006 0.570 365s 365s y: [1, 3, 2, 2] 365s -0.321 365s 0.447 365s 0.181 365s test-mul-mat0: iter:176/500 365s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 1, 1, 1] 365s -0.659 0.412 365s 365s x1: [2, 4, 1, 1] 365s 0.982 0.209 365s -0.710 -0.274 365s -0.143 -0.924 365s -0.037 -0.492 365s 365s y: [1, 4, 1, 1] 365s -0.561 365s 0.355 365s -0.286 365s -0.179 365s testing: mul_mat, [1, 1, 3, 1] = [2, 1, 3, 1] * [2, 1, 3, 1] 365s x0: [2, 1, 3, 1] 365s -0.575 0.864 365s 365s x1: [2, 1, 3, 1] 365s -0.601 -0.970 365s 365s y: [1, 1, 3, 1] 365s -0.493 365s testing: mul_mat, [3, 1, 3, 4] = [2, 3, 3, 4] * [2, 1, 3, 4] 365s x0: [2, 3, 3, 4] 365s 0.271 -0.361 365s 0.999 -0.153 365s -0.220 -0.176 365s 365s x1: [2, 1, 3, 4] 365s -0.715 -0.813 365s 365s y: [3, 1, 3, 4] 365s 0.100 -0.590 0.300 365s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s 0.868 -0.116 365s -0.125 0.658 365s -0.241 -0.657 365s 365s x1: [2, 3, 1, 1] 365s -0.948 -0.868 365s -0.148 0.598 365s 0.816 -0.673 365s 365s y: [3, 3, 1, 1] 365s -0.722 -0.452 0.798 365s -0.198 0.412 -0.357 365s 0.787 -0.544 0.245 365s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 2, 3, 1] 365s 0.102 0.971 -0.868 365s 0.805 -0.082 0.200 365s 365s x1: [3, 2, 3, 1] 365s -0.129 -0.764 -0.834 365s -0.100 -0.725 0.073 365s 365s y: [2, 2, 3, 1] 365s -0.031 -0.208 365s -0.777 -0.007 365s testing: mul_mat, [1, 3, 3, 4] = [2, 1, 3, 4] * [2, 3, 3, 4] 365s x0: [2, 1, 3, 4] 365s 0.601 -0.051 365s 365s x1: [2, 3, 3, 4] 365s 0.185 0.995 365s 0.878 0.230 365s 0.861 -0.321 365s 365s y: [1, 3, 3, 4] 365s 0.060 365s 0.515 365s 0.534 365s test-mul-mat0: iter:177/500 365s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.395 -0.244 0.073 0.843 365s 365s x1: [4, 4, 1, 1] 365s -0.669 0.549 -0.353 0.932 365s -0.502 0.172 -0.413 -0.684 365s -0.091 0.465 -0.319 0.737 365s -0.258 0.225 0.203 0.259 365s 365s y: [1, 4, 1, 1] 365s 0.362 365s -0.847 365s 0.448 365s 0.076 365s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 365s x0: [4, 1, 4, 1] 365s -0.058 0.112 0.740 0.623 365s 365s x1: [4, 1, 4, 1] 365s -0.849 -0.272 0.259 -0.645 365s 365s y: [1, 1, 4, 1] 365s -0.192 365s testing: mul_mat, [3, 1, 4, 3] = [4, 3, 4, 3] * [4, 1, 4, 3] 365s x0: [4, 3, 4, 3] 365s 0.660 0.068 0.327 -0.459 365s 0.844 0.646 -0.842 -0.933 365s -0.537 0.583 0.580 0.927 365s 365s x1: [4, 1, 4, 3] 365s -0.451 0.004 0.214 -0.866 365s 365s y: [3, 1, 4, 3] 365s 0.169 0.248 -0.434 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s -0.676 -0.229 0.120 0.516 365s 0.014 0.722 -0.949 0.342 365s -0.233 -0.317 0.782 0.894 365s -0.614 -0.098 -0.802 -0.690 365s 365s x1: [4, 3, 1, 1] 365s -0.296 0.807 -0.956 0.403 365s -0.922 0.384 -0.583 -0.368 365s 0.727 -0.063 0.461 -0.914 365s 365s y: [4, 3, 1, 1] 365s 0.108 1.624 -0.574 0.591 365s 0.275 0.691 -0.692 1.250 365s -0.894 -0.786 -0.606 -0.179 365s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 4, 4, 1] 365s -0.936 0.368 -0.455 -0.964 365s -0.251 0.663 0.298 0.007 365s -0.449 -0.538 -0.053 0.047 365s 0.204 -0.261 -0.916 0.063 365s 365s x1: [4, 4, 4, 1] 365s 0.600 0.597 0.014 0.407 365s 0.640 -0.584 0.485 0.024 365s -0.167 -0.883 -0.249 0.770 365s 0.578 -0.163 -0.343 0.902 365s 365s y: [4, 4, 4, 1] 365s -0.741 0.251 -0.572 -0.020 365s -1.059 -0.403 0.002 -0.160 365s -0.798 -0.612 0.599 0.473 365s -1.315 -0.350 -0.112 0.532 365s testing: mul_mat, [4, 4, 4, 3] = [4, 4, 4, 3] * [4, 4, 4, 3] 365s x0: [4, 4, 4, 3] 365s -0.139 -0.9ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 44 0.920 0.210 365s 0.581 0.069 -0.056 0.876 365s 0.618 -0.045 -0.482 0.093 365s 0.233 -0.310 -0.180 -0.497 365s 365s x1: [4, 4, 4, 3] 365s -0.848 0.335 0.863 -0.308 365s 0.039 0.438 0.777 -0.960 365s 0.961 -0.054 0.077 0.491 365s 0.162 0.178 -0.291 -0.467 365s 365s y: [4, 4, 4, 3] 365s 0.531 -0.787 -0.983 -0.304 365s 0.095 -0.832 -0.460 0.210 365s 0.091 0.980 0.604 -0.018 365s -0.557 -0.286 0.189 0.267 365s test-mul-mat0: iter:178/500 365s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.597 -0.982 0.508 0.585 365s -0.756 0.107 0.028 0.764 365s 365s x1: [4, 1, 1, 1] 365s -0.827 0.929 -0.990 -0.336 365s 365s y: [2, 1, 1, 1] 365s -1.118 0.440 365s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 365s x0: [4, 3, 4, 1] 365s -0.646 0.712 0.052 0.782 365s 0.108 -0.905 0.607 -0.847 365s -0.017 0.100 0.842 -0.144 365s 365s x1: [4, 2, 4, 1] 365s 0.894 -0.683 0.530 0.800 365s -0.923 0.428 0.396 0.043 365s 365s y: [3, 2, 4, 1] 365s -0.411 0.358 0.248 365s 0.955 -0.284 0.385 365s testing: mul_mat, [1, 3, 4, 2] = [4, 1, 4, 2] * [4, 3, 4, 2] 365s x0: [4, 1, 4, 2] 365s -0.357 0.278 -0.645 -0.446 365s 365s x1: [4, 3, 4, 2] 365s -0.102 0.118 -0.200 0.562 365s -0.588 -0.647 0.138 0.336 365s -0.033 -0.268 -0.083 -0.595 365s 365s y: [1, 3, 4, 2] 365s -0.053 365s -0.209 365s 0.256 365s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.770 0.512 -0.832 -0.318 365s -0.617 0.252 0.347 -0.112 365s 0.682 0.639 0.440 0.803 365s 365s x1: [4, 1, 1, 1] 365s 0.027 0.260 -0.378 0.473 365s 365s y: [3, 1, 1, 1] 365s 0.277 -0.135 0.398 365s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 365s x0: [3, 2, 4, 1] 365s -0.302 -0.555 0.781 365s 0.825 -0.645 0.198 365s 365s x1: [3, 4, 4, 1] 365s 0.009 0.333 -0.972 365s -0.882 0.632 -0.876 365s 0.627 -0.902 -0.130 365s -0.303 0.524 -0.381 365s 365s y: [2, 4, 4, 1] 365s -0.947 -0.400 365s -0.769 -1.309 365s 0.210 1.073 365s -0.497 -0.663 365s testing: mul_mat, [3, 3, 4, 2] = [2, 3, 4, 2] * [2, 3, 4, 2] 365s x0: [2, 3, 4, 2] 365s 0.707 -0.761 365s 0.069 0.436 365s 0.627 0.575 365s 365s x1: [2, 3, 4, 2] 365s 0.926 -0.866 365s 0.361 0.752 365s -0.144 -0.258 365s 365s y: [3, 3, 4, 2] 365s 1.314 -0.314 0.083 365s -0.317 0.353 0.658 365s 0.094 -0.123 -0.239 365s test-mul-mat0: iter:179/500 365s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 1, 1, 1] 365s -0.270 -0.303 0.926 -0.082 365s 365s x1: [4, 3, 1, 1] 365s 0.413 0.229 -0.892 0.128 365s -0.647 0.215 -0.983 0.261 365s -0.483 0.044 0.876 -0.111 365s 365s y: [1, 3, 1, 1] 365s -1.017 365s -0.823 365s 0.938 365s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.664 0.180 -0.054 -0.266 365s -0.730 -0.968 0.353 -0.476 365s 365s x1: [4, 1, 1, 1] 365s -0.505 0.695 -0.848 -0.420 365s 365s y: [2, 1, 1, 1] 365s 0.618 -0.403 365s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 365s x0: [4, 4, 1, 2] 365s -0.107 0.205 -0.727 0.045 365s 0.785 0.516 0.381 -0.035 365s -0.538 -0.885 0.235 -0.506 365s 0.467 0.760 0.778 -0.767 365s 365s x1: [4, 2, 1, 2] 365s 0.284 -0.235 0.754 0.392 365s 0.893 -0.894 -0.393 0.910 365s 365s y: [4, 2, 1, 2] 365s -0.609 0.375 0.034 0.240 365s 0.048 0.058 -0.242 -1.266 365s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 1, 1, 1] 365s 0.464 0.707 0.687 0.499 365s 365s x1: [4, 4, 1, 1] 365s -0.171 -0.509 -0.857 -0.386 365s -0.993 0.524 0.579 -0.531 365s 0.638 -0.186 -0.037 0.106 365s -0.426 -0.259 0.338 -0.913 365s 365s y: [1, 4, 1, 1] 365s -1.221 365s 0.043 365s 0.192 365s -0.604 365s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 365s x0: [1, 2, 1, 1] 365s -0.872 365s 0.131 365s 365s x1: [1, 4, 1, 1] 365s -0.305 365s -0.413 365s -0.512 365s -0.074 365s 365s y: [2, 4, 1, 1] 365s 0.266 -0.040 365s 0.360 -0.054 365s 0.447 -0.067 365s 0.065 -0.010 365s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 365s x0: [2, 4, 1, 2] 365s -0.628 -0.854 365s -0.433 0.312 365s -0.157 0.534 365s 0.896 -0.040 365s 365s x1: [2, 1, 1, 2] 365s 0.124 -0.654 365s 365s y: [4, 1, 1, 2] 365s 0.481 -0.258 -0.369 0.137 365s test-mul-mat0: iter:180/500 365s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.902 -0.690 365s 0.779 -0.261 365s 365s x1: [2, 3, 1, 1] 365s 0.201 0.429 365s -0.050 -0.298 365s 0.557 -0.919 365s 365s y: [2, 3, 1, 1] 365s -0.114 0.045 365s 0.160 0.039 365s 1.136 0.673 365s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 2, 2, 1] 365s -0.247 -0.014 365s -0.814 -0.269 365s 365s x1: [2, 2, 2, 1] 365s -0.318 -0.654 365s 0.582 -0.421 365s 365s y: [2, 2, 2, 1] 365s 0.088 0.434 365s -0.138 -0.360 365s testing: mul_mat, [4, 2, 2, 2] = [2, 4, 2, 2] * [2, 2, 2, 2] 365s x0: [2, 4, 2, 2] 365s -0.438 -0.928 365s 0.216 -0.325 365s 0.524 -0.614 365s 0.428 -0.490 365s 365s x1: [2, 2, 2, 2] 365s 0.728 0.114 365s 0.960 -0.071 365s 365s y: [4, 2, 2, 2] 365s -0.425 0.120 0.311 0.255 365s -0.355 0.231 0.547 0.446 365s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 4, 1, 1] 365s 0.468 -0.574 365s 0.013 -0.199 365s 0.355 0.536 365s -0.478 -0.231 365s 365s x1: [2, 4, 1, 1] 365s -0.683 -0.632 365s 0.435 0.841 365s -0.246 -0.137 365s -0.649 0.327 365s 365s y: [4, 4, 1, 1] 365s 0.043 0.118 -0.581 0.473 365s -0.279 -0.162 0.605 -0.403 365s -0.037 0.024 -0.160 0.149 365s -0.491 -0.073 -0.055 0.235 365s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 365s x0: [4, 3, 2, 1] 365s 0.240 0.590 0.058 0.580 365s -0.932 0.394 -0.593 -0.167 365s 0.849 -0.128 -0.774 0.027 365s 365s x1: [4, 2, 2, 1] 365s -0.211 0.460 -0.258 -0.123 365s -0.625 -0.829 -0.035 -0.409 365s 365s y: [3, 2, 2, 1] 365s 0.135 0.552 -0.042 365s -0.878 0.344 -0.408 365s testing: mul_mat, [3, 4, 2, 2] = [3, 3, 2, 2] * [3, 4, 2, 2] 365s x0: [3, 3, 2, 2] 365s 0.536 -0.806 0.544 365s 0.650 -0.359 -0.416 365s 0.536 0.896 -0.230 365s 365s x1: [3, 4, 2, 2] 365s -0.668 -0.568 -0.602 365s 0.883 0.831 0.712 365s -0.131 0.070 0.780 365s -0.282 -0.339 0.174 365s 365s y: [3, 4, 2, 2] 365s -0.227 0.020 -0.729 365s 0.190 -0.020 1.055 365s 0.297 -0.435 -0.186 365s 0.218 -0.134 -0.495 365s test-mul-mat0: iter:181/500 365s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.512 0.934 -0.838 365s 365s x1: [3, 2, 1, 1] 365s -0.160 -0.962 -0.861 365s 0.852 0.925 0.793 365s 365s y: [1, 2, 1, 1] 365s -0.259 365s 0.636 365s testing: mul_mat, [4, 1, 3, 1] = [3, 4, 3, 1] * [3, 1, 3, 1] 365s x0: [3, 4, 3, 1] 365s -0.492 0.627 0.000 365s 0.252 -0.896 0.745 365s -0.362 -0.523 -0.967 365s -0.393 0.620 -0.126 365s 365s x1: [3, 1, 3, 1] 365s 0.668 0.635 -0.048 365s 365s y: [4, 1, 3, 1] 365s 0.069 -0.436 -0.528 0.138 365s testing: mul_mat, [4, 4, 3, 3] = [3, 4, 3, 3] * [3, 4, 3, 3] 365s x0: [3, 4, 3, 3] 365s -0.691 0.150 0.395 365s 0.864 0.751 -0.827 365s -0.611 0.013 0.450 365s 0.937 0.565 -0.550 365s 365s x1: [3, 4, 3, 3] 365s 0.493 -0.511 0.722 365s 0.526 0.096 0.342 365s -0.600 0.742 0.102 365s 0.126 -0.688 -0.346 365s 365s y: [4, 4, 3, 3] 365s -0.132 -0.555 0.017 -0.224 365s -0.214 0.244 -0.166 0.359 365s 0.566 -0.046 0.422 -0.199 365s -0.327 -0.122 -0.241 -0.080 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.745 0.964 0.921 365s -0.698 -0.621 -0.945 365s -0.699 -0.367 0.020 365s 365s x1: [3, 4, 1, 1] 365s -0.493 0.845 0.958 365s 0.865 -0.068 -0.839 365s 0.722 -0.754 0.633 365s 0.974 -0.770 0.059 365s 365s y: [3, 4, 1, 1] 365s 1.329 -1.086 0.054 365s -0.194 0.231 -0.597 365s 0.395 -0.634 -0.216 365s 0.038 -0.257 -0.398 365s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 365s x0: [3, 1, 3, 1] 365s 0.493 1.000 0.577 365s 365s x1: [3, 3, 3, 1] 365s 0.042 0.110 0.912 365s -0.312 -0.115 -0.791 365s -0.470 -0.390 -0.547 365s 365s y: [1, 3, 3, 1] 365s 0.656 365s -0.725 365s -0.937 365s testing: mul_mat, [3, 3, 3, 3] = [1, 3, 3, 3] * [1, 3, 3, 3] 365s x0: [1, 3, 3, 3] 365s -0.626 365s 0.214 365s 0.782 365s 365s x1: [1, 3, 3, 3] 365s 0.753 365s -0.395 365s -0.850 365s 365s y: [3, 3, 3, 3] 365s -0.471 0.161 0.589 365s 0.247 -0.084 -0.309 365s 0.532 -0.181 -0.665 365s test-mul-mat0: iter:182/500 365s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 2, 1, 1] 365s -0.738 0.214 0.079 -0.244 365s 0.823 -0.595 -0.618 -0.249 365s 365s x1: [4, 1, 1, 1] 365s -0.830 -0.618 -0.671 -0.703 365s 365s y: [2, 1, 1, 1] 365s 0.600 0.274 365s testing: mul_mat, [2, 2, 4, 1] = [4, 2, 4, 1] * [4, 2, 4, 1] 365s x0: [4, 2, 4, 1] 365s 0.380 -0.359 -0.316 -0.641 365s -0.294 0.324 -0.815 -0.203 365s 365s x1: [4, 2, 4, 1] 365s 0.812 0.346 -0.405 0.163 365s -0.090 -0.021 -0.935 -0.360 365s 365s y: [2, 2, 4, 1] 365s 0.208 0.171 365s 0.500 0.855 365s testing: mul_mat, [1, 2, 4, 2] = [4, 1, 4, 2] * [4, 2, 4, 2] 365s x0: [4, 1, 4, 2] 365s -0.264 -0.697 -0.970 0.587 365s 365s x1: [4, 2, 4, 2] 365s 0.036 0.983 -0.464 0.741 365s 0.307 -0.280 -0.461 0.110 365s 365s y: [1, 2, 4, 2] 365s 0.191 365s 0.626 365s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.136 -0.748 -0.304 0.816 365s -0.355 0.058 -0.608 -0.914 365s 0.992 -0.764 -0.626 0.442 365s -0.295 0.837 0.070 0.158 365s 365s x1: [4, 1, 1, 1] 365s 0.250 -0.122 -0.374 -0.965 365s 365s y: [4, 1, 1, 1] 365s -0.548 1.014 0.149 -0.354 365s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 1, 4, 1] 365s 0.444 -0.865 -0.140 0.885 365s 365s x1: [4, 4, 4, 1] 365s -0.887 -0.406 0.866 0.993 365s -0.389 0.565 -0.704 0.559 365s -0.586 0.335 -0.192 0.292 365s 0.961 -0.156 0.677 0.097 365s 365s y: [1, 4, 4, 1] 365s 0.714 365s -0.068 365s -0.265 365s 0.553 365s testing: mul_mat, [4, 4, 4, 2] = [1, 4, 4, 2] * [1, 4, 4, 2] 365s x0: [1, 4, 4, 2] 365s 0.859 365s 0.328 365s -0.210 365s -0.496 365s 365s x1: [1, 4, 4, 2] 365s -0.884 365s 0.369 365s 0.540 365s 0.276 365s 365s y: [4, 4, 4, 2] 365s -0.760 -0.290 0.185 0.439 365s 0.318 0.121 -0.078 -0.183 365s 0.464 0.177 -0.113 -0.268 365s 0.237 0.091 -0.058 -0.137 365s test-mul-mat0: iter:183/500 365s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.082 0.468 365s -0.095 0.604 365s 365s x1: [2, 1, 1, 1] 365s 0.955 -0.778 365s 365s y: [2, 1, 1, 1] 365s -0.286 -0.560 365s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 2, 2, 1] 365s -0.694 0.772 365s 0.729 0.346 365s 365s x1: [2, 2, 2, 1] 365s 0.486 0.278 365s 0.657 0.136 365s 365s y: [2, 2, 2, 1] 365s -0.123 0.450 365s -0.351 0.526 365s testing: mul_mat, [2, 2, 2, 3] = [2, 2, 2, 3] * [2, 2, 2, 3] 365s x0: [2, 2, 2, 3] 365s 0.369 0.610 365s 0.698 0.561 365s 365s x1: [2, 2, 2, 3] 365s -0.292 -0.162 365s 0.396 -0.447 365s 365s y: [2, 2, 2, 3] 365s -0.207 -0.295 365s -0.127 0.025 365s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.505 -0.895 365s 0.734 0.432 365s -0.445 -0.883 365s 365s x1: [2, 2, 1, 1] 365s 0.738 0.545 365s -0.770 -0.150 365s 365s y: [3, 2, 1, 1] 365s -0.861 0.777 -0.810 365s 0.524 -0.630 0.475 365s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 365s x0: [3, 2, 2, 1] 365s 0.232 0.585 -0.484 365s 0.707 -0.929 -0.309 365s 365s x1: [3, 2, 2, 1] 365s -0.886 0.304 -0.614 365s -0.038 -0.428 0.209 365s 365s y: [2, 2, 2, 1] 365s 0.269 -0.719 365s -0.360 0.306 365s testing: mul_mat, [2, 3, 2, 3] = [2, 2, 2, 3] * [2, 3, 2, 3] 365s x0: [2, 2, 2, 3] 365s 0.537 -0.737 365s -0.892 0.784 365s 365s x1: [2, 3, 2, 3] 365s 0.066 -0.162 365s 0.978 0.171 365s -0.730 -0.905 365s 365s y: [2, 3, 2, 3] 365s 0.155 -0.186 365s 0.400 -0.739 365s 0.275 -0.058 365s test-mul-mat0: iter:184/500 365s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 365s x0: [4, 3, 1, 1] 365s 0.306 0.516 0.480 -0.859 365s -0.326 0.112 -0.252 -0.137 365s -0.706 -0.287 -0.848 0.544 365s 365s x1: [4, 2, 1, 1] 365s -0.921 0.793 0.476 0.617 365s 0.901 0.739 0.401 -0.845 365s 365s y: [3, 2, 1, 1] 365s -0.174 0.184 0.353 365s 1.575 -0.196 -1.648 365s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 365s x0: [4, 4, 4, 1] 365s -0.244 0.501 -0.160 0.871 365s 0.502 -0.751 -0.902 -0.749 365s -0.479 -0.381 0.713 -0.056 365s -0.316 0.901 -0.749 0.863 365s 365s x1: [4, 3, 4, 1] 365s -0.274 -0.116 0.114 0.025 365s 0.630 -0.890 0.033 0.458 365s 0.370 -0.273 0.537 0.163 365s 365s y: [4, 3, 4, 1] 365s 0.013 -0.172 0.255 -0.082 365s -0.206 0.612 0.034 -0.631 365s -0.170 -0.216 0.300 -0.623 365s testing: mul_mat, [3, 4, 4, 3] = [4, 3, 4, 3] * [4, 4, 4, 3] 365s x0: [4, 3, 4, 3] 365s 0.956 -0.864 0.744 0.831 365s 0.311 -0.373 0.859 0.893 365s -0.424 -0.951 0.105 -0.428 365s 365s x1: [4, 4, 4, 3] 365s 0.727 -0.283 0.556 0.441 365s -0.924 0.053 0.676 -0.946 365s -0.738 -0.146 -0.284 0.372 365s -0.716 0.241 0.597 -0.097 365s 365s y: [3, 4, 4, 3] 365s 1.721 1.204 -0.169 365s -1.213 -0.572 0.818 365s -0.481 -0.087 0.263 365s -0.530 0.113 0.179 365s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.231 0.225 0.086 -0.387 365s 0.041 0.479 -0.092 -0.202 365s -0.722 -0.278 0.913 0.625 365s 0.206 0.850 0.213 -0.561 365s 365s x1: [4, 3, 1, 1] 365s 0.072 -0.770 0.507 -0.669 365s 0.603 0.408 -0.797 0.406 365s 0.274 0.608 -0.015 -0.695 365s 365s y: [4, 3, 1, 1] 365s 0.145 -0.277 0.207 -0.157 365s 0.006 0.211 -1.022 0.073 365s 0.468 0.445 -0.815 0.961 365s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 365s x0: [4, 1, 4, 1] 365s 0.729 0.541 -0.062 -0.762 365s 365s x1: [4, 4, 4, 1] 365s -0.462 0.511 0.511 -0.232 365s 0.018 0.842 -0.629 -0.574 365s -0.954 0.777 0.700 0.654 365s -0.238 -0.995 0.574 0.993 365s 365s y: [1, 4, 4, 1] 365s 0.085 365s 0.945 365s -0.817 365s -1.504 365s testing: mul_mat, [2, 4, 4, 3] = [1, 2, 4, 3] * [1, 4, 4, 3] 365s x0: [1, 2, 4, 3] 365s -0.851 365s 0.828 365s 365s x1: [1, 4, 4, 3] 365s 0.767 365s 0.712 365s 0.014 365s 0.121 365s 365s y: [2, 4, 4, 3] 365s -0.653 0.635 365s -0.606 0.590 365s -0.012 0.011 365s -0.103 0.100 365s test-mul-mat0: iter:185/500 365s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 1, 1, 1] 365s 0.050 -0.703 365s 365s x1: [2, 4, 1, 1] 365s -0.403 0.467 365s 0.660 -0.254 365s 0.296 0.158 365s 0.707 -0.619 365s 365s y: [1, 4, 1, 1] 365s -0.349 365s 0.211 365s -0.096 365s 0.470 365s testing: mul_mat, [3, 1, 2, 1] = [2, 3, 2, 1] * [2, 1, 2, 1] 365s x0: [2, 3, 2, 1] 365s 0.993 -0.512 365s -0.367 -0.340 365s 0.733 -ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s 0.446 365s 365s x1: [2, 1, 2, 1] 365s -0.905 0.390 365s 365s y: [3, 1, 2, 1] 365s -1.099 0.199 -0.838 365s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 365s x0: [2, 4, 2, 1] 365s -0.962 0.143 365s 0.239 0.617 365s -0.642 -0.181 365s 0.610 -0.154 365s 365s x1: [2, 3, 2, 1] 365s 0.748 -0.404 365s -0.185 -0.654 365s -0.936 -0.525 365s 365s y: [4, 3, 2, 1] 365s -0.778 -0.070 -0.408 0.519 365s 0.085 -0.448 0.237 -0.012 365s 0.826 -0.548 0.696 -0.490 365s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 365s x0: [2, 2, 1, 1] 365s -0.534 0.358 365s 0.748 -0.793 365s 365s x1: [2, 4, 1, 1] 365s -0.553 -0.174 365s 0.122 0.043 365s 0.641 0.468 365s 0.107 -0.885 365s 365s y: [2, 4, 1, 1] 365s 0.233 -0.276 365s -0.050 0.057 365s -0.174 0.108 365s -0.374 0.781 365s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 2, 2, 1] 365s 0.142 0.742 365s -0.283 0.721 365s 365s x1: [2, 2, 2, 1] 365s -0.204 0.207 365s -0.755 0.939 365s 365s y: [2, 2, 2, 1] 365s 0.124 0.207 365s 0.589 0.890 365s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 365s x0: [2, 1, 2, 1] 365s -0.375 0.870 365s 365s x1: [2, 2, 2, 1] 365s 0.476 0.254 365s 0.330 -0.402 365s 365s y: [1, 2, 2, 1] 365s 0.043 365s -0.474 365s test-mul-mat0: iter:186/500 365s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.574 0.087 0.145 0.639 365s -0.509 -0.769 -0.736 -0.639 365s 0.065 -0.753 -0.562 -0.305 365s 0.436 -0.239 -0.737 0.071 365s 365s x1: [4, 4, 1, 1] 365s 0.568 0.635 -0.815 0.865 365s -0.100 0.657 0.007 0.617 365s 0.399 -0.272 -0.660 0.501 365s 0.535 -0.155 -0.023 -0.210 365s 365s y: [4, 4, 1, 1] 365s 0.816 -0.731 -0.248 0.758 365s 0.395 -0.853 -0.694 -0.162 365s 0.429 0.171 0.449 0.761 365s 0.156 -0.002 0.229 0.273 365s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 365s x0: [4, 4, 3, 1] 365s -0.402 -0.902 -0.662 -0.447 365s -0.776 0.728 -0.245 -0.213 365s 0.813 -0.607 -0.888 -0.839 365s 0.102 -0.618 -0.364 0.782 365s 365s x1: [4, 4, 3, 1] 365s -0.054 -0.872 0.971 -0.398 365s 0.135 0.588 -0.999 0.863 365s 0.929 0.502 0.398 -0.226 365s -0.522 -0.812 -0.052 -0.948 365s 365s y: [4, 4, 3, 1] 365s 0.343 -0.747 -0.043 -0.131 365s -0.309 0.385 -0.085 0.689 365s -0.988 -0.405 0.287 -0.537 365s 1.400 0.027 0.910 -0.274 365s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 365s x0: [4, 4, 3, 1] 365s -0.145 0.545 -0.595 -0.647 365s -0.449 -0.480 0.055 -0.249 365s -0.698 0.344 -0.219 0.532 365s -0.525 -0.113 -0.738 -0.424 365s 365s x1: [4, 4, 3, 1] 365s -0.923 0.500 -0.029 0.797 365s 0.749 0.476 -0.613 -0.031 365s -0.165 -0.963 -0.891 0.100 365s -0.980 0.262 0.963 -0.899 365s 365s y: [4, 4, 3, 1] 365s -0.092 -0.025 1.247 0.113 365s 0.535 -0.591 -0.241 0.018 365s -0.035 0.462 0.033 0.811 365s 0.293 0.592 0.085 0.155 365s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.622 -0.150 -0.334 -0.043 365s -0.288 0.693 0.454 -0.897 365s 0.770 0.404 -0.430 0.998 365s 365s x1: [4, 4, 1, 1] 365s -0.831 0.021 -0.478 -0.643 365s 0.587 0.932 -0.897 0.665 365s 0.690 -0.334 0.218 0.898 365s -0.199 0.625 -0.038 -0.519 365s 365s y: [3, 4, 1, 1] 365s 0.701 0.613 -1.068 365s -0.234 -0.528 1.878 365s -0.490 -1.137 1.199 365s 0.065 0.938 -0.401 365s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 365s x0: [3, 3, 3, 1] 365s -0.500 -0.404 0.911 365s -0.858 0.829 0.414 365s -0.565 0.727 0.821 365s 365s x1: [3, 4, 3, 1] 365s 0.824 0.543 -0.834 365s -0.155 -0.935 -0.476 365s -0.568 0.997 -0.374 365s -0.903 0.687 0.292 365s 365s y: [3, 4, 3, 1] 365s -1.391 -0.603 -0.755 365s 0.021 -0.839 -0.983 365s -0.459 1.159 0.739 365s 0.441 1.466 1.250 365s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 365s x0: [3, 2, 3, 1] 365s -0.610 -0.583 0.866 365s -0.925 0.842 -0.728 365s 365s x1: [3, 3, 3, 1] 365s 0.452 0.792 0.855 365s 0.261 -0.709 0.996 365s 0.696 -0.113 0.825 365s 365s y: [2, 3, 3, 1] 365s 0.003 -0.374 365s 1.117 -1.564 365s 0.356 -1.340 365s test-mul-mat0: iter:187/500 365s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 4, 1, 1] 365s 0.152 -0.849 -0.293 0.245 365s -0.626 -0.450 -0.424 0.841 365s -0.959 0.108 -0.053 0.408 365s 0.725 0.375 -0.975 0.089 365s 365s x1: [4, 4, 1, 1] 365s -0.383 0.364 -0.590 0.793 365s 0.213 0.487 -0.983 0.563 365s -0.806 -0.593 0.638 -0.389 365s -0.751 0.504 -0.117 0.352 365s 365s y: [4, 4, 1, 1] 365s 0.000 0.993 0.761 0.504 365s 0.045 0.538 0.131 1.345 365s 0.099 0.174 0.516 -1.463 365s -0.421 0.589 0.924 -0.211 365s testing: mul_mat, [4, 4, 2, 1] = [4, 4, 2, 1] * [4, 4, 2, 1] 365s x0: [4, 4, 2, 1] 365s -0.843 0.589 0.055 -0.978 365s 0.056 0.297 0.525 0.133 365s 0.788 -0.467 -0.905 0.632 365s 0.202 -0.793 0.627 -0.422 365s 365s x1: [4, 4, 2, 1] 365s 0.785 0.819 -0.698 0.272 365s 0.835 0.865 0.466 -0.758 365s 0.503 -0.922 -0.509 0.007 365s -0.039 0.844 -0.331 -0.887 365s 365s y: [4, 4, 2, 1] 365s -0.484 -0.044 1.041 -1.043 365s 0.572 0.447 -0.647 0.095 365s -1.002 -0.512 1.293 0.510 365s 1.379 -0.044 -0.686 -0.509 365s testing: mul_mat, [3, 4, 2, 4] = [4, 3, 2, 4] * [4, 4, 2, 4] 365s x0: [4, 3, 2, 4] 365s -0.243 0.217 0.111 -0.974 365s 0.034 -0.333 -0.674 -0.636 365s 0.701 -0.263 0.621 -0.165 365s 365s x1: [4, 4, 2, 4] 365s 0.800 0.521 0.343 -0.144 365s -0.182 -0.132 0.989 -0.394 365s 0.401 -0.916 -0.762 -0.398 365s -0.709 0.865 0.180 0.856 365s 365s y: [3, 4, 2, 4] 365s 0.097 -0.285 0.660 365s 0.509 -0.379 0.586 365s 0.006 1.085 0.114 365s -0.454 -0.977 -0.754 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.203 -0.610 -0.716 -0.842 365s 0.517 0.710 0.132 0.296 365s -0.128 -0.907 0.897 -0.543 365s 365s x1: [4, 3, 1, 1] 365s 0.923 -0.692 0.492 0.759 365s -0.378 -0.607 0.743 0.045 365s -0.949 0.896 -0.854 0.227 365s 365s y: [3, 3, 1, 1] 365s -0.757 0.275 0.539 365s -0.123 -0.515 1.241 365s 0.067 0.101 -1.581 365s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 365s x0: [3, 4, 2, 1] 365s -0.852 0.549 0.980 365s -0.747 0.764 -0.568 365s 0.307 -0.379 -0.821 365s -0.693 0.624 -0.290 365s 365s x1: [3, 4, 2, 1] 365s -0.686 -0.926 -0.784 365s 0.811 -0.444 -0.269 365s 0.734 -0.136 -0.777 365s 0.492 0.486 -0.385 365s 365s y: [4, 4, 2, 1] 365s -0.692 0.251 0.784 0.125 365s -1.199 -0.792 0.639 -0.761 365s -1.461 -0.210 0.915 -0.368 365s -0.529 0.222 0.282 0.073 365s testing: mul_mat, [1, 3, 2, 4] = [4, 1, 2, 4] * [4, 3, 2, 4] 365s x0: [4, 1, 2, 4] 365s 0.664 0.079 -0.537 -0.324 365s 365s x1: [4, 3, 2, 4] 365s -0.531 0.212 0.518 0.786 365s 0.076 -0.361 0.196 0.225 365s -0.109 -0.497 0.531 -0.559 365s 365s y: [1, 3, 2, 4] 365s -0.868 365s -0.156 365s -0.215 365s test-mul-mat0: iter:188/500 365s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 365s x0: [1, 3, 1, 1] 365s 0.851 365s -0.710 365s 0.101 365s 365s x1: [1, 2, 1, 1] 365s -0.374 365s -0.544 365s 365s y: [3, 2, 1, 1] 365s -0.318 0.265 -0.038 365s -0.463 0.386 -0.055 365s testing: mul_mat, [3, 3, 2, 1] = [1, 3, 2, 1] * [1, 3, 2, 1] 365s x0: [1, 3, 2, 1] 365s -0.923 365s -0.964 365s -0.292 365s 365s x1: [1, 3, 2, 1] 365s -0.977 365s 0.007 365s -0.926 365s 365s y: [3, 3, 2, 1] 365s 0.902 0.942 0.285 365s -0.007 -0.007 -0.002 365s 0.854 0.893 0.270 365s testing: mul_mat, [3, 3, 2, 4] = [1, 3, 2, 4] * [1, 3, 2, 4] 365s x0: [1, 3, 2, 4] 365s 0.093 365s 0.360 365s -0.995 365s 365s x1: [1, 3, 2, 4] 365s -0.790 365s 0.208 365s 0.846 365s 365s y: [3, 3, 2, 4] 365s -0.073 -0.285 0.786 365s 0.019 0.075 -0.206 365s 0.078 0.305 -0.842 365s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 365s x0: [1, 4, 1, 1] 365s -0.821 365s -0.116 365s 0.918 365s 0.271 365s 365s x1: [1, 3, 1, 1] 365s 0.418 365s 0.164 365s 0.559 365s 365s y: [4, 3, 1, 1] 365s -0.343 -0.049 0.384 0.113 365s -0.134 -0.019 0.150 0.044 365s -0.459 -0.065 0.514 0.152 365s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 365s x0: [4, 3, 2, 1] 365s -0.622 -0.455 -0.778 0.866 365s -0.355 0.068 -0.599 -0.615 365s -0.788 0.588 -0.597 -0.492 365s 365s x1: [4, 1, 2, 1] 365s -0.756 0.924 0.202 -0.175 365s 365s y: [3, 1, 2, 1] 365s -0.258 0.318 1.105 365s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 365s x0: [3, 2, 2, 4] 365s -0.709 0.818 0.914 365s 0.153 0.933 -0.469 365s 365s x1: [3, 4, 2, 4] 365s 0.581 0.631 0.781 365s -0.078 -0.754 -0.513 365s -0.430 -0.376 0.132 365s -0.218 0.169 -0.800 365s 365s y: [2, 4, 2, 4] 365s 0.819 0.312 365s -1.031 -0.474 365s 0.118 -0.479 365s -0.439 0.499 365s test-mul-mat0: iter:189/500 365s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 365s x0: [3, 3, 1, 1] 365s 0.780 0.429 0.476 365s 0.029 0.137 0.682 365s -0.652 0.497 0.610 365s 365s x1: [3, 4, 1, 1] 365s -0.145 -0.949 -0.437 365s 0.911 -0.623 0.926 365s -0.540 -0.994 0.686 365s 0.410 0.927 -0.400 365s 365s y: [3, 4, 1, 1] 365s -0.728 -0.433 -0.644 365s 0.884 0.572 -0.340 365s -0.520 0.316 0.276 365s 0.527 -0.134 -0.050 365s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 365s x0: [3, 3, 4, 1] 365s -0.326 0.014 -0.967 365s 0.134 -0.542 0.982 365s 0.504 -0.723 0.857 365s 365s x1: [3, 3, 4, 1] 365s -0.116 -0.927 0.789 365s 0.275 0.325 0.672 365s 0.605 0.393 0.386 365s 365s y: [3, 3, 4, 1] 365s -0.738 1.262 1.287 365s -0.735 0.521 0.480 365s -0.565 0.247 0.352 365s testing: mul_mat, [2, 3, 4, 3] = [3, 2, 4, 3] * [3, 3, 4, 3] 365s x0: [3, 2, 4, 3] 365s -0.519 -0.413 0.218 365s -0.204 -0.621 -0.166 365s 365s x1: [3, 3, 4, 3] 365s 0.285 0.033 0.786 365s 0.141 0.368 0.8ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s 89 365s -0.441 0.943 0.745 365s 365s y: [2, 3, 4, 3] 365s 0.010 -0.209 365s -0.032 -0.405 365s 0.002 -0.620 365s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 365s x0: [3, 1, 1, 1] 365s 0.070 -0.746 0.554 365s 365s x1: [3, 2, 1, 1] 365s 0.726 -0.059 0.967 365s -0.764 0.541 0.030 365s 365s y: [1, 2, 1, 1] 365s 0.630 365s -0.441 365s testing: mul_mat, [2, 3, 4, 1] = [1, 2, 4, 1] * [1, 3, 4, 1] 365s x0: [1, 2, 4, 1] 365s 0.274 365s -0.144 365s 365s x1: [1, 3, 4, 1] 365s 0.426 365s -0.129 365s 0.791 365s 365s y: [2, 3, 4, 1] 365s 0.117 -0.061 365s -0.035 0.019 365s 0.216 -0.114 365s testing: mul_mat, [4, 1, 4, 3] = [2, 4, 4, 3] * [2, 1, 4, 3] 365s x0: [2, 4, 4, 3] 365s -0.564 -0.898 365s 0.642 0.118 365s -0.935 0.216 365s 0.963 0.504 365s 365s x1: [2, 1, 4, 3] 365s 0.253 0.401 365s 365s y: [4, 1, 4, 3] 365s -0.503 0.210 -0.150 0.446 365s test-mul-mat0: iter:190/500 365s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 365s x0: [2, 3, 1, 1] 365s -0.111 -0.991 365s -0.226 0.852 365s 0.461 0.977 365s 365s x1: [2, 3, 1, 1] 365s 0.794 -0.773 365s -0.279 -0.166 365s 0.570 0.768 365s 365s y: [3, 3, 1, 1] 365s 0.678 -0.838 -0.389 365s 0.195 -0.078 -0.291 365s -0.824 0.526 1.013 365s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 365s x0: [2, 4, 3, 1] 365s -0.020 0.877 365s 0.636 -0.450 365s 0.645 -0.836 365s 0.439 0.654 365s 365s x1: [2, 3, 3, 1] 365s -0.151 0.599 365s -0.064 -0.077 365s 0.198 -0.966 365s 365s y: [4, 3, 3, 1] 365s 0.528 -0.365 -0.598 0.326 365s -0.066 -0.006 0.023 -0.078 365s -0.851 0.560 0.935 -0.545 365s testing: mul_mat, [2, 4, 3, 1] = [2, 2, 3, 1] * [2, 4, 3, 1] 365s x0: [2, 2, 3, 1] 365s 0.582 -0.267 365s -0.605 -0.936 365s 365s x1: [2, 4, 3, 1] 365s -0.459 0.091 365s -0.459 0.549 365s 0.843 0.698 365s -0.649 -0.177 365s 365s y: [2, 4, 3, 1] 365s -0.292 0.193 365s -0.413 -0.236 365s 0.305 -1.164 365s -0.331 0.558 365s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 365s x0: [2, 2, 1, 1] 365s 0.148 0.381 365s -0.356 -0.977 365s 365s x1: [2, 2, 1, 1] 365s -0.495 0.195 365s 0.927 -0.508 365s 365s y: [2, 2, 1, 1] 365s 0.001 -0.014 365s -0.057 0.166 365s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 365s x0: [2, 3, 3, 1] 365s -0.316 0.289 365s 0.507 -0.964 365s 0.845 -0.502 365s 365s x1: [2, 2, 3, 1] 365s 0.734 0.485 365s -0.987 0.739 365s 365s y: [3, 2, 3, 1] 365s -0.092 -0.096 0.377 365s 0.525 -1.212 -1.204 365s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 365s x0: [3, 2, 3, 1] 365s 0.841 -0.586 0.637 365s -0.966 0.788 0.191 365s 365s x1: [3, 2, 3, 1] 365s -0.184 0.038 0.961 365s -0.445 -0.232 -0.321 365s 365s y: [2, 2, 3, 1] 365s 0.435 0.391 365s -0.443 0.186 365s test-mul-mat0: iter:191/500 365s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.315 0.776 -0.235 -0.506 365s -0.109 -0.926 0.857 0.953 365s -0.801 -0.742 -0.024 -0.121 365s 365s x1: [4, 3, 1, 1] 365s 0.400 0.799 0.540 -0.489 365s 0.389 -0.394 -0.091 0.014 365s -0.829 -0.250 0.048 -0.415 365s 365s y: [3, 3, 1, 1] 365s 0.614 -0.787 -0.868 365s -0.414 0.258 -0.018 365s 0.266 -0.033 0.898 365s testing: mul_mat, [2, 3, 4, 1] = [4, 2, 4, 1] * [4, 3, 4, 1] 365s x0: [4, 2, 4, 1] 365s -0.425 0.929 0.889 0.849 365s -0.250 0.486 0.197 0.427 365s 365s x1: [4, 3, 4, 1] 365s 0.426 -0.323 0.710 -0.600 365s -0.607 -0.972 0.801 -0.809 365s 0.568 -0.689 0.580 -0.825 365s 365s y: [2, 3, 4, 1] 365s -0.359 -0.380 365s -0.619 -0.508 365s -1.067 -0.716 365s testing: mul_mat, [4, 2, 4, 3] = [4, 4, 4, 3] * [4, 2, 4, 3] 365s x0: [4, 4, 4, 3] 365s 0.056 -0.500 0.142 0.142 365s -0.778 0.981 -0.298 -0.313 365s -0.553 0.022 0.817 -0.705 365s 0.755 0.011 0.327 -0.304 365s 365s x1: [4, 2, 4, 3] 365s 0.006 -0.562 0.504 0.756 365s 0.924 -0.299 0.184 -0.039 365s 365s y: [4, 2, 4, 3] 365s 0.461 -0.944 -0.137 -0.066 365s 0.222 -1.054 -0.340 0.766 365s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 365s x0: [4, 3, 1, 1] 365s -0.657 -0.710 -0.410 0.135 365s 0.841 -0.261 0.024 -0.895 365s -0.871 -0.116 0.592 0.711 365s 365s x1: [4, 4, 1, 1] 365s 0.147 0.865 0.181 -0.207 365s -0.441 -0.452 -0.948 0.417 365s 0.558 0.777 0.252 -0.839 365s 0.688 0.218 0.841 0.319 365s 365s y: [3, 4, 1, 1] 365s -0.813 0.087 -0.268 365s 1.055 -0.648 0.172 365s -1.135 1.022 -1.022 365s -0.908 0.256 0.101 365s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 365s x0: [3, 3, 4, 1] 365s 0.404 0.439 -0.642 365s -0.079 0.151 0.972 365s 0.899 -0.033 0.058 365s 365s x1: [3, 4, 4, 1] 365s 0.001 -0.692 -0.142 365s -0.133 0.489 0.651 365s 0.426 -0.963 0.703 365s -0.158 0.595 0.481 365s 365s y: [3, 4, 4, 1] 365s -0.212 -0.243 0.016 365s -0.258 0.717 -0.098 365s -0.702 0.504 0.456 365s -0.111 0.569 -0.134 365s testing: mul_mat, [3, 3, 4, 3] = [3, 3, 4, 3] * [3, 3, 4, 3] 365s x0: [3, 3, 4, 3] 365s -ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 365s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 0.118 -0.599 -0.714 366s 0.283 0.953 -0.466 366s 0.478 -0.039 0.201 366s 366s x1: [3, 3, 4, 3] 366s -0.233 -0.067 0.816 366s 0.825 0.801 0.743 366s 0.079 -0.109 -0.373 366s 366s y: [3, 3, 4, 3] 366s -0.515 -0.511 0.056 366s -1.108 0.651 0.512 366s 0.322 0.093 -0.033 366s test-mul-mat0: iter:192/500 366s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.263 366s -0.579 366s -0.602 366s -0.085 366s 366s x1: [1, 4, 1, 1] 366s -0.549 366s 0.386 366s 0.891 366s -0.308 366s 366s y: [4, 4, 1, 1] 366s 0.144 0.318 0.331 0.047 366s -0.102 -0.224 -0.233 -0.033 366s -0.234 -0.516 -0.536 -0.076 366s 0.081 0.178 0.185 0.026 366s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 366s x0: [1, 3, 3, 1] 366s -0.934 366s 0.650 366s -0.512 366s 366s x1: [1, 4, 3, 1] 366s -0.267 366s -0.546 366s -0.842 366s 0.609 366s 366s y: [3, 4, 3, 1] 366s 0.250 -0.174 0.137 366s 0.510 -0.355 0.279 366s 0.787 -0.548 0.431 366s -0.569 0.396 -0.312 366s testing: mul_mat, [4, 3, 3, 4] = [1, 4, 3, 4] * [1, 3, 3, 4] 366s x0: [1, 4, 3, 4] 366s 0.367 366s 0.459 366s -0.848 366s -0.545 366s 366s x1: [1, 3, 3, 4] 366s 0.243 366s 0.228 366s 0.447 366s 366s y: [4, 3, 3, 4] 366s 0.089 0.111 -0.206 -0.132 366s 0.083 0.105 -0.193 -0.124 366s 0.164 0.205 -0.379 -0.244 366s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.971 366s 366s x1: [1, 4, 1, 1] 366s -0.969 366s 0.744 366s -0.550 366s 0.166 366s 366s y: [1, 4, 1, 1] 366s -0.941 366s 0.722 366s -0.534 366s 0.161 366s testing: mul_mat, [1, 1, 3, 1] = [1, 1, 3, 1] * [1, 1, 3, 1] 366s x0: [1, 1, 3, 1] 366s -0.495 366s 366s x1: [1, 1, 3, 1] 366s -0.307 366s 366s y: [1, 1, 3, 1] 366s 0.152 366s testing: mul_mat, [2, 1, 3, 4] = [1, 2, 3, 4] * [1, 1, 3, 4] 366s x0: [1, 2, 3, 4] 366s 0.149 366s 0.429 366s 366s x1: [1, 1, 3, 4] 366s -0.801 366s 366s y: [2, 1, 3, 4] 366s -0.120 -0.344 366s test-mul-mat0: iter:193/500 366s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.340 -0.178 0.519 -0.764 366s -0.025 -0.839 -0.702 0.221 366s -0.225 0.054 0.333 0.955 366s 366s x1: [4, 1, 1, 1] 366s 0.264 0.898 -0.525 -0.587 366s 366s y: [3, 1, 1, 1] 366s 0.106 -0.522 -0.746 366s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.157 -0.597 -0.289 0.318 366s 366s x1: [4, 3, 1, 1] 366s 0.650 0.950 0.311 0.575 366s -0.234 0.815 0.774 0.626 366s -0.103 -0.774 0.785 0.140 366s 366s y: [1, 3, 1, 1] 366s -0.576 366s -0.474 366s 0.295 366s testing: mul_mat, [2, 1, 1, 3] = [4, 2, 1, 3] * [4, 1, 1, 3] 366s x0: [4, 2, 1, 3] 366s -0.552 0.897 0.116 0.759 366s 0.472 0.882 0.574 0.246 366s 366s x1: [4, 1, 1, 3] 366s -0.184 -0.962 -0.342 0.638 366s 366s y: [2, 1, 1, 3] 366s -0.317 -0.975 366s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.198 0.080 -0.411 0.882 366s 0.010 -0.416 0.055 -0.334 366s 0.814 0.060 -0.468 -0.184 366s 366s x1: [4, 2, 1, 1] 366s 0.081 -0.691 -0.801 -0.085 366s -0.198 -0.555 -0.918 0.251 366s 366s y: [3, 2, 1, 1] 366s 0.215 0.273 0.415 366s 0.515 0.095 0.189 366s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.976 0.237 0.797 366s -0.454 -0.942 0.579 366s 366s x1: [3, 4, 1, 1] 366s -0.981 -0.320 0.812 366s 0.473 0.511 -0.155 366s 0.586 0.980 0.435 366s -0.408 0.061 0.744 366s 366s y: [2, 4, 1, 1] 366s -0.387 1.217 366s 0.459 -0.786 366s 1.150 -0.937 366s 0.209 0.558 366s testing: mul_mat, [3, 3, 1, 3] = [2, 3, 1, 3] * [2, 3, 1, 3] 366s x0: [2, 3, 1, 3] 366s 0.282 -0.657 366s 0.893 0.637 366s -0.522 0.269 366s 366s x1: [2, 3, 1, 3] 366s 0.256 -0.194 366s 0.394 -0.664 366s 0.390 -0.546 366s 366s y: [3, 3, 1, 3] 366s 0.199 0.105 -0.186 366s 0.547 -0.071 -0.384 366s 0.469 0.000 -0.351 366s test-mul-mat0: iter:194/500 366s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.173 -0.174 -0.208 -0.288 366s -0.892 -0.315 0.190 -0.549 366s 366s x1: [4, 1, 1, 1] 366s 0.732 -0.280 -0.472 0.524 366s 366s y: [2, 1, 1, 1] 366s -0.131 -0.942 366s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 2, 3, 1] 366s 0.223 0.414 -0.741 0.908 366s -0.397 -0.289 -0.769 0.062 366s 366s x1: [4, 2, 3, 1] 366s -0.677 -0.541 -0.229 -0.494 366s -0.036 0.148 -0.514 0.049 366s 366s y: [2, 2, 3, 1] 366s -0.654 0.571 366s 0.479 0.370 366s testing: mul_mat, [3, 2, 3, 4] = [4, 3, 3, 4] * [4, 2, 3, 4] 366s x0: [4, 3, 3, 4] 366s -0.911 -0.347 0.129 0.126 366s 0.327 0.623 -0.013 -0.441 366s 0.897 0.002 0.159 0.689 366s 366s x1: [4, 2, 3, 4] 366s 0.812 0.539 0.029 0.137 366s 0.970 0.731 0.288 0.193 366s 366s y: [3, 2, 3, 4] 366s -0.906 0.541 0.828 366s -1.076 0.684 1.050 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.402 -0.482 -0.021 0.773 366s -0.011 0.125 -0.945 -ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 0.652 366s -0.122 -0.940 0.784 0.865 366s 366s x1: [4, 3, 1, 1] 366s -0.718 -0.957 -0.817 0.928 366s 0.262 -0.643 0.082 -0.321 366s 0.551 -0.877 0.264 0.472 366s 366s y: [3, 3, 1, 1] 366s 0.908 0.056 1.150 366s 0.165 0.048 0.359 366s 1.003 -0.673 1.372 366s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 366s x0: [3, 1, 3, 1] 366s -0.520 -0.886 0.531 366s 366s x1: [3, 4, 3, 1] 366s 0.891 0.967 -0.960 366s 0.038 -0.689 0.402 366s 0.320 -0.646 0.586 366s 0.248 0.615 0.943 366s 366s y: [1, 4, 3, 1] 366s -1.829 366s 0.805 366s 0.717 366s -0.174 366s testing: mul_mat, [3, 3, 3, 4] = [1, 3, 3, 4] * [1, 3, 3, 4] 366s x0: [1, 3, 3, 4] 366s -0.579 366s -0.183 366s -0.532 366s 366s x1: [1, 3, 3, 4] 366s -0.489 366s 0.584 366s 0.934 366s 366s y: [3, 3, 3, 4] 366s 0.283 0.090 0.260 366s -0.338 -0.107 -0.311 366s -0.541 -0.171 -0.497 366s test-mul-mat0: iter:195/500 366s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.492 0.907 366s 0.412 0.317 366s 0.694 0.611 366s 366s x1: [2, 4, 1, 1] 366s -0.689 -0.784 366s -0.006 -0.508 366s 0.061 -0.019 366s -0.625 -0.618 366s 366s y: [3, 4, 1, 1] 366s -1.050 -0.533 -0.958 366s -0.464 -0.164 -0.315 366s 0.013 0.019 0.031 366s -0.868 -0.453 -0.811 366s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 2, 1, 1] 366s -0.804 0.351 366s 0.610 -0.126 366s 366s x1: [2, 3, 1, 1] 366s -0.183 -0.746 366s -0.942 0.926 366s -0.986 0.296 366s 366s y: [2, 3, 1, 1] 366s -0.115 -0.018 366s 1.083 -0.691 366s 0.896 -0.639 366s testing: mul_mat, [4, 2, 1, 3] = [2, 4, 1, 3] * [2, 2, 1, 3] 366s x0: [2, 4, 1, 3] 366s -0.089 0.431 366s 0.640 -0.677 366s -0.252 0.334 366s 0.935 0.565 366s 366s x1: [2, 2, 1, 3] 366s -0.295 0.257 366s -0.239 -0.927 366s 366s y: [4, 2, 1, 3] 366s 0.137 -0.362 0.160 -0.131 366s -0.379 0.475 -0.249 -0.747 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.572 -0.793 366s -0.728 0.753 366s -0.811 0.350 366s 0.762 -0.800 366s 366s x1: [2, 4, 1, 1] 366s 0.706 -0.996 366s -0.930 0.056 366s 0.038 -0.991 366s -0.211 0.949 366s 366s y: [4, 4, 1, 1] 366s 0.386 -1.264 -0.921 1.335 366s 0.487 0.719 0.775 -0.754 366s 0.764 -0.775 -0.378 0.822 366s -0.632 0.869 0.503 -0.921 366s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.798 -0.110 0.363 0.754 366s -0.143 0.749 -0.404 0.433 366s 366s x1: [4, 2, 1, 1] 366s 0.244 -0.048 0.488 0.270 366s -0.250 -0.872 -0.095 0.423 366s 366s y: [2, 2, 1, 1] 366s 0.581 -0.151 366s 0.181 -0.397 366s testing: mul_mat, [2, 4, 1, 3] = [2, 2, 1, 3] * [2, 4, 1, 3] 366s x0: [2, 2, 1, 3] 366s 0.399 0.326 366s 0.791 0.995 366s 366s x1: [2, 4, 1, 3] 366s 0.653 -0.208 366s 0.442 -0.559 366s -0.259 -0.118 366s -0.130 0.014 366s 366s y: [2, 4, 1, 3] 366s 0.192 0.309 366s -0.006 -0.206 366s -0.142 -0.322 366s -0.047 -0.089 366s test-mul-mat0: iter:196/500 366s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 2, 1, 1] 366s -0.323 -0.371 366s 0.294 0.186 366s 366s x1: [2, 4, 1, 1] 366s 0.312 -0.022 366s 0.572 -0.268 366s 0.046 -0.495 366s 0.641 -0.615 366s 366s y: [2, 4, 1, 1] 366s -0.093 0.088 366s -0.085 0.119 366s 0.169 -0.078 366s 0.021 0.074 366s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 366s x0: [2, 4, 2, 1] 366s -0.658 0.544 366s -0.683 0.419 366s -0.373 0.276 366s 0.510 -0.219 366s 366s x1: [2, 2, 2, 1] 366s -0.330 0.257 366s -0.415 -0.539 366s 366s y: [4, 2, 2, 1] 366s 0.357 0.333 0.194 -0.225 366s -0.020 0.058 0.006 -0.094 366s testing: mul_mat, [1, 4, 2, 2] = [2, 1, 2, 2] * [2, 4, 2, 2] 366s x0: [2, 1, 2, 2] 366s 0.666 -0.866 366s 366s x1: [2, 4, 2, 2] 366s 0.190 0.133 366s 0.871 0.818 366s -0.573 0.057 366s -0.511 0.685 366s 366s y: [1, 4, 2, 2] 366s 0.011 366s -0.129 366s -0.430 366s -0.933 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.415 0.798 366s 0.033 -0.461 366s 0.824 -0.649 366s -0.430 0.839 366s 366s x1: [2, 1, 1, 1] 366s 0.804 -0.415 366s 366s y: [4, 1, 1, 1] 366s 0.002 0.218 0.931 -0.694 366s testing: mul_mat, [2, 2, 2, 1] = [4, 2, 2, 1] * [4, 2, 2, 1] 366s x0: [4, 2, 2, 1] 366s 0.192 0.005 0.544 -0.590 366s -0.041 -0.005 0.532 -0.424 366s 366s x1: [4, 2, 2, 1] 366s -0.235 -0.086 -0.254 0.812 366s 0.235 -0.908 -0.588 -0.639 366s 366s y: [2, 2, 2, 1] 366s -0.663 -0.469 366s 0.098 -0.047 366s testing: mul_mat, [1, 4, 2, 2] = [2, 1, 2, 2] * [2, 4, 2, 2] 366s x0: [2, 1, 2, 2] 366s 0.526 0.234 366s 366s x1: [2, 4, 2, 2] 366s -0.041 -0.528 366s 0.479 0.051 366s -0.116 0.840 366s 0.433 0.091 366s 366s y: [1, 4, 2, 2] 366s -0.146 366s 0.264 366s 0.136 366s 0.249 366s test-mul-mat0: iter:197/500 366s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.733 366s 0.970 366s 366s x1: [1, 2, 1, 1] 366s 0.968 366s -0.112 366s 366s y: [2, 2, 1, 1] 366s 0.709 0.939 366s -0.082 -0.108 366s testing: mul_mat, [2ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s , 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 366s x0: [1, 2, 4, 1] 366s 0.634 366s 0.439 366s 366s x1: [1, 2, 4, 1] 366s -0.242 366s 0.030 366s 366s y: [2, 2, 4, 1] 366s -0.153 -0.106 366s 0.019 0.013 366s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 366s x0: [1, 4, 4, 1] 366s 0.732 366s 0.316 366s -0.168 366s 0.464 366s 366s x1: [1, 2, 4, 1] 366s -0.326 366s 0.720 366s 366s y: [4, 2, 4, 1] 366s -0.239 -0.103 0.055 -0.151 366s 0.527 0.227 -0.121 0.334 366s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.352 366s -0.817 366s 366s x1: [1, 4, 1, 1] 366s -0.799 366s -0.152 366s -0.035 366s 0.509 366s 366s y: [2, 4, 1, 1] 366s 0.281 0.653 366s 0.053 0.124 366s 0.012 0.028 366s -0.179 -0.416 366s testing: mul_mat, [4, 1, 4, 1] = [2, 4, 4, 1] * [2, 1, 4, 1] 366s x0: [2, 4, 4, 1] 366s -0.011 -0.500 366s -0.090 -0.808 366s 0.698 0.024 366s -0.725 0.487 366s 366s x1: [2, 1, 4, 1] 366s 0.620 0.986 366s 366s y: [4, 1, 4, 1] 366s -0.499 -0.852 0.457 0.031 366s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 1, 4, 1] 366s 0.039 -0.589 0.585 -0.469 366s 366s x1: [4, 2, 4, 1] 366s 0.153 0.191 -0.532 0.653 366s 0.384 0.492 0.140 0.753 366s 366s y: [1, 2, 4, 1] 366s -0.724 366s -0.546 366s test-mul-mat0: iter:198/500 366s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.327 -0.621 -0.928 366s -0.742 -0.090 0.623 366s 366s x1: [3, 3, 1, 1] 366s 0.933 0.944 -0.608 366s 0.296 0.461 -0.230 366s -0.098 -0.529 0.341 366s 366s y: [2, 3, 1, 1] 366s -0.327 -1.156 366s -0.170 -0.404 366s 0.044 0.332 366s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 3, 1, 1] 366s -0.606 -0.080 -0.760 366s -0.421 -0.508 -0.025 366s -0.735 0.709 -0.091 366s 366s x1: [3, 2, 1, 1] 366s 0.596 -0.360 0.198 366s 0.463 0.693 -0.291 366s 366s y: [3, 2, 1, 1] 366s -0.482 -0.073 -0.711 366s -0.115 -0.540 0.177 366s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.409 -0.160 0.074 366s 0.502 -0.537 -0.330 366s -0.858 0.660 -0.867 366s 0.834 -0.631 -0.385 366s 366s x1: [3, 3, 1, 1] 366s -0.791 -0.899 -0.795 366s 0.671 -0.129 0.107 366s -0.858 -0.788 -0.232 366s 366s y: [4, 3, 1, 1] 366s 0.409 0.349 0.775 0.214 366s -0.246 0.370 -0.754 0.599 366s 0.460 0.070 0.417 -0.129 366s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 3, 1, 1] 366s -0.538 -0.326 0.265 366s -0.948 -0.180 0.659 366s 0.095 0.910 -0.017 366s 366s x1: [3, 4, 1, 1] 366s -0.772 0.289 -0.145 366s -0.193 0.781 0.830 366s 0.072 0.490 -0.261 366s 0.282 0.591 -0.055 366s 366s y: [3, 4, 1, 1] 366s 0.282 0.584 0.192 366s 0.069 0.590 0.678 366s -0.268 -0.329 0.457 366s -0.359 -0.410 0.565 366s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.440 0.255 0.778 366s 366s x1: [3, 3, 1, 1] 366s -0.233 -0.878 0.653 366s -0.092 0.782 0.786 366s -0.257 -0.849 -0.600 366s 366s y: [1, 3, 1, 1] 366s 0.181 366s 0.771 366s -0.797 366s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.825 366s 366s x1: [1, 3, 1, 1] 366s 0.221 366s 0.086 366s -0.150 366s 366s y: [1, 3, 1, 1] 366s 0.182 366s 0.071 366s -0.123 366s test-mul-mat0: iter:199/500 366s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.205 -0.489 0.121 366s 0.356 -0.089 -0.908 366s -0.204 -0.834 0.870 366s -0.983 0.252 -0.280 366s 366s x1: [3, 4, 1, 1] 366s -0.236 -0.178 -0.821 366s 0.438 0.641 -0.911 366s -0.297 0.301 0.072 366s 0.470 0.423 -0.275 366s 366s y: [4, 4, 1, 1] 366s -0.060 0.678 -0.517 0.417 366s -0.334 0.927 -1.417 -0.014 366s -0.199 -0.198 -0.128 0.348 366s -0.144 0.380 -0.688 -0.278 366s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 366s x0: [3, 2, 2, 1] 366s -0.610 0.589 -0.135 366s -0.593 -0.159 0.585 366s 366s x1: [3, 4, 2, 1] 366s -0.273 0.078 -0.148 366s 0.029 0.847 0.937 366s 0.793 -0.331 -0.885 366s 0.232 -0.690 -0.796 366s 366s y: [2, 4, 2, 1] 366s 0.232 0.063 366s 0.354 0.396 366s -0.559 -0.935 366s -0.440 -0.493 366s testing: mul_mat, [1, 2, 2, 1] = [3, 1, 2, 1] * [3, 2, 2, 1] 366s x0: [3, 1, 2, 1] 366s 0.629 -0.519 0.213 366s 366s x1: [3, 2, 2, 1] 366s 0.957 -0.565 -0.511 366s 0.188 -0.255 -0.307 366s 366s y: [1, 2, 2, 1] 366s 0.786 366s 0.185 366s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 3, 1, 1] 366s -0.666 -0.747 -0.981 366s -0.342 0.095 0.468 366s 0.155 0.318 0.275 366s 366s x1: [3, 1, 1, 1] 366s 0.614 0.493 0.073 366s 366s y: [3, 1, 1, 1] 366s -0.849 -0.129 0.272 366s testing: mul_mat, [1, 3, 2, 1] = [3, 1, 2, 1] * [3, 3, 2, 1] 366s x0: [3, 1, 2, 1] 366s -0.534 -0.082 0.433 366s 366s x1: [3, 3, 2, 1] 366s -0.546 0.957 -0.537 366s 0.199 -0.350 0.586 366s 0.556 -0.381 -0.887 366s 366s y: [1, 3, 2, 1] 366s -0.019 366s 0.176 366s -0.650 366s testing: mul_mat, [3,ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s 3, 2, 1] = [1, 3, 2, 1] * [1, 3, 2, 1] 366s x0: [1, 3, 2, 1] 366s -0.904 366s 0.644 366s -0.276 366s 366s x1: [1, 3, 2, 1] 366s 0.053 366s -0.329 366s 0.906 366s 366s y: [3, 3, 2, 1] 366s -0.047 0.034 -0.014 366s 0.297 -0.212 0.091 366s -0.819 0.583 -0.250 366s test-mul-mat0: iter:200/500 366s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.403 366s 0.600 366s 0.998 366s 366s x1: [1, 2, 1, 1] 366s 0.766 366s 0.802 366s 366s y: [3, 2, 1, 1] 366s 0.308 0.460 0.764 366s 0.323 0.481 0.800 366s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.554 366s 0.264 366s 0.743 366s 366s x1: [1, 3, 1, 1] 366s 0.248 366s 0.013 366s -0.536 366s 366s y: [3, 3, 1, 1] 366s -0.137 0.065 0.184 366s -0.007 0.003 0.010 366s 0.297 -0.141 -0.398 366s testing: mul_mat, [2, 3, 1, 4] = [1, 2, 1, 4] * [1, 3, 1, 4] 366s x0: [1, 2, 1, 4] 366s 0.917 366s -0.239 366s 366s x1: [1, 3, 1, 4] 366s 0.034 366s -0.684 366s -0.586 366s 366s y: [2, 3, 1, 4] 366s 0.031 -0.008 366s -0.627 0.163 366s -0.538 0.140 366s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.247 366s 0.558 366s -0.798 366s -0.081 366s 366s x1: [1, 2, 1, 1] 366s 0.505 366s -0.455 366s 366s y: [4, 2, 1, 1] 366s -0.125 0.282 -0.403 -0.041 366s 0.112 -0.254 0.363 0.037 366s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.075 0.170 -0.900 -0.723 366s 0.979 0.628 -0.917 0.864 366s 366s x1: [4, 1, 1, 1] 366s -0.995 0.466 -0.338 0.039 366s 366s y: [2, 1, 1, 1] 366s 0.281 -0.338 366s testing: mul_mat, [3, 4, 1, 4] = [2, 3, 1, 4] * [2, 4, 1, 4] 366s x0: [2, 3, 1, 4] 366s 0.673 -0.134 366s -0.309 0.471 366s -0.777 0.374 366s 366s x1: [2, 4, 1, 4] 366s 0.720 0.673 366s -0.142 0.637 366s -0.565 0.194 366s -0.219 0.151 366s 366s y: [3, 4, 1, 4] 366s 0.394 0.095 -0.308 366s -0.181 0.344 0.349 366s -0.406 0.266 0.512 366s -0.168 0.139 0.227 366s test-mul-mat0: iter:201/500 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.901 0.674 366s 366s x1: [2, 3, 1, 1] 366s -0.380 0.739 366s -0.923 -0.707 366s -0.570 -0.700 366s 366s y: [1, 3, 1, 1] 366s 0.156 366s -1.309 366s -0.986 366s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 366s x0: [2, 3, 3, 1] 366s 0.863 -0.444 366s 0.795 -0.007 366s -0.422 0.921 366s 366s x1: [2, 1, 3, 1] 366s 0.322 0.098 366s 366s y: [3, 1, 3, 1] 366s 0.234 0.255 -0.046 366s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 366s x0: [2, 3, 3, 1] 366s -0.697 0.352 366s -0.888 -0.847 366s 0.384 -0.459 366s 366s x1: [2, 3, 3, 1] 366s -0.431 -0.114 366s 0.094 0.729 366s -0.213 -0.233 366s 366s y: [3, 3, 3, 1] 366s 0.260 0.479 -0.113 366s 0.191 -0.700 -0.299 366s 0.066 0.386 0.025 366s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.120 -0.078 366s -0.750 -0.296 366s -0.152 -0.510 366s -0.813 -0.726 366s 366s x1: [2, 3, 1, 1] 366s 0.434 -0.884 366s -0.382 -0.229 366s -0.433 -0.844 366s 366s y: [4, 3, 1, 1] 366s 0.017 -0.064 0.385 0.289 366s 0.063 0.354 0.175 0.477 366s 0.118 0.575 0.496 0.965 366s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 4, 3, 1] 366s 0.543 -0.769 0.601 0.035 366s 0.740 0.943 -0.580 0.867 366s 0.669 0.063 0.328 0.678 366s -0.753 -0.125 0.361 0.701 366s 366s x1: [4, 2, 3, 1] 366s -0.184 -0.356 0.658 0.357 366s -0.547 0.759 -0.385 -0.461 366s 366s y: [4, 2, 3, 1] 366s 0.583 -0.544 0.312 0.671 366s -1.129 0.135 -0.756 -0.146 366s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 2, 3, 1] 366s 0.449 0.787 -0.375 0.935 366s -0.697 -0.275 -0.528 -0.656 366s 366s x1: [4, 4, 3, 1] 366s -0.488 0.651 0.500 0.940 366s -0.113 0.150 0.474 -0.535 366s -0.416 0.680 -0.460 -0.288 366s 0.440 0.362 -0.616 -0.060 366s 366s y: [2, 4, 3, 1] 366s 0.984 -0.719 366s -0.610 0.138 366s 0.252 0.535 366s 0.657 -0.041 366s test-mul-mat0: iter:202/500 366s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.135 366s 0.113 366s 366s x1: [1, 2, 1, 1] 366s 0.416 366s -0.190 366s 366s y: [2, 2, 1, 1] 366s -0.056 0.047 366s 0.026 -0.021 366s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 366s x0: [1, 4, 4, 1] 366s 0.039 366s 0.286 366s 0.659 366s -0.196 366s 366s x1: [1, 2, 4, 1] 366s -0.879 366s 0.590 366s 366s y: [4, 2, 4, 1] 366s -0.035 -0.252 -0.579 0.172 366s 0.023 0.169 0.389 -0.116 366s testing: mul_mat, [2, 4, 4, 1] = [1, 2, 4, 1] * [1, 4, 4, 1] 366s x0: [1, 2, 4, 1] 366s -0.766 366s -0.469 366s 366s x1: [1, 4, 4, 1] 366s 0.454 366s -0.399 366s -0.576 366s 0.789 366s 366s y: [2, 4, 4, 1] 366s -0.348 -0.213 366s 0.306 0.187 366s 0.442 0.270 366s -0.604 -0.370 366s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.609 366s 0.054 366s 0.526 366s 366s x1: [1, 2, 1, 1] 366s 0.985 366s 0.869 366s 366s y: [3, 2, 1, 1] 366s 0.600 0.053 0.5ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 18 366s 0.529 0.047 0.457 366s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 366s x0: [3, 3, 4, 1] 366s 0.810 -0.851 0.960 366s -0.308 -0.074 -0.497 366s 0.677 -0.792 0.106 366s 366s x1: [3, 1, 4, 1] 366s 0.063 0.655 0.949 366s 366s y: [3, 1, 4, 1] 366s 0.405 -0.540 -0.376 366s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 366s x0: [3, 1, 4, 1] 366s -0.498 -0.896 -0.462 366s 366s x1: [3, 3, 4, 1] 366s 0.319 -0.620 -0.933 366s -0.575 -0.668 0.893 366s -0.379 -0.731 0.589 366s 366s y: [1, 3, 4, 1] 366s 0.827 366s 0.473 366s 0.572 366s test-mul-mat0: iter:203/500 366s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.454 0.622 0.645 -0.573 366s 366s x1: [4, 2, 1, 1] 366s 0.928 -0.708 -0.825 -0.047 366s -0.005 0.600 0.090 -0.881 366s 366s y: [1, 2, 1, 1] 366s -1.367 366s 0.939 366s testing: mul_mat, [1, 1, 3, 1] = [4, 1, 3, 1] * [4, 1, 3, 1] 366s x0: [4, 1, 3, 1] 366s 0.517 0.834 -0.609 0.376 366s 366s x1: [4, 1, 3, 1] 366s 0.458 -0.406 -0.072 0.562 366s 366s y: [1, 1, 3, 1] 366s 0.153 366s testing: mul_mat, [1, 1, 3, 4] = [4, 1, 3, 4] * [4, 1, 3, 4] 366s x0: [4, 1, 3, 4] 366s -0.285 -0.832 -0.016 0.893 366s 366s x1: [4, 1, 3, 4] 366s -0.218 0.585 -0.091 -0.595 366s 366s y: [1, 1, 3, 4] 366s -0.955 366s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 1, 1, 1] 366s 0.279 -0.453 -0.245 0.859 366s 366s x1: [4, 1, 1, 1] 366s 0.885 0.999 0.896 0.965 366s 366s y: [1, 1, 1, 1] 366s 0.405 366s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 366s x0: [1, 3, 3, 1] 366s -0.903 366s 0.884 366s -0.001 366s 366s x1: [1, 4, 3, 1] 366s -0.404 366s 0.398 366s -0.632 366s -0.296 366s 366s y: [3, 4, 3, 1] 366s 0.365 -0.357 0.000 366s -0.360 0.352 -0.000 366s 0.571 -0.559 0.001 366s 0.267 -0.262 0.000 366s testing: mul_mat, [2, 1, 3, 4] = [3, 2, 3, 4] * [3, 1, 3, 4] 366s x0: [3, 2, 3, 4] 366s 0.129 0.432 -0.358 366s 0.009 -0.884 0.217 366s 366s x1: [3, 1, 3, 4] 366s 0.878 0.133 -0.721 366s 366s y: [2, 1, 3, 4] 366s 0.429 -0.266 366s test-mul-mat0: iter:204/500 366s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.768 0.526 0.258 366s 0.513 0.877 -0.650 366s 366s x1: [3, 4, 1, 1] 366s -0.410 -0.473 0.347 366s 0.603 0.658 0.814 366s -0.187 -0.924 0.428 366s -0.618 0.961 0.086 366s 366s y: [2, 4, 1, 1] 366s -0.473 -0.851 366s 1.019 0.357 366s -0.519 -1.185 366s 0.052 0.470 366s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.006 -0.441 0.959 366s 0.883 0.570 -0.451 366s -0.589 -0.083 -0.848 366s -0.932 -0.269 -0.035 366s 366s x1: [3, 2, 1, 1] 366s -0.372 -0.743 0.522 366s -0.670 -0.530 -0.218 366s 366s y: [4, 2, 1, 1] 366s 0.830 -0.987 -0.161 0.528 366s 0.029 -0.796 0.623 0.775 366s testing: mul_mat, [1, 4, 1, 4] = [3, 1, 1, 4] * [3, 4, 1, 4] 366s x0: [3, 1, 1, 4] 366s -0.845 -0.890 -0.379 366s 366s x1: [3, 4, 1, 4] 366s -0.855 -0.841 0.347 366s -0.895 0.245 -0.048 366s 0.873 -0.230 -0.790 366s 0.386 -0.352 -0.440 366s 366s y: [1, 4, 1, 4] 366s 1.341 366s 0.557 366s -0.234 366s 0.154 366s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.609 0.011 0.150 366s 366s x1: [3, 1, 1, 1] 366s 0.138 -0.371 -0.034 366s 366s y: [1, 1, 1, 1] 366s 0.075 366s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.289 366s 0.095 366s -0.044 366s 366s x1: [1, 3, 1, 1] 366s -0.538 366s 0.826 366s -0.158 366s 366s y: [3, 3, 1, 1] 366s -0.155 -0.051 0.024 366s 0.239 0.078 -0.036 366s -0.046 -0.015 0.007 366s testing: mul_mat, [4, 1, 1, 4] = [3, 4, 1, 4] * [3, 1, 1, 4] 366s x0: [3, 4, 1, 4] 366s -0.614 -0.792 -0.118 366s 0.070 0.273 -0.472 366s -0.355 0.517 0.026 366s -0.449 0.876 0.345 366s 366s x1: [3, 1, 1, 4] 366s 0.656 0.429 0.554 366s 366s y: [4, 1, 1, 4] 366s -0.808 -0.098 0.003 0.272 366s test-mul-mat0: iter:205/500 366s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.020 366s -0.692 366s 366s x1: [1, 1, 1, 1] 366s -0.896 366s 366s y: [2, 1, 1, 1] 366s 0.018 0.620 366s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 366s x0: [1, 2, 3, 1] 366s 0.213 366s -0.050 366s 366s x1: [1, 2, 3, 1] 366s 0.449 366s -0.376 366s 366s y: [2, 2, 3, 1] 366s 0.096 -0.022 366s -0.080 0.019 366s testing: mul_mat, [1, 2, 3, 2] = [1, 1, 3, 2] * [1, 2, 3, 2] 366s x0: [1, 1, 3, 2] 366s -0.269 366s 366s x1: [1, 2, 3, 2] 366s 0.164 366s 0.594 366s 366s y: [1, 2, 3, 2] 366s -0.044 366s -0.160 366s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.947 366s 0.718 366s 366s x1: [1, 1, 1, 1] 366s -0.666 366s 366s y: [2, 1, 1, 1] 366s 0.630 -0.478 366s testing: mul_mat, [1, 1, 3, 1] = [2, 1, 3, 1] * [2, 1, 3, 1] 366s x0: [2, 1, 3, 1] 366s -0.963 0.345 366s 366s x1: [2, 1, 3, 1] 366s 0.961 -0.856 366s 366s y: [1, 1, 3, 1] 366s -1.221 366s testing: mul_mat, [4, 2, 3, 2] = [1, 4, 3, 2ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ] * [1, 2, 3, 2] 366s x0: [1, 4, 3, 2] 366s 0.392 366s 0.688 366s 0.227 366s 0.445 366s 366s x1: [1, 2, 3, 2] 366s -0.805 366s -0.640 366s 366s y: [4, 2, 3, 2] 366s -0.315 -0.554 -0.183 -0.358 366s -0.251 -0.440 -0.145 -0.285 366s test-mul-mat0: iter:206/500 366s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.267 0.356 366s 366s x1: [2, 2, 1, 1] 366s -0.481 0.022 366s -0.961 0.911 366s 366s y: [1, 2, 1, 1] 366s -0.120 366s 0.068 366s testing: mul_mat, [3, 1, 2, 1] = [2, 3, 2, 1] * [2, 1, 2, 1] 366s x0: [2, 3, 2, 1] 366s -0.167 -0.030 366s 0.394 -0.242 366s -0.167 -0.430 366s 366s x1: [2, 1, 2, 1] 366s -0.883 -0.546 366s 366s y: [3, 1, 2, 1] 366s 0.164 -0.215 0.382 366s testing: mul_mat, [3, 3, 2, 3] = [2, 3, 2, 3] * [2, 3, 2, 3] 366s x0: [2, 3, 2, 3] 366s 0.709 -0.015 366s -0.234 0.426 366s -0.449 -0.910 366s 366s x1: [2, 3, 2, 3] 366s 0.189 -0.467 366s 0.923 -0.752 366s 0.136 0.591 366s 366s y: [3, 3, 2, 3] 366s 0.141 -0.243 0.340 366s 0.665 -0.536 0.271 366s 0.088 0.220 -0.599 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.163 0.403 366s 366s x1: [2, 3, 1, 1] 366s -0.872 0.476 366s -0.979 0.658 366s 0.324 0.362 366s 366s y: [1, 3, 1, 1] 366s 0.050 366s 0.106 366s 0.199 366s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 366s x0: [1, 3, 2, 1] 366s -0.575 366s 0.183 366s 0.485 366s 366s x1: [1, 2, 2, 1] 366s 0.370 366s -0.613 366s 366s y: [3, 2, 2, 1] 366s -0.213 0.068 0.179 366s 0.353 -0.112 -0.297 366s testing: mul_mat, [2, 1, 2, 3] = [3, 2, 2, 3] * [3, 1, 2, 3] 366s x0: [3, 2, 2, 3] 366s -0.305 0.432 -0.934 366s -0.289 0.596 -0.002 366s 366s x1: [3, 1, 2, 3] 366s -0.557 -0.647 0.163 366s 366s y: [2, 1, 2, 3] 366s -0.261 -0.226 366s test-mul-mat0: iter:207/500 366s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.347 0.210 -0.216 366s 0.650 0.806 0.261 366s 366s x1: [3, 2, 1, 1] 366s 0.002 -0.256 0.399 366s -0.992 0.016 0.087 366s 366s y: [2, 2, 1, 1] 366s -0.141 -0.101 366s 0.329 -0.609 366s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 366s x0: [3, 4, 3, 1] 366s 0.907 0.234 -0.792 366s -0.076 -0.678 0.597 366s 0.577 0.532 -0.620 366s 0.227 0.338 0.641 366s 366s x1: [3, 2, 3, 1] 366s -0.444 0.003 -0.704 366s 0.391 -0.541 0.213 366s 366s y: [4, 2, 3, 1] 366s 0.155 -0.388 0.182 -0.552 366s 0.060 0.464 -0.194 0.042 366s testing: mul_mat, [3, 4, 3, 3] = [3, 3, 3, 3] * [3, 4, 3, 3] 366s x0: [3, 3, 3, 3] 366s -0.186 0.842 0.630 366s -0.601 -0.028 -0.540 366s -0.583 -0.918 0.483 366s 366s x1: [3, 4, 3, 3] 366s -0.009 -0.269 -0.223 366s 0.372 0.958 -0.885 366s 0.013 0.741 -0.544 366s -0.050 -0.085 0.256 366s 366s y: [3, 4, 3, 3] 366s -0.365 0.133 0.144 366s 0.181 0.227 -1.523 366s 0.279 0.265 -0.950 366s 0.099 -0.105 0.231 366s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.685 -0.880 0.335 366s -0.078 0.176 0.124 366s -0.979 -0.075 0.898 366s 366s x1: [3, 3, 1, 1] 366s 0.844 -0.366 -0.465 366s 0.211 -0.596 -0.518 366s -0.230 0.061 0.901 366s 366s y: [3, 3, 1, 1] 366s 0.744 -0.188 -1.216 366s 0.495 -0.186 -0.627 366s 0.091 0.140 1.030 366s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 366s x0: [3, 2, 3, 1] 366s 0.546 0.844 -0.960 366s 0.793 -0.039 -0.745 366s 366s x1: [3, 3, 3, 1] 366s 0.415 -0.752 -0.588 366s 0.377 0.028 -0.979 366s -0.995 0.949 -0.612 366s 366s y: [2, 3, 3, 1] 366s 0.156 0.796 366s 1.169 1.027 366s 0.845 -0.370 366s testing: mul_mat, [4, 3, 3, 3] = [2, 4, 3, 3] * [2, 3, 3, 3] 366s x0: [2, 4, 3, 3] 366s 0.379 -0.683 366s 0.068 0.194 366s 0.783 0.893 366s 0.384 -0.978 366s 366s x1: [2, 3, 3, 3] 366s -0.545 0.609 366s 0.173 -0.315 366s -0.985 0.744 366s 366s y: [4, 3, 3, 3] 366s -0.622 0.081 0.118 -0.805 366s 0.281 -0.049 -0.146 0.375 366s -0.881 0.078 -0.106 -1.106 366s test-mul-mat0: iter:208/500 366s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.198 -0.086 -0.338 366s 366s x1: [3, 3, 1, 1] 366s 0.348 0.420 0.376 366s 0.982 0.511 0.064 366s 0.788 -0.898 0.355 366s 366s y: [1, 3, 1, 1] 366s -0.094 366s 0.129 366s 0.114 366s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.720 0.553 -0.924 366s -0.149 -0.922 0.104 366s 366s x1: [3, 1, 1, 1] 366s 0.611 0.429 0.854 366s 366s y: [2, 1, 1, 1] 366s -0.992 -0.398 366s testing: mul_mat, [3, 2, 1, 2] = [3, 3, 1, 2] * [3, 2, 1, 2] 366s x0: [3, 3, 1, 2] 366s -0.108 -0.044 0.004 366s -0.753 0.867 -0.798 366s 0.161 -0.470 0.813 366s 366s x1: [3, 2, 1, 2] 366s 0.034 0.450 0.275 366s -0.594 -0.920 0.839 366s 366s y: [3, 2, 1, 2] 366s -0.023 0.145 0.018 366s 0.108 -1.021 1.019 366s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.824 -0.284 -0.037 366s -0.564 0.391 0.259 366s 0.423 -0.573 -0.371 366s 366s x1: [3,ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 3, 1, 1] 366s -0.245 0.464 -0.757 366s 0.161 0.544 -0.918 366s 0.131 0.996 0.268 366s 366s y: [3, 3, 1, 1] 366s -0.306 0.124 -0.089 366s 0.012 -0.116 0.097 366s -0.185 0.385 -0.615 366s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.877 -0.941 0.307 366s 0.063 0.587 0.220 366s 366s x1: [3, 3, 1, 1] 366s -0.876 0.789 -0.558 366s -0.286 -0.828 -0.769 366s 0.584 -0.891 -0.905 366s 366s y: [2, 3, 1, 1] 366s -0.145 0.284 366s 0.794 -0.674 366s 0.049 -0.686 366s testing: mul_mat, [1, 3, 1, 2] = [2, 1, 1, 2] * [2, 3, 1, 2] 366s x0: [2, 1, 1, 2] 366s 0.742 0.352 366s 366s x1: [2, 3, 1, 2] 366s 0.130 0.388 366s -0.648 0.126 366s -0.344 -0.127 366s 366s y: [1, 3, 1, 2] 366s 0.233 366s -0.437 366s -0.300 366s test-mul-mat0: iter:209/500 366s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.183 -0.584 0.521 -0.511 366s 0.636 -0.349 0.878 0.988 366s 366s x1: [4, 1, 1, 1] 366s 0.052 0.811 -0.881 -0.644 366s 366s y: [2, 1, 1, 1] 366s -0.595 -1.658 366s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 2, 3, 1] 366s -0.649 -0.159 -0.294 0.002 366s -0.281 -0.306 -0.221 0.253 366s 366s x1: [4, 2, 3, 1] 366s 0.777 -0.466 -0.139 -0.272 366s 0.626 0.157 -0.606 -0.891 366s 366s y: [2, 2, 3, 1] 366s -0.390 -0.114 366s -0.255 -0.315 366s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 3, 3, 1] 366s -0.802 0.157 -0.099 -0.853 366s -0.539 -0.200 -0.607 -0.340 366s 0.869 0.293 0.690 0.290 366s 366s x1: [4, 2, 3, 1] 366s -0.754 -0.800 0.069 -0.100 366s 0.030 0.421 0.969 0.380 366s 366s y: [3, 2, 3, 1] 366s 0.558 0.559 -0.871 366s -0.378 -0.818 0.928 366s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.885 0.302 0.713 -0.779 366s -0.981 0.736 -0.924 -0.949 366s 0.857 0.982 -0.496 -0.301 366s -0.884 0.155 -0.372 0.267 366s 366s x1: [4, 3, 1, 1] 366s 0.846 -0.914 0.192 0.715 366s 0.379 -0.118 0.005 -0.359 366s 0.952 -0.443 -0.422 0.405 366s 366s y: [4, 3, 1, 1] 366s 0.053 -2.359 -0.483 -0.770 366s 0.583 -0.123 0.314 -0.451 366s 0.093 -1.255 0.468 -0.645 366s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 2, 3, 1] 366s 0.499 -0.961 -0.211 -0.601 366s 0.652 0.911 -0.174 -0.571 366s 366s x1: [4, 4, 3, 1] 366s 0.744 0.231 0.114 0.831 366s -0.577 -0.171 0.210 0.305 366s 0.834 0.851 0.257 -0.609 366s -0.571 -0.339 0.482 -0.686 366s 366s y: [2, 4, 3, 1] 366s -0.373 0.202 366s -0.351 -0.743 366s -0.090 1.622 366s 0.351 -0.373 366s testing: mul_mat, [3, 4, 3, 1] = [2, 3, 3, 1] * [2, 4, 3, 1] 366s x0: [2, 3, 3, 1] 366s 0.719 0.566 366s -0.806 0.817 366s 0.753 -0.378 366s 366s x1: [2, 4, 3, 1] 366s 0.992 -0.622 366s 0.859 0.619 366s 0.448 0.530 366s -0.416 -0.053 366s 366s y: [3, 4, 3, 1] 366s 0.361 -1.308 0.982 366s 0.968 -0.188 0.413 366s 0.622 0.072 0.136 366s -0.329 0.292 -0.293 366s test-mul-mat0: iter:210/500 366s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.250 -0.722 0.429 366s 0.067 -0.101 -0.012 366s 0.262 0.380 -0.835 366s 366s x1: [3, 3, 1, 1] 366s 0.070 -0.916 0.567 366s -0.480 -0.840 0.312 366s -0.856 0.057 -0.475 366s 366s y: [3, 3, 1, 1] 366s 0.922 0.090 -0.804 366s 0.621 0.049 -0.706 366s -0.459 -0.057 0.194 366s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.311 0.662 -0.286 366s 366s x1: [3, 3, 1, 1] 366s -0.096 0.391 0.925 366s 0.493 -0.416 -0.692 366s -0.059 0.747 -0.860 366s 366s y: [1, 3, 1, 1] 366s -0.035 366s 0.075 366s 0.722 366s testing: mul_mat, [3, 1, 1, 2] = [3, 3, 1, 2] * [3, 1, 1, 2] 366s x0: [3, 3, 1, 2] 366s 0.066 0.241 0.862 366s 0.344 -0.330 -0.071 366s -0.757 0.658 -0.809 366s 366s x1: [3, 1, 1, 2] 366s 0.395 0.229 0.234 366s 366s y: [3, 1, 1, 2] 366s 0.283 0.044 -0.338 366s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.628 -0.782 0.742 366s -0.499 -0.306 -0.920 366s 366s x1: [3, 3, 1, 1] 366s 0.345 -0.803 0.174 366s -0.344 0.858 0.888 366s -0.948 0.087 0.122 366s 366s y: [2, 3, 1, 1] 366s 0.974 -0.086 366s -0.228 -0.909 366s -0.573 0.334 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.893 -0.706 366s 366s x1: [2, 3, 1, 1] 366s -0.963 -0.588 366s 0.008 -0.719 366s -0.930 0.199 366s 366s y: [1, 3, 1, 1] 366s -0.444 366s 0.515 366s -0.971 366s testing: mul_mat, [3, 2, 1, 2] = [1, 3, 1, 2] * [1, 2, 1, 2] 366s x0: [1, 3, 1, 2] 366s -0.589 366s 0.861 366s 0.894 366s 366s x1: [1, 2, 1, 2] 366s -0.082 366s 0.641 366s 366s y: [3, 2, 1, 2] 366s 0.048 -0.071 -0.073 366s -0.377 0.552 0.573 366s test-mul-mat0: iter:211/500 366s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 1, 1, 1] 366s 0.482 0.479 0.321 -0.319 366s 366s x1: [4, 2, 1, 1] 366s -0.824 -0.533 0.097 -0.606 366s 0.161 -0.162 -0.527 0.198 366s 366s y: [1, 2, 1, 1] 366s -0.428 366s -0.232 366s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.858 -0.183 -0.448 -0.731 366s -0.322 -0.555 0.854 0.196 366s -0.802 0.328 0.765 0.037 366s 366s x1: [4, 1, 1, 1] 366s 0.383 0.214 -0.025 -0.699 366s 366s y: [3, 1, 1, 1] 366s 0.811 -0.400 -0.282 366s testing: mul_mat, [2, 3, 1, 4] = [4, 2, 1, 4] * [4, 3, 1, 4] 366s x0: [4, 2, 1, 4] 366s 0.741 0.576 0.693 -0.209 366s 0.649 -0.787 -0.713 0.576 366s 366s x1: [4, 3, 1, 4] 366s 0.924 0.941 0.504 0.021 366s -0.665 -0.336 0.859 -0.192 366s 0.863 0.110 -0.710 0.341 366s 366s y: [2, 3, 1, 4] 366s 1.572 -0.488 366s -0.051 -0.891 366s 0.139 1.177 366s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.237 0.457 -0.426 0.163 366s -0.374 -0.268 0.726 0.202 366s 0.788 0.458 0.196 0.882 366s 0.248 -0.956 0.177 -0.802 366s 366s x1: [4, 2, 1, 1] 366s 0.181 0.076 -0.189 -0.170 366s 0.289 0.097 -0.594 0.217 366s 366s y: [4, 2, 1, 1] 366s 0.131 -0.260 -0.010 0.075 366s 0.402 -0.522 0.348 -0.301 366s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.269 -0.536 -0.002 -0.425 366s 0.540 -0.999 -0.492 -0.276 366s 366s x1: [4, 4, 1, 1] 366s 0.035 0.414 -0.524 -0.361 366s -0.202 -0.504 0.820 0.874 366s 0.306 -0.351 0.163 -0.596 366s 0.055 -0.620 0.481 -0.708 366s 366s y: [2, 4, 1, 1] 366s -0.058 -0.038 366s -0.158 -0.250 366s 0.524 0.600 366s 0.647 0.608 366s testing: mul_mat, [3, 4, 1, 4] = [2, 3, 1, 4] * [2, 4, 1, 4] 366s x0: [2, 3, 1, 4] 366s 0.546 0.631 366s 0.853 0.794 366s 0.478 -0.474 366s 366s x1: [2, 4, 1, 4] 366s 0.704 0.751 366s 0.887 -0.093 366s 0.634 -0.915 366s 0.942 0.048 366s 366s y: [3, 4, 1, 4] 366s 0.859 1.198 -0.019 366s 0.425 0.682 0.468 366s -0.231 -0.186 0.736 366s 0.544 0.841 0.428 366s test-mul-mat0: iter:212/500 366s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.084 366s -0.966 366s 366s x1: [1, 4, 1, 1] 366s -0.142 366s 0.425 366s 0.606 366s -0.597 366s 366s y: [2, 4, 1, 1] 366s -0.012 0.138 366s 0.036 -0.410 366s 0.051 -0.585 366s -0.050 0.576 366s testing: mul_mat, [4, 2, 3, 1] = [1, 4, 3, 1] * [1, 2, 3, 1] 366s x0: [1, 4, 3, 1] 366s -0.927 366s 0.092 366s -0.953 366s 0.047 366s 366s x1: [1, 2, 3, 1] 366s 0.072 366s 0.611 366s 366s y: [4, 2, 3, 1] 366s -0.067 0.007 -0.069 0.003 366s -0.566 0.056 -0.582 0.028 366s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 366s x0: [1, 4, 3, 1] 366s 0.984 366s 0.289 366s -0.275 366s 0.967 366s 366s x1: [1, 4, 3, 1] 366s 0.024 366s 0.922 366s 0.766 366s 0.503 366s 366s y: [4, 4, 3, 1] 366s 0.024 0.007 -0.007 0.023 366s 0.908 0.266 -0.254 0.891 366s 0.754 0.221 -0.211 0.741 366s 0.495 0.145 -0.138 0.486 366s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.843 366s 366s x1: [1, 4, 1, 1] 366s -0.631 366s 0.438 366s 0.275 366s -0.845 366s 366s y: [1, 4, 1, 1] 366s -0.532 366s 0.369 366s 0.232 366s -0.712 366s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 366s x0: [1, 2, 3, 1] 366s 0.298 366s -0.767 366s 366s x1: [1, 1, 3, 1] 366s 0.179 366s 366s y: [2, 1, 3, 1] 366s 0.053 -0.137 366s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 366s x0: [2, 2, 3, 1] 366s -0.623 -0.048 366s -0.234 -0.764 366s 366s x1: [2, 1, 3, 1] 366s -0.039 -0.678 366s 366s y: [2, 1, 3, 1] 366s 0.057 0.528 366s test-mul-mat0: iter:213/500 366s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.802 -0.048 0.850 0.158 366s 366s x1: [4, 1, 1, 1] 366s 0.459 -0.576 0.611 0.691 366s 366s y: [1, 1, 1, 1] 366s 0.289 366s testing: mul_mat, [4, 1, 3, 1] = [4, 4, 3, 1] * [4, 1, 3, 1] 366s x0: [4, 4, 3, 1] 366s -0.981 -0.586 -0.927 -0.974 366s -0.023 -0.228 0.875 -0.281 366s 0.085 0.334 0.144 -0.304 366s 0.025 0.178 -0.106 0.977 366s 366s x1: [4, 1, 3, 1] 366s 0.273 -0.056 -0.896 -0.116 366s 366s y: [4, 1, 3, 1] 366s 0.710 -0.746 -0.089 -0.022 366s testing: mul_mat, [2, 4, 3, 4] = [4, 2, 3, 4] * [4, 4, 3, 4] 366s x0: [4, 2, 3, 4] 366s -0.238 -0.790 0.486 -0.983 366s -0.534 0.925 -0.122 -0.460 366s 366s x1: [4, 4, 3, 4] 366s 0.529 0.056 -0.424 0.685 366s 0.190 0.310 -0.144 -0.416 366s 0.571 0.994 0.476 0.120 366s 0.139 0.433 -0.312 -0.302 366s 366s y: [2, 4, 3, 4] 366s -1.049 -0.494 366s 0.048 0.394 366s -0.808 0.501 366s -0.230 0.503 366s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.921 -0.819 -0.274 -0.921 366s -0.668 -0.887 -0.825 0.832 366s -0.994 0.245 -0.037 -0.433 366s -0.185 -0.316 -0.599 -0.528 366s 366s x1: [4, 2, 1, 1] 366s 0.500 -0.528 0.876 -0.007 366s -0.778 -0.482 0.425 -0.191 366s 366s y: [4, 2, 1, 1] 366s 0.659 -0.593 -0.657 -0.447 366s -0.262 0.437 0.722 0.143 366s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 1, 3, 1] 366s -0.793 -0.414 0.262 0.915 366s 366s x1: [4, 4, 3, 1] 366s 0.718 -0.487 -0.374 0.808 366s 0.740 0.300 0.309 -0.788 366s 0.176 -0.698 -0.566 0.694 366s 0.727 0.243 0.303 0.647 366s 366s y: [1, 4, 3, 1] 366s 0.274 366s -1.351 366s 0.636 366s -0.005 366s testing: mul_mat, [4, 4, 3, 4] = [1, 4, 3, 4] * [1, 4, 3, 4] 366s x0: [1, 4, 3, 4] 366s 0.601 366s -0.873 366s 0.514 366s 0.618 366s 366s x1: [1, 4, 3, 4] 366s -0.806 366s -0.386 366s -0.301 366s 0.254 366s 366s y: [4, 4, 3, 4] 366s -0.485 0.704 -0.415 -0.499 366s -0.232 0.337 -0.199 -0.239 366s -0.181 0.263 -0.155 -0.186 366s 0.153 -0.222 0.131 0.157 366s test-mul-mat0: iter:214/500 366s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.173 0.680 0.664 366s 0.913 0.731 0.322 366s 366s x1: [3, 3, 1, 1] 366s -0.137 -0.528 0.761 366s 0.512 0.493 0.518 366s 0.602 0.359 0.536 366s 366s y: [2, 3, 1, 1] 366s 0.171 -0.266 366s 0.590 0.995 366s 0.496 0.985 366s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 3, 1, 1] 366s -0.667 -0.402 0.527 366s -0.820 -0.256 -0.735 366s 0.564 0.352 0.128 366s 366s x1: [3, 2, 1, 1] 366s -0.956 0.267 0.632 366s -0.397 0.624 -0.962 366s 366s y: [3, 2, 1, 1] 366s 0.863 0.251 -0.365 366s -0.494 0.872 -0.127 366s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.847 -0.218 0.627 366s 0.578 -0.896 0.671 366s 366s x1: [3, 3, 1, 1] 366s -0.964 0.113 -0.360 366s 0.529 -0.369 -0.758 366s -0.112 -0.833 0.118 366s 366s y: [2, 3, 1, 1] 366s -1.066 -0.900 366s 0.053 0.127 366s 0.161 0.761 366s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 1, 1, 1] 366s -0.628 0.848 -0.018 366s 366s x1: [3, 2, 1, 1] 366s -0.155 0.736 -0.725 366s -0.531 0.774 -0.679 366s 366s y: [1, 2, 1, 1] 366s 0.735 366s 1.002 366s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.241 366s -0.418 366s 366s x1: [1, 3, 1, 1] 366s 0.116 366s -0.887 366s -0.454 366s 366s y: [2, 3, 1, 1] 366s 0.028 -0.048 366s -0.214 0.370 366s -0.109 0.189 366s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.788 -0.622 366s -0.877 0.241 366s -0.001 -0.286 366s 366s x1: [2, 1, 1, 1] 366s 0.581 0.881 366s 366s y: [3, 1, 1, 1] 366s -1.005 -0.297 -0.253 366s test-mul-mat0: iter:215/500 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.863 -0.714 366s 0.116 0.638 366s -0.393 0.918 366s -0.990 -0.544 366s 366s x1: [2, 1, 1, 1] 366s -0.873 -0.988 366s 366s y: [4, 1, 1, 1] 366s -0.049 -0.731 -0.565 1.403 366s testing: mul_mat, [1, 4, 2, 1] = [2, 1, 2, 1] * [2, 4, 2, 1] 366s x0: [2, 1, 2, 1] 366s 0.077 -0.806 366s 366s x1: [2, 4, 2, 1] 366s -0.100 0.125 366s -0.431 0.447 366s 0.593 0.810 366s -0.971 0.173 366s 366s y: [1, 4, 2, 1] 366s -0.108 366s -0.393 366s -0.607 366s -0.214 366s testing: mul_mat, [2, 1, 2, 3] = [2, 2, 2, 3] * [2, 1, 2, 3] 366s x0: [2, 2, 2, 3] 366s 0.820 0.432 366s 0.933 0.849 366s 366s x1: [2, 1, 2, 3] 366s -0.794 -0.562 366s 366s y: [2, 1, 2, 3] 366s -0.894 -1.218 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.096 0.191 366s -0.349 -0.472 366s 366s x1: [2, 2, 1, 1] 366s -0.028 -0.002 366s 0.257 -0.473 366s 366s y: [2, 2, 1, 1] 366s -0.003 0.011 366s -0.065 0.133 366s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 366s x0: [2, 2, 2, 1] 366s -0.864 -0.381 366s 0.139 0.703 366s 366s x1: [2, 2, 2, 1] 366s -0.416 0.040 366s 0.133 -0.792 366s 366s y: [2, 2, 2, 1] 366s 0.344 -0.030 366s 0.187 -0.539 366s testing: mul_mat, [3, 2, 2, 3] = [2, 3, 2, 3] * [2, 2, 2, 3] 366s x0: [2, 3, 2, 3] 366s 0.490 0.641 366s 0.994 -0.532 366s -0.112 -0.417 366s 366s x1: [2, 2, 2, 3] 366s -0.877 -0.687 366s 0.568 0.648 366s 366s y: [3, 2, 2, 3] 366s -0.870 -0.506 0.384 366s 0.694 0.219 -0.334 366s test-mul-mat0: iter:216/500 366s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.413 -0.569 -0.495 -0.851 366s -0.128 -0.249 -0.418 0.604 366s 0.571 -0.495 -0.670 0.279 366s -0.931 0.524 0.570 0.397 366s 366s x1: [4, 3, 1, 1] 366s -0.260 0.612 0.133 -0.201 366s 0.362 -0.109 -0.124 -0.678 366s -0.634 -0.839 0.019 -0.856 366s 366s y: [4, 3, 1, 1] 366s -0.136 -0.296 -0.596 0.558 366s 0.551 -0.377 0.155 -0.735 366s 1.459 -0.236 -0.199 -0.179 366s testing: mul_mat, [3, 4, 4, 1] = [4, 3, 4, 1] * [4, 4, 4, 1] 366s x0: [4, 3, 4, 1] 366s 0.573 -0.788 -0.053 0.961 366s -0.248 -0.062 0.144 0.693 366s -0.096 -0.859 0.180 -0.463 366s 366s x1: [4, 4, 4, 1] 366s 0.499 -0.987 -0.862 0.111 366s 0.145 -0.063 -0.527 -0.964 366s 0.813 -0.205 -0.598 0.974 366s 0.814 -0.455 0.668 -0.600 366s 366s y: [3, 4, 4, 1] 366s 1.216 -0.111 0.593 366s -0.766 -0.777 0.392 366s 1.595 0.399 -0.461 366s 0.213 -0.4ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 93 0.710 366s testing: mul_mat, [1, 3, 4, 3] = [4, 1, 4, 3] * [4, 3, 4, 3] 366s x0: [4, 1, 4, 3] 366s -0.576 0.777 -0.157 -0.466 366s 366s x1: [4, 3, 4, 3] 366s -0.254 -0.816 -0.478 0.051 366s 0.465 0.788 -0.640 0.168 366s 0.907 -0.982 -0.601 0.683 366s 366s y: [1, 3, 4, 3] 366s -0.436 366s 0.366 366s -1.509 366s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 3, 1, 1] 366s -0.418 -0.240 0.621 -0.093 366s 0.186 0.840 -0.245 0.448 366s -0.530 0.233 -0.826 0.959 366s 366s x1: [4, 1, 1, 1] 366s -0.077 -0.514 0.051 0.614 366s 366s y: [3, 1, 1, 1] 366s 0.131 -0.183 0.468 366s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 366s x0: [3, 2, 4, 1] 366s 0.555 -0.056 0.905 366s -0.727 -0.488 -0.266 366s 366s x1: [3, 4, 4, 1] 366s -0.273 -0.538 -0.319 366s 0.594 0.080 -0.824 366s -0.376 -0.212 0.671 366s -0.565 0.547 -0.772 366s 366s y: [2, 4, 4, 1] 366s -0.410 0.546 366s -0.421 -0.252 366s 0.410 0.199 366s -1.043 0.349 366s testing: mul_mat, [4, 3, 4, 3] = [2, 4, 4, 3] * [2, 3, 4, 3] 366s x0: [2, 4, 4, 3] 366s -0.332 -0.423 366s -0.211 0.038 366s -0.529 0.716 366s 0.882 0.322 366s 366s x1: [2, 3, 4, 3] 366s 0.502 0.394 366s -0.433 0.731 366s -0.538 0.967 366s 366s y: [4, 3, 4, 3] 366s -0.333 -0.091 0.017 0.570 366s -0.166 0.119 0.753 -0.147 366s -0.231 0.151 0.978 -0.164 366s test-mul-mat0: iter:217/500 366s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.879 0.051 0.085 366s 366s x1: [3, 1, 1, 1] 366s -0.835 -0.849 0.527 366s 366s y: [1, 1, 1, 1] 366s -0.732 366s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 366s x0: [3, 1, 4, 1] 366s 0.835 -0.787 -0.014 366s 366s x1: [3, 1, 4, 1] 366s 0.960 0.960 0.814 366s 366s y: [1, 1, 4, 1] 366s 0.035 366s testing: mul_mat, [2, 1, 4, 1] = [3, 2, 4, 1] * [3, 1, 4, 1] 366s x0: [3, 2, 4, 1] 366s -0.779 0.905 0.667 366s -0.972 0.342 -0.950 366s 366s x1: [3, 1, 4, 1] 366s 0.731 -0.903 -0.096 366s 366s y: [2, 1, 4, 1] 366s -1.451 -0.928 366s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.332 -0.691 0.723 366s -0.620 -0.304 -0.641 366s 366s x1: [3, 2, 1, 1] 366s 0.870 -0.069 -0.341 366s 0.505 -0.553 -0.525 366s 366s y: [2, 2, 1, 1] 366s -0.488 -0.300 366s -0.165 0.191 366s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 366s x0: [2, 4, 4, 1] 366s 0.205 -0.657 366s -0.379 0.548 366s 0.189 -0.539 366s 0.901 0.621 366s 366s x1: [2, 3, 4, 1] 366s -0.440 0.278 366s 0.395 0.529 366s -0.929 0.480 366s 366s y: [4, 3, 4, 1] 366s -0.273 0.319 -0.233 -0.224 366s -0.266 0.140 -0.211 0.685 366s -0.505 0.615 -0.434 -0.540 366s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 1, 4, 1] 366s 0.746 0.001 -0.533 -0.135 366s 366s x1: [4, 2, 4, 1] 366s 0.214 0.235 -0.342 0.557 366s -0.217 0.119 0.179 0.726 366s 366s y: [1, 2, 4, 1] 366s 0.267 366s -0.356 366s test-mul-mat0: iter:218/500 366s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 1, 1, 1] 366s -0.850 366s 366s x1: [1, 1, 1, 1] 366s 0.537 366s 366s y: [1, 1, 1, 1] 366s -0.456 366s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.518 366s 0.404 366s 0.274 366s 0.570 366s 366s x1: [1, 1, 1, 1] 366s 0.174 366s 366s y: [4, 1, 1, 1] 366s -0.090 0.070 0.048 0.099 366s testing: mul_mat, [2, 4, 1, 2] = [1, 2, 1, 2] * [1, 4, 1, 2] 366s x0: [1, 2, 1, 2] 366s 0.079 366s 0.960 366s 366s x1: [1, 4, 1, 2] 366s -0.152 366s -0.041 366s 0.316 366s 0.849 366s 366s y: [2, 4, 1, 2] 366s -0.012 -0.146 366s -0.003 -0.040 366s 0.025 0.303 366s 0.067 0.815 366s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.846 366s 0.773 366s 0.144 366s 366s x1: [1, 2, 1, 1] 366s -0.707 366s -0.102 366s 366s y: [3, 2, 1, 1] 366s -0.598 -0.546 -0.102 366s -0.086 -0.079 -0.015 366s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.378 0.147 0.061 366s -0.335 -0.787 -0.564 366s -0.191 0.477 -0.968 366s -0.798 -0.283 -0.090 366s 366s x1: [3, 1, 1, 1] 366s -0.902 -0.044 -0.485 366s 366s y: [4, 1, 1, 1] 366s -0.376 0.610 0.621 0.776 366s testing: mul_mat, [3, 3, 1, 2] = [4, 3, 1, 2] * [4, 3, 1, 2] 366s x0: [4, 3, 1, 2] 366s 0.186 0.622 0.483 -0.037 366s -0.485 -0.453 -0.240 0.126 366s 0.393 -0.698 0.241 0.258 366s 366s x1: [4, 3, 1, 2] 366s 0.861 -0.787 -0.062 0.480 366s -0.634 -0.983 0.439 -0.498 366s -0.211 0.732 0.400 -0.673 366s 366s y: [3, 3, 1, 2] 366s -0.377 0.014 0.996 366s -0.498 0.584 0.414 366s 0.634 -0.410 -0.671 366s test-mul-mat0: iter:219/500 366s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.393 0.337 0.882 0.766 366s 0.050 -0.917 0.610 -0.562 366s -0.586 0.439 0.138 -0.748 366s 0.919 0.957 0.799 -0.803 366s 366s x1: [4, 4, 1, 1] 366s 0.547 -0.722 0.152 -0.267 366s -0.208 -0.455 -0.645 0.339 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s 366s -0.153 0.839 -0.901 -0.912 366s -0.199 0.225 0.346 0.203 366s 366s y: [4, 4, 1, 1] 366s -0.099 0.932 -0.418 0.147 366s -0.544 -0.177 -0.421 -1.415 366s -1.271 -0.813 1.016 0.674 366s 0.459 -0.119 0.111 0.146 366s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.612 0.093 0.118 -0.529 366s 366s x1: [4, 4, 1, 1] 366s 0.108 -0.467 -0.011 0.653 366s -0.112 -0.672 -0.500 -0.273 366s -0.573 -0.412 0.528 0.652 366s 0.934 -0.269 -0.494 0.327 366s 366s y: [1, 4, 1, 1] 366s -0.457 366s 0.091 366s 0.030 366s -0.828 366s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.138 -0.938 -0.349 -0.063 366s -0.741 0.759 0.470 0.248 366s 366s x1: [4, 1, 1, 1] 366s -0.297 0.556 -0.115 -0.857 366s 366s y: [2, 1, 1, 1] 366s -0.469 0.376 366s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.565 0.413 0.021 0.701 366s 0.345 -0.450 0.558 -0.218 366s -0.838 -0.145 -0.912 0.044 366s 0.762 -0.468 0.907 0.763 366s 366s x1: [4, 2, 1, 1] 366s 0.412 -0.642 0.576 0.913 366s 0.085 -0.996 -0.499 -0.387 366s 366s y: [4, 2, 1, 1] 366s 0.154 0.554 -0.738 1.834 366s -0.742 0.283 0.511 -0.217 366s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.331 -0.734 -0.931 -0.759 366s 0.214 0.219 -0.202 0.628 366s 366s x1: [4, 4, 1, 1] 366s 0.433 0.982 -0.978 0.192 366s 0.452 0.270 -0.396 0.810 366s -0.153 -0.483 -0.105 -0.149 366s 0.018 0.508 -0.493 0.452 366s 366s y: [2, 4, 1, 1] 366s -0.099 0.626 366s -0.594 0.744 366s 0.616 -0.211 366s -0.263 0.499 366s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.707 -0.375 366s 366s x1: [2, 4, 1, 1] 366s -0.114 -0.853 366s 0.329 0.668 366s 0.192 0.092 366s 0.101 0.173 366s 366s y: [1, 4, 1, 1] 366s 0.400 366s -0.483 366s -0.170 366s -0.136 366s test-mul-mat0: iter:220/500 366s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.244 -0.281 -0.205 0.642 366s -0.134 -0.876 0.310 -0.942 366s 0.216 -0.589 0.231 -0.670 366s -0.296 0.856 -0.285 0.601 366s 366s x1: [4, 3, 1, 1] 366s 0.414 0.330 -0.917 -0.568 366s -0.162 -0.410 0.884 0.691 366s 0.259 0.098 0.957 -0.522 366s 366s y: [4, 3, 1, 1] 366s -0.371 -0.093 0.064 0.079 366s 0.417 0.004 -0.052 -0.140 366s -0.622 0.668 0.569 -0.579 366s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 1, 3, 1] 366s 0.076 -0.375 0.254 -0.110 366s 366s x1: [4, 4, 3, 1] 366s -0.710 0.947 0.015 0.620 366s -0.970 0.447 -0.542 -0.381 366s 0.331 -0.851 0.878 -0.571 366s -0.894 -0.643 0.596 -0.138 366s 366s y: [1, 4, 3, 1] 366s -0.473 366s -0.337 366s 0.629 366s 0.339 366s testing: mul_mat, [2, 1, 3, 3] = [4, 2, 3, 3] * [4, 1, 3, 3] 366s x0: [4, 2, 3, 3] 366s 0.585 -0.336 -0.105 0.019 366s 0.036 -0.180 0.203 0.119 366s 366s x1: [4, 1, 3, 3] 366s -0.426 -0.682 -0.454 -0.881 366s 366s y: [2, 1, 3, 3] 366s 0.011 -0.089 366s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.382 0.054 -0.931 0.001 366s -0.637 -0.974 0.820 0.716 366s -0.123 -0.306 0.451 -0.601 366s -0.960 -0.215 0.228 0.257 366s 366s x1: [4, 2, 1, 1] 366s -0.820 -0.064 0.564 0.210 366s -0.904 -0.888 -0.634 -0.869 366s 366s y: [4, 2, 1, 1] 366s -0.842 1.198 0.249 0.984 366s 0.197 0.299 0.619 0.691 366s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 2, 3, 1] 366s 0.980 -0.828 -0.585 -0.922 366s 0.465 0.119 -0.145 -0.183 366s 366s x1: [4, 4, 3, 1] 366s -0.385 -0.599 0.790 0.268 366s 0.586 -0.663 0.448 -0.478 366s 0.901 -0.342 -0.382 -0.987 366s 0.023 -0.251 0.711 -0.595 366s 366s y: [2, 4, 3, 1] 366s -0.590 -0.414 366s 1.302 0.216 366s 2.299 0.614 366s 0.363 -0.013 366s testing: mul_mat, [1, 4, 3, 3] = [2, 1, 3, 3] * [2, 4, 3, 3] 366s x0: [2, 1, 3, 3] 366s 0.182 -0.976 366s 366s x1: [2, 4, 3, 3] 366s -0.892 -0.651 366s -0.472 0.077 366s 0.884 0.827 366s -0.366 0.864 366s 366s y: [1, 4, 3, 3] 366s 0.473 366s -0.161 366s -0.646 366s -0.909 366s test-mul-mat0: iter:221/500 366s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.750 366s -0.164 366s -0.453 366s 366s x1: [1, 4, 1, 1] 366s -0.330 366s -0.083 366s 0.305 366s 0.623 366s 366s y: [3, 4, 1, 1] 366s 0.247 0.054 0.149 366s 0.062 0.014 0.038 366s -0.229 -0.050 -0.138 366s -0.467 -0.102 -0.282 366s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 366s x0: [1, 1, 2, 1] 366s 0.315 366s 366s x1: [1, 3, 2, 1] 366s 0.399 366s -0.982 366s -0.428 366s 366s y: [1, 3, 2, 1] 366s 0.126 366s -0.309 366s -0.135 366s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 1, 2, 1] 366s -0.068 366s 366s x1: [1, 1, 2, 1] 366s -0.906 366s 366s y: [1, 1, 2, 1] 366s 0.062 366s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s [1, 1, 1, 1] 366s x0: [1, 1, 1, 1] 366s -0.106 366s 366s x1: [1, 1, 1, 1] 366s -0.218 366s 366s y: [1, 1, 1, 1] 366s 0.023 366s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 2, 2, 1] 366s 0.319 366s 0.204 366s 366s x1: [1, 1, 2, 1] 366s -0.369 366s 366s y: [2, 1, 2, 1] 366s -0.118 -0.075 366s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 366s x0: [2, 4, 2, 1] 366s -0.552 0.122 366s -0.449 -0.540 366s -0.353 -0.842 366s -0.534 0.505 366s 366s x1: [2, 1, 2, 1] 366s -0.879 -0.002 366s 366s y: [4, 1, 2, 1] 366s 0.485 0.396 0.313 0.468 366s test-mul-mat0: iter:222/500 366s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.124 366s -0.027 366s 366s x1: [1, 1, 1, 1] 366s 0.769 366s 366s y: [2, 1, 1, 1] 366s -0.096 -0.021 366s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 366s x0: [1, 2, 4, 1] 366s 0.640 366s 0.391 366s 366s x1: [1, 2, 4, 1] 366s -0.252 366s -0.135 366s 366s y: [2, 2, 4, 1] 366s -0.161 -0.098 366s -0.087 -0.053 366s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 366s x0: [1, 4, 4, 1] 366s 0.887 366s 0.563 366s -0.443 366s -0.237 366s 366s x1: [1, 2, 4, 1] 366s 0.261 366s -0.697 366s 366s y: [4, 2, 4, 1] 366s 0.232 0.147 -0.116 -0.062 366s -0.618 -0.392 0.309 0.165 366s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.009 366s -0.319 366s 0.243 366s 0.496 366s 366s x1: [1, 4, 1, 1] 366s 0.145 366s 0.785 366s 0.969 366s 0.420 366s 366s y: [4, 4, 1, 1] 366s -0.001 -0.046 0.035 0.072 366s -0.007 -0.251 0.191 0.390 366s -0.009 -0.309 0.235 0.481 366s -0.004 -0.134 0.102 0.208 366s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 366s x0: [4, 2, 4, 1] 366s -0.487 -0.749 -0.753 0.983 366s 0.828 -0.803 0.396 -0.784 366s 366s x1: [4, 1, 4, 1] 366s -0.915 0.627 -0.947 -0.584 366s 366s y: [2, 1, 4, 1] 366s 0.115 -1.177 366s testing: mul_mat, [2, 4, 4, 1] = [2, 2, 4, 1] * [2, 4, 4, 1] 366s x0: [2, 2, 4, 1] 366s -0.457 0.981 366s -0.512 0.544 366s 366s x1: [2, 4, 4, 1] 366s -0.312 -0.009 366s 0.019 -0.065 366s -0.614 0.001 366s 0.152 0.202 366s 366s y: [2, 4, 4, 1] 366s 0.133 0.155 366s -0.072 -0.045 366s 0.281 0.315 366s 0.129 0.032 366s test-mul-mat0: iter:223/500 366s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.036 -0.319 366s -0.333 0.829 366s 0.285 -0.809 366s 366s x1: [2, 1, 1, 1] 366s -0.041 -0.184 366s 366s y: [3, 1, 1, 1] 366s 0.060 -0.139 0.137 366s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 366s x0: [2, 1, 2, 1] 366s -0.915 -0.260 366s 366s x1: [2, 3, 2, 1] 366s 0.382 -0.121 366s -0.265 0.869 366s -0.140 -0.721 366s 366s y: [1, 3, 2, 1] 366s -0.318 366s 0.016 366s 0.316 366s testing: mul_mat, [3, 1, 2, 3] = [2, 3, 2, 3] * [2, 1, 2, 3] 366s x0: [2, 3, 2, 3] 366s -0.150 -0.328 366s -0.256 0.585 366s -0.458 0.604 366s 366s x1: [2, 1, 2, 3] 366s 0.268 -0.538 366s 366s y: [3, 1, 2, 3] 366s 0.136 -0.383 -0.448 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.540 0.913 366s 366s x1: [2, 3, 1, 1] 366s -0.303 0.016 366s -0.769 -0.655 366s -0.374 0.788 366s 366s y: [1, 3, 1, 1] 366s -0.149 366s -1.013 366s 0.518 366s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 366s x0: [1, 1, 2, 1] 366s 0.522 366s 366s x1: [1, 2, 2, 1] 366s 0.573 366s 0.194 366s 366s y: [1, 2, 2, 1] 366s 0.300 366s 0.101 366s testing: mul_mat, [4, 1, 2, 3] = [1, 4, 2, 3] * [1, 1, 2, 3] 366s x0: [1, 4, 2, 3] 366s -0.638 366s -0.669 366s -0.142 366s 0.929 366s 366s x1: [1, 1, 2, 3] 366s -0.880 366s 366s y: [4, 1, 2, 3] 366s 0.561 0.589 0.125 -0.817 366s test-mul-mat0: iter:224/500 366s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.058 0.034 366s 0.698 0.210 366s 0.683 0.794 366s 366s x1: [2, 3, 1, 1] 366s -0.674 -0.243 366s -0.775 -0.312 366s 0.088 0.083 366s 366s y: [3, 3, 1, 1] 366s -0.047 -0.521 -0.653 366s -0.056 -0.606 -0.776 366s 0.008 0.079 0.126 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.932 -0.763 366s 366s x1: [2, 3, 1, 1] 366s 0.592 -0.324 366s -0.483 0.763 366s 0.585 -0.220 366s 366s y: [1, 3, 1, 1] 366s -0.305 366s -0.132 366s -0.377 366s testing: mul_mat, [1, 1, 1, 3] = [2, 1, 1, 3] * [2, 1, 1, 3] 366s x0: [2, 1, 1, 3] 366s 0.405 0.218 366s 366s x1: [2, 1, 1, 3] 366s 0.418 -0.696 366s 366s y: [1, 1, 1, 3] 366s 0.018 366s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.691 0.518 366s 0.561 0.244 366s -0.480 -0.686 366s 366s x1: [2, 1, 1, 1] 366s 0.250 -0.473 366s 366s y: [3, 1, 1, 1] 366s -0.418 0.025 0.204 366s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.573 -0.861 0.656 366s 0.847 0.877 -0.350 366s 366s x1: [3, 2, 1, 1] 366s 0.111 0.920 -0.169 366s -0.126 0.505 0.611 366s 366s y: [2, 2, 1, 1] 366s -0.839 0.959 366s -0.106 0.122 366s testing: mul_mat, [1, 3, 1, 3] = [2, 1, 1, 3] * [2, 3, 1, 3] 366s x0: [2, 1, 1, ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 3] 366s 0.070 -0.948 366s 366s x1: [2, 3, 1, 3] 366s 0.888 0.157 366s 0.841 0.293 366s -0.625 0.359 366s 366s y: [1, 3, 1, 3] 366s -0.087 366s -0.219 366s -0.384 366s test-mul-mat0: iter:225/500 366s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 4, 1, 1] 366s 0.912 366s -0.677 366s 0.869 366s 0.206 366s 366s x1: [1, 1, 1, 1] 366s 0.166 366s 366s y: [4, 1, 1, 1] 366s 0.151 -0.112 0.144 0.034 366s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 366s x0: [1, 2, 2, 1] 366s -0.231 366s -0.915 366s 366s x1: [1, 4, 2, 1] 366s -0.302 366s 0.227 366s -0.338 366s -0.483 366s 366s y: [2, 4, 2, 1] 366s 0.070 0.276 366s -0.052 -0.208 366s 0.078 0.310 366s 0.112 0.442 366s testing: mul_mat, [2, 2, 2, 4] = [1, 2, 2, 4] * [1, 2, 2, 4] 366s x0: [1, 2, 2, 4] 366s -0.146 366s -0.284 366s 366s x1: [1, 2, 2, 4] 366s 0.306 366s -0.440 366s 366s y: [2, 2, 2, 4] 366s -0.045 -0.087 366s 0.064 0.125 366s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.338 366s 366s x1: [1, 2, 1, 1] 366s 0.773 366s 0.536 366s 366s y: [1, 2, 1, 1] 366s 0.261 366s 0.181 366s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 2, 2, 1] 366s 0.086 366s 0.832 366s 366s x1: [1, 1, 2, 1] 366s -0.444 366s 366s y: [2, 1, 2, 1] 366s -0.038 -0.369 366s testing: mul_mat, [4, 1, 2, 4] = [2, 4, 2, 4] * [2, 1, 2, 4] 366s x0: [2, 4, 2, 4] 366s -0.507 -0.054 366s 0.410 0.705 366s 0.884 0.604 366s -0.733 0.502 366s 366s x1: [2, 1, 2, 4] 366s 0.554 -0.566 366s 366s y: [4, 1, 2, 4] 366s -0.250 -0.171 0.148 -0.690 366s test-mul-mat0: iter:226/500 366s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.504 366s 366s x1: [1, 4, 1, 1] 366s -0.544 366s -0.404 366s -0.355 366s -0.537 366s 366s y: [1, 4, 1, 1] 366s -0.274 366s -0.204 366s -0.179 366s -0.271 366s testing: mul_mat, [4, 1, 3, 1] = [1, 4, 3, 1] * [1, 1, 3, 1] 366s x0: [1, 4, 3, 1] 366s 0.343 366s -0.948 366s 0.716 366s -0.486 366s 366s x1: [1, 1, 3, 1] 366s 0.910 366s 366s y: [4, 1, 3, 1] 366s 0.312 -0.863 0.652 -0.442 366s testing: mul_mat, [1, 4, 3, 4] = [1, 1, 3, 4] * [1, 4, 3, 4] 366s x0: [1, 1, 3, 4] 366s 0.640 366s 366s x1: [1, 4, 3, 4] 366s -0.123 366s -0.005 366s 0.480 366s -0.768 366s 366s y: [1, 4, 3, 4] 366s -0.079 366s -0.003 366s 0.307 366s -0.492 366s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.271 366s -0.808 366s 366s x1: [1, 1, 1, 1] 366s -0.849 366s 366s y: [2, 1, 1, 1] 366s 0.230 0.686 366s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 366s x0: [2, 4, 3, 1] 366s 0.599 -0.479 366s -0.080 -0.727 366s -0.970 0.649 366s 0.319 -0.081 366s 366s x1: [2, 1, 3, 1] 366s -0.907 0.873 366s 366s y: [4, 1, 3, 1] 366s -0.962 -0.561 1.447 -0.360 366s testing: mul_mat, [1, 2, 3, 4] = [4, 1, 3, 4] * [4, 2, 3, 4] 366s x0: [4, 1, 3, 4] 366s 0.676 0.841 -0.210 -0.176 366s 366s x1: [4, 2, 3, 4] 366s -0.567 0.864 -0.540 -0.440 366s -0.505 -0.016 -0.489 -0.906 366s 366s y: [1, 2, 3, 4] 366s 0.535 366s -0.093 366s test-mul-mat0: iter:227/500 366s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.426 0.791 366s 0.633 -0.546 366s -0.064 -0.158 366s 366s x1: [2, 2, 1, 1] 366s -0.533 -0.280 366s 0.054 0.464 366s 366s y: [3, 2, 1, 1] 366s 0.005 -0.184 0.079 366s 0.344 -0.219 -0.077 366s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 366s x0: [2, 2, 4, 1] 366s -0.278 -0.867 366s -0.109 0.176 366s 366s x1: [2, 3, 4, 1] 366s -0.825 -0.130 366s -0.125 -0.988 366s -0.405 0.918 366s 366s y: [2, 3, 4, 1] 366s 0.342 0.067 366s 0.891 -0.160 366s -0.684 0.206 366s testing: mul_mat, [4, 2, 4, 1] = [2, 4, 4, 1] * [2, 2, 4, 1] 366s x0: [2, 4, 4, 1] 366s 0.517 -0.072 366s -0.993 0.249 366s -0.721 0.945 366s 0.857 0.643 366s 366s x1: [2, 2, 4, 1] 366s -0.163 -0.943 366s -0.166 0.995 366s 366s y: [4, 2, 4, 1] 366s -0.016 -0.073 -0.774 -0.747 366s -0.158 0.412 1.060 0.498 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.559 0.393 366s -0.409 -0.951 366s -0.173 -0.912 366s 0.406 -0.810 366s 366s x1: [2, 4, 1, 1] 366s 0.172 -0.800 366s 0.422 0.452 366s -0.855 0.279 366s 0.095 -0.375 366s 366s y: [4, 4, 1, 1] 366s -0.218 0.690 0.700 0.718 366s 0.414 -0.602 -0.485 -0.194 366s -0.368 0.085 -0.106 -0.573 366s -0.094 0.318 0.326 0.342 366s testing: mul_mat, [4, 2, 4, 1] = [4, 4, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 4, 4, 1] 366s -0.754 0.063 -0.383 -0.663 366s -0.818 -0.274 -0.810 -0.944 366s 0.492 0.214 0.528 -0.832 366s -0.517 0.199 -0.319 0.937 366s 366s x1: [4, 2, 4, 1] 366s -0.552 -0.459 -0.227 -0.568 366s 0.204 -0.762 -0.119 0.234 366s 366s y: [4, 2, 4, 1] 366s 0.850 1.297 -0.017 -0.265 366s -0.312 -0.083 -0.321 0.000 366s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 366s x0: [4, 4, 4, 1] 366s 0.949 0.173 0.580 0.859 366s -0.759 -0.699 -0.442 0.779 366s 0.957 0.115 0.545 -0.337 366s -0.666 0.247 -0.784 0.574 366s 366s x1: [4, 4, 4, 1] 366s -0.915 -0.428 0.499 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s -0.117 366s 0.190 0.607 -0.701 0.997 366s 0.243 -0.604 0.142 -0.065 366s -0.339 -0.776 0.358 0.858 366s 366s y: [4, 4, 4, 1] 366s -0.753 0.682 -0.614 0.046 366s 0.735 0.519 -0.466 1.145 366s 0.152 0.124 0.262 -0.460 366s 0.489 1.310 -0.508 0.246 366s test-mul-mat0: iter:228/500 366s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 1, 1, 1] 366s -0.034 366s 366s x1: [1, 3, 1, 1] 366s 0.992 366s 0.828 366s 0.575 366s 366s y: [1, 3, 1, 1] 366s -0.034 366s -0.028 366s -0.019 366s testing: mul_mat, [1, 1, 3, 1] = [1, 1, 3, 1] * [1, 1, 3, 1] 366s x0: [1, 1, 3, 1] 366s -0.697 366s 366s x1: [1, 1, 3, 1] 366s -0.960 366s 366s y: [1, 1, 3, 1] 366s 0.669 366s testing: mul_mat, [4, 1, 3, 4] = [1, 4, 3, 4] * [1, 1, 3, 4] 366s x0: [1, 4, 3, 4] 366s -0.659 366s -0.766 366s -0.087 366s 0.884 366s 366s x1: [1, 1, 3, 4] 366s 0.611 366s 366s y: [4, 1, 3, 4] 366s -0.403 -0.468 -0.053 0.541 366s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.327 366s 0.578 366s 366s x1: [1, 4, 1, 1] 366s 0.153 366s 0.507 366s -0.715 366s 0.445 366s 366s y: [2, 4, 1, 1] 366s 0.050 0.089 366s 0.166 0.293 366s -0.234 -0.413 366s 0.145 0.257 366s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 366s x0: [2, 4, 3, 1] 366s 0.815 -0.418 366s 0.776 -0.019 366s 0.132 -0.674 366s -0.868 0.685 366s 366s x1: [2, 1, 3, 1] 366s -0.611 0.289 366s 366s y: [4, 1, 3, 1] 366s -0.619 -0.480 -0.275 0.728 366s testing: mul_mat, [2, 2, 3, 4] = [4, 2, 3, 4] * [4, 2, 3, 4] 366s x0: [4, 2, 3, 4] 366s 0.255 0.642 0.445 0.160 366s 0.424 0.358 0.288 -0.289 366s 366s x1: [4, 2, 3, 4] 366s 0.863 -0.532 -0.778 -0.307 366s 0.850 0.661 -0.783 0.665 366s 366s y: [2, 2, 3, 4] 366s -0.518 0.041 366s 0.399 0.180 366s test-mul-mat0: iter:229/500 366s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.934 -0.386 366s 0.326 0.611 366s -0.180 -0.395 366s 0.390 0.372 366s 366s x1: [2, 2, 1, 1] 366s -0.545 0.792 366s 0.682 -0.908 366s 366s y: [4, 2, 1, 1] 366s -0.815 0.306 -0.215 0.083 366s 0.988 -0.332 0.236 -0.072 366s testing: mul_mat, [4, 4, 2, 1] = [2, 4, 2, 1] * [2, 4, 2, 1] 366s x0: [2, 4, 2, 1] 366s -0.592 0.829 366s 0.846 -0.910 366s 0.921 -0.177 366s -0.976 -0.465 366s 366s x1: [2, 4, 2, 1] 366s -0.657 0.045 366s -0.550 -0.186 366s -0.527 0.324 366s 0.914 -0.584 366s 366s y: [4, 4, 2, 1] 366s 0.426 -0.597 -0.613 0.620 366s 0.172 -0.296 -0.473 0.623 366s 0.581 -0.741 -0.543 0.364 366s -1.025 1.305 0.945 -0.620 366s testing: mul_mat, [4, 4, 2, 4] = [2, 4, 2, 4] * [2, 4, 2, 4] 366s x0: [2, 4, 2, 4] 366s 0.260 0.212 366s 0.716 0.011 366s -0.586 0.162 366s -0.407 0.129 366s 366s x1: [2, 4, 2, 4] 366s 0.177 0.911 366s -0.458 -0.499 366s 0.825 -0.042 366s -0.180 0.991 366s 366s y: [4, 4, 2, 4] 366s 0.239 0.137 0.043 0.045 366s -0.225 -0.333 0.188 0.122 366s 0.206 0.590 -0.490 -0.341 366s 0.163 -0.119 0.266 0.201 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.688 0.879 366s 0.277 -0.235 366s -0.442 0.015 366s -0.228 0.890 366s 366s x1: [2, 4, 1, 1] 366s -0.401 -0.700 366s -0.239 0.934 366s -0.837 0.191 366s -0.372 -0.625 366s 366s y: [4, 4, 1, 1] 366s -0.891 0.054 0.166 -0.532 366s 0.656 -0.285 0.120 0.886 366s -0.408 -0.276 0.373 0.361 366s -0.805 0.044 0.155 -0.471 366s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 366s x0: [4, 3, 2, 1] 366s -0.416 0.212 -0.100 0.671 366s -0.774 -0.399 0.878 0.757 366s 0.288 0.639 -0.802 -0.037 366s 366s x1: [4, 2, 2, 1] 366s 0.095 0.180 0.350 -0.805 366s -0.596 0.654 0.965 0.302 366s 366s y: [3, 2, 2, 1] 366s -0.577 -0.448 -0.108 366s 0.493 1.277 -0.539 366s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 366s x0: [3, 2, 2, 4] 366s 0.375 -0.840 -0.091 366s -0.206 0.687 -0.537 366s 366s x1: [3, 4, 2, 4] 366s 0.448 -0.675 -0.284 366s -0.215 -0.025 -0.221 366s -0.118 0.559 0.005 366s -0.830 -0.229 0.606 366s 366s y: [2, 4, 2, 4] 366s 0.761 -0.403 366s -0.040 0.146 366s -0.514 0.406 366s -0.174 -0.312 366s test-mul-mat0: iter:230/500 366s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.326 -0.396 0.770 366s -0.445 -0.044 -0.392 366s 366s x1: [3, 4, 1, 1] 366s -0.893 0.448 0.458 366s 0.397 0.096 -0.100 366s 0.135 0.476 0.969 366s 0.752 0.231 0.832 366s 366s y: [2, 4, 1, 1] 366s 0.467 0.197 366s -0.244 -0.141 366s 0.513 -0.461 366s 0.303 -0.671 366s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.865 -0.504 0.567 366s 0.864 -0.085 0.674 366s 0.312 -0.627 0.071 366s 366s x1: [3, 2, 1, 1] 366s 0.739 0.595 0.015 366s -0.458 -0.488 -0.227 366s 366s y: [3, 2, 1, 1] 366s 0.348 0.599 -0.141 366s -0.279 -0.508 0.147 366s testing: mul_mat, [1, 3, 1, 4] = [3, 1, 1, 4] * [3, 3, 1, 4] 366s x0: [3, 1, 1, 4] 366s 0.188 -0.975 -0.655 366s 366s x1: [3, 3, 1, 4] 366s -0.592 0.273 -0.795 366s 0.884 0.242 0.957 366s 0.115 0.074 -0.514 366s 366s y: [1, 3, 1, 4] 366s 0.143 366s -0.697 366s 0.286 366s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.915 0.204 -0.317 366s 0.142 -0.013 -0.642 366s -0.638 -0.832 0.348 366s -0.615 0.639 -0.416 366s 366s x1: [3, 1, 1, 1] 366s -0.973 0.308 0.995 366s 366s y: [4, 1, 1, 1] 366s -1.143 -0.781 0.710 0.382 366s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.069 -0.810 0.120 -0.293 366s -0.805 -0.242 -0.461 -0.825 366s 0.346 -0.846 -0.464 -0.210 366s -0.760 0.332 -0.893 0.065 366s 366s x1: [4, 3, 1, 1] 366s -0.568 -0.701 -0.558 0.620 366s -0.676 -0.213 -0.403 -0.973 366s 0.388 0.917 -0.635 -0.148 366s 366s y: [4, 3, 1, 1] 366s 0.358 0.372 0.524 0.738 366s 0.456 1.584 0.338 0.741 366s -0.802 -0.120 -0.315 0.567 366s testing: mul_mat, [1, 4, 1, 4] = [4, 1, 1, 4] * [4, 4, 1, 4] 366s x0: [4, 1, 1, 4] 366s -0.696 -0.343 -0.339 0.068 366s 366s x1: [4, 4, 1, 4] 366s 0.523 0.373 0.496 0.822 366s 0.816 0.117 -0.854 -0.397 366s 0.714 -0.826 0.991 0.630 366s -0.462 -0.157 -0.050 0.469 366s 366s y: [1, 4, 1, 4] 366s -0.605 366s -0.345 366s -0.506 366s 0.424 366s test-mul-mat0: iter:231/500 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.284 -0.135 -0.803 -0.555 366s -0.130 -0.430 0.738 -0.649 366s -0.362 -0.889 -0.225 0.442 366s 366s x1: [4, 3, 1, 1] 366s -0.215 -0.484 -0.086 0.670 366s -0.455 0.806 0.624 -0.251 366s 0.493 -0.978 -0.377 0.798 366s 366s y: [3, 3, 1, 1] 366s -0.298 -0.262 0.824 366s -0.599 0.336 -0.804 366s 0.132 -0.439 1.129 366s testing: mul_mat, [2, 3, 4, 1] = [4, 2, 4, 1] * [4, 3, 4, 1] 366s x0: [4, 2, 4, 1] 366s 0.778 -0.533 0.640 -0.202 366s -0.921 -0.109 -0.613 0.429 366s 366s x1: [4, 3, 4, 1] 366s 0.861 0.854 -0.789 -0.958 366s 0.450 0.504 -0.172 0.966 366s -0.582 -0.502 -0.489 -0.776 366s 366s y: [2, 3, 4, 1] 366s -0.096 -0.813 366s -0.224 0.051 366s -0.341 0.558 366s testing: mul_mat, [1, 2, 4, 2] = [4, 1, 4, 2] * [4, 2, 4, 2] 366s x0: [4, 1, 4, 2] 366s 0.960 0.992 -0.161 0.349 366s 366s x1: [4, 2, 4, 2] 366s 0.773 -0.980 -0.197 0.852 366s -0.088 0.189 0.281 0.405 366s 366s y: [1, 2, 4, 2] 366s 0.099 366s 0.199 366s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.343 -0.229 -0.579 -0.149 366s 0.758 0.063 -0.353 0.785 366s 0.333 -0.738 0.904 0.998 366s 0.508 0.957 -0.443 -0.665 366s 366s x1: [4, 1, 1, 1] 366s -0.597 0.102 0.748 -0.240 366s 366s y: [4, 1, 1, 1] 366s -0.216 -0.898 0.163 -0.377 366s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 366s x0: [4, 4, 4, 1] 366s -0.616 -0.168 0.049 0.352 366s -0.581 -0.257 0.132 0.633 366s 0.698 0.517 -0.820 0.062 366s 0.850 0.833 -0.956 0.686 366s 366s x1: [4, 4, 4, 1] 366s -0.223 -0.606 -0.215 0.364 366s 0.773 0.894 0.803 -0.816 366s -0.843 -0.799 -0.413 0.260 366s 0.948 0.347 -0.310 -0.395 366s 366s y: [4, 4, 4, 1] 366s 0.356 0.487 -0.270 -0.239 366s -0.874 -1.089 0.293 0.075 366s 0.724 0.804 -0.646 -0.809 366s -0.796 -0.931 1.071 1.121 366s testing: mul_mat, [2, 4, 4, 2] = [4, 2, 4, 2] * [4, 4, 4, 2] 366s x0: [4, 2, 4, 2] 366s 0.324 0.954 -0.524 0.725 366s -0.855 0.815 0.925 0.878 366s 366s x1: [4, 4, 4, 2] 366s -0.697 0.931 -0.689 0.103 366s 0.847 0.768 -0.049 -0.105 366s 0.405 -0.157 0.294 -0.325 366s 0.798 0.380 -0.094 0.917 366s 366s y: [2, 4, 4, 2] 366s 1.098 0.808 366s 0.956 -0.237 366s -0.408 -0.487 366s 1.334 0.346 366s test-mul-mat0: iter:232/500 366s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.238 366s -0.909 366s 366s x1: [1, 1, 1, 1] 366s 0.430 366s 366s y: [2, 1, 1, 1] 366s -0.102 -0.391 366s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 366s x0: [1, 2, 3, 1] 366s -0.997 366s 0.764 366s 366s x1: [1, 2, 3, 1] 366s 0.388 366s -0.810 366s 366s y: [2, 2, 3, 1] 366s -0.387 0.297 366s 0.808 -0.619 366s testing: mul_mat, [4, 2, 3, 4] = [1, 4, 3, 4] * [1, 2, 3, 4] 366s x0: [1, 4, 3, 4] 366s 0.292 366s -0.566 366s 0.431 366s -0.388 366s 366s x1: [1, 2, 3, 4] 366s -0.218 366s 0.420 366s 366s y: [4, 2, 3, 4] 366s -0.064 0.123 -0.094 0.085 366s 0.123 -0.238 0.181 -0.163 366s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.630 366s -0.418 366s 0.391 366s 366s x1: [1, 4, 1, 1] 366s -0.794 366s 0.398 366s -0.969 366s -0.837 366s 366s y: [3, 4, 1, 1] 366s -0.500 0.332 -0.310 366s 0.251 -0.166 0.156 366s -0.611 0.405 -0.379 366s -0.528 0.350 -0.327 366s testing: mul_mat, [1, 1, 3, 1] = [3, 1, 3, 1] * [3, 1, 3, 1] 366s x0: [3, 1, 3, 1] 366s 0.258 -0.937 -0.432 366s 366s x1: [3, 1, 3, 1] 366s 0.888 -0.411 0.714 366s 366s y: [1, 1, 3, 1] 366s 0.306 366s testing: mul_mat, [2, 3, 3, 4] = [1, 2, 3, 4] *ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s [1, 3, 3, 4] 366s x0: [1, 2, 3, 4] 366s -0.339 366s 0.596 366s 366s x1: [1, 3, 3, 4] 366s -0.816 366s 0.801 366s 0.357 366s 366s y: [2, 3, 3, 4] 366s 0.276 -0.486 366s -0.271 0.477 366s -0.121 0.213 366s test-mul-mat0: iter:233/500 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.739 0.046 366s 366s x1: [2, 3, 1, 1] 366s 0.385 -0.682 366s -0.967 -0.954 366s 0.914 0.817 366s 366s y: [1, 3, 1, 1] 366s -0.316 366s 0.671 366s -0.638 366s testing: mul_mat, [4, 1, 4, 1] = [2, 4, 4, 1] * [2, 1, 4, 1] 366s x0: [2, 4, 4, 1] 366s -0.425 0.058 366s 0.995 0.730 366s -0.843 0.848 366s 0.444 0.534 366s 366s x1: [2, 1, 4, 1] 366s 0.494 0.950 366s 366s y: [4, 1, 4, 1] 366s -0.154 1.185 0.389 0.726 366s testing: mul_mat, [3, 4, 4, 3] = [2, 3, 4, 3] * [2, 4, 4, 3] 366s x0: [2, 3, 4, 3] 366s -0.383 -0.363 366s -0.594 -0.922 366s -0.892 0.753 366s 366s x1: [2, 4, 4, 3] 366s 0.489 -0.598 366s 0.716 0.646 366s -0.750 0.160 366s 0.180 0.571 366s 366s y: [3, 4, 4, 3] 366s 0.030 0.260 -0.886 366s -0.508 -1.021 -0.152 366s 0.229 0.299 0.789 366s -0.276 -0.633 0.269 366s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.690 -0.254 366s 0.551 -0.322 366s 0.472 0.402 366s -0.832 -0.548 366s 366s x1: [2, 3, 1, 1] 366s 0.156 0.958 366s -0.148 -0.508 366s -0.510 0.466 366s 366s y: [4, 3, 1, 1] 366s -0.351 -0.222 0.459 -0.655 366s 0.231 0.082 -0.274 0.402 366s 0.234 -0.431 -0.053 0.169 366s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 3, 4, 1] 366s -0.101 0.955 0.685 -0.773 366s 0.860 0.469 -0.903 -0.222 366s -0.407 0.921 0.102 -0.832 366s 366s x1: [4, 2, 4, 1] 366s 0.981 -0.188 -0.235 0.419 366s 0.241 0.056 0.609 0.329 366s 366s y: [3, 2, 4, 1] 366s -0.764 0.875 -0.946 366s 0.192 -0.389 -0.258 366s testing: mul_mat, [2, 4, 4, 3] = [3, 2, 4, 3] * [3, 4, 4, 3] 366s x0: [3, 2, 4, 3] 366s -0.430 -0.418 -0.600 366s -0.318 -0.401 0.205 366s 366s x1: [3, 4, 4, 3] 366s -0.456 -0.014 0.066 366s 0.959 0.223 0.154 366s 0.155 -0.403 -0.075 366s -0.188 -0.489 -0.169 366s 366s y: [2, 4, 4, 3] 366s 0.162 0.164 366s -0.598 -0.363 366s 0.147 0.097 366s 0.387 0.222 366s test-mul-mat0: iter:234/500 366s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.345 -0.436 366s -0.675 0.495 366s 0.102 -0.703 366s 366s x1: [2, 4, 1, 1] 366s -0.647 -0.414 366s -0.227 0.894 366s -0.072 -0.393 366s 0.265 -0.836 366s 366s y: [3, 4, 1, 1] 366s -0.043 0.232 0.225 366s -0.468 0.596 -0.652 366s 0.146 -0.146 0.269 366s 0.455 -0.593 0.615 366s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.861 0.338 366s 0.573 -0.700 366s 366s x1: [2, 3, 1, 1] 366s -0.244 0.339 366s -0.549 0.808 366s 0.857 0.389 366s 366s y: [2, 3, 1, 1] 366s -0.096 -0.377 366s -0.200 -0.880 366s 0.869 0.219 366s testing: mul_mat, [2, 2, 1, 3] = [2, 2, 1, 3] * [2, 2, 1, 3] 366s x0: [2, 2, 1, 3] 366s -0.877 -0.312 366s -0.554 -0.552 366s 366s x1: [2, 2, 1, 3] 366s 0.476 -0.679 366s 0.830 -0.797 366s 366s y: [2, 2, 1, 3] 366s -0.205 0.112 366s -0.479 -0.020 366s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.096 -0.266 366s 0.397 0.600 366s 0.692 -0.533 366s 366s x1: [2, 2, 1, 1] 366s -0.416 0.798 366s -0.396 0.922 366s 366s y: [3, 2, 1, 1] 366s -0.172 0.313 -0.713 366s -0.207 0.396 -0.765 366s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.987 0.025 -0.257 366s -0.705 -0.567 0.208 366s 366s x1: [3, 2, 1, 1] 366s -0.083 -0.611 -0.984 366s -0.005 -0.894 -0.828 366s 366s y: [2, 2, 1, 1] 366s 0.156 0.200 366s 0.185 0.339 366s testing: mul_mat, [3, 3, 1, 3] = [2, 3, 1, 3] * [2, 3, 1, 3] 366s x0: [2, 3, 1, 3] 366s -0.425 -0.319 366s -0.763 -0.591 366s 0.765 -0.897 366s 366s x1: [2, 3, 1, 3] 366s 0.980 0.487 366s -0.853 -0.133 366s -0.318 -0.107 366s 366s y: [3, 3, 1, 3] 366s -0.572 -1.036 0.313 366s 0.405 0.730 -0.533 366s 0.169 0.306 -0.147 366s test-mul-mat0: iter:235/500 366s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.418 -0.662 0.279 -0.902 366s -0.253 0.382 -0.234 0.451 366s 366s x1: [4, 2, 1, 1] 366s -0.929 -0.872 -0.109 -0.343 366s 0.881 -0.256 -0.157 -0.899 366s 366s y: [2, 2, 1, 1] 366s 0.468 -0.227 366s 1.304 -0.689 366s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 1, 4, 1] 366s -0.687 0.392 0.000 -0.855 366s 366s x1: [4, 2, 4, 1] 366s -0.490 0.867 -0.102 -0.154 366s -0.052 0.832 0.329 -0.104 366s 366s y: [1, 2, 4, 1] 366s 0.809 366s 0.452 366s testing: mul_mat, [1, 1, 4, 3] = [4, 1, 4, 3] * [4, 1, 4, 3] 366s x0: [4, 1, 4, 3] 366s -0.662 0.500 0.892 0.738 366s 366s x1: [4, 1, 4, 3] 366s 0.872 0.555 0.290 -0.249 366s 366s y: [1, 1, 4, 3] 366s -0.224 366s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 3, 1, 1] 366s -0.476 0.107 -0.779 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s .673 366s 0.962 0.851 -0.044 0.561 366s 0.011 0.655 0.026 -0.711 366s 366s x1: [4, 1, 1, 1] 366s -0.028 0.828 -0.189 0.726 366s 366s y: [3, 1, 1, 1] 366s 0.737 1.094 0.021 366s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 366s x0: [3, 2, 4, 1] 366s 0.461 -0.125 -0.418 366s -0.841 0.728 -0.954 366s 366s x1: [3, 4, 4, 1] 366s -0.138 0.375 -0.040 366s 0.114 -0.697 0.129 366s -0.920 0.342 -0.418 366s -0.201 0.361 -0.341 366s 366s y: [2, 4, 4, 1] 366s -0.093 0.428 366s 0.086 -0.727 366s -0.292 1.421 366s 0.005 0.757 366s testing: mul_mat, [1, 3, 4, 3] = [2, 1, 4, 3] * [2, 3, 4, 3] 366s x0: [2, 1, 4, 3] 366s 0.637 0.318 366s 366s x1: [2, 3, 4, 3] 366s 0.836 0.330 366s 0.120 -0.965 366s 0.534 0.963 366s 366s y: [1, 3, 4, 3] 366s 0.638 366s -0.230 366s 0.647 366s test-mul-mat0: iter:236/500 366s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.731 0.109 366s 366s x1: [2, 4, 1, 1] 366s -0.130 -0.181 366s -0.578 -0.493 366s -0.864 0.999 366s 0.817 -0.949 366s 366s y: [1, 4, 1, 1] 366s 0.075 366s 0.369 366s 0.740 366s -0.701 366s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 366s x0: [2, 1, 2, 1] 366s -0.160 -0.340 366s 366s x1: [2, 1, 2, 1] 366s 0.187 -0.946 366s 366s y: [1, 1, 2, 1] 366s 0.292 366s testing: mul_mat, [2, 1, 2, 1] = [2, 2, 2, 1] * [2, 1, 2, 1] 366s x0: [2, 2, 2, 1] 366s -0.650 0.025 366s 0.176 0.326 366s 366s x1: [2, 1, 2, 1] 366s 0.277 0.530 366s 366s y: [2, 1, 2, 1] 366s -0.167 0.221 366s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.444 -0.447 366s 0.193 -0.620 366s -0.211 -0.156 366s 366s x1: [2, 2, 1, 1] 366s -0.456 -0.297 366s 0.393 -0.457 366s 366s y: [3, 2, 1, 1] 366s -0.070 0.096 0.142 366s 0.379 0.359 -0.011 366s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 366s x0: [3, 2, 2, 1] 366s 0.413 -0.876 0.847 366s 0.725 0.851 -0.681 366s 366s x1: [3, 2, 2, 1] 366s 0.337 -0.165 0.219 366s -0.823 0.495 0.039 366s 366s y: [2, 2, 2, 1] 366s 0.470 -0.045 366s -0.741 -0.202 366s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 366s x0: [2, 1, 2, 1] 366s 0.609 -0.997 366s 366s x1: [2, 3, 2, 1] 366s 0.946 -0.872 366s -0.668 -0.501 366s -0.493 0.176 366s 366s y: [1, 3, 2, 1] 366s 1.446 366s 0.092 366s -0.475 366s test-mul-mat0: iter:237/500 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.317 -0.914 0.998 0.153 366s 0.428 -0.607 0.167 -0.734 366s 0.827 0.402 0.875 0.830 366s 366s x1: [4, 3, 1, 1] 366s 0.605 0.262 -0.876 0.924 366s -0.630 -0.484 -0.053 -0.128 366s 0.451 -0.510 -0.183 0.579 366s 366s y: [3, 3, 1, 1] 366s -0.780 -0.725 0.607 366s 0.170 0.109 -0.868 366s 0.516 0.048 0.488 366s testing: mul_mat, [2, 3, 3, 1] = [4, 2, 3, 1] * [4, 3, 3, 1] 366s x0: [4, 2, 3, 1] 366s -0.537 0.681 0.224 0.558 366s -0.404 -0.476 0.600 -0.618 366s 366s x1: [4, 3, 3, 1] 366s -0.915 0.872 -0.618 0.566 366s 0.282 -0.341 0.171 -0.456 366s -0.217 0.095 -0.086 0.300 366s 366s y: [2, 3, 3, 1] 366s 1.263 -0.766 366s -0.600 0.433 366s 0.329 -0.194 366s testing: mul_mat, [2, 2, 3, 4] = [4, 2, 3, 4] * [4, 2, 3, 4] 366s x0: [4, 2, 3, 4] 366s -0.919 -0.167 -0.954 0.800 366s -0.370 0.253 0.773 0.238 366s 366s x1: [4, 2, 3, 4] 366s 0.552 0.586 0.809 0.335 366s -0.122 -0.054 0.776 0.341 366s 366s y: [2, 2, 3, 4] 366s -1.109 0.649 366s -0.346 0.712 366s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.408 0.781 -0.174 0.695 366s 0.340 -0.206 0.504 0.580 366s -0.939 0.730 0.747 -0.831 366s -0.777 -0.907 -0.340 0.143 366s 366s x1: [4, 2, 1, 1] 366s 0.761 -0.339 -0.207 0.787 366s 0.331 -0.849 -0.475 0.461 366s 366s y: [4, 2, 1, 1] 366s 0.008 0.681 -1.771 -0.101 366s -0.395 0.316 -1.668 0.741 366s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 2, 3, 1] 366s -0.286 -0.919 -0.002 0.652 366s 0.287 0.431 -0.051 -0.545 366s 366s x1: [4, 4, 3, 1] 366s -0.849 0.853 -0.122 -0.902 366s 0.045 0.978 0.859 0.705 366s -0.230 0.646 0.036 -0.079 366s -0.829 -0.503 0.186 -0.236 366s 366s y: [2, 4, 3, 1] 366s -1.128 0.621 366s -0.453 0.006 366s -0.579 0.253 366s 0.545 -0.335 366s testing: mul_mat, [4, 4, 3, 4] = [2, 4, 3, 4] * [2, 4, 3, 4] 366s x0: [2, 4, 3, 4] 366s 0.088 -0.927 366s 0.743 0.863 366s -0.511 -0.694 366s 0.258 -0.995 366s 366s x1: [2, 4, 3, 4] 366s -0.224 -0.129 366s 0.783 0.569 366s -0.402 0.452 366s 0.609 0.312 366s 366s y: [4, 4, 3, 4] 366s 0.100 -0.278 0.204 0.071 366s -0.459 1.073 -0.794 -0.364 366s -0.454 0.092 -0.108 -0.554 366s -0.235 0.721 -0.527 -0.153 366s test-mul-mat0: iter:238/500 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.419 0.435 0.938 0.909 366s -0.666 0.873 -0.125 0.975 366s -0.687 -0.842 -0.386 0.318 366s 366s x1: [4, 3, 1, 1] 366s 0.950 -0.985 -0.ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 840 -0.443 366s 0.238 -0.649 0.492 0.568 366s -0.475 -0.255 -0.154 -0.408 366s 366s y: [3, 3, 1, 1] 366s -1.220 -1.821 0.359 366s 0.794 -0.233 0.374 366s -0.824 -0.284 0.470 366s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 366s x0: [4, 4, 3, 1] 366s 0.197 0.049 0.383 0.781 366s -0.976 -0.497 0.857 -0.433 366s -0.853 -0.321 0.957 -0.617 366s 0.036 -0.234 -0.945 0.330 366s 366s x1: [4, 3, 3, 1] 366s 0.341 -0.006 -0.791 -0.923 366s 0.723 -0.392 -0.972 0.738 366s -0.232 -0.416 -0.024 0.119 366s 366s y: [4, 3, 3, 1] 366s -0.957 -0.609 -0.477 0.457 366s 0.326 -1.663 -1.877 1.280 366s 0.018 0.360 0.234 0.151 366s testing: mul_mat, [2, 4, 3, 2] = [4, 2, 3, 2] * [4, 4, 3, 2] 366s x0: [4, 2, 3, 2] 366s -0.698 -0.599 0.087 -0.139 366s 0.028 -0.915 -0.705 -0.007 366s 366s x1: [4, 4, 3, 2] 366s -0.071 -0.537 -0.546 -0.031 366s -0.560 0.281 0.603 -0.785 366s 0.199 0.825 0.998 -0.714 366s 0.357 -0.605 -0.431 0.938 366s 366s y: [2, 4, 3, 2] 366s 0.328 0.874 366s 0.384 -0.692 366s -0.446 -1.447 366s -0.054 0.861 366s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.023 -0.085 -0.349 -0.213 366s -0.785 0.933 0.987 -0.215 366s -0.924 -0.821 -0.424 0.674 366s 0.400 -0.297 0.664 -0.551 366s 366s x1: [4, 2, 1, 1] 366s -0.477 -0.623 -0.748 0.021 366s 0.741 0.587 0.471 0.500 366s 366s y: [4, 2, 1, 1] 366s 0.299 -0.949 1.284 -0.514 366s -0.304 0.322 -1.030 0.160 366s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 3, 3, 1] 366s 0.603 -0.065 -0.746 -0.640 366s -0.995 -0.373 0.824 0.807 366s 0.804 0.925 0.709 0.167 366s 366s x1: [4, 4, 3, 1] 366s -0.227 -0.174 0.386 0.715 366s 0.171 0.102 -0.762 0.548 366s 0.353 0.259 0.289 -0.060 366s -0.270 -0.211 0.461 0.753 366s 366s y: [3, 4, 3, 1] 366s -0.872 1.186 0.050 366s 0.314 -0.394 -0.217 366s 0.019 -0.258 0.719 366s -0.975 1.335 0.041 366s testing: mul_mat, [2, 4, 3, 2] = [3, 2, 3, 2] * [3, 4, 3, 2] 366s x0: [3, 2, 3, 2] 366s -0.736 -0.106 -0.434 366s -0.018 -0.978 -0.315 366s 366s x1: [3, 4, 3, 2] 366s -0.511 0.648 0.830 366s -0.802 -0.992 0.636 366s 0.366 -0.078 -0.802 366s -0.182 0.777 0.231 366s 366s y: [2, 4, 3, 2] 366s -0.053 -0.886 366s 0.418 0.785 366s 0.088 0.322 366s -0.048 -0.829 366s test-mul-mat0: iter:239/500 366s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.582 0.245 0.929 366s 0.568 0.258 -0.445 366s 0.478 -0.912 -0.896 366s 366s x1: [3, 3, 1, 1] 366s -0.882 0.962 0.753 366s 0.566 0.585 0.429 366s -0.120 -0.839 -0.353 366s 366s y: [3, 3, 1, 1] 366s 0.422 -0.589 -1.974 366s 0.871 0.282 -0.647 366s -0.603 -0.127 1.025 366s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 366s x0: [3, 1, 3, 1] 366s 0.959 0.127 0.331 366s 366s x1: [3, 3, 3, 1] 366s -0.135 0.954 -0.277 366s -0.483 -0.198 -0.763 366s -0.808 -0.784 -0.564 366s 366s y: [1, 3, 3, 1] 366s -0.099 366s -0.740 366s -1.061 366s testing: mul_mat, [2, 1, 3, 3] = [3, 2, 3, 3] * [3, 1, 3, 3] 366s x0: [3, 2, 3, 3] 366s 0.328 0.236 0.522 366s -0.139 0.336 -0.749 366s 366s x1: [3, 1, 3, 3] 366s 0.533 -0.900 -0.272 366s 366s y: [2, 1, 3, 3] 366s -0.180 -0.173 366s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.532 -0.515 0.050 366s -0.436 -0.140 -0.659 366s 0.730 0.531 -0.212 366s -0.594 0.125 0.608 366s 366s x1: [3, 2, 1, 1] 366s -0.827 -0.941 0.773 366s 0.378 -0.820 0.504 366s 366s y: [4, 2, 1, 1] 366s 0.083 -0.016 -1.267 0.843 366s 0.649 -0.382 -0.266 -0.021 366s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 366s x0: [4, 4, 3, 1] 366s 0.657 0.675 0.388 0.563 366s 0.226 0.068 -0.098 0.089 366s -0.480 -0.823 -0.130 0.601 366s -0.169 -0.818 0.213 0.953 366s 366s x1: [4, 3, 3, 1] 366s -0.267 0.786 0.014 -0.510 366s 0.319 0.077 0.936 0.334 366s -0.479 0.174 -0.552 -0.452 366s 366s y: [4, 3, 3, 1] 366s 0.073 -0.054 -0.826 -1.080 366s 0.813 0.016 -0.138 0.400 366s -0.666 -0.083 -0.113 -0.610 366s testing: mul_mat, [3, 4, 3, 3] = [4, 3, 3, 3] * [4, 4, 3, 3] 366s x0: [4, 3, 3, 3] 366s -0.137 -0.440 -0.840 -0.867 366s -0.347 0.028 -0.297 -0.395 366s -0.743 -0.083 0.506 -0.250 366s 366s x1: [4, 4, 3, 3] 366s -0.403 0.608 -0.645 0.071 366s 0.111 -0.276 -0.668 -0.477 366s -0.676 -0.834 -0.144 0.645 366s 0.892 0.189 -0.378 -0.155 366s 366s y: [3, 4, 3, 3] 366s 0.268 0.321 -0.095 366s 1.082 0.341 -0.279 366s 0.022 -0.001 0.338 366s 0.247 -0.131 -0.831 366s test-mul-mat0: iter:240/500 366s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.471 0.535 0.662 0.201 366s -0.532 0.064 -0.958 -0.787 366s 366s x1: [4, 1, 1, 1] 366s -0.141 0.244 0.610 0.887 366s 366s y: [2, 1, 1, 1] 366s 0.779 -1.192 366s testing: mul_mat, [2, 2, 1, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.000 -0.463 0.679 0.162 366s 0.056 -0.646 -0.233 -0.806 366s 366s x1: [4, 2, 1, 1] 366s -0.249 0.728 -0.145 -0.472 366s 0.849 -0.359 -0.093 0.503 366s 366s y: [2, 2, 1, 1] 366s -0.512 -0.070 366s 0.185 -0.104 366s testing: mul_mat, [4, 2, 1, 4] = [4, 4, 1, 4] * [4, 2, 1, 4] 366s x0: [4, 4, 1, 4] 366s -0.366 -0.289 -0.818 0.836 366s 0.796 0.230 -0.300 -0.918 366s -0.254 -0.372 -0.589 -0.526 366s 0.307 -0.533 0.058 -0.485 366s 366s x1: [4, 2, 1, 4] 366s -0.053 0.626 0.439 -0.443 366s 0.513 0.202 0.086 0.048 366s 366s y: [4, 2, 1, 4] 366s -0.891 0.376 -0.245 -0.110 366s -0.276 0.385 -0.281 0.031 366s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.901 -0.952 0.814 -0.150 366s -0.659 0.112 0.981 -0.195 366s 366s x1: [4, 4, 1, 1] 366s -0.353 -0.399 0.226 0.792 366s 0.689 0.488 0.684 0.406 366s -0.840 -0.950 -0.570 0.299 366s 0.473 0.969 -0.191 0.179 366s 366s y: [2, 4, 1, 1] 366s 0.763 0.255 366s -0.590 0.192 366s 1.153 -0.171 366s -1.530 -0.425 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.851 -0.585 366s 0.009 0.387 366s 0.227 0.882 366s 0.833 -0.155 366s 366s x1: [2, 4, 1, 1] 366s -0.021 -0.492 366s 0.468 0.710 366s -0.281 -0.486 366s -0.643 0.320 366s 366s y: [4, 4, 1, 1] 366s 0.306 -0.191 -0.439 0.058 366s -0.813 0.279 0.733 0.280 366s 0.523 -0.191 -0.493 -0.159 366s 0.360 0.118 0.136 -0.585 366s testing: mul_mat, [2, 2, 1, 4] = [4, 2, 1, 4] * [4, 2, 1, 4] 366s x0: [4, 2, 1, 4] 366s 0.187 0.779 0.607 0.136 366s -0.987 -0.863 -0.345 -0.686 366s 366s x1: [4, 2, 1, 4] 366s -0.314 0.356 -0.186 0.496 366s -0.466 -0.050 0.594 -0.125 366s 366s y: [2, 2, 1, 4] 366s 0.173 -0.273 366s 0.218 0.383 366s test-mul-mat0: iter:241/500 366s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.220 366s -0.138 366s -0.590 366s 366s x1: [1, 1, 1, 1] 366s 0.518 366s 366s y: [3, 1, 1, 1] 366s 0.114 -0.071 -0.305 366s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.501 366s -0.774 366s 366s x1: [1, 3, 1, 1] 366s 0.090 366s 0.700 366s -0.516 366s 366s y: [2, 3, 1, 1] 366s 0.045 -0.070 366s 0.351 -0.542 366s -0.258 0.399 366s testing: mul_mat, [3, 2, 1, 2] = [1, 3, 1, 2] * [1, 2, 1, 2] 366s x0: [1, 3, 1, 2] 366s 0.936 366s -0.502 366s -0.792 366s 366s x1: [1, 2, 1, 2] 366s 0.742 366s -0.222 366s 366s y: [3, 2, 1, 2] 366s 0.695 -0.373 -0.588 366s -0.208 0.112 0.176 366s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.182 366s 0.560 366s 366s x1: [1, 3, 1, 1] 366s 0.723 366s 0.796 366s -0.739 366s 366s y: [2, 3, 1, 1] 366s -0.131 0.404 366s -0.145 0.445 366s 0.134 -0.414 366s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.187 0.686 366s -0.728 -0.951 366s 366s x1: [2, 1, 1, 1] 366s 0.669 0.977 366s 366s y: [2, 1, 1, 1] 366s 0.795 -1.417 366s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 366s x0: [2, 1, 1, 2] 366s 0.387 -0.894 366s 366s x1: [2, 2, 1, 2] 366s -0.318 -0.225 366s 0.749 0.166 366s 366s y: [1, 2, 1, 2] 366s 0.078 366s 0.141 366s test-mul-mat0: iter:242/500 366s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.258 366s 0.735 366s 0.179 366s 366s x1: [1, 2, 1, 1] 366s -0.519 366s -0.071 366s 366s y: [3, 2, 1, 1] 366s 0.134 -0.381 -0.093 366s 0.018 -0.052 -0.013 366s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.408 366s 0.428 366s -0.547 366s 366s x1: [1, 3, 1, 1] 366s -0.698 366s 0.405 366s 0.156 366s 366s y: [3, 3, 1, 1] 366s 0.285 -0.299 0.382 366s -0.165 0.173 -0.221 366s -0.064 0.067 -0.085 366s testing: mul_mat, [2, 3, 1, 4] = [1, 2, 1, 4] * [1, 3, 1, 4] 366s x0: [1, 2, 1, 4] 366s 0.378 366s 0.282 366s 366s x1: [1, 3, 1, 4] 366s -0.359 366s -0.890 366s 0.229 366s 366s y: [2, 3, 1, 4] 366s -0.136 -0.101 366s -0.336 -0.251 366s 0.086 0.064 366s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 4, 1, 1] 366s 0.998 366s -0.347 366s -0.458 366s -0.234 366s 366s x1: [1, 2, 1, 1] 366s 0.696 366s 0.248 366s 366s y: [4, 2, 1, 1] 366s 0.695 -0.242 -0.319 -0.163 366s 0.248 -0.086 -0.114 -0.058 366s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.552 0.356 0.916 -0.952 366s -0.724 0.477 -0.975 0.585 366s 366s x1: [4, 1, 1, 1] 366s 0.245 0.970 0.220 0.886 366s 366s y: [2, 1, 1, 1] 366s -0.432 0.589 366s testing: mul_mat, [4, 4, 1, 4] = [2, 4, 1, 4] * [2, 4, 1, 4] 366s x0: [2, 4, 1, 4] 366s -0.085 0.001 366s -0.041 -0.782 366s -0.636 0.916 366s 0.939 -0.444 366s 366s x1: [2, 4, 1, 4] 366s -0.919 -0.275 366s -0.449 0.458 366s -0.993 0.024 366s -0.958 0.854 366s 366s y: [4, 4, 1, 4] 366s 0.078 0.253 0.333 -0.741 366s 0.038 -0.340 0.705 -0.625 366s 0.084 0.022 0.653 -0.943 366s 0.082 -0.628 1.392 -1.27ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 9 366s test-mul-mat0: iter:243/500 366s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.337 0.436 -0.765 0.011 366s -0.205 -0.660 -0.277 -0.739 366s 366s x1: [4, 2, 1, 1] 366s 0.619 -0.118 -0.456 0.897 366s -0.527 0.903 0.623 0.735 366s 366s y: [2, 2, 1, 1] 366s 0.516 -0.586 366s -0.252 -1.204 366s testing: mul_mat, [4, 2, 2, 1] = [4, 4, 2, 1] * [4, 2, 2, 1] 366s x0: [4, 4, 2, 1] 366s 0.945 -0.486 0.808 0.568 366s -0.751 -0.473 -0.095 0.685 366s -0.238 0.916 -0.519 0.102 366s -0.360 -0.258 -0.591 -0.530 366s 366s x1: [4, 2, 2, 1] 366s 0.307 0.831 0.310 0.054 366s -0.642 0.990 0.166 -0.613 366s 366s y: [4, 2, 2, 1] 366s 0.168 -0.616 0.533 -0.537 366s -1.302 -0.423 0.912 0.202 366s testing: mul_mat, [3, 4, 2, 1] = [4, 3, 2, 1] * [4, 4, 2, 1] 366s x0: [4, 3, 2, 1] 366s -0.545 -0.292 0.293 -0.580 366s 0.179 -0.752 -0.698 -0.869 366s -0.695 -0.177 0.824 -0.230 366s 366s x1: [4, 4, 2, 1] 366s -0.463 0.666 0.060 -0.214 366s -0.807 0.965 -0.529 -0.045 366s 0.882 -0.048 -0.943 -0.479 366s 0.694 -0.534 -0.008 0.745 366s 366s y: [3, 4, 2, 1] 366s 0.200 -0.439 0.303 366s 0.029 -0.461 -0.035 366s -0.465 1.269 -1.272 366s -0.657 -0.116 -0.566 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.046 0.869 0.683 0.318 366s 0.324 0.148 -0.282 -0.925 366s -0.746 0.025 0.724 -0.018 366s 366s x1: [4, 3, 1, 1] 366s 0.874 -0.526 0.386 0.308 366s 0.869 0.308 -0.591 -0.676 366s -0.985 0.702 -0.256 0.194 366s 366s y: [3, 3, 1, 1] 366s -0.055 -0.189 -0.391 366s -0.311 1.120 -1.056 366s 0.451 -0.323 0.563 366s testing: mul_mat, [1, 4, 2, 1] = [3, 1, 2, 1] * [3, 4, 2, 1] 366s x0: [3, 1, 2, 1] 366s -0.130 -0.286 -0.469 366s 366s x1: [3, 4, 2, 1] 366s 0.988 -0.711 0.620 366s -0.715 -0.140 0.621 366s -0.841 0.335 0.007 366s 0.467 0.204 -0.686 366s 366s y: [1, 4, 2, 1] 366s -0.216 366s -0.159 366s 0.010 366s 0.203 366s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 366s x0: [1, 1, 2, 1] 366s 0.273 366s 366s x1: [1, 3, 2, 1] 366s -0.824 366s 0.078 366s 0.132 366s 366s y: [1, 3, 2, 1] 366s -0.225 366s 0.021 366s 0.036 366s test-mul-mat0: iter:244/500 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.216 -0.229 366s 0.655 0.655 366s 0.486 -0.814 366s 0.842 -0.725 366s 366s x1: [2, 4, 1, 1] 366s -0.994 -0.596 366s 0.540 -0.721 366s 0.927 0.067 366s 0.597 -0.225 366s 366s y: [4, 4, 1, 1] 366s 0.351 -1.042 0.003 -0.405 366s 0.049 -0.119 0.850 0.978 366s -0.215 0.651 0.396 0.732 366s -0.077 0.244 0.473 0.666 366s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 366s x0: [2, 3, 4, 1] 366s -0.592 -0.027 366s 0.704 0.699 366s -0.443 -0.197 366s 366s x1: [2, 4, 4, 1] 366s 0.698 -0.981 366s 0.353 -0.170 366s -0.520 0.292 366s 0.584 0.100 366s 366s y: [3, 4, 4, 1] 366s -0.387 -0.195 -0.116 366s -0.204 0.130 -0.123 366s 0.300 -0.162 0.173 366s -0.348 0.480 -0.278 366s testing: mul_mat, [2, 3, 4, 3] = [2, 2, 4, 3] * [2, 3, 4, 3] 366s x0: [2, 2, 4, 3] 366s -0.317 0.834 366s -0.398 0.395 366s 366s x1: [2, 3, 4, 3] 366s 0.675 0.486 366s -0.113 -0.120 366s -0.459 0.030 366s 366s y: [2, 3, 4, 3] 366s 0.191 -0.077 366s -0.064 -0.003 366s 0.170 0.195 366s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.749 0.405 366s 0.023 0.042 366s 0.702 0.232 366s 0.275 0.371 366s 366s x1: [2, 2, 1, 1] 366s -0.735 0.927 366s -0.035 -0.413 366s 366s y: [4, 2, 1, 1] 366s -0.176 0.022 -0.302 0.142 366s -0.193 -0.018 -0.120 -0.163 366s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 1, 4, 1] 366s 0.952 -0.825 -0.289 0.607 366s 366s x1: [4, 2, 4, 1] 366s -0.473 0.872 0.171 -0.387 366s 0.907 0.655 0.979 0.827 366s 366s y: [1, 2, 4, 1] 366s -1.454 366s 0.543 366s testing: mul_mat, [1, 4, 4, 3] = [1, 1, 4, 3] * [1, 4, 4, 3] 366s x0: [1, 1, 4, 3] 366s 0.964 366s 366s x1: [1, 4, 4, 3] 366s -0.647 366s 0.061 366s -0.530 366s 0.335 366s 366s y: [1, 4, 4, 3] 366s -0.624 366s 0.058 366s -0.511 366s 0.323 366s test-mul-mat0: iter:245/500 366s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.187 0.707 0.380 366s -0.827 -0.217 -0.228 366s 366s x1: [3, 1, 1, 1] 366s -0.594 0.832 -0.841 366s 366s y: [2, 1, 1, 1] 366s 0.380 0.502 366s testing: mul_mat, [1, 2, 3, 1] = [3, 1, 3, 1] * [3, 2, 3, 1] 366s x0: [3, 1, 3, 1] 366s -0.333 0.388 0.922 366s 366s x1: [3, 2, 3, 1] 366s 0.476 -0.808 0.647 366s -0.105 -0.355 0.611 366s 366s y: [1, 2, 3, 1] 366s 0.125 366s 0.461 366s testing: mul_mat, [3, 1, 3, 4] = [3, 3, 3, 4] * [3, 1, 3, 4] 366s x0: [3, 3, 3, 4] 366s 0.238 -0.473 0.956 366s 0.678 -0.919 -0.513 366s -0.312 0.509 -0.524 366s 366s x1: [3, 1, 3, 4] 366s 0.485 -0.962 0.592 366s 366s y: [3, 1, 3, 4] 366s 1.137 0.910 -0.951 366s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 1, 1, 1] 366s -0.823 -0.785 -0.701 366s 366s x1: [3, 3, 1, 1] 366s 0.551 0.712 -0.831 366s 0.917 0.622 0.786 366s 0.323 -0.581 0.732 366s 366s y: [1, 3, 1, 1] 366s -0.431 366s -1.795 366s -0.322 366s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 366s x0: [1, 2, 3, 1] 366s 0.875 366s 0.572 366s 366s x1: [1, 3, 3, 1] 366s -0.649 366s 0.098 366s -0.699 366s 366s y: [2, 3, 3, 1] 366s -0.568 -0.371 366s 0.086 0.056 366s -0.612 -0.399 366s testing: mul_mat, [1, 1, 3, 4] = [2, 1, 3, 4] * [2, 1, 3, 4] 366s x0: [2, 1, 3, 4] 366s 0.170 0.049 366s 366s x1: [2, 1, 3, 4] 366s 0.800 0.819 366s 366s y: [1, 1, 3, 4] 366s 0.176 366s test-mul-mat0: iter:246/500 366s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.621 366s -0.652 366s 366s x1: [1, 3, 1, 1] 366s 0.285 366s 0.330 366s 0.451 366s 366s y: [2, 3, 1, 1] 366s -0.177 -0.186 366s -0.205 -0.215 366s -0.280 -0.294 366s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 366s x0: [1, 2, 3, 1] 366s -0.672 366s -0.937 366s 366s x1: [1, 2, 3, 1] 366s -0.614 366s 0.154 366s 366s y: [2, 2, 3, 1] 366s 0.412 0.575 366s -0.103 -0.144 366s testing: mul_mat, [2, 2, 3, 4] = [1, 2, 3, 4] * [1, 2, 3, 4] 366s x0: [1, 2, 3, 4] 366s 0.330 366s 0.904 366s 366s x1: [1, 2, 3, 4] 366s -0.456 366s 0.511 366s 366s y: [2, 2, 3, 4] 366s -0.150 -0.412 366s 0.169 0.462 366s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 4, 1, 1] 366s 0.173 366s 0.843 366s 0.907 366s 0.292 366s 366s x1: [1, 2, 1, 1] 366s -0.585 366s -0.159 366s 366s y: [4, 2, 1, 1] 366s -0.101 -0.493 -0.531 -0.171 366s -0.027 -0.134 -0.144 -0.046 366s testing: mul_mat, [4, 1, 3, 1] = [4, 4, 3, 1] * [4, 1, 3, 1] 366s x0: [4, 4, 3, 1] 366s -0.187 -0.390 -0.347 -0.508 366s 0.985 -0.122 0.651 0.031 366s 0.357 -0.937 -0.035 -0.759 366s -0.423 -0.180 0.068 -0.665 366s 366s x1: [4, 1, 3, 1] 366s 0.173 0.811 0.152 0.187 366s 366s y: [4, 1, 3, 1] 366s -0.497 0.176 -0.845 -0.333 366s testing: mul_mat, [1, 4, 3, 4] = [4, 1, 3, 4] * [4, 4, 3, 4] 366s x0: [4, 1, 3, 4] 366s 0.610 -0.210 0.514 0.247 366s 366s x1: [4, 4, 3, 4] 366s -0.770 0.694 0.510 -0.086 366s -0.611 0.714 -0.718 -0.172 366s -0.473 0.199 0.695 -0.218 366s 0.612 -0.804 0.849 -0.990 366s 366s y: [1, 4, 3, 4] 366s -0.375 366s -0.935 366s -0.027 366s 0.735 366s test-mul-mat0: iter:247/500 366s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 3, 1, 1] 366s -0.091 0.296 -0.127 -0.622 366s 0.650 -0.694 -0.985 -0.035 366s -0.183 0.070 -0.166 0.235 366s 366s x1: [4, 2, 1, 1] 366s 0.283 -0.170 -0.326 0.297 366s -0.308 -0.606 -0.046 -0.811 366s 366s y: [3, 2, 1, 1] 366s -0.220 0.613 0.060 366s 0.359 0.295 -0.169 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.846 0.248 0.404 -0.200 366s 0.437 0.642 0.709 -0.267 366s -0.486 -0.913 -0.616 -0.180 366s 366s x1: [4, 3, 1, 1] 366s 0.427 -0.841 0.864 0.953 366s -0.377 0.919 -0.750 0.267 366s 0.726 -0.664 -0.451 -0.444 366s 366s y: [3, 3, 1, 1] 366s 0.311 0.006 -0.143 366s -0.447 -0.177 -0.242 366s 0.356 -0.310 0.612 366s testing: mul_mat, [1, 3, 1, 2] = [4, 1, 1, 2] * [4, 3, 1, 2] 366s x0: [4, 1, 1, 2] 366s 0.136 -0.645 0.708 0.223 366s 366s x1: [4, 3, 1, 2] 366s -0.898 0.349 0.637 0.172 366s -0.817 -0.128 -0.401 -0.658 366s -0.264 -0.448 -0.035 -0.345 366s 366s y: [1, 3, 1, 2] 366s 0.142 366s -0.460 366s 0.151 366s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.388 -0.571 0.588 0.216 366s 0.774 -0.550 -0.411 0.931 366s 0.648 0.808 0.747 0.288 366s 0.577 -0.190 0.737 -0.356 366s 366s x1: [4, 1, 1, 1] 366s -0.834 -0.503 -0.730 0.038 366s 366s y: [4, 1, 1, 1] 366s 0.190 -0.034 -1.481 -0.937 366s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 1, 1, 1] 366s 0.569 0.191 -0.875 -0.981 366s 366s x1: [4, 4, 1, 1] 366s 0.553 -0.518 0.780 0.908 366s -0.810 0.003 -0.354 -0.282 366s -0.672 -0.266 -0.116 -0.175 366s 0.003 0.921 0.921 0.615 366s 366s y: [1, 4, 1, 1] 366s -1.358 366s 0.126 366s -0.160 366s -1.231 366s testing: mul_mat, [2, 4, 1, 2] = [1, 2, 1, 2] * [1, 4, 1, 2] 366s x0: [1, 2, 1, 2] 366s 0.448 366s -0.834 366s 366s x1: [1, 4, 1, 2] 366s -0.000 366s -0.065 366s 0.607 366s 0.588 366s 366s y: [2, 4, 1, 2] 366s -0.000 0.000 366s -0.029 0.054 366s 0.272 -0.506 366s 0.264 -0.491 366s test-mul-mat0: iter:248/500 366s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.543 -0.173 0.305 -0.928 366s 366s x1: [4, 1, 1, 1] 366s -0.979 -0.441 0.454 -0.095 366s 366s y: [1, 1, 1, 1] 366s 0.834 366s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 366s x0: [4, 1, 4, 1] 366s 0.332 0.116 -0.108 0.563 366s 366s x1: [4, 1, 4, 1] 366s -0.478 -0.126 0.263 -0.353 366s 366s y: [1, 1, 4, 1] 366s -0.400 366s testing: mul_mat, [3, 1, 4, 3] = [4, 3, 4, 3] * [4, 1, 4, 3] 366s x0: [4, 3, 4, 3] 366s -0.040 -0.370 -0.391 0.933 366s -0.549 0.220 -0.588 -0.981 366s -0.523 0.703 0.406 -0.099 366s 366s x1: [4, 1, 4, 3] 366s 0.082 0.814 -0.422 0.345 366s 366s y: [3, 1, 4, 3] 366s 0.183 0.043 0.324 366s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.348 0.512 0.849 0.369 366s 0.488 0.315 0.876 0.447 366s 0.513 -0.676 -0.660 -0.010 366s 0.247 -0.185 -0.282 -0.482 366s 366s x1: [4, 3, 1, 1] 366s -0.164 0.901 -0.453 -0.243 366s 0.900 0.278 -0.579 0.719 366s 0.732 -0.269 -0.117 0.982 366s 366s y: [4, 3, 1, 1] 366s 0.045 -0.302 -0.392 0.037 366s -0.397 0.341 0.649 -0.012 366s -0.130 0.609 0.625 -0.209 366s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 366s x0: [4, 2, 4, 1] 366s -0.061 0.129 0.732 0.408 366s 0.358 -0.975 -0.707 0.575 366s 366s x1: [4, 4, 4, 1] 366s -0.223 0.910 0.354 -0.322 366s -0.542 -0.889 -0.422 0.736 366s -0.468 -0.703 0.468 0.263 366s 0.180 0.449 0.886 0.833 366s 366s y: [2, 4, 4, 1] 366s 0.259 -1.402 366s -0.091 1.393 366s 0.388 0.338 366s 1.036 -0.521 366s testing: mul_mat, [4, 4, 4, 3] = [2, 4, 4, 3] * [2, 4, 4, 3] 366s x0: [2, 4, 4, 3] 366s -0.428 -0.481 366s 0.372 -0.774 366s 0.443 0.901 366s -0.032 0.776 366s 366s x1: [2, 4, 4, 3] 366s 0.040 0.295 366s 0.269 -0.227 366s 0.589 -0.323 366s -0.652 -0.690 366s 366s y: [4, 4, 4, 3] 366s -0.159 -0.214 0.284 0.228 366s -0.006 0.276 -0.086 -0.185 366s -0.097 0.469 -0.030 -0.270 366s 0.612 0.292 -0.911 -0.515 366s test-mul-mat0: iter:249/500 366s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.266 -0.944 366s -0.127 -0.650 366s -0.494 -0.106 366s 366s x1: [2, 4, 1, 1] 366s 0.510 0.336 366s 0.726 -0.584 366s -0.168 -0.244 366s 0.290 0.705 366s 366s y: [3, 4, 1, 1] 366s -0.453 -0.283 -0.288 366s 0.358 0.287 -0.297 366s 0.275 0.180 0.109 366s -0.743 -0.495 -0.218 366s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 366s x0: [2, 2, 3, 1] 366s -0.651 -0.405 366s 0.042 0.638 366s 366s x1: [2, 3, 3, 1] 366s 0.769 0.535 366s 0.026 0.671 366s 0.334 0.196 366s 366s y: [2, 3, 3, 1] 366s -0.718 0.374 366s -0.288 0.429 366s -0.297 0.139 366s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 2, 3, 1] 366s -0.795 0.490 366s -0.479 -0.862 366s 366s x1: [2, 2, 3, 1] 366s -0.509 -0.602 366s 0.124 -0.838 366s 366s y: [2, 2, 3, 1] 366s 0.109 0.762 366s -0.509 0.663 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.481 0.766 366s 0.710 -0.395 366s 366s x1: [2, 2, 1, 1] 366s -0.503 -0.495 366s 0.384 0.219 366s 366s y: [2, 2, 1, 1] 366s -0.621 -0.162 366s 0.352 0.186 366s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 1, 3, 1] 366s -0.234 -0.098 366s 366s x1: [2, 2, 3, 1] 366s 0.872 0.498 366s -0.076 -0.863 366s 366s y: [1, 2, 3, 1] 366s -0.253 366s 0.102 366s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, 1] * [1, 2, 3, 1] 366s x0: [1, 3, 3, 1] 366s -0.228 366s -0.727 366s -0.196 366s 366s x1: [1, 2, 3, 1] 366s 0.378 366s -0.101 366s 366s y: [3, 2, 3, 1] 366s -0.086 -0.275 -0.074 366s 0.023 0.073 0.020 366s test-mul-mat0: iter:250/500 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.639 -0.865 366s 0.124 0.260 366s 0.543 -0.001 366s -0.336 -0.665 366s 366s x1: [2, 4, 1, 1] 366s 0.058 -0.046 366s 0.727 -0.292 366s 0.527 -0.507 366s 0.610 -0.093 366s 366s y: [4, 4, 1, 1] 366s 0.002 -0.005 0.032 0.011 366s -0.212 0.015 0.395 -0.050 366s 0.101 -0.066 0.287 0.160 366s -0.309 0.052 0.332 -0.143 366s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.614 -0.313 366s 366s x1: [2, 4, 1, 1] 366s -0.907 0.436 366s -0.392 -0.103 366s 0.974 0.486 366s 0.573 -0.990 366s 366s y: [1, 4, 1, 1] 366s 0.420 366s 0.273 366s -0.750 366s -0.041 366s testing: mul_mat, [4, 1, 1, 3] = [2, 4, 1, 3] * [2, 1, 1, 3] 366s x0: [2, 4, 1, 3] 366s 0.771 -0.993 366s 0.564 -0.323 366s 0.753 0.925 366s -0.188 -0.123 366s 366s x1: [2, 1, 1, 3] 366s -0.813 0.289 366s 366s y: [4, 1, 1, 3] 366s -0.914 -0.552 -0.344 0.117 366s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.024 -0.193 366s 366s x1: [2, 4, 1, 1] 366s -0.739 -0.885 366s 0.741 0.507 366s -0.641 0.255 366s 0.460 -0.871 366s 366s y: [1, 4, 1, 1] 366s 0.153 366s -0.080 366s -0.065 366s 0.179 366s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.891 366s 366s x1: [1, 2, 1, 1] 366s 0.014 366s -0.051 366s 366s y: [1, 2, 1, 1] 366s 0.013 366s -0.045 366s testing: mul_mat, [2, 1, 1, 3] = [1, 2, 1, 3] * [1, 1, 1, 3] 366s x0: [1, 2, 1, 3] 366s -0.334 366s 0.737 366s 366s x1: [1, 1, 1, 3] 366s -0.866 366s 366s y: [2, 1, 1, 3] 366s 0.289 -0.639 366s test-mul-mat0: iter:251/500 366s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.022 -0.973 -0.103 366s -0.087 -0.839 -0.128 366s 366s x1: ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s [3, 4, 1, 1] 366s 0.868 0.751 0.087 366s 0.983 0.492 -0.406 366s -0.658 -0.253 -0.946 366s -0.528 -0.992 0.078 366s 366s y: [2, 4, 1, 1] 366s -0.720 -0.716 366s -0.415 -0.446 366s 0.329 0.390 366s 0.945 0.868 366s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 366s x0: [3, 4, 3, 1] 366s 0.723 -0.184 -0.270 366s 0.731 0.895 -0.992 366s -0.247 0.922 -0.095 366s 0.666 -0.917 0.778 366s 366s x1: [3, 2, 3, 1] 366s -0.319 -0.856 0.538 366s -0.582 0.411 0.504 366s 366s y: [4, 2, 3, 1] 366s -0.219 -1.533 -0.761 0.992 366s -0.632 -0.557 0.475 -0.373 366s testing: mul_mat, [1, 4, 3, 2] = [3, 1, 3, 2] * [3, 4, 3, 2] 366s x0: [3, 1, 3, 2] 366s -0.655 0.030 0.197 366s 366s x1: [3, 4, 3, 2] 366s -0.304 0.972 0.720 366s 0.602 0.638 0.803 366s 0.379 0.985 -0.971 366s -0.305 -0.249 0.666 366s 366s y: [1, 4, 3, 2] 366s 0.370 366s -0.217 366s -0.410 366s 0.323 366s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.962 0.391 0.096 366s 366s x1: [3, 1, 1, 1] 366s -0.336 0.488 0.861 366s 366s y: [1, 1, 1, 1] 366s -0.049 366s testing: mul_mat, [1, 3, 3, 1] = [1, 1, 3, 1] * [1, 3, 3, 1] 366s x0: [1, 1, 3, 1] 366s -0.202 366s 366s x1: [1, 3, 3, 1] 366s 0.819 366s 0.119 366s 0.434 366s 366s y: [1, 3, 3, 1] 366s -0.165 366s -0.024 366s -0.088 366s testing: mul_mat, [2, 1, 3, 2] = [1, 2, 3, 2] * [1, 1, 3, 2] 366s x0: [1, 2, 3, 2] 366s 0.306 366s 0.343 366s 366s x1: [1, 1, 3, 2] 366s -0.431 366s 366s y: [2, 1, 3, 2] 366s -0.132 -0.148 366s test-mul-mat0: iter:252/500 366s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.437 -0.424 0.097 0.906 366s 366s x1: [4, 3, 1, 1] 366s -0.362 0.095 0.339 0.767 366s 0.189 -0.863 0.181 -0.189 366s -0.293 -0.857 -0.834 0.862 366s 366s y: [1, 3, 1, 1] 366s 0.846 366s 0.129 366s 1.192 366s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.910 0.439 0.578 0.363 366s 366s x1: [4, 1, 1, 1] 366s 0.884 0.553 0.914 -0.320 366s 366s y: [1, 1, 1, 1] 366s -0.149 366s testing: mul_mat, [3, 1, 1, 4] = [4, 3, 1, 4] * [4, 1, 1, 4] 366s x0: [4, 3, 1, 4] 366s 0.590 0.505 0.595 -0.313 366s 0.411 0.479 -0.760 0.325 366s -0.840 -0.890 0.415 0.598 366s 366s x1: [4, 1, 1, 4] 366s -0.984 -0.595 -0.931 0.681 366s 366s y: [3, 1, 1, 4] 366s -1.648 0.240 1.377 366s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.165 0.682 -0.383 0.850 366s -0.143 -0.233 -0.648 -0.384 366s 366s x1: [4, 3, 1, 1] 366s 0.750 -0.042 -0.412 0.821 366s -0.662 -0.384 0.858 -0.831 366s -0.699 -0.330 0.157 0.784 366s 366s y: [2, 3, 1, 1] 366s 0.951 -0.146 366s -1.405 -0.053 366s 0.265 -0.226 366s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 2, 1, 1] 366s -0.023 0.373 366s -0.864 0.564 366s 366s x1: [2, 4, 1, 1] 366s -0.654 0.091 366s -0.274 0.756 366s 0.931 -0.476 366s 0.943 -0.614 366s 366s y: [2, 4, 1, 1] 366s 0.049 0.616 366s 0.288 0.663 366s -0.199 -1.073 366s -0.251 -1.161 366s testing: mul_mat, [4, 2, 1, 4] = [2, 4, 1, 4] * [2, 2, 1, 4] 366s x0: [2, 4, 1, 4] 366s -0.645 0.045 366s 0.290 0.842 366s 0.911 0.604 366s 0.081 -0.012 366s 366s x1: [2, 2, 1, 4] 366s 0.958 0.711 366s -0.819 0.815 366s 366s y: [4, 2, 1, 4] 366s -0.586 0.876 1.302 0.069 366s 0.566 0.449 -0.254 -0.076 366s test-mul-mat0: iter:253/500 366s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.994 366s -0.625 366s 366s x1: [1, 2, 1, 1] 366s 0.765 366s -0.644 366s 366s y: [2, 2, 1, 1] 366s 0.760 -0.478 366s -0.640 0.403 366s testing: mul_mat, [2, 2, 2, 1] = [1, 2, 2, 1] * [1, 2, 2, 1] 366s x0: [1, 2, 2, 1] 366s -0.131 366s 0.594 366s 366s x1: [1, 2, 2, 1] 366s -0.645 366s -0.641 366s 366s y: [2, 2, 2, 1] 366s 0.085 -0.383 366s 0.084 -0.381 366s testing: mul_mat, [4, 2, 2, 3] = [1, 4, 2, 3] * [1, 2, 2, 3] 366s x0: [1, 4, 2, 3] 366s -0.479 366s 0.222 366s 0.559 366s 0.923 366s 366s x1: [1, 2, 2, 3] 366s 0.786 366s 0.443 366s 366s y: [4, 2, 2, 3] 366s -0.376 0.174 0.439 0.725 366s -0.212 0.098 0.247 0.408 366s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.327 366s 0.938 366s 366s x1: [1, 4, 1, 1] 366s 0.994 366s -0.645 366s -0.972 366s 0.131 366s 366s y: [2, 4, 1, 1] 366s 0.326 0.933 366s -0.211 -0.605 366s -0.318 -0.911 366s 0.043 0.123 366s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 366s x0: [2, 1, 2, 1] 366s 0.412 0.276 366s 366s x1: [2, 1, 2, 1] 366s 0.041 -0.451 366s 366s y: [1, 1, 2, 1] 366s -0.108 366s testing: mul_mat, [4, 2, 2, 3] = [1, 4, 2, 3] * [1, 2, 2, 3] 366s x0: [1, 4, 2, 3] 366s -0.743 366s -0.153 366s -0.256 366s -0.388 366s 366s x1: [1, 2, 2, 3] 366s 0.944 366s 0.856 366s 366s y: [4, 2, 2, 3] 366s -0.702 -0.144 -0.241 -0.367 366s -0.637 -0.131 -0.219 -0.333 366s test-mul-mat0: iter:254/500 366s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.274 0.566 0.123 366s 366s x1: [3, 1, 1, 1] 366s -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s .745 -0.573 -0.178 366s 366s y: [1, 1, 1, 1] 366s -0.551 366s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.254 -0.573 0.105 366s -0.435 0.562 0.394 366s 366s x1: [3, 1, 1, 1] 366s -0.850 0.441 -0.745 366s 366s y: [2, 1, 1, 1] 366s -0.548 0.325 366s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.183 0.400 -0.247 366s -0.321 -0.345 0.074 366s -0.249 -0.089 0.501 366s 366s x1: [3, 2, 1, 1] 366s 0.286 0.974 -0.801 366s 0.710 -0.670 -0.139 366s 366s y: [3, 2, 1, 1] 366s 0.640 -0.487 -0.560 366s -0.104 -0.007 -0.187 366s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.227 -0.211 -0.236 366s -0.617 0.776 0.976 366s -0.602 0.684 0.394 366s 366s x1: [3, 3, 1, 1] 366s 0.572 0.422 -0.225 366s 0.138 -0.455 -0.076 366s -0.421 -0.200 0.277 366s 366s y: [3, 3, 1, 1] 366s 0.094 -0.245 -0.144 366s 0.145 -0.512 -0.424 366s -0.119 0.375 0.226 366s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.803 -0.387 0.279 366s 366s x1: [3, 3, 1, 1] 366s 0.094 -0.163 0.284 366s -0.723 -0.763 -0.962 366s -0.044 -0.108 0.112 366s 366s y: [1, 3, 1, 1] 366s 0.218 366s -0.553 366s 0.038 366s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.688 366s -0.004 366s -0.429 366s 366s x1: [1, 3, 1, 1] 366s 0.225 366s 0.387 366s -0.583 366s 366s y: [3, 3, 1, 1] 366s -0.155 -0.001 -0.097 366s -0.267 -0.002 -0.166 366s 0.401 0.002 0.250 366s test-mul-mat0: iter:255/500 366s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.555 -0.439 366s 0.591 -0.277 366s -0.202 0.629 366s 366s x1: [2, 3, 1, 1] 366s 0.228 -0.413 366s 0.748 -0.088 366s 0.351 0.724 366s 366s y: [3, 3, 1, 1] 366s 0.055 0.249 -0.305 366s -0.376 0.466 -0.206 366s -0.513 0.007 0.384 366s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 366s x0: [2, 3, 3, 1] 366s 0.446 -0.638 366s 0.047 -0.203 366s -0.914 0.354 366s 366s x1: [2, 3, 3, 1] 366s 0.679 0.691 366s -0.260 0.386 366s 0.494 0.353 366s 366s y: [3, 3, 3, 1] 366s -0.138 -0.108 -0.376 366s -0.362 -0.091 0.374 366s -0.005 -0.048 -0.326 366s testing: mul_mat, [3, 3, 3, 4] = [2, 3, 3, 4] * [2, 3, 3, 4] 366s x0: [2, 3, 3, 4] 366s -0.812 0.200 366s -0.194 -0.256 366s 0.006 0.868 366s 366s x1: [2, 3, 3, 4] 366s 0.695 0.348 366s 0.407 -0.224 366s 0.838 -0.540 366s 366s y: [3, 3, 3, 4] 366s -0.495 -0.224 0.307 366s -0.376 -0.022 -0.192 366s -0.789 -0.024 -0.464 366s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.948 -0.815 366s -0.452 -0.978 366s 366s x1: [2, 3, 1, 1] 366s -0.667 -0.721 366s 0.299 0.690 366s 0.972 0.345 366s 366s y: [2, 3, 1, 1] 366s -0.045 1.007 366s -0.278 -0.810 366s 0.641 -0.777 366s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 1, 3, 1] 366s -0.638 -0.302 366s 366s x1: [2, 2, 3, 1] 366s 0.475 -0.311 366s -0.641 0.903 366s 366s y: [1, 2, 3, 1] 366s -0.209 366s 0.136 366s testing: mul_mat, [1, 2, 3, 4] = [1, 1, 3, 4] * [1, 2, 3, 4] 366s x0: [1, 1, 3, 4] 366s 0.703 366s 366s x1: [1, 2, 3, 4] 366s 0.476 366s 0.621 366s 366s y: [1, 2, 3, 4] 366s 0.335 366s 0.436 366s test-mul-mat0: iter:256/500 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.872 -0.670 366s 0.200 -0.339 366s -0.968 -0.042 366s 0.229 -0.987 366s 366s x1: [2, 1, 1, 1] 366s -0.003 -0.040 366s 366s y: [4, 1, 1, 1] 366s 0.029 0.013 0.005 0.039 366s testing: mul_mat, [2, 4, 3, 1] = [2, 2, 3, 1] * [2, 4, 3, 1] 366s x0: [2, 2, 3, 1] 366s 0.609 -0.538 366s -0.664 -0.432 366s 366s x1: [2, 4, 3, 1] 366s 0.359 -0.430 366s 0.393 0.263 366s -0.728 0.471 366s 0.328 -0.760 366s 366s y: [2, 4, 3, 1] 366s 0.450 -0.052 366s 0.098 -0.375 366s -0.696 0.280 366s 0.608 0.111 366s testing: mul_mat, [2, 2, 3, 3] = [2, 2, 3, 3] * [2, 2, 3, 3] 366s x0: [2, 2, 3, 3] 366s 0.582 -0.312 366s -0.288 0.447 366s 366s x1: [2, 2, 3, 3] 366s -0.483 -0.226 366s -0.950 -0.301 366s 366s y: [2, 2, 3, 3] 366s -0.210 0.038 366s -0.459 0.139 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.187 0.423 366s 0.289 -0.504 366s 366s x1: [2, 2, 1, 1] 366s 0.697 0.911 366s 0.758 0.421 366s 366s y: [2, 2, 1, 1] 366s 0.515 -0.257 366s 0.320 0.007 366s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 1, 3, 1] 366s 0.088 -0.588 366s 366s x1: [2, 2, 3, 1] 366s 0.763 0.248 366s -0.746 -0.465 366s 366s y: [1, 2, 3, 1] 366s -0.078 366s 0.207 366s testing: mul_mat, [4, 2, 3, 3] = [1, 4, 3, 3] * [1, 2, 3, 3] 366s x0: [1, 4, 3, 3] 366s -0.121 366s 0.260 366s 0.866 366s 0.082 366s 366s x1: [1, 2, 3, 3] 366s 0.007 366s -0.866 366s 366s y: [4, 2, 3, 3] 366s -0.001 0.002 0.006 0.001 366s 0.104 -0.225 -0.750 -0.071 366s test-mul-mat0: iter:257/500 366s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.117 -0.976 366s -0.397 -0.493 366s 366s x1: [2, 3, 1, 1] 366s 0.465 -0.351 366s -0.325 0.334 366s -0.139 0.920 366s 366s y: [2, 3, 1, 1] 366s 0.397 -0.012 366s -0.364 -0.036 366s -0.915 -0.398 366s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.793 -0.650 366s -0.004 -0.322 366s 0.016 0.570 366s 366s x1: [2, 2, 1, 1] 366s -0.827 -0.287 366s -0.545 -0.136 366s 366s y: [3, 2, 1, 1] 366s 0.843 0.096 -0.177 366s 0.521 0.046 -0.087 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.443 0.429 366s 366s x1: [2, 3, 1, 1] 366s -0.972 -0.057 366s -0.989 -0.539 366s 0.986 -0.102 366s 366s y: [1, 3, 1, 1] 366s 0.406 366s 0.206 366s -0.480 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.864 0.184 366s -0.933 -0.951 366s -0.975 -0.627 366s 0.770 0.691 366s 366s x1: [2, 1, 1, 1] 366s -0.471 0.206 366s 366s y: [4, 1, 1, 1] 366s -0.369 0.243 0.330 -0.220 366s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.547 0.650 0.059 0.396 366s 366s x1: [4, 2, 1, 1] 366s -0.778 0.086 -0.854 0.086 366s 0.084 -0.647 0.436 -0.920 366s 366s y: [1, 2, 1, 1] 366s 0.465 366s -0.805 366s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.939 366s 366s x1: [1, 4, 1, 1] 366s 0.661 366s 0.520 366s 0.382 366s -0.441 366s 366s y: [1, 4, 1, 1] 366s 0.621 366s 0.488 366s 0.359 366s -0.414 366s test-mul-mat0: iter:258/500 366s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.767 0.477 0.290 366s -0.414 -0.087 0.370 366s 0.616 0.366 0.020 366s -0.325 -0.239 -0.319 366s 366s x1: [3, 3, 1, 1] 366s -0.023 0.212 -0.881 366s -0.253 -0.604 -0.832 366s 0.120 -0.913 -0.610 366s 366s y: [4, 3, 1, 1] 366s -0.137 -0.335 0.046 0.238 366s -0.335 -0.150 -0.393 0.492 366s -0.704 -0.196 -0.272 0.374 366s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.164 0.608 -0.263 366s -0.077 0.999 -0.057 366s 0.156 0.475 -0.767 366s 0.742 -0.612 0.602 366s 366s x1: [3, 4, 1, 1] 366s -0.805 -0.857 0.240 366s -0.221 -0.918 0.228 366s 0.893 0.228 0.322 366s -0.130 -0.560 0.440 366s 366s y: [4, 4, 1, 1] 366s -0.452 -0.808 -0.717 0.072 366s -0.582 -0.912 -0.646 0.535 366s -0.092 0.141 0.001 0.717 366s -0.435 -0.574 -0.624 0.511 366s testing: mul_mat, [2, 4, 1, 3] = [3, 2, 1, 3] * [3, 4, 1, 3] 366s x0: [3, 2, 1, 3] 366s -0.313 -0.071 0.369 366s 0.695 0.670 0.140 366s 366s x1: [3, 4, 1, 3] 366s 0.358 0.753 -0.378 366s -0.966 -0.485 0.303 366s -0.771 -0.342 -0.457 366s 0.008 -0.259 0.771 366s 366s y: [2, 4, 1, 3] 366s -0.305 0.701 366s 0.449 -0.954 366s 0.097 -0.829 366s 0.301 -0.060 366s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.727 -0.333 -0.307 366s -0.578 -0.269 0.494 366s 366s x1: [3, 2, 1, 1] 366s -0.057 -0.217 -0.338 366s -0.758 0.849 -0.335 366s 366s y: [2, 2, 1, 1] 366s 0.217 -0.075 366s 0.371 0.045 366s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.070 0.002 366s -0.764 0.827 366s -0.926 -0.732 366s -0.023 0.910 366s 366s x1: [2, 3, 1, 1] 366s 0.418 0.622 366s -0.137 0.113 366s 0.292 -0.997 366s 366s y: [4, 3, 1, 1] 366s -0.028 0.196 -0.842 0.556 366s 0.010 0.198 0.044 0.106 366s -0.023 -1.047 0.459 -0.914 366s testing: mul_mat, [2, 2, 1, 3] = [4, 2, 1, 3] * [4, 2, 1, 3] 366s x0: [4, 2, 1, 3] 366s 0.676 -0.701 -0.511 0.207 366s -0.779 0.503 -0.791 -0.784 366s 366s x1: [4, 2, 1, 3] 366s 0.704 0.727 -0.043 -0.353 366s -0.491 0.620 -0.111 -0.642 366s 366s y: [2, 2, 1, 3] 366s -0.085 0.128 366s -0.842 1.285 366s test-mul-mat0: iter:259/500 366s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.382 -0.468 366s -0.532 0.591 366s 0.739 -0.317 366s -0.243 0.593 366s 366s x1: [2, 2, 1, 1] 366s 0.353 0.758 366s 0.083 0.028 366s 366s y: [4, 2, 1, 1] 366s -0.220 0.260 0.020 0.364 366s 0.018 -0.027 0.052 -0.003 366s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 366s x0: [2, 4, 4, 1] 366s 0.474 0.071 366s 0.739 -0.414 366s 0.277 -0.869 366s 0.174 -0.439 366s 366s x1: [2, 4, 4, 1] 366s 0.408 0.543 366s -0.664 -0.508 366s -0.386 0.112 366s -0.794 -0.608 366s 366s y: [4, 4, 4, 1] 366s 0.232 0.077 -0.359 -0.167 366s -0.350 -0.280 0.258 0.108 366s -0.175 -0.332 -0.204 -0.116 366s -0.419 -0.335 0.308 0.129 366s testing: mul_mat, [1, 4, 4, 2] = [2, 1, 4, 2] * [2, 4, 4, 2] 366s x0: [2, 1, 4, 2] 366s 0.595 0.792 366s 366s x1: [2, 4, 4, 2] 366s 0.123 0.539 366s -0.339 -0.599 366s 0.670 0.835 366s -0.038 0.372 366s 366s y: [1, 4, 4, 2] 366s 0.501 366s -0.676 366s 1.060 366s 0.272 366s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.621 0.222 366s -0.920 0.921 366s -0.535 0.173 366s 366s x1: [2, 1, 1, 1] 366s 0.518 -0.826 366s 366s y: [3, 1, 1ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s , 1] 366s -0.505 -1.237 -0.420 366s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 366s x0: [3, 1, 4, 1] 366s 0.595 -0.172 -0.322 366s 366s x1: [3, 2, 4, 1] 366s -0.900 0.496 -0.841 366s -0.864 -0.655 -0.913 366s 366s y: [1, 2, 4, 1] 366s -0.350 366s -0.107 366s testing: mul_mat, [1, 3, 4, 2] = [1, 1, 4, 2] * [1, 3, 4, 2] 366s x0: [1, 1, 4, 2] 366s 0.255 366s 366s x1: [1, 3, 4, 2] 366s -0.215 366s -0.273 366s -0.114 366s 366s y: [1, 3, 4, 2] 366s -0.055 366s -0.070 366s -0.029 366s test-mul-mat0: iter:260/500 366s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.612 366s -0.847 366s -0.950 366s 366s x1: [1, 3, 1, 1] 366s 0.506 366s 0.375 366s -0.310 366s 366s y: [3, 3, 1, 1] 366s -0.310 -0.429 -0.481 366s -0.229 -0.318 -0.356 366s 0.190 0.262 0.294 366s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 366s x0: [1, 1, 2, 1] 366s -0.417 366s 366s x1: [1, 3, 2, 1] 366s 0.099 366s -0.811 366s -0.832 366s 366s y: [1, 3, 2, 1] 366s -0.041 366s 0.338 366s 0.347 366s testing: mul_mat, [4, 1, 2, 1] = [1, 4, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 4, 2, 1] 366s -0.860 366s 0.549 366s 0.239 366s -0.551 366s 366s x1: [1, 1, 2, 1] 366s -0.116 366s 366s y: [4, 1, 2, 1] 366s 0.099 -0.063 -0.028 0.064 366s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 1, 1, 1] 366s -0.068 366s 366s x1: [1, 4, 1, 1] 366s -0.941 366s 0.269 366s 0.557 366s -0.668 366s 366s y: [1, 4, 1, 1] 366s 0.064 366s -0.018 366s -0.038 366s 0.045 366s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 2, 2, 1] 366s 0.175 366s 0.007 366s 366s x1: [1, 1, 2, 1] 366s 0.022 366s 366s y: [2, 1, 2, 1] 366s 0.004 0.000 366s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 366s x0: [2, 1, 2, 1] 366s -0.081 0.182 366s 366s x1: [2, 1, 2, 1] 366s 0.175 0.445 366s 366s y: [1, 1, 2, 1] 366s 0.067 366s test-mul-mat0: iter:261/500 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.007 -0.181 366s 0.056 -0.725 366s -0.624 0.388 366s 0.050 0.309 366s 366s x1: [2, 1, 1, 1] 366s 0.464 -0.846 366s 366s y: [4, 1, 1, 1] 366s 0.156 0.639 -0.618 -0.238 366s testing: mul_mat, [2, 4, 2, 1] = [2, 2, 2, 1] * [2, 4, 2, 1] 366s x0: [2, 2, 2, 1] 366s -0.623 0.477 366s 0.386 0.841 366s 366s x1: [2, 4, 2, 1] 366s -0.589 -0.993 366s -0.371 0.586 366s 0.014 0.049 366s -0.050 -0.812 366s 366s y: [2, 4, 2, 1] 366s -0.107 -1.063 366s 0.510 0.350 366s 0.015 0.047 366s -0.356 -0.702 366s testing: mul_mat, [3, 2, 2, 3] = [2, 3, 2, 3] * [2, 2, 2, 3] 366s x0: [2, 3, 2, 3] 366s -0.727 -0.626 366s -0.562 -0.344 366s 0.223 -0.112 366s 366s x1: [2, 2, 2, 3] 366s 0.439 0.124 366s -0.174 -0.173 366s 366s y: [3, 2, 2, 3] 366s -0.397 -0.289 0.084 366s 0.234 0.157 -0.019 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.810 0.387 366s 366s x1: [2, 3, 1, 1] 366s -0.458 -0.272 366s -0.836 -0.743 366s 0.650 -0.315 366s 366s y: [1, 3, 1, 1] 366s -0.476 366s -0.964 366s 0.404 366s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 366s x0: [1, 3, 2, 1] 366s -0.434 366s 0.888 366s 0.366 366s 366s x1: [1, 2, 2, 1] 366s -0.278 366s -0.207 366s 366s y: [3, 2, 2, 1] 366s 0.121 -0.247 -0.102 366s 0.090 -0.184 -0.076 366s testing: mul_mat, [4, 1, 2, 3] = [3, 4, 2, 3] * [3, 1, 2, 3] 366s x0: [3, 4, 2, 3] 366s 0.975 -0.497 -0.877 366s 0.352 -0.243 -0.081 366s 0.207 0.826 0.811 366s 0.260 -0.609 -0.432 366s 366s x1: [3, 1, 2, 3] 366s 0.911 0.107 -0.146 366s 366s y: [4, 1, 2, 3] 366s 0.964 0.307 0.158 0.235 366s test-mul-mat0: iter:262/500 366s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.368 0.220 366s -0.542 -0.445 366s 0.715 0.630 366s 366s x1: [2, 2, 1, 1] 366s -0.598 -0.184 366s -0.385 -0.796 366s 366s y: [3, 2, 1, 1] 366s 0.180 0.406 -0.544 366s -0.034 0.563 -0.776 366s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 366s x0: [2, 3, 4, 1] 366s 0.682 -0.479 366s 0.531 -0.764 366s -0.764 0.161 366s 366s x1: [2, 3, 4, 1] 366s -0.715 -0.790 366s -0.375 -0.588 366s 0.336 0.857 366s 366s y: [3, 3, 4, 1] 366s -0.109 0.224 0.419 366s 0.026 0.250 0.192 366s -0.181 -0.476 -0.120 366s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 366s x0: [2, 2, 4, 1] 366s 0.541 0.532 366s 0.549 0.638 366s 366s x1: [2, 3, 4, 1] 366s 0.849 -0.120 366s 0.977 -0.583 366s 0.930 0.277 366s 366s y: [2, 3, 4, 1] 366s 0.396 0.389 366s 0.218 0.164 366s 0.651 0.687 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s -0.438 -0.744 366s 0.920 -0.830 366s 366s x1: [2, 2, 1, 1] 366s -0.752 0.515 366s -0.405 -0.383 366s 366s y: [2, 2, 1, 1] 366s -0.053 -1.119 366s 0.463 -0.055 366s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 366s x0: [2, 2, 4, 1] 366s 0.382 0.058 366s 0.060 -0.137 366s 366s x1: [2, 2, 4, 1] 366s 0.067 -0.255 366s 0.953 0.099 366s 366s y: [2, 2, 4, 1] 366s 0.011 0.039 366s 0.370 0.044 366s testing: mul_mat, [4, 2, 4, 1] = [2, 4, 4, 1] * [2, 2, 4, 1] 366s x0: [2, 4, 4, 1] 366s -0.497 -0.450 366s -0.828 -0.042 366s -0.110 0.731 366s 0.808 -0.078 366s 366s x1: [2, 2, 4, 1] 366s -0.247 -0.992 366s -0.126 -0.931 366s 366s y: [4, 2, 4, 1] 366s 0.569 0.246 -0.697 -0.123 366s 0.482 0.144 -0.667 -0.030 366s test-mul-mat0: iter:263/500 366s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 3, 1, 1] 366s -0.533 -0.200 -0.649 366s -0.050 0.724 0.492 366s 0.442 0.522 -0.298 366s 366s x1: [3, 1, 1, 1] 366s -0.624 -0.106 0.249 366s 366s y: [3, 1, 1, 1] 366s 0.192 0.077 -0.406 366s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.894 0.115 0.043 366s 0.460 -0.108 -0.539 366s -0.377 0.218 -0.163 366s 0.517 -0.533 -0.200 366s 366s x1: [3, 3, 1, 1] 366s -0.418 0.225 -0.155 366s -0.352 0.846 0.215 366s 0.403 0.254 0.511 366s 366s y: [4, 3, 1, 1] 366s 0.393 -0.133 0.232 -0.305 366s 0.422 -0.369 0.282 -0.676 366s -0.309 -0.117 -0.180 -0.028 366s testing: mul_mat, [2, 4, 1, 2] = [3, 2, 1, 2] * [3, 4, 1, 2] 366s x0: [3, 2, 1, 2] 366s 0.027 -0.859 0.868 366s -0.456 -0.392 -0.332 366s 366s x1: [3, 4, 1, 2] 366s 0.985 0.267 0.152 366s -0.065 -0.009 -0.356 366s -0.623 -0.487 0.346 366s -0.041 0.738 -0.809 366s 366s y: [2, 4, 1, 2] 366s -0.071 -0.604 366s -0.303 0.151 366s 0.702 0.360 366s -1.337 -0.002 366s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 1, 1, 1] 366s -0.298 0.406 0.417 366s 366s x1: [3, 2, 1, 1] 366s 0.841 0.380 0.810 366s -0.200 0.118 -1.000 366s 366s y: [1, 2, 1, 1] 366s 0.241 366s -0.310 366s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.647 366s 0.512 366s -0.114 366s 0.276 366s 366s x1: [1, 3, 1, 1] 366s 0.539 366s -0.678 366s -0.057 366s 366s y: [4, 3, 1, 1] 366s -0.349 0.276 -0.061 0.149 366s 0.439 -0.347 0.077 -0.187 366s 0.037 -0.029 0.006 -0.016 366s testing: mul_mat, [1, 1, 1, 2] = [4, 1, 1, 2] * [4, 1, 1, 2] 366s x0: [4, 1, 1, 2] 366s -0.102 -0.550 -0.232 0.362 366s 366s x1: [4, 1, 1, 2] 366s -0.457 0.913 0.417 -0.590 366s 366s y: [1, 1, 1, 2] 366s -0.766 366s test-mul-mat0: iter:264/500 366s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.989 0.253 366s 0.370 -0.124 366s -0.471 0.912 366s -0.211 0.946 366s 366s x1: [2, 2, 1, 1] 366s -0.422 -0.085 366s 0.506 -0.100 366s 366s y: [4, 2, 1, 1] 366s -0.439 -0.145 0.121 0.008 366s 0.476 0.200 -0.330 -0.202 366s testing: mul_mat, [4, 4, 2, 1] = [2, 4, 2, 1] * [2, 4, 2, 1] 366s x0: [2, 4, 2, 1] 366s -0.981 -0.416 366s -0.522 0.526 366s 0.484 -0.664 366s 0.515 -0.263 366s 366s x1: [2, 4, 2, 1] 366s -0.677 0.247 366s 0.971 0.401 366s -0.767 0.872 366s -0.700 -0.316 366s 366s y: [4, 4, 2, 1] 366s 0.561 0.483 -0.492 -0.414 366s -1.119 -0.296 0.203 0.395 366s 0.389 0.859 -0.950 -0.624 366s 0.818 0.199 -0.129 -0.278 366s testing: mul_mat, [3, 4, 2, 4] = [2, 3, 2, 4] * [2, 4, 2, 4] 366s x0: [2, 3, 2, 4] 366s -0.600 -0.605 366s 0.181 0.756 366s -0.290 0.094 366s 366s x1: [2, 4, 2, 4] 366s 0.182 0.342 366s -0.340 0.054 366s 0.642 0.344 366s 0.694 0.304 366s 366s y: [3, 4, 2, 4] 366s -0.316 0.292 -0.021 366s 0.171 -0.021 0.104 366s -0.593 0.376 -0.154 366s -0.600 0.355 -0.173 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.525 0.658 366s 366s x1: [2, 3, 1, 1] 366s 0.095 0.655 366s 0.893 -0.254 366s -0.282 0.711 366s 366s y: [1, 3, 1, 1] 366s 0.481 366s 0.302 366s 0.319 366s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 366s x0: [1, 4, 2, 1] 366s 0.430 366s -0.485 366s -0.695 366s -0.727 366s 366s x1: [1, 2, 2, 1] 366s -0.959 366s 0.668 366s 366s y: [4, 2, 2, 1] 366s -0.413 0.466 0.666 0.698 366s 0.287 -0.324 -0.464 -0.486 366s testing: mul_mat, [4, 1, 2, 4] = [4, 4, 2, 4] * [4, 1, 2, 4] 366s x0: [4, 4, 2, 4] 366s 0.606 0.665 -0.738 -0.805 366s 0.580 0.044 -0.377 0.082 366s 0.482 -0.238 -0.229 0.658 366s -0.205 -0.155 -0.803 0.425 366s 366s x1: [4, 1, 2, 4] 366s 0.704 0.107 0.034 -0.262 366s 366s y: [4, 1, 2, 4] 366s 0.683 0.379 0.134 -0.300 366s test-mul-mat0: iter:265/500 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s -0.664 -0.527 366s 0.662 0.346 366s 366s x1: [2, 2, 1, 1] 366s -0.506 0.638 366s 0.790 -0.251 366s 366s y: [2, 2, 1, 1] 366s -0.000 -0.114 366s -0.392 0.436 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.027 0.652 366s -0.677 0.465 366s 366s x1: [2, 2, 1, 1] 366s 0.446 -0.650 366s 0.185 -0.239 366s 366s y: [2, 2, 1, 1] 366s -0.412 -0.604 366s -0.151 -0.237 366s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 366s x0: [2, 1, 1, 2] 366s -0.373 -0.589 366s 366s x1: [2, 2, 1, 2] 366s -0.368 -0.788 366s 0.080 -0.665 366s 366s y: [1, 2, 1, 2] 366s 0.602 366s 0.362 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.654 -0.037 366s 0.302 -0.693 366s -0.039 0.573 366s -0.873 0.313 366s 366s x1: [2, 1, 1, 1] 366s -0.095 0.080 366s 366s y: [4, 1, 1, 1] 366s -0.065 -0.084 0.050 0.108 366s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.065 0.111 -0.961 0.481 366s -0.048 -0.949 -0.323 0.513 366s -0.842 0.856 -0.733 0.172 366s 366s x1: [4, 2, 1, 1] 366s 0.492 -0.666 0.993 -0.480 366s 0.986 -0.684 0.985 -0.383 366s 366s y: [3, 2, 1, 1] 366s -1.228 0.042 -1.795 366s -1.143 0.088 -2.203 366s testing: mul_mat, [2, 4, 1, 2] = [3, 2, 1, 2] * [3, 4, 1, 2] 366s x0: [3, 2, 1, 2] 366s 0.617 0.258 -0.344 366s 0.143 0.098 -0.570 366s 366s x1: [3, 4, 1, 2] 366s -0.439 -0.863 -0.174 366s 0.863 0.098 -0.047 366s -0.174 0.405 -0.474 366s -0.861 -0.102 -0.141 366s 366s y: [2, 4, 1, 2] 366s -0.433 -0.048 366s 0.574 0.160 366s 0.161 0.285 366s -0.509 -0.053 366s test-mul-mat0: iter:266/500 366s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.482 -0.684 0.180 0.060 366s -0.219 -0.829 -0.324 0.924 366s 366s x1: [4, 2, 1, 1] 366s -0.806 0.594 -0.380 0.612 366s -0.561 0.068 0.015 0.901 366s 366s y: [2, 2, 1, 1] 366s -0.826 0.373 366s -0.261 0.894 366s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 2, 3, 1] 366s -0.297 0.923 0.087 -0.126 366s -0.400 0.011 -0.698 -0.626 366s 366s x1: [4, 2, 3, 1] 366s 0.429 0.774 -0.420 0.858 366s 0.433 0.373 0.114 -0.045 366s 366s y: [2, 2, 3, 1] 366s 0.442 -0.407 366s 0.231 -0.220 366s testing: mul_mat, [2, 2, 3, 4] = [4, 2, 3, 4] * [4, 2, 3, 4] 366s x0: [4, 2, 3, 4] 366s 0.665 0.962 -0.388 0.418 366s 0.231 -0.159 -0.203 -0.472 366s 366s x1: [4, 2, 3, 4] 366s -0.606 0.743 0.294 -0.616 366s 0.266 -0.843 -0.310 0.969 366s 366s y: [2, 2, 3, 4] 366s -0.060 -0.027 366s -0.108 -0.199 366s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.389 0.543 0.197 0.269 366s 366s x1: [4, 2, 1, 1] 366s -0.575 -0.250 -0.388 0.809 366s -0.391 -0.374 0.478 0.796 366s 366s y: [1, 2, 1, 1] 366s 0.229 366s 0.258 366s testing: mul_mat, [2, 4, 3, 1] = [1, 2, 3, 1] * [1, 4, 3, 1] 366s x0: [1, 2, 3, 1] 366s -0.464 366s -0.721 366s 366s x1: [1, 4, 3, 1] 366s -0.026 366s -0.920 366s -0.763 366s -0.638 366s 366s y: [2, 4, 3, 1] 366s 0.012 0.018 366s 0.427 0.663 366s 0.354 0.550 366s 0.296 0.460 366s testing: mul_mat, [3, 1, 3, 4] = [2, 3, 3, 4] * [2, 1, 3, 4] 366s x0: [2, 3, 3, 4] 366s 0.312 0.157 366s -0.644 0.561 366s 0.141 0.222 366s 366s x1: [2, 1, 3, 4] 366s -0.404 0.817 366s 366s y: [3, 1, 3, 4] 366s 0.002 0.718 0.125 366s test-mul-mat0: iter:267/500 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.640 -0.142 366s 0.901 0.083 366s -0.636 -0.564 366s 0.171 0.571 366s 366s x1: [2, 1, 1, 1] 366s -0.165 0.254 366s 366s y: [4, 1, 1, 1] 366s 0.070 -0.128 -0.039 0.117 366s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 366s x0: [2, 4, 4, 1] 366s -0.551 -0.838 366s 0.847 0.646 366s -0.162 -0.155 366s 0.887 -0.181 366s 366s x1: [2, 4, 4, 1] 366s -0.863 -0.245 366s -0.312 0.779 366s 0.912 0.197 366s -0.325 -0.002 366s 366s y: [4, 4, 4, 1] 366s 0.681 -0.890 0.178 -0.721 366s -0.481 0.239 -0.070 -0.418 366s -0.668 0.900 -0.179 0.773 366s 0.181 -0.276 0.053 -0.287 366s testing: mul_mat, [1, 4, 4, 3] = [2, 1, 4, 3] * [2, 4, 4, 3] 366s x0: [2, 1, 4, 3] 366s -0.089 -0.850 366s 366s x1: [2, 4, 4, 3] 366s -0.526 0.834 366s -0.136 0.312 366s -0.320 -0.249 366s -0.869 0.099 366s 366s y: [1, 4, 4, 3] 366s -0.662 366s -0.253 366s 0.240 366s -0.007 366s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.976 0.962 366s -0.165 0.745 366s 0.936 -0.914 366s 366s x1: [2, 1, 1, 1] 366s 0.909 0.169 366s 366s y: [3, 1, 1, 1] 366s -0.724 -0.024 0.696 366s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 366s x0: [3, 1, 4, 1] 366s 0.103 0.056 0.136 366s 366s x1: [3, 2, 4, 1] 366s -0.478 0.909 0.021 366s -0.094 0.869 -0.102 366s 366s y: [1, 2, 4, 1] 366s 0.004 366s 0.025 366s testing: mul_mat, [2, 3, 4, 3] = [1, 2, 4, 3] * [1, 3, 4, 3] 366s x0: [1, 2, 4, 3] 366s -0.514 366s 0.256 366s 366s x1: [1, 3, 4, 3] 366s 0.525 366s 0.867 366s 0.702 366s 366s y: [2, 3, 4, 3] 366s -0.270 0.135 366s -0.446 0.222 366s -0.361 0.180 366s test-mul-mat0: iter:268/500 366s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.501 0.558 0.823 366s -0.804 -0.400 0.168 366s 0.500 0.670 -0.007 366s 366s x1: [3, 1, 1, 1] 366s 0.002 -0.755 0.492 366s 366s y: [3, 1, 1, 1] 366s -0.016 0.383 -0.508 366s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 366s x0: [3, 3, 2, 1] 366s 0.280 -0.430 -0.064 366s -0.219 -0.872 -0.241 366s -0.022 -0.272 0.927 366s 366s x1: [3, 3, 2, 1] 366s -0.510 -0.011 -0.502 366s -0.731 0.613 -0.180 366s 0.903 -0.658 0.180 366s 366s y: [3, 3, 2, 1] 366s -0.106 0.242 -0.451 366s -0.457 -0.331 -0.318 366s 0.525 0.333 0.326 366s testing: mul_mat, [4, 3, 2, 3] = [3, 4, 2, 3] * [3, 3, 2, 3] 366s x0: [3, 4, 2, 3] 366s 0.478 0.997 -0.058 366s 0.435 -0.190 0.357 366s 0.629 -0.237 0.114 366s 0.242 -0.169 -0.900 366s 366s x1: [3, 3, 2, 3] 366s 0.139 0.341 0.418 366s -0.885 0.229 0.240 366s -0.902 -0.940 0.128 366s 366s y: [4, 3, 2, 3] 366s 0.382 0.145 0.054 -0.401 366s -0.208 -0.342 -0.584 -0.469 366s -1.376 -0.168 -0.330 -0.174 366s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.793 0.987 -0.343 366s -0.454 0.904 0.165 366s -0.056 0.146 -0.847 366s -0.174 0.542 -0.362 366s 366s x1: [3, 4, 1, 1] 366s 0.900 -0.821 0.131 366s -0.001 -0.049 0.402 366s 0.807 0.009 0.935 366s 0.028 -0.130 0.695 366s 366s y: [4, 4, 1, 1] 366s -1.569 -1.129 -0.281 -0.649 366s -0.185 0.022 -0.347 -0.172 366s -0.951 -0.203 -0.836 -0.474 366s -0.389 -0.016 -0.609 -0.327 366s testing: mul_mat, [4, 3, 2, 1] = [4, 4, 2, 1] * [4, 3, 2, 1] 366s x0: [4, 4, 2, 1] 366s -0.081 -0.195 0.143 -0.346 366s -0.349 0.667 0.265 0.624 366s 0.029 0.775 0.714 0.101 366s -0.916 0.983 0.557 0.042 366s 366s x1: [4, 3, 2, 1] 366s 0.692 0.178 0.476 0.489 366s 0.731 -0.587 0.389 0.910 366s 0.545 -0.612 -0.139 -0.054 366s 366s y: [4, 3, 2, 1] 366s -0.192 0.309 0.548 -0.172 366s -0.204 0.025 -0.064 -0.991 366s 0.074 -0.668 -0.563 -1.180 366s testing: mul_mat, [4, 4, 2, 3] = [4, 4, 2, 3] * [4, 4, 2, 3] 366s x0: [4, 4, 2, 3] 366s 0.620 0.254 0.585 0.930 366s 0.611 -0.988 0.054 -0.393 366s -0.213 0.613 -0.421 -0.867 366s -0.759 0.725 -0.494 -0.463 366s 366s x1: [4, 4, 2, 3] 366s -0.609 0.850 -0.182 0.196 366s 0.517 -0.406 0.179 -0.340 366s 0.858 -0.107 -0.783 -0.487 366s -0.483 0.318 0.555 0.002 366s 366s y: [4, 4, 2, 3] 366s -0.085 -1.298 0.557 1.077 366s 0.006 0.860 -0.140 -0.618 366s -0.407 0.779 0.504 -0.117 366s 0.107 -0.580 0.063 0.323 366s test-mul-mat0: iter:269/500 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.313 0.321 366s 0.881 -0.250 366s -0.912 -0.498 366s -0.073 0.257 366s 366s x1: [2, 4, 1, 1] 366s -0.280 0.879 366s -0.316 0.793 366s -0.645 0.698 366s -0.905 0.196 366s 366s y: [4, 4, 1, 1] 366s 0.370 -0.467 -0.182 0.247 366s 0.354 -0.477 -0.107 0.227 366s 0.426 -0.743 0.241 0.227 366s 0.346 -0.847 0.727 0.117 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.577 -0.844 366s 0.895 -0.942 366s 0.783 -0.386 366s 0.936 -0.533 366s 366s x1: [2, 4, 1, 1] 366s -0.918 0.185 366s 0.999 -0.430 366s 0.023 0.768 366s 0.307 0.561 366s 366s y: [4, 4, 1, 1] 366s 0.374 -0.996 -0.790 -0.958 366s -0.214 1.300 0.948 1.165 366s -0.661 -0.703 -0.278 -0.388 366s -0.650 -0.253 0.024 -0.011 366s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.940 -0.103 366s 0.754 -0.134 366s 366s x1: [2, 4, 1, 1] 366s -0.592 -0.709 366s -0.835 -0.497 366s 0.487 0.370 366s 0.190 -0.191 366s 366s y: [2, 4, 1, 1] 366s -0.484 -0.352 366s -0.734 -0.563 366s 0.420 0.318 366s 0.198 0.169 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.075 0.712 366s 0.922 -0.365 366s 366s x1: [2, 2, 1, 1] 366s -0.845 0.835 366s -0.949 -0.846 366s 366s y: [2, 2, 1, 1] 366s 0.532 -1.084 366s -0.673 -0.566 366s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.426 0.041 366s -0.942 -0.534 366s -0.871 -0.579 366s 366s x1: [2, 2, 1, 1] 366s 0.549 -0.865 366s -0.209 0.443 366s 366s y: [3, 2, 1, 1] 366s -0.269 -0.055 0.023 366s 0.107 -0.040 -0.075 366s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 1, 1, 1] 366s -0.902 0.614 0.581 366s 366s x1: [3, 2, 1, 1] 366s 0.206 -0.032 0.908 366s -0.424 -0.842 -0.283 366s 366s y: [1, 2, 1, 1] 366s 0.323 366s -0.299 366s test-mul-mat0: iter:270/500 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.164 0.862 366s -0.908 -0.106 366s 0.992 0.133 366s 0.359 -0.588 366s 366s x1: [2, 4, 1, 1] 366s 0.922 0.822 366s -0.909 0.844 366s 0.534 -0.274 366s 0.393 0.669 366s 366s y: [4, 4, 1, 1] 366s 0.557 -0.925 1.024 -0.152 366s 0.877 0.736 -0.790 -0.823 366s -0.324 -0.456 0.493 0.353 366s 0.513 -0.428 0.478 -0.252 366s testing: mul_mat, [2, 4, 4, 1] = [2, 2, 4, 1] * [2, 4, 4, 1] 366s x0: [2, 2, 4, 1] 366s 0.353 0.377 366s 0.481 -0.609 366s 366s x1: [2, 4, 4, 1] 366s -0.661 -0.672 366s -0.680 -0.085 366s -0.514 0.037 366s -0.257 -0.416 366s 366s y: [2, 4, 4, 1] 366s -0.487 0.092 366s -0.272 -0.275 366s -0.168 -0.270 366s -0.248 0.130 366s testing: mul_mat, [3, 2, 4, 2] = [2, 3, 4, 2] * [2, 2, 4, 2] 366s x0: [2, 3, 4, 2] 366s 0.505 0.033 366s -0.547 0.487 366s 0.277 -0.443 366s 366s x1: [2, 2, 4, 2] 366s -0.152 -0.785 366s 0.567 0.615 366s 366s y: [3, 2, ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 4, 2] 366s -0.103 -0.299 0.306 366s 0.307 -0.011 -0.115 366s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.688 -0.547 366s -0.022 -0.372 366s 0.284 -0.553 366s 366s x1: [2, 3, 1, 1] 366s -0.994 -0.430 366s 0.501 -0.637 366s 0.296 0.265 366s 366s y: [3, 3, 1, 1] 366s 0.919 0.182 -0.044 366s 0.004 0.226 0.495 366s -0.349 -0.105 -0.063 366s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 366s x0: [3, 1, 4, 1] 366s 0.964 0.964 0.843 366s 366s x1: [3, 2, 4, 1] 366s 0.932 0.782 -0.357 366s 0.864 -0.576 -0.883 366s 366s y: [1, 2, 4, 1] 366s 1.352 366s -0.466 366s testing: mul_mat, [3, 3, 4, 2] = [1, 3, 4, 2] * [1, 3, 4, 2] 366s x0: [1, 3, 4, 2] 366s -0.010 366s -0.636 366s -0.484 366s 366s x1: [1, 3, 4, 2] 366s -0.434 366s -0.359 366s -0.218 366s 366s y: [3, 3, 4, 2] 366s 0.004 0.276 0.210 366s 0.004 0.228 0.174 366s 0.002 0.139 0.106 366s test-mul-mat0: iter:271/500 366s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.148 -0.650 0.258 -0.187 366s -0.211 0.775 0.336 0.693 366s -0.200 0.073 -0.151 -0.449 366s -0.320 -0.387 -0.251 0.180 366s 366s x1: [4, 2, 1, 1] 366s -0.111 0.021 -0.172 0.879 366s 0.385 0.344 0.082 -0.453 366s 366s y: [4, 2, 1, 1] 366s -0.206 0.591 -0.345 0.229 366s -0.175 -0.101 0.139 -0.358 366s testing: mul_mat, [1, 4, 2, 1] = [4, 1, 2, 1] * [4, 4, 2, 1] 366s x0: [4, 1, 2, 1] 366s 0.800 -0.819 -0.615 -0.824 366s 366s x1: [4, 4, 2, 1] 366s -0.363 -0.618 0.131 -0.115 366s 0.413 -0.625 0.774 -0.566 366s 0.203 0.653 0.818 -0.453 366s -0.264 -0.634 0.457 0.587 366s 366s y: [1, 4, 2, 1] 366s 0.230 366s 0.833 366s -0.502 366s -0.457 366s testing: mul_mat, [3, 1, 2, 4] = [4, 3, 2, 4] * [4, 1, 2, 4] 366s x0: [4, 3, 2, 4] 366s -0.487 -0.878 0.697 -0.556 366s 0.840 0.794 -0.225 0.063 366s -0.620 -0.157 0.484 -0.744 366s 366s x1: [4, 1, 2, 4] 366s -0.363 0.925 -0.656 0.003 366s 366s y: [3, 1, 2, 4] 366s -1.094 0.577 -0.239 366s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.600 -0.296 -0.146 -0.825 366s 366s x1: [4, 3, 1, 1] 366s 0.249 0.731 -0.796 -0.586 366s -0.333 -0.422 -0.013 -0.065 366s -0.247 -0.311 -0.513 -0.298 366s 366s y: [1, 3, 1, 1] 366s 0.234 366s 0.380 366s 0.562 366s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 366s x0: [1, 4, 2, 1] 366s -0.918 366s -0.672 366s -0.223 366s -0.275 366s 366s x1: [1, 4, 2, 1] 366s -0.683 366s -0.605 366s 0.917 366s -0.238 366s 366s y: [4, 4, 2, 1] 366s 0.627 0.459 0.152 0.188 366s 0.555 0.406 0.135 0.166 366s -0.842 -0.616 -0.204 -0.252 366s 0.219 0.160 0.053 0.066 366s testing: mul_mat, [3, 1, 2, 4] = [4, 3, 2, 4] * [4, 1, 2, 4] 366s x0: [4, 3, 2, 4] 366s 0.205 -0.558 -0.618 0.024 366s -0.875 -0.760 0.959 -0.467 366s 0.906 -0.550 0.395 -0.485 366s 366s x1: [4, 1, 2, 4] 366s -0.561 0.122 -0.515 0.427 366s 366s y: [3, 1, 2, 4] 366s 0.146 -0.296 -0.985 366s test-mul-mat0: iter:272/500 366s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.532 366s -0.112 366s -0.188 366s 0.738 366s 366s x1: [1, 4, 1, 1] 366s -0.550 366s -0.517 366s 0.031 366s 0.728 366s 366s y: [4, 4, 1, 1] 366s 0.292 0.061 0.104 -0.405 366s 0.275 0.058 0.097 -0.381 366s -0.017 -0.003 -0.006 0.023 366s -0.387 -0.081 -0.137 0.537 366s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.992 366s -0.436 366s 366s x1: [1, 4, 1, 1] 366s 0.598 366s -0.081 366s -0.992 366s -0.696 366s 366s y: [2, 4, 1, 1] 366s 0.593 -0.261 366s -0.080 0.035 366s -0.985 0.433 366s -0.691 0.303 366s testing: mul_mat, [4, 2, 1, 2] = [1, 4, 1, 2] * [1, 2, 1, 2] 366s x0: [1, 4, 1, 2] 366s 0.947 366s -0.338 366s -0.173 366s -0.545 366s 366s x1: [1, 2, 1, 2] 366s 0.473 366s -0.324 366s 366s y: [4, 2, 1, 2] 366s 0.447 -0.160 -0.082 -0.258 366s -0.307 0.110 0.056 0.177 366s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.099 366s -0.410 366s -0.769 366s 366s x1: [1, 4, 1, 1] 366s 0.147 366s -0.269 366s 0.173 366s 0.630 366s 366s y: [3, 4, 1, 1] 366s -0.014 -0.060 -0.113 366s 0.027 0.110 0.207 366s -0.017 -0.071 -0.133 366s -0.062 -0.259 -0.485 366s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.680 0.968 0.003 366s 0.976 -0.745 -0.278 366s 0.691 0.410 -0.587 366s 0.937 -0.356 -0.050 366s 366s x1: [3, 1, 1, 1] 366s 0.789 0.401 0.968 366s 366s y: [4, 1, 1, 1] 366s -0.145 0.203 0.141 0.549 366s testing: mul_mat, [2, 3, 1, 2] = [4, 2, 1, 2] * [4, 3, 1, 2] 366s x0: [4, 2, 1, 2] 366s -0.009 0.476 -0.890 0.097 366s 0.697 0.713 -0.574 0.350 366s 366s x1: [4, 3, 1, 2] 366s 0.384 0.240 0.405 -0.447 366s -0.445 -0.585 0.276 0.702 366s 0.146 -0.551 0.332 -0.091 366s 366s y: [2, 3, 1, 2] 366s -0.293 0.050 366s -0.451 -0.639 366s -0.568 -0.513 366s test-mul-mat0: iter:273/500 366s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 1] * [3, 2, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.392 0.409 -0.313 366s 0.081 -0.601 -0.616 366s 366s x1: [3, 2, 1, 1] 366s 0.441 0.864 -0.815 366s -0.450 -0.610 0.828 366s 366s y: [2, 2, 1, 1] 366s 0.435 0.019 366s -0.332 -0.180 366s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.141 0.750 0.059 366s 0.784 0.357 -0.548 366s 366s x1: [3, 2, 1, 1] 366s -0.444 -0.888 -0.506 366s -0.018 0.209 0.844 366s 366s y: [2, 2, 1, 1] 366s -0.633 -0.387 366s 0.209 -0.402 366s testing: mul_mat, [4, 2, 1, 4] = [3, 4, 1, 4] * [3, 2, 1, 4] 366s x0: [3, 4, 1, 4] 366s 0.159 0.538 0.236 366s -0.057 -0.105 0.687 366s -0.899 0.192 -0.560 366s 0.983 -0.650 -0.365 366s 366s x1: [3, 2, 1, 4] 366s 0.158 -0.703 -0.247 366s 0.882 0.158 -0.805 366s 366s y: [4, 2, 1, 4] 366s -0.411 -0.105 -0.139 0.702 366s 0.036 -0.620 -0.311 1.059 366s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.976 -0.378 -0.588 366s -0.295 -0.217 -0.446 366s -0.655 -0.161 -0.358 366s 0.564 -0.360 -0.608 366s 366s x1: [3, 4, 1, 1] 366s 0.486 -0.797 0.981 366s -0.691 -0.365 -0.597 366s -0.418 0.448 0.917 366s -0.201 0.624 0.731 366s 366s y: [4, 4, 1, 1] 366s 0.200 -0.408 -0.541 -0.035 366s -0.186 0.549 0.725 0.105 366s -1.116 -0.383 -0.128 -0.955 366s -0.862 -0.402 -0.231 -0.783 366s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.747 -0.365 -0.043 -0.744 366s -0.097 -0.388 0.417 -0.398 366s 0.809 0.853 0.961 -0.848 366s 0.877 0.340 0.740 -0.190 366s 366s x1: [4, 3, 1, 1] 366s -0.097 -0.628 -0.935 -0.368 366s 0.967 -0.398 -0.881 -0.830 366s -0.417 -0.572 -0.195 -0.014 366s 366s y: [4, 3, 1, 1] 366s 0.615 0.010 -1.201 -0.920 366s 0.079 0.024 0.298 0.219 366s 0.539 0.187 -1.001 -0.702 366s testing: mul_mat, [2, 4, 1, 4] = [4, 2, 1, 4] * [4, 4, 1, 4] 366s x0: [4, 2, 1, 4] 366s -0.851 -0.988 0.417 0.220 366s 0.881 -0.191 0.556 -0.253 366s 366s x1: [4, 4, 1, 4] 366s 0.243 -0.456 0.713 0.615 366s -0.391 -0.655 0.582 0.211 366s -0.536 0.751 0.794 -0.108 366s -0.444 -0.220 0.902 -0.191 366s 366s y: [2, 4, 1, 4] 366s 0.676 0.542 366s 1.269 0.051 366s 0.021 -0.148 366s 0.929 0.201 366s test-mul-mat0: iter:274/500 366s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 2, 1, 1] 366s -0.750 -0.044 366s 0.359 -0.421 366s 366s x1: [2, 1, 1, 1] 366s -0.742 -0.231 366s 366s y: [2, 1, 1, 1] 366s 0.567 -0.169 366s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.702 -0.222 366s 0.297 -0.903 366s 0.169 0.212 366s 0.160 -0.267 366s 366s x1: [2, 2, 1, 1] 366s -0.080 -0.991 366s -0.045 0.347 366s 366s y: [4, 2, 1, 1] 366s 0.164 0.872 -0.223 0.252 366s -0.109 -0.327 0.066 -0.100 366s testing: mul_mat, [1, 4, 1, 4] = [2, 1, 1, 4] * [2, 4, 1, 4] 366s x0: [2, 1, 1, 4] 366s -0.220 0.823 366s 366s x1: [2, 4, 1, 4] 366s -0.267 -0.937 366s -0.507 0.326 366s -0.479 0.184 366s -0.933 -0.140 366s 366s y: [1, 4, 1, 4] 366s -0.713 366s 0.379 366s 0.257 366s 0.090 366s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.934 -0.610 366s 366s x1: [2, 1, 1, 1] 366s -0.053 0.996 366s 366s y: [1, 1, 1, 1] 366s -0.557 366s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.493 366s 0.031 366s 366s x1: [1, 2, 1, 1] 366s 0.232 366s -0.456 366s 366s y: [2, 2, 1, 1] 366s 0.115 0.007 366s -0.225 -0.014 366s testing: mul_mat, [3, 1, 1, 4] = [2, 3, 1, 4] * [2, 1, 1, 4] 366s x0: [2, 3, 1, 4] 366s 0.536 -0.515 366s 0.330 -0.890 366s 0.100 -0.077 366s 366s x1: [2, 1, 1, 4] 366s 0.292 0.102 366s 366s y: [3, 1, 1, 4] 366s 0.104 0.006 0.021 366s test-mul-mat0: iter:275/500 366s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.897 0.703 -0.440 366s 366s x1: [3, 1, 1, 1] 366s -0.461 -0.433 -0.398 366s 366s y: [1, 1, 1, 1] 366s -0.543 366s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.438 -0.197 -0.993 366s 366s x1: [3, 1, 1, 1] 366s -0.993 -0.374 -0.484 366s 366s y: [1, 1, 1, 1] 366s 0.119 366s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.335 -0.761 0.406 366s 366s x1: [3, 1, 1, 1] 366s 0.756 -0.964 0.928 366s 366s y: [1, 1, 1, 1] 366s 1.364 366s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.119 0.213 0.901 366s -0.623 -0.229 0.213 366s 366s x1: [3, 1, 1, 1] 366s 0.178 0.914 -0.659 366s 366s y: [2, 1, 1, 1] 366s -0.421 -0.460 366s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.628 -0.546 366s -0.857 0.582 366s -0.539 0.264 366s 366s x1: [2, 3, 1, 1] 366s 0.310 -0.532 366s 0.815 0.385 366s -0.635 0.518 366s 366s y: [3, 3, 1, 1] 366s 0.096 -0.575 -0.307 366s -0.722 -0.474 -0.337 366s 0.116 0.845 0.479 366s testing: mul_mat, [1, 2, 1, 1]ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s = [3, 1, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 1, 1, 1] 366s -0.291 0.814 -0.457 366s 366s x1: [3, 2, 1, 1] 366s -0.539 0.338 0.300 366s -0.611 0.303 -0.365 366s 366s y: [1, 2, 1, 1] 366s 0.294 366s 0.591 366s test-mul-mat0: iter:276/500 366s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.109 -0.839 0.624 366s -0.164 -0.467 0.768 366s 0.298 -0.013 0.349 366s -0.438 0.449 -0.313 366s 366s x1: [3, 2, 1, 1] 366s 0.079 -0.805 0.328 366s -0.708 0.506 0.796 366s 366s y: [4, 2, 1, 1] 366s 0.871 0.614 0.148 -0.498 366s 0.150 0.491 0.061 0.288 366s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 366s x0: [3, 1, 3, 1] 366s -0.610 -0.258 0.520 366s 366s x1: [3, 4, 3, 1] 366s 0.862 0.838 0.990 366s -0.503 -0.534 -0.301 366s -0.689 0.009 0.748 366s -0.823 -0.567 0.175 366s 366s y: [1, 4, 3, 1] 366s -0.228 366s 0.288 366s 0.807 366s 0.740 366s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 366s x0: [3, 2, 3, 1] 366s 0.982 -0.074 0.747 366s 0.702 -0.645 -0.170 366s 366s x1: [3, 1, 3, 1] 366s 0.371 0.937 0.495 366s 366s y: [2, 1, 3, 1] 366s 0.665 -0.429 366s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.973 -0.159 0.482 366s -0.465 -0.204 -0.807 366s 366s x1: [3, 2, 1, 1] 366s -0.859 0.074 0.086 366s 0.512 0.011 -0.419 366s 366s y: [2, 2, 1, 1] 366s -0.806 0.315 366s 0.294 0.098 366s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 366s x0: [2, 4, 3, 1] 366s -0.821 0.097 366s -0.484 0.386 366s -0.151 0.163 366s 0.190 0.632 366s 366s x1: [2, 3, 3, 1] 366s -0.222 -0.592 366s 0.940 -0.521 366s -0.237 -0.230 366s 366s y: [4, 3, 3, 1] 366s 0.125 -0.121 -0.063 -0.416 366s -0.822 -0.656 -0.227 -0.151 366s 0.172 0.026 -0.002 -0.190 366s testing: mul_mat, [1, 2, 3, 1] = [4, 1, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 1, 3, 1] 366s -0.233 0.956 0.687 -0.129 366s 366s x1: [4, 2, 3, 1] 366s -0.019 -0.758 0.691 -0.823 366s 0.347 0.121 0.514 0.526 366s 366s y: [1, 2, 3, 1] 366s -0.140 366s 0.320 366s test-mul-mat0: iter:277/500 366s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.387 0.630 0.909 366s -0.133 -0.975 -0.517 366s 366s x1: [3, 1, 1, 1] 366s -0.805 0.020 0.654 366s 366s y: [2, 1, 1, 1] 366s 0.296 -0.251 366s testing: mul_mat, [3, 2, 4, 1] = [3, 3, 4, 1] * [3, 2, 4, 1] 366s x0: [3, 3, 4, 1] 366s 0.058 -0.478 0.694 366s 0.926 -0.452 -0.824 366s 0.795 0.328 -0.552 366s 366s x1: [3, 2, 4, 1] 366s 0.869 -0.219 -0.729 366s 0.156 -0.334 0.039 366s 366s y: [3, 2, 4, 1] 366s -0.350 1.504 1.021 366s 0.196 0.264 -0.007 366s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 366s x0: [3, 2, 4, 1] 366s -0.314 0.824 0.401 366s -0.439 -0.902 0.092 366s 366s x1: [3, 3, 4, 1] 366s -0.456 0.720 -0.187 366s -0.008 -0.329 -0.193 366s -0.522 -0.266 0.154 366s 366s y: [2, 3, 4, 1] 366s 0.661 -0.466 366s -0.346 0.283 366s 0.007 0.483 366s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.013 0.104 -0.183 366s 0.005 0.781 -0.256 366s -0.311 0.168 -0.340 366s 0.574 0.101 -0.586 366s 366s x1: [3, 2, 1, 1] 366s 0.602 -0.880 -0.908 366s 0.041 -0.673 0.182 366s 366s y: [4, 2, 1, 1] 366s 0.083 -0.452 -0.026 0.787 366s -0.103 -0.572 -0.187 -0.151 366s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 366s x0: [4, 1, 4, 1] 366s -0.886 -0.082 -0.999 0.762 366s 366s x1: [4, 3, 4, 1] 366s -0.413 -0.042 0.844 -0.782 366s -0.114 -0.954 -0.947 0.118 366s 0.022 0.028 -0.978 -0.304 366s 366s y: [1, 3, 4, 1] 366s -1.070 366s 1.216 366s 0.724 366s testing: mul_mat, [2, 4, 4, 1] = [1, 2, 4, 1] * [1, 4, 4, 1] 366s x0: [1, 2, 4, 1] 366s 0.079 366s 0.686 366s 366s x1: [1, 4, 4, 1] 366s -0.508 366s 0.583 366s 0.155 366s -0.682 366s 366s y: [2, 4, 4, 1] 366s -0.040 -0.348 366s 0.046 0.400 366s 0.012 0.106 366s -0.054 -0.468 366s test-mul-mat0: iter:278/500 366s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.002 -0.825 366s 366s x1: [2, 1, 1, 1] 366s 0.415 0.020 366s 366s y: [1, 1, 1, 1] 366s -0.017 366s testing: mul_mat, [4, 1, 4, 1] = [2, 4, 4, 1] * [2, 1, 4, 1] 366s x0: [2, 4, 4, 1] 366s 0.036 -0.608 366s 0.030 -0.040 366s -0.075 -0.891 366s -0.354 0.433 366s 366s x1: [2, 1, 4, 1] 366s 0.401 0.385 366s 366s y: [4, 1, 4, 1] 366s -0.220 -0.003 -0.373 0.025 366s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 366s x0: [2, 1, 4, 1] 366s 0.967 0.152 366s 366s x1: [2, 4, 4, 1] 366s -0.235 0.235 366s -0.123 0.690 366s 0.344 0.523 366s 0.123 -0.389 366s 366s y: [1, 4, 4, 1] 366s -0.191 366s -0.013 366s 0.412 366s 0.060 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.663 -0.018 366s -0.435 -0.770 366s -0.322 0.503 366s 0.776 -0.149 366s 366s x1: [2, 1, 1, 1] 366s -0.834 0.423 366s 366s y: [4, 1, 1, 1] 366s -0.560 0.038 0.481 -0.710 366s testing: mul_mat, [2, 2, 4, 1] =ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s [4, 2, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 2, 4, 1] 366s -0.695 0.171 0.746 -0.916 366s 0.418 0.854 -0.578 0.165 366s 366s x1: [4, 2, 4, 1] 366s -0.134 -0.770 0.073 0.157 366s -0.446 0.549 -0.671 0.252 366s 366s y: [2, 2, 4, 1] 366s -0.127 -0.729 366s -0.328 0.712 366s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 366s x0: [2, 4, 4, 1] 366s 0.976 -0.866 366s 0.675 0.501 366s 0.422 -0.943 366s -0.099 0.144 366s 366s x1: [2, 4, 4, 1] 366s 0.548 -0.480 366s -0.158 0.295 366s -0.057 -0.074 366s -0.541 -0.921 366s 366s y: [4, 4, 4, 1] 366s 0.950 0.130 0.684 -0.123 366s -0.409 0.041 -0.345 0.058 366s 0.009 -0.076 0.046 -0.005 366s 0.270 -0.826 0.641 -0.079 366s test-mul-mat0: iter:279/500 366s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.643 0.931 -0.505 366s 0.664 0.196 -0.756 366s -0.706 -0.394 -0.176 366s -0.118 0.150 0.785 366s 366s x1: [3, 4, 1, 1] 366s 0.989 0.477 0.914 366s -0.429 -0.674 0.491 366s 0.378 -0.120 -0.660 366s 0.234 0.497 -0.870 366s 366s y: [4, 4, 1, 1] 366s 0.619 0.059 -1.047 0.672 366s -1.152 -0.789 0.482 0.335 366s 0.464 0.727 -0.104 -0.581 366s 1.053 0.911 -0.208 -0.636 366s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.772 -0.065 0.611 366s 0.269 0.065 0.059 366s 366s x1: [3, 4, 1, 1] 366s 0.949 0.669 -0.580 366s -0.562 0.090 0.190 366s -0.573 -0.433 0.104 366s -0.002 -0.108 -0.405 366s 366s y: [2, 4, 1, 1] 366s 0.335 0.265 366s -0.324 -0.134 366s -0.351 -0.176 366s -0.242 -0.032 366s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 1, 1, 1] 366s -0.202 -0.378 0.752 366s 366s x1: [3, 2, 1, 1] 366s -0.088 -0.004 0.554 366s -0.424 -0.808 0.798 366s 366s y: [1, 2, 1, 1] 366s 0.436 366s 0.991 366s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 3, 1, 1] 366s -0.173 -0.983 0.121 366s -0.861 -0.434 0.564 366s -0.293 0.274 -0.834 366s 366s x1: [3, 1, 1, 1] 366s 0.948 -0.593 0.702 366s 366s y: [3, 1, 1, 1] 366s 0.504 -0.163 -1.025 366s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.836 -0.096 0.566 366s -0.304 0.962 0.702 366s 366s x1: [3, 3, 1, 1] 366s 0.716 0.940 0.938 366s -0.349 0.550 0.208 366s 0.716 -0.390 -0.880 366s 366s y: [2, 3, 1, 1] 366s -0.157 1.346 366s 0.357 0.781 366s -1.060 -1.212 366s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.160 0.726 366s 0.974 0.247 366s 0.835 -0.045 366s 366s x1: [2, 3, 1, 1] 366s -0.416 0.319 366s 0.650 -0.009 366s 0.020 0.267 366s 366s y: [3, 3, 1, 1] 366s 0.165 -0.326 -0.361 366s 0.097 0.631 0.543 366s 0.197 0.086 0.005 366s test-mul-mat0: iter:280/500 366s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.220 366s -0.441 366s 0.039 366s 0.389 366s 366s x1: [1, 1, 1, 1] 366s 0.352 366s 366s y: [4, 1, 1, 1] 366s -0.077 -0.155 0.014 0.137 366s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 366s x0: [1, 4, 2, 1] 366s -0.740 366s -0.992 366s 0.090 366s 0.280 366s 366s x1: [1, 4, 2, 1] 366s -0.321 366s 0.751 366s 0.553 366s 0.375 366s 366s y: [4, 4, 2, 1] 366s 0.238 0.318 -0.029 -0.090 366s -0.556 -0.745 0.068 0.210 366s -0.410 -0.549 0.050 0.155 366s -0.278 -0.372 0.034 0.105 366s testing: mul_mat, [3, 4, 2, 4] = [1, 3, 2, 4] * [1, 4, 2, 4] 366s x0: [1, 3, 2, 4] 366s -0.477 366s -0.539 366s -0.582 366s 366s x1: [1, 4, 2, 4] 366s -0.258 366s -0.834 366s -0.503 366s 0.698 366s 366s y: [3, 4, 2, 4] 366s 0.123 0.139 0.150 366s 0.398 0.450 0.486 366s 0.240 0.271 0.293 366s -0.333 -0.376 -0.406 366s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 1, 1, 1] 366s -0.329 366s 366s x1: [1, 3, 1, 1] 366s -0.859 366s -0.356 366s -0.092 366s 366s y: [1, 3, 1, 1] 366s 0.283 366s 0.117 366s 0.030 366s testing: mul_mat, [3, 1, 2, 1] = [1, 3, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 3, 2, 1] 366s -0.867 366s -0.683 366s 0.627 366s 366s x1: [1, 1, 2, 1] 366s 0.672 366s 366s y: [3, 1, 2, 1] 366s -0.583 -0.459 0.421 366s testing: mul_mat, [3, 1, 2, 4] = [3, 3, 2, 4] * [3, 1, 2, 4] 366s x0: [3, 3, 2, 4] 366s -0.637 0.326 -0.189 366s -0.469 0.951 0.937 366s 0.707 0.029 0.866 366s 366s x1: [3, 1, 2, 4] 366s 0.485 0.987 -0.163 366s 366s y: [3, 1, 2, 4] 366s 0.043 0.558 0.231 366s test-mul-mat0: iter:281/500 366s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.238 0.820 0.485 0.525 366s 0.032 -0.750 -0.649 -0.909 366s -0.241 0.946 0.208 0.093 366s -0.328 -0.084 -0.041 0.857 366s 366s x1: [4, 1, 1, 1] 366s -0.390 -0.691 -0.067 -0.862 366s 366s y: [4, 1, 1, 1] 366s -1.144 1.333 -0.653 -0.550 366s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 3, 3, 1] 366s 0.993 -0.547 0.464 0.448 366s -0.620 0.536 -0.152 0.514 366s -0.300 -0.079 0.566 -0.176 366s 366s x1: [4, 4, 3, 1] 366s 0.166 0.385 -0.690 0.896 366s -0.762 0.581 -0.551 -0.764 366s 0.691 0.408 -0.154 -1.000 366s -0.660 -0.016 0.577 0.579 366s 366s y: [3, 4, 3, 1] 366s 0.035 0.668 -0.628 366s -1.673 0.476 0.005 366s -0.056 -0.701 -0.150 366s -0.119 0.611 0.423 366s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 366s x0: [4, 1, 3, 1] 366s -0.359 -0.717 0.909 -0.327 366s 366s x1: [4, 3, 3, 1] 366s -0.824 -0.085 0.665 -0.124 366s 0.836 0.230 0.700 -0.314 366s -0.987 -0.968 0.777 0.910 366s 366s y: [1, 3, 3, 1] 366s 1.001 366s 0.273 366s 1.456 366s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.326 -0.770 0.703 -0.701 366s -0.147 0.453 -0.882 -0.297 366s 366s x1: [4, 1, 1, 1] 366s -0.986 -0.091 -0.485 0.774 366s 366s y: [2, 1, 1, 1] 366s -0.492 0.301 366s testing: mul_mat, [2, 4, 3, 1] = [2, 2, 3, 1] * [2, 4, 3, 1] 366s x0: [2, 2, 3, 1] 366s -0.269 -0.480 366s 0.800 -0.151 366s 366s x1: [2, 4, 3, 1] 366s -0.817 -0.940 366s 0.368 0.123 366s 0.657 0.263 366s 0.764 0.941 366s 366s y: [2, 4, 3, 1] 366s 0.672 -0.512 366s -0.158 0.276 366s -0.304 0.486 366s -0.658 0.470 366s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 3, 3, 1] 366s -0.361 0.140 366s 0.024 0.371 366s 0.905 -0.279 366s 366s x1: [2, 2, 3, 1] 366s 0.919 0.758 366s 0.026 -0.643 366s 366s y: [3, 2, 3, 1] 366s -0.226 0.303 0.621 366s -0.099 -0.238 0.203 366s test-mul-mat0: iter:282/500 366s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.322 -0.240 -0.666 0.786 366s -0.489 0.765 -0.160 -0.583 366s 0.014 0.750 -0.757 -0.980 366s 0.685 -0.574 -0.216 0.638 366s 366s x1: [4, 4, 1, 1] 366s -0.235 0.952 0.760 0.273 366s -0.854 0.513 -0.608 0.497 366s 0.455 0.031 -0.479 0.360 366s -0.829 0.892 -0.919 -0.959 366s 366s y: [4, 4, 1, 1] 366s -0.596 0.564 -0.132 -0.698 366s 0.397 0.618 0.346 -0.432 366s 0.741 -0.332 0.039 0.627 366s -0.623 1.793 2.293 -1.494 366s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 366s x0: [4, 2, 4, 1] 366s -0.354 0.217 -0.668 0.737 366s 0.877 -0.187 0.342 -0.812 366s 366s x1: [4, 4, 4, 1] 366s -0.814 -0.943 0.783 0.700 366s -0.551 0.280 0.155 0.480 366s 0.801 -0.486 0.651 0.693 366s -0.405 0.692 -0.244 0.917 366s 366s y: [2, 4, 4, 1] 366s 0.075 -0.837 366s 0.506 -0.873 366s -0.313 0.454 366s 1.132 -1.312 366s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 3, 4, 1] 366s -0.907 0.992 -0.641 -0.708 366s -0.191 0.532 -0.967 0.817 366s -0.379 0.433 0.473 0.644 366s 366s x1: [4, 2, 4, 1] 366s -0.249 0.374 -0.143 -0.372 366s -0.813 -0.801 -0.183 0.173 366s 366s y: [3, 2, 4, 1] 366s 0.951 0.082 -0.050 366s -0.062 0.048 -0.014 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.708 0.429 -0.317 -0.442 366s -0.643 0.397 -0.530 -0.024 366s -0.024 0.278 0.398 -0.009 366s 366s x1: [4, 3, 1, 1] 366s 0.866 -0.870 -0.769 -0.685 366s -0.235 0.534 0.841 0.156 366s -0.047 0.980 -0.655 0.608 366s 366s y: [3, 3, 1, 1] 366s 0.785 -0.478 -0.563 366s -0.272 -0.087 0.488 366s 0.326 0.751 0.008 366s testing: mul_mat, [4, 4, 4, 1] = [3, 4, 4, 1] * [3, 4, 4, 1] 366s x0: [3, 4, 4, 1] 366s -0.499 -0.621 -0.875 366s 0.189 0.226 0.054 366s -0.203 -0.155 -0.316 366s -0.966 -0.748 0.999 366s 366s x1: [3, 4, 4, 1] 366s -0.923 0.796 0.778 366s 0.073 0.657 -0.651 366s -0.061 0.787 -0.420 366s 0.254 -0.448 -0.886 366s 366s y: [4, 4, 4, 1] 366s -0.715 0.047 -0.182 1.074 366s 0.125 0.127 0.089 -1.213 366s -0.091 0.144 0.023 -0.949 366s 0.927 -0.101 0.298 -0.795 366s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 366s x0: [4, 4, 4, 1] 366s -0.433 -0.175 0.569 0.912 366s 0.625 0.401 0.374 0.340 366s 0.801 0.889 0.349 0.860 366s -0.316 0.510 0.536 -0.205 366s 366s x1: [4, 3, 4, 1] 366s -0.848 -0.643 -0.654 -0.734 366s -0.178 0.413 -0.953 0.308 366s 0.349 0.203 -0.118 0.230 366s 366s y: [4, 3, 4, 1] 366s -0.562 -1.283 -2.111 -0.260 366s -0.257 -0.198 0.156 -0.307 366s -0.044 0.334 0.617 -0.118 366s test-mul-mat0: iter:283/500 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s -0.146 0.274 -0.020 -0.332 366s 0.313 0.196 0.181 0.703 366s 0.511 0.248 -0.648 -0.560 366s 366s x1: [4, 3, 1, 1] 366s 0.686 0.462 -0.456 0.754 366s 0.031 0.464 0.735 0.941 366s -0.757 -0.490 -0.553 -0.971 366s 366s y: [3, 3, 1, 1] 366s -0.215 0.753 0.338 366s -0.205 0.895 -0.872 366s 0.310 -1.116 0.394 366s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 366s x0: [4, 4, 4, 1] 366s 0.006 -0.473 -0.311 -0.525 366s 0.059 -0.501 0.139 -0.228 366s -0.989 0.314 0.734 0.876 366s 0.929 -0.870 -0.375 -0.673 366s 366s x1: [4, 3, 4, 1] 366s 0.435 -0.115 0.783 -0.573 366s -ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s 0.872 -0.057 -0.887 0.590 366s 0.486 0.867 -0.378 -0.050 366s 366s y: [4, 3, 4, 1] 366s 0.114 0.323 -0.393 0.596 366s -0.012 -0.280 0.710 -0.825 366s -0.263 -0.447 -0.530 -0.127 366s testing: mul_mat, [3, 4, 4, 3] = [4, 3, 4, 3] * [4, 4, 4, 3] 366s x0: [4, 3, 4, 3] 366s -0.989 -0.880 -0.796 -0.862 366s 0.779 -0.126 0.909 -0.603 366s 0.978 -0.226 -0.402 0.067 366s 366s x1: [4, 4, 4, 3] 366s 0.042 0.573 -0.390 0.664 366s 0.810 -0.658 -0.703 0.361 366s 0.807 -0.194 -0.159 0.403 366s 0.940 -0.353 0.953 -0.424 366s 366s y: [3, 4, 4, 3] 366s -0.807 -0.795 0.112 366s 0.027 -0.143 1.247 366s -0.848 0.265 0.924 366s -1.012 1.898 0.587 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.030 -0.922 -0.367 -0.929 366s -0.114 -0.211 -0.806 0.268 366s -0.696 0.400 0.354 0.299 366s 366s x1: [4, 3, 1, 1] 366s 0.092 0.260 -0.503 -0.537 366s 0.412 0.480 0.450 0.401 366s 0.641 0.524 0.815 -0.294 366s 366s y: [3, 3, 1, 1] 366s 0.446 0.196 -0.299 366s -0.968 -0.404 0.185 366s -0.489 -0.920 -0.036 366s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 366s x0: [3, 2, 4, 1] 366s -0.142 -0.936 0.889 366s -0.239 -0.996 0.571 366s 366s x1: [3, 4, 4, 1] 366s -0.076 -0.807 -0.808 366s 0.416 -0.869 0.507 366s -0.492 0.391 -0.995 366s -0.029 -0.197 0.485 366s 366s y: [2, 4, 4, 1] 366s 0.048 0.361 366s 1.205 1.056 366s -1.181 -0.840 366s 0.619 0.480 366s testing: mul_mat, [4, 3, 4, 3] = [2, 4, 4, 3] * [2, 3, 4, 3] 366s x0: [2, 4, 4, 3] 366s -0.034 -0.021 366s 0.872 0.402 366s -0.062 -0.136 366s 0.890 0.327 366s 366s x1: [2, 3, 4, 3] 366s 0.231 -0.189 366s -0.443 -0.255 366s -0.172 -0.731 366s 366s y: [4, 3, 4, 3] 366s -0.004 0.125 0.011 0.144 366s 0.020 -0.488 0.062 -0.477 366s 0.021 -0.444 0.110 -0.392 366s test-mul-mat0: iter:284/500 366s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.513 -0.088 0.905 -0.930 366s -0.355 0.046 0.909 -0.601 366s 0.623 -0.768 0.896 0.333 366s 366s x1: [4, 4, 1, 1] 366s 0.866 0.300 -0.136 0.364 366s -0.627 -0.301 -0.153 -0.508 366s 0.161 -0.996 0.164 -0.650 366s 0.151 -0.072 0.973 0.369 366s 366s y: [3, 4, 1, 1] 366s -0.044 -0.636 0.308 366s 0.039 0.374 -0.466 366s 0.923 0.437 0.796 366s 0.621 0.606 1.144 366s testing: mul_mat, [1, 3, 2, 1] = [4, 1, 2, 1] * [4, 3, 2, 1] 366s x0: [4, 1, 2, 1] 366s -0.741 0.890 0.952 -0.509 366s 366s x1: [4, 3, 2, 1] 366s 0.641 0.720 0.199 -0.059 366s -0.416 -0.437 0.314 0.283 366s 0.409 0.806 -0.556 0.413 366s 366s y: [1, 3, 2, 1] 366s 0.386 366s 0.074 366s -0.326 366s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 366s x0: [4, 3, 2, 1] 366s 0.869 0.161 -0.389 -0.566 366s 0.059 0.378 -0.633 -0.563 366s 0.657 0.213 0.720 -0.993 366s 366s x1: [4, 1, 2, 1] 366s -0.516 0.072 -0.910 0.047 366s 366s y: [3, 1, 2, 1] 366s -0.109 0.546 -1.026 366s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 1, 1, 1] 366s 0.945 0.195 -0.187 0.158 366s 366s x1: [4, 3, 1, 1] 366s 0.977 -0.182 -0.654 0.350 366s 0.274 -0.463 0.167 0.144 366s 0.699 0.778 0.577 -0.243 366s 366s y: [1, 3, 1, 1] 366s 1.065 366s 0.161 366s 0.665 366s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 366s x0: [1, 1, 2, 1] 366s -0.746 366s 366s x1: [1, 4, 2, 1] 366s -0.085 366s -0.180 366s 0.862 366s -0.035 366s 366s y: [1, 4, 2, 1] 366s 0.063 366s 0.135 366s -0.643 366s 0.026 366s testing: mul_mat, [4, 1, 2, 1] = [1, 4, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 4, 2, 1] 366s -0.299 366s -0.157 366s 0.034 366s -0.948 366s 366s x1: [1, 1, 2, 1] 366s -0.276 366s 366s y: [4, 1, 2, 1] 366s 0.082 0.043 -0.009 0.262 366s test-mul-mat0: iter:285/500 366s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 3, 1, 1] 366s -0.863 0.785 0.803 0.102 366s -0.130 -0.741 -0.931 -0.289 366s 0.096 -0.678 0.440 0.821 366s 366s x1: [4, 1, 1, 1] 366s -0.848 -0.216 0.223 -0.035 366s 366s y: [3, 1, 1, 1] 366s 0.737 0.073 0.134 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s -0.967 -0.059 -0.807 0.255 366s 0.906 -0.865 0.393 0.691 366s 0.938 -0.505 -0.439 -0.803 366s 366s x1: [4, 3, 1, 1] 366s -0.653 -0.871 -0.478 0.191 366s 0.163 -0.426 0.309 -0.265 366s 0.792 -0.430 -0.995 -0.211 366s 366s y: [3, 3, 1, 1] 366s 1.117 0.107 -0.115 366s -0.449 0.454 0.446 366s 0.008 0.553 1.567 366s testing: mul_mat, [3, 3, 1, 2] = [4, 3, 1, 2] * [4, 3, 1, 2] 366s x0: [4, 3, 1, 2] 366s 0.424 0.329 0.978 0.918 366s 0.890 -0.825 -0.518 0.161 366s 0.468 0.367 0.873 -0.418 366s 366s x1: [4, 3, 1, 2] 366s -0.437 0.272 0.293 -0.114 366s -0.288 0.114 0.233 -0.160 366s 0.636 -0.576 -0.997 -0.790 366s 366s y: [3, 3, 1, 2] 366s 0.086 -0.783 0.199 366s -0.004 -0.497 0.177 366s -1.621 1.431 -0.454 366s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.174 -0.154 0.137 -0.425 366s -0.027 -0.081 0.818 0.729 366s -0.663 -0.206 -0.725 0.994 366s -0.806 0.559 0.564 0.634 366s 366s x1: [4, 3, 1, 1] 366s -0.228 -0.028 -0.047 -0.323 366s 0.003 -0.408 -0.283 -0.574 366s 0.921 -0.305 -0.655 0.811 366s 366s y: [4, 3, 1, 1] 366s 0.175 -0.265 -0.130 -0.063 366s 0.268 -0.617 -0.283 -0.754 366s -0.547 0.054 0.734 -0.769 366s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.700 -0.689 -0.516 0.756 366s 0.690 -0.381 0.870 -0.697 366s 366s x1: [4, 4, 1, 1] 366s 0.925 0.215 -0.594 -0.103 366s -0.831 0.083 0.900 -0.239 366s 0.800 -0.674 -0.318 -0.506 366s -0.330 -0.507 0.364 0.497 366s 366s y: [2, 4, 1, 1] 366s 0.728 0.111 366s -1.284 0.344 366s 0.806 0.885 366s 0.306 -0.065 366s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 366s x0: [2, 4, 1, 2] 366s -0.706 0.088 366s 0.592 -0.900 366s -0.365 0.358 366s 0.657 -0.221 366s 366s x1: [2, 4, 1, 2] 366s -0.855 0.320 366s -0.123 0.874 366s 0.315 -0.489 366s 0.799 -0.470 366s 366s y: [4, 4, 1, 2] 366s 0.632 -0.794 0.427 -0.633 366s 0.164 -0.859 0.358 -0.274 366s -0.265 0.626 -0.290 0.315 366s -0.606 0.896 -0.460 0.629 366s test-mul-mat0: iter:286/500 366s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.597 366s 366s x1: [1, 3, 1, 1] 366s -0.900 366s -0.102 366s 0.871 366s 366s y: [1, 3, 1, 1] 366s -0.537 366s -0.061 366s 0.520 366s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 366s x0: [1, 2, 4, 1] 366s 0.198 366s 0.964 366s 366s x1: [1, 1, 4, 1] 366s 0.871 366s 366s y: [2, 1, 4, 1] 366s 0.173 0.840 366s testing: mul_mat, [1, 2, 4, 2] = [1, 1, 4, 2] * [1, 2, 4, 2] 366s x0: [1, 1, 4, 2] 366s -0.394 366s 366s x1: [1, 2, 4, 2] 366s 0.773 366s -0.655 366s 366s y: [1, 2, 4, 2] 366s -0.305 366s 0.258 366s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 1, 1, 1] 366s -0.002 366s 366s x1: [1, 1, 1, 1] 366s 0.864 366s 366s y: [1, 1, 1, 1] 366s -0.002 366s testing: mul_mat, [3, 1, 4, 1] = [1, 3, 4, 1] * [1, 1, 4, 1] 366s x0: [1, 3, 4, 1] 366s -0.066 366s 0.668 366s 0.599 366s 366s x1: [1, 1, 4, 1] 366s -0.141 366s 366s y: [3, 1, 4, 1] 366s 0.009 -0.094 -0.085 366s testing: mul_mat, [2, 1, 4, 2] = [3, 2, 4, 2] * [3, 1, 4, 2] 366s x0: [3, 2, 4, 2] 366s -0.386 -0.835 0.236 366s 0.923 -0.909 0.626 366s 366s x1: [3, 1, 4, 2] 366s -0.514 0.184 0.713 366s 366s y: [2, 1, 4, 2] 366s 0.213 -0.195 366s test-mul-mat0: iter:287/500 366s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.891 -0.169 -0.562 366s -0.519 -0.169 0.077 366s 0.429 0.250 0.055 366s -0.953 0.015 0.728 366s 366s x1: [3, 3, 1, 1] 366s -0.298 0.744 -0.007 366s 0.887 -0.837 0.226 366s -0.423 0.829 -0.933 366s 366s y: [4, 3, 1, 1] 366s 0.144 0.028 0.057 0.290 366s -0.775 -0.301 0.184 -0.692 366s 0.760 0.007 -0.025 -0.264 366s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 366s x0: [3, 3, 4, 1] 366s -0.701 0.839 -0.309 366s -0.083 -0.585 -0.981 366s -0.388 -0.580 -0.403 366s 366s x1: [3, 4, 4, 1] 366s 0.088 -0.539 -0.589 366s -0.834 0.302 -0.717 366s 0.589 -0.750 0.381 366s -0.709 0.994 -0.626 366s 366s y: [3, 4, 4, 1] 366s -0.333 0.885 0.515 366s 1.060 0.595 0.437 366s -1.160 0.016 0.053 366s 1.525 0.092 -0.049 366s testing: mul_mat, [4, 3, 4, 3] = [3, 4, 4, 3] * [3, 3, 4, 3] 366s x0: [3, 4, 4, 3] 366s 0.807 0.259 0.804 366s -0.953 0.544 0.474 366s -0.139 0.825 0.521 366s -0.745 -0.445 -0.579 366s 366s x1: [3, 3, 4, 3] 366s -0.186 -0.684 -0.018 366s 0.411 0.678 0.455 366s 0.601 0.519 0.068 366s 366s y: [4, 3, 4, 3] 366s -0.342 -0.203 -0.548 0.454 366s 0.873 0.193 0.740 -0.872 366s 0.675 -0.258 0.380 -0.719 366s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 2, 1, 1] 366s 0.562 -0.915 -0.587 366s -0.252 -0.915 0.084 366s 366s x1: [3, 4, 1, 1] 366s 0.326 0.604 -0.644 366s 0.554 -0.428 -0.728 366s -0.260 -0.288 0.390 366s -0.015 0.506 0.108 366s 366s y: [2, 4, 1, 1] 366s 0.009 -0.689 366s 1.130 0.191 366s -0.111 0.362 366s -0.535 -0.451 366s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 366s x0: [2, 4, 4, 1] 366s 0.723 0.017 366s -0.396 -0.340 366s 0.576 -0.795 366s 0.807 -0.736 366s 366s x1: [2, 3, 4, 1] 366s -0.602 0.246 366s 0.811 -0.635 366s -0.542 -0.496 366s 366s y: [4, 3, 4, 1] 366s -0.431 0.155 -0.543 -0.668 366s 0.575 -0.105 0.972 1.123 366s -0.400 0.383 0.082 -0.072 366s testing: mul_mat, [1, 2, 4, 3] = [4, 1, 4, 3] * [4, 2, 4, 3] 366s x0: [4, 1, 4, 3] 366s 0.983 -0.582 0.198 0.785 366s 366s x1: [4, 2, 4, 3] 366s 0.968 -0.341 -0.730 -0.015 366s 0.319 -0.525 0.249 0.791 366s 366s y: [1, 2, 4, 3] 366s 0.995 366s 1.289 366s test-mul-mat0: iter:288/500 366s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.748 366s -0.609 366s -0.874 366s 366s x1: [1, 2, 1, 1] 366s -0.301 366s 0.461 366s 366s y: [3, 2, 1, 1] 366s -0.225 0.184 0.263 366s 0.345 -0.281 -0.403 366s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.973 366s 366s x1: [1, 3, 1, 1] 366s -0.455 366s 0.190 366s 0.140 366s 366s y: [1, 3, 1, 1] 366s -0.443 366s 0.185 366s 0.137 366s testing: mul_mat, [1, 1, 1, 2] = [1, 1, 1, 2] * [1, 1, 1, 2] 366s x0: [1, 1, 1, 2] 366s 0.160 366s 366s x1: [1, 1, 1, 2] 366s -0.427 366s 366s y: [1, 1, 1, 2] 366s -0.069 366s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.933 366s 366s x1: [1, 1, 1, 1] 366s 0.148 366s 366s y: [1, 1, 1, 1] 366s 0.138 366s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.626 366s 366s x1: [1, 1, 1, 1] 366s -0.143 366s 366s y: [1, 1, 1, 1] 366s -0.090 366s testing: mul_mat, [4, 1, 1, 2] = [1, 4, 1, 2] * [1, 1, 1, 2] 366s x0: [1, 4, 1, 2] 366s 0.288 366s 0.045 366s -0.183 366s -0.424 366s 366s x1: [1, 1, 1, 2] 366s -0.793 366s 366s y: [4, 1, 1, 2] 366s -0.228 -0.036 0.145 0.336 366s test-mul-mat0: iter:289/500 366s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.736 0.802 -0.213 0.425 366s -0.553 0.935 0.191 -0.620 366s -0.208 0.356 -0.993 -0.001 366s 0.478 -0.342 -0.713 -0.477 366s 366s x1: [4, 1, 1, 1] 366s 0.567 -0.727 0.692 0.428 366s 366s y: [4, 1, 1, 1] 366s -0.966 -1.127 -1.064 -0.178 366s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 3, 3, 1] 366s 0.470 -0.747 -0.723 0.216 366s 0.038 0.838 -0.683 0.828 366s -0.601 -0.103 0.277 0.246 366s 366s x1: [4, 4, 3, 1] 366s 0.475 -0.137 0.753 -0.802 366s -0.862 -0.555 -0.377 0.160 366s 0.527 0.174 -0.272 0.800 366s -0.134 -0.844 -0.953 0.130 366s 366s y: [3, 4, 3, 1] 366s -0.392 -1.275 -0.260 366s 0.317 -0.108 0.511 366s 0.487 1.015 -0.214 366s 1.285 0.046 -0.064 366s testing: mul_mat, [3, 3, 3, 1] = [4, 3, 3, 1] * [4, 3, 3, 1] 366s x0: [4, 3, 3, 1] 366s -0.921 0.114 0.548 -0.414 366s -0.447 -0.239 0.900 -0.855 366s 0.043 0.028 -0.282 0.310 366s 366s x1: [4, 3, 3, 1] 366s 0.905 0.689 -0.587 -0.696 366s -0.414 0.690 0.550 -0.415 366s 0.891 -0.493 0.439 0.214 366s 366s y: [3, 3, 3, 1] 366s -0.788 -0.502 0.008 366s 0.933 0.870 -0.282 366s -0.725 -0.069 -0.033 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.082 0.303 0.078 -0.182 366s -0.914 -0.979 -0.817 -0.127 366s -0.566 0.642 -0.207 -0.528 366s 366s x1: [4, 3, 1, 1] 366s 0.705 0.255 0.491 -0.252 366s -0.716 -0.791 -0.942 0.993 366s 0.108 0.406 0.828 -0.530 366s 366s y: [3, 3, 1, 1] 366s 0.219 -1.265 -0.204 366s -0.553 2.072 -0.431 366s 0.293 -1.106 0.308 366s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 366s x0: [3, 2, 3, 1] 366s -0.125 0.879 -0.514 366s 0.948 -0.513 -0.808 366s 366s x1: [3, 4, 3, 1] 366s -0.786 0.642 0.266 366s 0.158 -0.002 -0.388 366s -0.136 -0.746 -0.896 366s 0.612 -0.463 -0.687 366s 366s y: [2, 4, 3, 1] 366s 0.527 -1.290 366s 0.178 0.464 366s -0.179 0.978 366s -0.130 1.373 366s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 366s x0: [2, 3, 3, 1] 366s -0.600 0.632 366s -0.442 0.932 366s 0.358 -0.137 366s 366s x1: [2, 3, 3, 1] 366s 0.411 0.379 366s 0.662 -0.164 366s -0.983 -0.284 366s 366s y: [3, 3, 3, 1] 366s -0.007 0.171 0.096 366s -0.501 -0.445 0.260 366s 0.410 0.169 -0.313 366s test-mul-mat0: iter:290/500 366s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 2, 1, 1] 366s -0.250 366s 0.209 366s 366s x1: [1, 1, 1, 1] 366s 0.452 366s 366s y: [2, 1, 1, 1] 366s -0.113 0.095 366s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 366s x0: [1, 4, 4, 1] 366s 0.811 366s -0.731 366s -0.095 366s -0.443 366s 366s x1: [1, 2, 4, 1] 366s 0.779 366s -0.697 366s 366s y: [4, 2, 4, 1] 366s 0.631 -0.569 -0.074 -0.345 366s -0.565 0.509 0.066 0.309 366s testing: mul_mat, [3, 4, 4, 3] = [1, 3, 4, 3] * [1, 4, 4, 3] 366s x0: [1, 3, 4, 3] 366s 0.872 366s 0.992 366s 0.447 366s 366s x1: [1, 4, 4, 3] 366s 0.748 366s 0.418 366s -0.733 366s -0.529 366s 366s y: [3, 4, 4, 3] 366s 0.652 0.742 0.334 366s 0.364 0.414 0.187 366s -0.639 -0.727 -0.327 366s -0.461 -0.524 -0.236 366s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.720 366s -0.770 366s 0.495 366s 366s x1: [1, 3, 1, 1] 366s 0.827 366s -0.516 366s -0.498 366s 366s y: [3, 3, 1, 1] 366s 0.595 -0.637 0.409 366s -0.372 0.397 -0.255 366s -0.358 0.383 -0.246 366s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 366s x0: [3, 4, 4, 1] 366s 0.094 -0.634 0.305 366s 0.513 0.915 -0.737 366s 0.972 0.549 0.295 366s 0.386 0.210 0.131 366s 366s x1: [3, 1, 4, 1] 366s 0.237 0.014 -0.516 366s 366s y: [4, 1, 4, 1] 366s -0.144 0.515 0.086 0.027 366s testing: mul_mat, [3, 3, 4, 3] = [4, 3, 4, 3] * [4, 3, 4, 3] 366s x0: [4, 3, 4, 3] 366s -0.371 -0.522 -0.234 -0.063 366s 0.700 0.874 -0.654 -0.728 366s 0.569 0.441 -0.751 -0.549 366s 366s x1: [4, 3, 4, 3] 366s -0.070 0.853 0.335 -0.545 366s -0.142 0.703 0.531 0.250 366s 0.490 0.772 -0.931 -0.865 366s 366s y: [3, 3, 4, 3] 366s -0.464 0.874 0.385 366s -0.454 -0.015 -0.307 366s -0.313 2.256 1.793 366s test-mul-mat0: iter:291/500 366s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.941 366s 0.722 366s 0.535 366s 366s x1: [1, 4, 1, 1] 366s -0.504 366s 0.982 366s -0.843 366s 0.058 366s 366s y: [3, 4, 1, 1] 366s 0.475 -0.364 -0.270 366s -0.924 0.709 0.525 366s 0.793 -0.608 -0.451 366s -0.055 0.042 0.031 366s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.166 366s -0.573 366s 366s x1: [1, 3, 1, 1] 366s -0.136 366s 0.523 366s 0.169 366s 366s y: [2, 3, 1, 1] 366s -0.023 0.078 366s 0.087 -0.299 366s 0.028 -0.097 366s testing: mul_mat, [4, 2, 1, 3] = [1, 4, 1, 3] * [1, 2, 1, 3] 366s x0: [1, 4, 1, 3] 366s -0.115 366s 0.644 366s -0.660 366s -0.669 366s 366s x1: [1, 2, 1, 3] 366s 0.206 366s 0.275 366s 366s y: [4, 2, 1, 3] 366s -0.024 0.132 -0.136 -0.138 366s -0.032 0.177 -0.182 -0.184 366s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 4, 1, 1] 366s 0.210 366s 0.638 366s -0.235 366s -0.621 366s 366s x1: [1, 4, 1, 1] 366s -0.284 366s 0.520 366s 0.675 366s -0.226 366s 366s y: [4, 4, 1, 1] 366s -0.060 -0.181 0.067 0.176 366s 0.109 0.332 -0.122 -0.323 366s 0.142 0.431 -0.159 -0.419 366s -0.047 -0.144 0.053 0.140 366s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.898 0.163 0.264 0.948 366s 0.887 -0.067 0.982 -0.117 366s -0.952 0.241 0.073 0.438 366s 0.597 0.503 -0.120 -0.590 366s 366s x1: [4, 1, 1, 1] 366s 0.389 0.931 -0.194 -0.405 366s 366s y: [4, 1, 1, 1] 366s -0.632 0.140 -0.337 0.963 366s testing: mul_mat, [4, 4, 1, 3] = [4, 4, 1, 3] * [4, 4, 1, 3] 366s x0: [4, 4, 1, 3] 366s 0.613 -0.068 -0.308 -0.398 366s 0.828 -0.506 -0.421 0.901 366s 0.177 -0.035 -0.832 -0.883 366s 0.298 0.108 -0.672 -0.445 366s 366s x1: [4, 4, 1, 3] 366s -0.326 -0.846 0.930 -0.651 366s -0.072 0.172 0.559 -0.434 366s 0.936 0.938 0.955 0.867 366s -0.256 -0.450 0.073 -0.154 366s 366s y: [4, 4, 1, 3] 366s -0.169 -0.821 -0.227 -0.524 366s -0.055 -0.772 -0.101 -0.186 366s -0.129 0.679 -1.428 -0.647 366s -0.088 -0.154 0.046 -0.106 366s test-mul-mat0: iter:292/500 366s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.064 -0.932 366s -0.653 -0.560 366s -0.140 -0.344 366s 0.644 -0.172 366s 366s x1: [2, 2, 1, 1] 366s -0.271 -0.876 366s 0.914 -0.602 366s 366s y: [4, 2, 1, 1] 366s 0.799 0.667 0.339 -0.024 366s 0.619 -0.260 0.079 0.692 366s testing: mul_mat, [4, 4, 3, 1] = [2, 4, 3, 1] * [2, 4, 3, 1] 366s x0: [2, 4, 3, 1] 366s -0.913 0.938 366s -0.912 -0.257 366s 0.582 -0.083 366s 0.006 0.362 366s 366s x1: [2, 4, 3, 1] 366s -0.737 0.780 366s 0.291 -0.165 366s 0.024 -0.554 366s 0.985 -0.564 366s 366s y: [4, 4, 3, 1] 366s 1.404 0.472 -0.494 0.278 366s -0.421 -0.223 0.183 -0.058 366s -0.541 0.121 0.060 -0.200 366s -1.428 -0.753 0.620 -0.198 366s testing: mul_mat, [4, 4, 3, 4] = [2, 4, 3, 4] * [2, 4, 3, 4] 366s x0: [2, 4, 3, 4] 366s -0.141 0.187 366s 0.441 0.614 366s 0.766 0.822 366s 0.569 -0.554 366s 366s x1: [2, 4, 3, 4] 366s -0.597 0.670 366s 0.031 0.563 366s 0.411 -0.472 366s -0.208 0.498 366s 366s y: [4, 4, 3, 4] 366s 0.210 0.148 0.093 -0.711 366s 0.101 0.359 0.486 -0.295 366s -0.147 -0.109 -0.073 0.495 366s 0.123 0.214 0.249 -0.395 366s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.746 0.280 366s -0.656 0.051 366s -0.719 0.103 366s 366s x1: [2, 4, 1, 1] 366s 0.367 -0.899 366s -0.479 0.882 366s 0.712 -0.643 366s -0.293 0.749 366s 366s y: [3, 4, 1, 1] 366s 0.022 -0.286 -0.357 366s -0.111 0.359 0.436 366s 0.351 -0.500 -0.579 366s -0.009 0.230 0.288 366s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 366s x0: [3, 4, 3, 1] 366s 0.334 0.636 0.768 366s 0.887 0.487 0.066 366s 0.402 0.787 0.031 366s -0.959 0.980 -0.129 366s 366s x1: [3, 2, 3, 1] 366s -0.109 0.948 -0.293 366s -0.537 0.551 0.095 366s 366s y: [4, 2, 3, 1] 366s 0.341 0.345 0.693 1.072 366s 0.244 -0.202 0.221 1.044 366s testing: mul_mat, [4, 3, 3, 4] = [4, 4, 3, 4] * [4, 3, 3, 4] 366s x0: [4, 4, 3, 4] 366s -0.056 -0.377 -0.830 -0.293 366s 0.243 0.742 0.676 -0.702 366s 0.059 0.340 0.741 -0.381 366s 0.968 0.021 -0.040 -0.481 366s 366s x1: [4, 3, 3, 4] 366s -0.187 0.218 -0.789 -0.420 366s -0.716 0.242 0.452 -0.759 366s -0.779 0.030 -0.339 -0.249 366s 366s y: [4, 3, 3, 4] 366s 0.706 -0.123 -0.362 0.057 366s -0.204 0.844 0.664 -0.340 366s 0.387 -0.222 -0.192 -0.620 366s test-mul-mat0: iter:293/500 366s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 1, 1] 366s x0: [4, 2, 1, 1] 366s 0.898 -0.354 0.184 -0.986 366s 0.842 0.098 0.575 -0.074 366s 366s x1: [4, 1, 1, 1] 366s 0.265 0.012 -0.370 0.050 366s 366s y: [2, 1, 1, 1] 366s 0.116 0.008 366s testing: mul_mat, [4, 2, 2, 1] = [4, 4, 2, 1] * [4, 2, 2, 1] 366s x0: [4, 4, 2, 1] 366s -0.453 0.088 0.243 0.177 366s -0.862 0.387 0.075 -0.216 366s -0.429 0.089 -0.375 0.668 366s -0.336 0.551 -0.331 -0.611 366s 366s x1: [4, 2, 2, 1] 366s 0.001 0.726 0.299 -0.213 366s 0.557 -0.703 0.690 -0.876 366s 366s y: [4, 2, 2, 1] 366s 0.098 0.349 -0.190 0.431 366s -0.301 -0.512 -1.145 -0.268 366s testing: mul_mat, [3, 4, 2, 4] = [4, 3, 2, 4] * [4, 4, 2, 4] 366s x0: [4, 3, 2, 4] 366s 0.097 0.836 0.193 -0.264 366s -0.794 -0.780 0.836 0.277 366s -0.919 -0.010 -0.580 0.903 366s 366s x1: [4, 4, 2, 4] 366s -0.152 -0.140 -0.509 -0.015 366s -0.753 0.566 0.769 -0.182 366s -0.345 -0.605 -0.514 0.319 366s 0.946 0.155 0.708 -0.204 366s 366s y: [3, 4, 2, 4] 366s -0.226 -0.200 0.423 366s 0.597 0.750 0.076 366s -0.723 0.405 0.909 366s 0.412 -0.337 -1.465 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.956 0.437 -0.016 0.840 366s -0.835 0.913 0.858 -0.176 366s -0.757 -0.034 -0.614 0.764 366s 366s x1: [4, 3, 1, 1] 366s -0.522 -0.771 0.524 0.495 366s -0.895 -0.437 -0.551 -0.863 366s -0.254 0.785 0.273 -0.357 366s 366s y: [3, 3, 1, 1] 366s -0.429 0.094 0.478 366s -1.763 0.028 0.371 366s -0.205 1.226 -0.274 366s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 366s x0: [3, 4, 2, 1] 366s 0.430 -0.196 0.783 366s 0.981 -0.063 -0.753 366s 0.174 -0.445 0.337 366s 0.270 -0.972 -0.649 366s 366s x1: [3, 4, 2, 1] 366s 0.285 0.224 -0.155 366s 0.691 0.309 -0.431 366s -0.831 0.538 -0.908 366s 0.664 0.643 -0.344 366s 366s y: [4, 4, 2, 1] 366s -0.042 0.382 -0.102 -0.040 366s -0.101 0.983 -0.163 0.166 366s -1.173 -0.165 -0.690 -0.159 366s -0.109 0.871 -0.287 -0.223 366s testing: mul_mat, [2, 3, 2, 4] = [4, 2, 2, 4] * [4, 3, 2, 4] 366s x0: [4, 2, 2, 4] 366s 0.019 0.853 0.323 -0.422 366s 0.288 0.303 -0.621 -0.369 366s 366s x1: [4, 3, 2, 4] 366s 0.670 -0.638 -0.244 0.492 366s -0.192 -0.121 -0.720 -0.762 366s -0.141 0.454 0.508 0.664 366s 366s y: [2, 3, 2, 4] 366s -0.819 -0.030 366s -0.018 0.636 366s 0.268 -0.463 366s test-mul-mat0: iter:294/500 366s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.196 -0.091 -0.228 -0.306 366s -0.317 0.719 -0.382 0.605 366s 0.181 -0.207 -0.460 -0.527 366s -0.762 -0.224 0.448 -0.744 366s 366s x1: [4, 3, 1, 1] 366s 0.168 -0.886 -0.616 0.225 366s -0.725 -0.524 0.882 0.610 366s 0.851 -0.077 -0.711 0.095 366s 366s y: [4, 3, 1, 1] 366s 0.119 -0.319 0.379 -0.373 366s -0.198 -0.115 -0.750 0.612 366s -0.026 0.003 0.448 -1.020 366s testing: mul_mat, [4, 4, 2, 1] = [4, 4, 2, 1] * [4, 4, 2, 1] 366s x0: [4, 4, 2, 1] 366s -0.249 -0.813 0.430 0.399 366s -0.789 -0.226 0.930 0.845 366s 0.970 0.384 0.768 -0.740 366s 0.865 -0.505 0.460 0.119 366s 366s x1: [4, 4, 2, 1] 366s 0.635 -0.515 0.424 0.749 366s -0.131 -0.351 -0.976 0.345 366s -0.469 0.634 0.196 0.454 366s 0.923 -0.710 -0.519 -0.273 366s 366s y: [4, 4, 2, 1] 366s 0.742 0.642 0.189 1.093 366s 0.037 -0.434 -1.266 -0.344 366s -0.133 0.792 -0.397 -0.582 366s 0.015 -1.282 0.426 0.887 366s testing: mul_mat, [3, 4, 2, 4] = [4, 3, 2, 4] * [4, 4, 2, 4] 366s x0: [4, 3, 2, 4] 366s 0.618 0.541 0.273 -0.822 366s -0.362 0.829 -0.409 0.872 366s -0.910 -0.494 0.761 0.788 366s 366s x1: [4, 4, 2, 4] 366s -0.718 -0.061 0.336 -0.507 366s 0.714 0.266 -0.662 0.684 366s -0.349 -0.895 0.944 -0.484 366s -0.400 0.404 0.635 -0.483 366s 366s y: [3, 4, 2, 4] 366s 0.032 -0.370 0.539 366s -0.158 0.829 -0.746 366s -0.044 -1.423 1.097 366s 0.542 -0.202 0.267 366s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 3, 1, 1] 366s 0.836 -0.091 0.239 -0.548 366s -0.411 -0.755 0.826 0.632 366s -0.430 0.539 0.846 0.262 366s 366s x1: [4, 3, 1, 1] 366s -0.956 0.017 0.623 -0.170 366s -0.471 0.121 0.034 -0.268 366s -0.195 0.952 -0.076 -0.092 366s 366s y: [3, 3, 1, 1] 366s -0.558 0.787 0.903 366s -0.250 -0.039 0.227 366s -0.217 -0.760 0.508 366s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 366s x0: [3, 3, 2, 1] 366s -0.838 -0.386 0.686 366s -0.367 -0.736 -0.194 366s -0.053 -0.791 0.202 366s 366s x1: [3, 4, 2, 1] 366s 0.072 -0.458 -0.007 366s 0.244 -0.024 0.516 366s 0.288 0.993 0.139 366s -0.882 -0.478 -0.740 366s 366s y: [3, 4, 2, 1] 366s 0.112 0.312 0.357 366s 0.159 -0.172 0.110 366s -0.530 -0.863 -0.772 366s 0.417 0.819 0.276 366s testing: mul_mat, [1, 3, 2, 4] = [3, 1, 2, 4] * [3, 3, 2, 4] 366s x0: [3, 1, 2, 4] 366s 0.945 -0.811 0.013 366s 366s x1: [3, 3, 2, 4] 366s 0.4ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s 86 0.551 0.182 366s -0.410 0.728 0.155 366s 0.581 0.742 0.717 366s 366s y: [1, 3, 2, 4] 366s 0.015 366s -0.976 366s -0.043 366s test-mul-mat0: iter:295/500 366s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.630 366s 366s x1: [1, 4, 1, 1] 366s -0.858 366s 0.718 366s -0.383 366s -0.913 366s 366s y: [1, 4, 1, 1] 366s -0.541 366s 0.452 366s -0.241 366s -0.575 366s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 366s x0: [1, 2, 3, 1] 366s -0.793 366s -0.119 366s 366s x1: [1, 1, 3, 1] 366s 0.059 366s 366s y: [2, 1, 3, 1] 366s -0.046 -0.007 366s testing: mul_mat, [4, 2, 3, 3] = [1, 4, 3, 3] * [1, 2, 3, 3] 366s x0: [1, 4, 3, 3] 366s -0.767 366s -0.590 366s 0.132 366s 0.170 366s 366s x1: [1, 2, 3, 3] 366s -0.406 366s 0.203 366s 366s y: [4, 2, 3, 3] 366s 0.312 0.240 -0.054 -0.069 366s -0.156 -0.120 0.027 0.034 366s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.642 366s 0.099 366s -0.825 366s 366s x1: [1, 4, 1, 1] 366s 0.364 366s -0.813 366s 0.375 366s 0.552 366s 366s y: [3, 4, 1, 1] 366s 0.234 0.036 -0.301 366s -0.522 -0.080 0.670 366s 0.241 0.037 -0.309 366s 0.354 0.055 -0.455 366s testing: mul_mat, [1, 1, 3, 1] = [3, 1, 3, 1] * [3, 1, 3, 1] 366s x0: [3, 1, 3, 1] 366s 0.820 -0.119 0.386 366s 366s x1: [3, 1, 3, 1] 366s 0.111 0.260 -0.330 366s 366s y: [1, 1, 3, 1] 366s -0.067 366s testing: mul_mat, [4, 3, 3, 3] = [1, 4, 3, 3] * [1, 3, 3, 3] 366s x0: [1, 4, 3, 3] 366s 0.413 366s -0.556 366s -0.748 366s -0.398 366s 366s x1: [1, 3, 3, 3] 366s 0.189 366s 0.801 366s 0.245 366s 366s y: [4, 3, 3, 3] 366s 0.078 -0.105 -0.141 -0.075 366s 0.331 -0.445 -0.599 -0.319 366s 0.101 -0.136 -0.183 -0.097 366s test-mul-mat0: iter:296/500 366s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.483 -0.463 366s -0.683 0.380 366s 0.552 -0.737 366s 366s x1: [2, 2, 1, 1] 366s -0.758 0.347 366s 0.748 0.027 366s 366s y: [3, 2, 1, 1] 366s -0.527 0.649 -0.674 366s 0.349 -0.500 0.393 366s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 366s x0: [2, 4, 4, 1] 366s -0.890 0.171 366s -0.830 0.490 366s -0.277 -0.566 366s -0.466 -0.376 366s 366s x1: [2, 3, 4, 1] 366s 0.044 0.901 366s 0.982 -0.205 366s 0.886 0.293 366s 366s y: [4, 3, 4, 1] 366s 0.115 0.405 -0.522 -0.359 366s -0.909 -0.915 -0.156 -0.381 366s -0.739 -0.591 -0.412 -0.524 366s testing: mul_mat, [2, 4, 4, 4] = [2, 2, 4, 4] * [2, 4, 4, 4] 366s x0: [2, 2, 4, 4] 366s -0.386 0.511 366s 0.874 0.160 366s 366s x1: [2, 4, 4, 4] 366s -0.651 0.456 366s -0.594 0.071 366s 0.890 -0.060 366s 0.695 -0.694 366s 366s y: [2, 4, 4, 4] 366s 0.485 -0.496 366s 0.266 -0.507 366s -0.374 0.768 366s -0.623 0.496 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.746 0.722 366s -0.987 0.110 366s 366s x1: [2, 2, 1, 1] 366s -0.220 -0.358 366s 0.608 -0.111 366s 366s y: [2, 2, 1, 1] 366s -0.422 0.177 366s 0.374 -0.612 366s testing: mul_mat, [1, 2, 4, 1] = [2, 1, 4, 1] * [2, 2, 4, 1] 366s x0: [2, 1, 4, 1] 366s -0.361 -0.905 366s 366s x1: [2, 2, 4, 1] 366s -0.089 0.593 366s -0.028 -0.540 366s 366s y: [1, 2, 4, 1] 366s -0.504 366s 0.499 366s testing: mul_mat, [4, 2, 4, 4] = [1, 4, 4, 4] * [1, 2, 4, 4] 366s x0: [1, 4, 4, 4] 366s 0.797 366s -0.400 366s -0.502 366s 0.828 366s 366s x1: [1, 2, 4, 4] 366s 0.686 366s 0.453 366s 366s y: [4, 2, 4, 4] 366s 0.547 -0.275 -0.345 0.568 366s 0.361 -0.181 -0.228 0.375 366s test-mul-mat0: iter:297/500 366s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.086 366s 0.546 366s 0.747 366s -0.375 366s 366s x1: [1, 1, 1, 1] 366s -0.532 366s 366s y: [4, 1, 1, 1] 366s 0.046 -0.290 -0.397 0.199 366s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 366s x0: [1, 3, 3, 1] 366s -0.522 366s 0.669 366s 0.102 366s 366s x1: [1, 4, 3, 1] 366s 0.885 366s 0.618 366s -0.226 366s -0.327 366s 366s y: [3, 4, 3, 1] 366s -0.462 0.592 0.090 366s -0.323 0.413 0.063 366s 0.118 -0.151 -0.023 366s 0.171 -0.219 -0.033 366s testing: mul_mat, [1, 3, 3, 2] = [1, 1, 3, 2] * [1, 3, 3, 2] 366s x0: [1, 1, 3, 2] 366s 0.416 366s 366s x1: [1, 3, 3, 2] 366s -0.257 366s 0.462 366s 0.854 366s 366s y: [1, 3, 3, 2] 366s -0.107 366s 0.192 366s 0.355 366s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.327 366s -0.217 366s 0.464 366s 366s x1: [1, 1, 1, 1] 366s -0.401 366s 366s y: [3, 1, 1, 1] 366s 0.131 0.087 -0.186 366s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 366s x0: [3, 2, 3, 1] 366s -0.863 0.005 -0.831 366s 0.968 0.452 -0.963 366s 366s x1: [3, 1, 3, 1] 366s -0.176 0.526 -0.074 366s 366s y: [2, 1, 3, 1] 366s 0.217 0.139 366s testing: mul_mat, [1, 3, 3, 2] = [2, 1, 3, 2] * [2, 3, 3, 2] 366s x0: [2, 1, 3, 2] 366s -0.280 0.621 366s 366s x1: [2, 3, 3, 2] 366s -0.432 -0.428 366s 0.370 0.392 366s -0.902 -0.704 366s 366s y: [1, 3, 3, 2] 366s -0.145 366s 0.140 366s -0.185 366s test-mul-mat0: iter:298/500 366s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.031 -0.294 -0.975 366s 0.085 0.260 0.283 366s 0.773 0.183 0.933 366s -0.441 0.403 -0.347 366s 366s x1: [3, 1, 1, 1] 366s -0.460 -0.358 -0.941 366s 366s y: [4, 1, 1, 1] 366s 1.037 -0.399 -1.300 0.385 366s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 366s x0: [3, 3, 4, 1] 366s 0.948 -0.744 -0.119 366s -0.984 0.215 -0.233 366s 0.367 0.948 0.836 366s 366s x1: [3, 4, 4, 1] 366s -0.820 0.487 -0.113 366s -0.870 0.395 0.474 366s -0.454 0.097 0.419 366s -0.121 -0.240 -0.242 366s 366s y: [3, 4, 4, 1] 366s -1.126 0.938 0.066 366s -1.175 0.830 0.452 366s -0.552 0.370 0.275 366s 0.093 0.123 -0.474 366s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 366s x0: [3, 1, 4, 1] 366s -0.842 -0.217 -0.738 366s 366s x1: [3, 3, 4, 1] 366s -0.855 0.920 -0.446 366s 0.981 -0.428 0.793 366s -0.712 0.161 -0.495 366s 366s y: [1, 3, 4, 1] 366s 0.849 366s -1.318 366s 0.929 366s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 2, 1, 1] 366s -0.143 -0.656 0.194 366s -0.383 0.222 0.149 366s 366s x1: [3, 1, 1, 1] 366s -0.347 -0.011 -0.423 366s 366s y: [2, 1, 1, 1] 366s -0.025 0.068 366s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 366s x0: [2, 2, 4, 1] 366s 0.990 0.542 366s -0.701 0.212 366s 366s x1: [2, 3, 4, 1] 366s -0.034 -0.677 366s -0.441 -0.145 366s 0.373 -0.477 366s 366s y: [2, 3, 4, 1] 366s -0.400 -0.120 366s -0.515 0.279 366s 0.111 -0.363 366s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 366s x0: [2, 3, 4, 1] 366s 0.339 -0.970 366s -0.643 -0.464 366s -0.223 -0.407 366s 366s x1: [2, 2, 4, 1] 366s 0.962 -0.405 366s -0.761 0.097 366s 366s y: [3, 2, 4, 1] 366s 0.720 -0.430 -0.050 366s -0.353 0.444 0.130 366s test-mul-mat0: iter:299/500 366s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.270 -0.837 0.196 0.756 366s -0.923 -0.872 -0.340 -0.812 366s 0.394 0.585 0.602 0.261 366s 0.666 -0.998 -0.075 -0.108 366s 366s x1: [4, 4, 1, 1] 366s 0.665 0.226 0.684 0.004 366s 0.584 -0.539 0.034 -0.881 366s 0.053 -0.688 0.688 -0.165 366s 0.363 0.016 -0.198 -0.535 366s 366s y: [4, 4, 1, 1] 366s -0.231 -1.047 0.807 0.166 366s -0.365 0.635 -0.295 1.019 366s 0.572 0.450 -0.010 0.688 366s -0.554 0.153 -0.107 0.298 366s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 4, 3, 1] 366s 0.935 0.707 -0.842 0.919 366s 0.238 0.161 0.489 -0.629 366s 0.425 -0.275 0.506 -0.386 366s -0.147 -0.672 0.733 0.458 366s 366s x1: [4, 4, 3, 1] 366s 0.686 0.930 -0.524 -0.853 366s -0.037 -0.405 0.200 0.275 366s -0.717 -0.964 -0.362 0.299 366s -0.162 0.103 -0.714 0.568 366s 366s y: [4, 4, 3, 1] 366s 0.956 0.594 0.100 -1.500 366s -0.236 -0.149 0.091 0.550 366s -0.773 -0.691 -0.338 0.625 366s 1.045 -0.729 -0.678 -0.309 366s testing: mul_mat, [1, 4, 3, 4] = [4, 1, 3, 4] * [4, 4, 3, 4] 366s x0: [4, 1, 3, 4] 366s -0.789 0.030 0.427 -0.440 366s 366s x1: [4, 4, 3, 4] 366s 0.444 -0.642 -0.458 -0.384 366s -0.503 -0.789 0.783 0.418 366s -0.078 -0.951 0.407 -0.757 366s -0.321 0.945 0.500 0.018 366s 366s y: [1, 4, 3, 4] 366s -0.396 366s 0.523 366s 0.539 366s 0.487 366s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.557 0.240 0.862 -0.476 366s 0.408 -0.346 -0.307 -0.924 366s 0.536 0.421 -0.279 -0.199 366s -0.639 -0.535 -0.502 0.148 366s 366s x1: [4, 1, 1, 1] 366s -0.399 -0.756 -0.900 0.449 366s 366s y: [4, 1, 1, 1] 366s -0.949 -0.039 -0.370 1.178 366s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 366s x0: [4, 4, 3, 1] 366s -0.117 -0.310 0.359 -0.101 366s 0.294 -0.765 -0.901 0.777 366s 0.609 -0.553 -0.002 -0.926 366s 0.318 -0.194 0.283 -0.250 366s 366s x1: [4, 4, 3, 1] 366s -0.414 -0.133 0.050 -0.976 366s -0.467 -0.459 -0.909 -0.768 366s 0.763 -0.889 -0.166 -0.992 366s -0.789 -0.717 -0.426 -0.347 366s 366s y: [4, 4, 3, 1] 366s 0.206 -0.822 0.725 0.152 366s -0.052 0.436 0.682 -0.125 366s 0.227 0.284 1.876 0.617 366s 0.197 0.432 0.238 -0.146 366s testing: mul_mat, [1, 4, 3, 4] = [4, 1, 3, 4] * [4, 4, 3, 4] 366s x0: [4, 1, 3, 4] 366s 0.751 -0.612 -0.239 -0.475 366s 366s x1: [4, 4, 3, 4] 366s 0.001 0.100 0.999 0.956 366s -0.602 -0.523 0.302 0.857 366s -0.830 0.321 -0.946 0.732 366s 0.703 0.096 -0.823 0.379 366s 366s y: [1, 4, 3, 4] 366s -0.754 366s -0.612 366s -0.941 366s 0.486 366s test-mul-mat0: iter:300/500 366s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.468 0.785 366s -0.862 0.166 366s 0.120 -0.483 366s 366s x1: [2, 1, 1, 1] 366s -0.306 -0.046 366s 366s y: [3, 1, 1, 1] 366s -0.179 0.256 -0.015 366s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.315 0.618 366s 366s x1: [2, 3, 1, 1] 366s -0.774 -0.794 366s 0.284 -0.953 366s 0.816 -0.512 366s 366s y: [1, 3, 1, 1] 366s -0.735 366s -0.500 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s -0.059 366s testing: mul_mat, [1, 1, 1, 4] = [2, 1, 1, 4] * [2, 1, 1, 4] 366s x0: [2, 1, 1, 4] 366s 0.099 0.481 366s 366s x1: [2, 1, 1, 4] 366s -0.722 0.530 366s 366s y: [1, 1, 1, 4] 366s 0.184 366s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.549 -0.343 366s 0.093 -0.622 366s 0.036 0.083 366s 366s x1: [2, 1, 1, 1] 366s 0.331 -0.424 366s 366s y: [3, 1, 1, 1] 366s -0.036 0.295 -0.023 366s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 3, 1, 1] 366s -0.186 0.103 0.911 366s -0.199 0.359 -0.542 366s 0.265 -0.133 -0.651 366s 366s x1: [3, 2, 1, 1] 366s -0.527 -0.134 -0.539 366s 0.789 -0.516 -0.261 366s 366s y: [3, 2, 1, 1] 366s -0.407 0.348 0.229 366s -0.437 -0.201 0.447 366s testing: mul_mat, [2, 3, 1, 4] = [3, 2, 1, 4] * [3, 3, 1, 4] 366s x0: [3, 2, 1, 4] 366s -0.249 -0.783 0.116 366s 0.848 -0.419 -0.771 366s 366s x1: [3, 3, 1, 4] 366s -0.498 0.160 -0.476 366s -0.952 0.598 -0.833 366s 0.379 -0.825 -0.023 366s 366s y: [2, 3, 1, 4] 366s -0.057 -0.122 366s -0.328 -0.416 366s 0.549 0.685 366s test-mul-mat0: iter:301/500 366s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.608 0.891 0.712 -0.340 366s 0.068 -0.444 -0.263 0.342 366s 366s x1: [4, 3, 1, 1] 366s -0.491 0.393 0.238 0.260 366s 0.241 0.455 0.841 -0.643 366s 0.685 -0.091 0.119 0.009 366s 366s y: [2, 3, 1, 1] 366s 0.730 -0.181 366s 1.076 -0.627 366s -0.416 0.059 366s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 366s x0: [4, 3, 2, 1] 366s -0.570 -0.590 0.193 0.549 366s 0.420 -0.871 0.941 0.310 366s 0.841 -0.399 -0.622 -0.604 366s 366s x1: [4, 2, 2, 1] 366s -0.268 0.726 -0.927 0.109 366s 0.537 0.719 -0.966 -0.327 366s 366s y: [3, 2, 2, 1] 366s -0.394 -1.583 -0.004 366s -1.095 -1.412 0.962 366s testing: mul_mat, [2, 3, 2, 1] = [4, 2, 2, 1] * [4, 3, 2, 1] 366s x0: [4, 2, 2, 1] 366s 0.382 -0.155 -0.621 -0.343 366s 0.797 -0.318 0.113 0.581 366s 366s x1: [4, 3, 2, 1] 366s -0.544 0.784 0.574 -0.101 366s 0.857 -0.701 0.183 -0.712 366s -0.291 -0.624 0.837 -0.871 366s 366s y: [2, 3, 2, 1] 366s -0.651 -0.676 366s 0.567 0.513 366s -0.235 -0.444 366s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 4, 1, 1] 366s 0.436 0.521 0.517 0.370 366s -0.811 0.139 -0.744 0.744 366s 0.031 0.600 -0.837 -0.545 366s -0.782 -0.634 -0.801 0.382 366s 366s x1: [4, 2, 1, 1] 366s 0.562 0.393 -0.925 0.340 366s -0.167 -0.580 0.719 -0.350 366s 366s y: [4, 2, 1, 1] 366s 0.098 0.539 0.842 0.181 366s -0.132 -0.739 -0.765 -0.211 366s testing: mul_mat, [3, 4, 2, 1] = [4, 3, 2, 1] * [4, 4, 2, 1] 366s x0: [4, 3, 2, 1] 366s -0.633 -0.485 -0.261 0.382 366s 0.223 0.692 0.502 0.031 366s 0.431 0.099 0.600 -0.249 366s 366s x1: [4, 4, 2, 1] 366s 0.182 0.010 -0.423 -0.093 366s -0.725 -0.852 -0.531 0.668 366s -0.777 0.810 -0.499 -0.357 366s 0.529 0.151 0.805 -0.035 366s 366s y: [3, 4, 2, 1] 366s -0.045 -0.168 -0.151 366s 1.266 -0.997 -0.882 366s 0.093 0.126 -0.465 366s -0.631 0.625 0.734 366s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 366s x0: [3, 4, 2, 1] 366s 0.410 -0.396 -0.146 366s 0.603 -0.848 0.406 366s -0.417 -0.921 0.215 366s -0.101 -0.986 0.872 366s 366s x1: [3, 4, 2, 1] 366s 0.180 0.410 -0.298 366s 0.327 0.200 0.018 366s -0.517 0.568 -0.759 366s 0.914 -0.917 0.933 366s 366s y: [4, 4, 2, 1] 366s -0.045 -0.360 -0.517 -0.682 366s 0.052 0.034 -0.317 -0.215 366s -0.326 -1.102 -0.470 -1.170 366s 0.601 1.708 0.663 1.625 366s test-mul-mat0: iter:302/500 366s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.485 -0.889 -0.624 366s 366s x1: [3, 4, 1, 1] 366s -0.307 -0.950 -0.350 366s -0.897 0.653 0.233 366s 0.002 -0.743 0.385 366s 0.081 -0.729 -0.761 366s 366s y: [1, 4, 1, 1] 366s 0.914 366s -1.161 366s 0.422 366s 1.163 366s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 366s x0: [3, 1, 2, 1] 366s 0.750 -0.575 -0.289 366s 366s x1: [3, 1, 2, 1] 366s -0.732 -0.776 -0.820 366s 366s y: [1, 1, 2, 1] 366s 0.134 366s testing: mul_mat, [3, 1, 2, 2] = [3, 3, 2, 2] * [3, 1, 2, 2] 366s x0: [3, 3, 2, 2] 366s -0.030 0.359 0.821 366s 0.081 0.735 -0.911 366s 0.304 0.915 0.222 366s 366s x1: [3, 1, 2, 2] 366s -0.894 0.463 -0.314 366s 366s y: [3, 1, 2, 2] 366s -0.065 0.555 0.082 366s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.479 -0.308 -0.836 366s 366s x1: [3, 3, 1, 1] 366s 0.399 0.562 0.891 366s -0.379 0.135 -0.198 366s -0.336 -0.541 -0.862 366s 366s y: [1, 3, 1, 1] 366s -0.726 366s -0.058 366s 0.726 366s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 366s x0: [1, 4, 2, 1] 366s -0.686 366s 0.828 366s -0.870 366s -0.479 366s 366s x1: [1, 3, 2, 1] 366s -0.395 366s -0.292 366s 0.597 366s 366s y: [4, 3, 2, 1] 366s 0.271 -0.327 0.343 0.189 366s 0.200 -0.242 0.254 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s .140 366s -0.409 0.494 -0.519 -0.285 366s testing: mul_mat, [4, 1, 2, 2] = [4, 4, 2, 2] * [4, 1, 2, 2] 366s x0: [4, 4, 2, 2] 366s 0.159 -0.390 -0.810 -0.468 366s -0.207 -0.940 -0.167 -0.898 366s 0.084 -0.689 0.149 -0.261 366s 0.997 -0.562 0.809 -0.512 366s 366s x1: [4, 1, 2, 2] 366s -0.070 0.957 -0.071 -0.671 366s 366s y: [4, 1, 2, 2] 366s -0.012 -0.269 -0.500 -0.322 366s test-mul-mat0: iter:303/500 366s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.159 366s 366s x1: [1, 3, 1, 1] 366s 0.049 366s -0.692 366s 0.630 366s 366s y: [1, 3, 1, 1] 366s 0.008 366s -0.110 366s 0.100 366s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 4, 1, 1] 366s -0.888 366s 0.785 366s -0.451 366s -0.905 366s 366s x1: [1, 1, 1, 1] 366s -0.662 366s 366s y: [4, 1, 1, 1] 366s 0.588 -0.520 0.298 0.599 366s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 4, 1, 1] 366s 0.232 366s -0.001 366s -0.327 366s 0.767 366s 366s x1: [1, 4, 1, 1] 366s -0.699 366s 0.559 366s 0.928 366s -0.166 366s 366s y: [4, 4, 1, 1] 366s -0.162 0.001 0.229 -0.536 366s 0.129 -0.001 -0.183 0.428 366s 0.215 -0.001 -0.303 0.711 366s -0.038 0.000 0.054 -0.127 366s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.044 366s 366s x1: [1, 4, 1, 1] 366s 0.828 366s -0.517 366s 0.985 366s -0.482 366s 366s y: [1, 4, 1, 1] 366s 0.037 366s -0.023 366s 0.043 366s -0.021 366s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 1, 1, 1] 366s -0.057 366s 366s x1: [1, 1, 1, 1] 366s 0.861 366s 366s y: [1, 1, 1, 1] 366s -0.049 366s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.991 366s 366s x1: [1, 1, 1, 1] 366s -0.532 366s 366s y: [1, 1, 1, 1] 366s -0.528 366s test-mul-mat0: iter:304/500 366s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.323 -0.930 0.498 366s 0.906 0.302 -0.503 366s -0.421 0.068 -0.674 366s 0.062 0.053 -0.156 366s 366s x1: [3, 2, 1, 1] 366s -0.422 -0.430 -0.952 366s -0.636 0.119 -0.858 366s 366s y: [4, 2, 1, 1] 366s 0.062 -0.033 0.790 0.100 366s -0.332 -0.110 0.854 0.101 366s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 1, 1, 1] 366s -0.376 0.932 -0.409 366s 366s x1: [3, 4, 1, 1] 366s -0.081 -0.903 -0.295 366s -0.202 0.040 0.173 366s 0.408 0.031 0.949 366s 0.649 -0.946 0.884 366s 366s y: [1, 4, 1, 1] 366s -0.690 366s 0.042 366s -0.513 366s -1.487 366s testing: mul_mat, [2, 1, 1, 2] = [3, 2, 1, 2] * [3, 1, 1, 2] 366s x0: [3, 2, 1, 2] 366s 0.446 0.251 -0.095 366s -0.485 0.576 0.967 366s 366s x1: [3, 1, 1, 2] 366s 0.742 -0.926 0.255 366s 366s y: [2, 1, 1, 2] 366s 0.075 -0.647 366s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 4, 1, 1] 366s -0.869 0.559 -0.425 366s -0.032 -0.844 -0.718 366s -0.587 -0.863 0.568 366s -0.937 -0.186 -0.098 366s 366s x1: [3, 2, 1, 1] 366s -0.296 -0.702 0.092 366s 0.735 -0.754 -0.259 366s 366s y: [4, 2, 1, 1] 366s -0.175 0.536 0.832 0.398 366s -0.950 0.799 0.071 -0.524 366s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.763 0.914 -0.822 -0.386 366s 0.437 -0.517 0.704 0.146 366s 366s x1: [4, 3, 1, 1] 366s 0.728 -0.182 0.807 -0.758 366s -0.606 0.774 0.810 -0.186 366s -0.340 0.475 0.939 -0.656 366s 366s y: [2, 3, 1, 1] 366s -1.093 0.869 366s 0.576 -0.122 366s 0.174 0.172 366s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 366s x0: [2, 4, 1, 2] 366s 0.114 -0.664 366s 0.991 -0.470 366s -0.649 0.290 366s 0.053 0.772 366s 366s x1: [2, 4, 1, 2] 366s -0.883 -0.323 366s -0.295 -0.728 366s -0.186 0.519 366s -0.152 0.096 366s 366s y: [4, 4, 1, 2] 366s 0.114 -0.723 0.480 -0.296 366s 0.450 0.050 -0.020 -0.578 366s -0.366 -0.428 0.271 0.391 366s -0.081 -0.196 0.127 0.066 366s test-mul-mat0: iter:305/500 366s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 3, 1, 1] 366s -0.979 0.742 366s 0.727 -0.988 366s -0.908 -0.220 366s 366s x1: [2, 4, 1, 1] 366s 0.384 -0.024 366s -0.592 0.134 366s 0.800 -0.687 366s -0.310 -0.134 366s 366s y: [3, 4, 1, 1] 366s -0.395 0.304 -0.343 366s 0.679 -0.562 0.508 366s -1.292 1.260 -0.574 366s 0.204 -0.093 0.311 366s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.213 -0.769 366s 0.166 0.478 366s 366s x1: [2, 3, 1, 1] 366s -0.653 -0.378 366s -0.930 -0.880 366s 0.066 0.844 366s 366s y: [2, 3, 1, 1] 366s 0.151 -0.289 366s 0.478 -0.575 366s -0.634 0.414 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.399 -0.849 366s -0.303 -0.468 366s 366s x1: [2, 2, 1, 1] 366s 0.296 -0.032 366s -0.986 0.176 366s 366s y: [2, 2, 1, 1] 366s 0.145 -0.075 366s -0.542 0.217 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.243 0.350 366s -0.442 0.255 366s 366s x1: [2, 2, 1, 1] 366s 0.951 0.01ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 0 366s 0.222 -0.183 366s 366s y: [2, 2, 1, 1] 366s 0.235 -0.417 366s -0.010 -0.145 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.722 -0.957 366s -0.661 0.792 366s 366s x1: [2, 2, 1, 1] 366s -0.349 -0.871 366s 0.602 0.273 366s 366s y: [2, 2, 1, 1] 366s 0.581 -0.458 366s 0.174 -0.182 366s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.926 0.044 366s 366s x1: [2, 2, 1, 1] 366s 0.552 -0.725 366s -0.042 0.030 366s 366s y: [1, 2, 1, 1] 366s 0.479 366s -0.037 366s test-mul-mat0: iter:306/500 366s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.440 -0.776 366s -0.107 -0.305 366s -0.126 0.023 366s 366s x1: [2, 3, 1, 1] 366s -0.089 -0.025 366s -0.151 0.921 366s -0.803 0.665 366s 366s y: [3, 3, 1, 1] 366s -0.019 0.017 0.011 366s -0.782 -0.265 0.040 366s -0.870 -0.117 0.116 366s testing: mul_mat, [1, 3, 4, 1] = [2, 1, 4, 1] * [2, 3, 4, 1] 366s x0: [2, 1, 4, 1] 366s 0.165 0.493 366s 366s x1: [2, 3, 4, 1] 366s -0.703 -0.853 366s 0.223 -0.981 366s -0.514 0.266 366s 366s y: [1, 3, 4, 1] 366s -0.537 366s -0.447 366s 0.046 366s testing: mul_mat, [1, 1, 4, 1] = [2, 1, 4, 1] * [2, 1, 4, 1] 366s x0: [2, 1, 4, 1] 366s -0.657 0.452 366s 366s x1: [2, 1, 4, 1] 366s -0.575 -0.824 366s 366s y: [1, 1, 4, 1] 366s 0.005 366s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.975 0.325 366s 0.832 0.392 366s 0.212 0.050 366s 0.190 0.693 366s 366s x1: [2, 1, 1, 1] 366s -0.700 -0.438 366s 366s y: [4, 1, 1, 1] 366s 0.541 -0.754 -0.170 -0.437 366s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 1, 4, 1] 366s 0.064 -0.733 -0.501 -0.727 366s 366s x1: [4, 2, 4, 1] 366s -0.406 0.208 -0.792 0.020 366s 0.384 0.209 -0.289 0.371 366s 366s y: [1, 2, 4, 1] 366s 0.204 366s -0.254 366s testing: mul_mat, [3, 4, 4, 1] = [1, 3, 4, 1] * [1, 4, 4, 1] 366s x0: [1, 3, 4, 1] 366s -0.518 366s -0.664 366s -0.901 366s 366s x1: [1, 4, 4, 1] 366s 0.137 366s 0.644 366s -0.566 366s 0.143 366s 366s y: [3, 4, 4, 1] 366s -0.071 -0.091 -0.123 366s -0.333 -0.427 -0.580 366s 0.293 0.376 0.510 366s -0.074 -0.095 -0.128 366s test-mul-mat0: iter:307/500 366s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.177 366s 366s x1: [1, 3, 1, 1] 366s -0.594 366s -0.803 366s -0.213 366s 366s y: [1, 3, 1, 1] 366s -0.105 366s -0.142 366s -0.038 366s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 2, 2, 1] 366s 0.348 366s -0.665 366s 366s x1: [1, 1, 2, 1] 366s 0.741 366s 366s y: [2, 1, 2, 1] 366s 0.258 -0.493 366s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 366s x0: [1, 1, 2, 1] 366s 0.421 366s 366s x1: [1, 2, 2, 1] 366s -0.682 366s -0.617 366s 366s y: [1, 2, 2, 1] 366s -0.287 366s -0.260 366s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 4, 1, 1] 366s 0.505 366s 0.310 366s -0.335 366s 0.435 366s 366s x1: [1, 1, 1, 1] 366s 0.577 366s 366s y: [4, 1, 1, 1] 366s 0.291 0.179 -0.193 0.251 366s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 366s x0: [4, 3, 2, 1] 366s 0.376 -0.157 0.178 0.495 366s 0.012 -0.668 -0.348 0.035 366s -0.493 0.854 -0.944 0.379 366s 366s x1: [4, 1, 2, 1] 366s -0.510 -0.417 -0.115 -0.538 366s 366s y: [3, 1, 2, 1] 366s -0.413 0.293 -0.200 366s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 366s x0: [3, 2, 2, 1] 366s -0.950 0.760 0.849 366s -0.282 -0.561 -0.751 366s 366s x1: [3, 4, 2, 1] 366s 0.274 0.593 -0.874 366s 0.253 -0.539 0.961 366s -0.981 0.836 -0.027 366s -0.474 -0.321 0.305 366s 366s y: [2, 4, 2, 1] 366s -0.552 0.246 366s 0.164 -0.491 366s 1.545 -0.172 366s 0.466 0.085 366s test-mul-mat0: iter:308/500 366s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 4, 1, 1] 366s -0.671 0.627 366s 0.562 -0.856 366s 0.274 -0.692 366s -0.241 -0.630 366s 366s x1: [2, 2, 1, 1] 366s -0.582 -0.387 366s 0.892 -0.725 366s 366s y: [4, 2, 1, 1] 366s 0.148 0.004 0.108 0.384 366s -1.053 1.122 0.746 0.242 366s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 366s x0: [2, 4, 4, 1] 366s -0.581 0.183 366s -0.852 0.278 366s -0.866 -0.229 366s 0.849 -0.472 366s 366s x1: [2, 4, 4, 1] 366s -0.615 -0.386 366s -0.580 0.103 366s -0.625 -0.141 366s -0.048 -0.377 366s 366s y: [4, 4, 4, 1] 366s 0.286 0.417 0.620 -0.340 366s 0.356 0.523 0.479 -0.541 366s 0.337 0.494 0.574 -0.464 366s -0.041 -0.063 0.128 0.137 366s testing: mul_mat, [4, 4, 4, 4] = [2, 4, 4, 4] * [2, 4, 4, 4] 366s x0: [2, 4, 4, 4] 366s -0.271 0.434 366s 0.102 0.837 366s -0.958 0.898 366s -0.137 0.050 366s 366s x1: [2, 4, 4, 4] 366s 0.164 0.611 366s 0.719 0.299 366s -0.618 0.569 366s 0.827 -0.848 366s 366s y: [4, 4, 4, 4] 366s 0.220 0.528 0.391 0.008 366s -0.065 0.323 -0.420 -0.084 366s 0.414 0.413 1.102 0.113 366s -0.592 -0.626 -1.553 -0.156 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.783 -0.013 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 366s 0.998 -0.202 366s -0.760 0.544 366s 0.913 0.515 366s 366s x1: [2, 4, 1, 1] 366s -0.936 -0.062 366s -0.867 0.094 366s -0.426 -0.305 366s -0.930 -0.293 366s 366s y: [4, 4, 1, 1] 366s -0.733 -0.922 0.678 -0.887 366s -0.680 -0.884 0.710 -0.742 366s -0.330 -0.363 0.158 -0.546 366s -0.725 -0.869 0.548 -1.000 366s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 366s x0: [4, 1, 4, 1] 366s -0.331 -0.223 0.838 -0.040 366s 366s x1: [4, 2, 4, 1] 366s -0.476 0.993 0.870 -0.272 366s -0.465 -0.708 0.511 0.096 366s 366s y: [1, 2, 4, 1] 366s 0.676 366s 0.736 366s testing: mul_mat, [4, 4, 4, 4] = [1, 4, 4, 4] * [1, 4, 4, 4] 366s x0: [1, 4, 4, 4] 366s 0.801 366s 0.727 366s 0.209 366s -0.603 366s 366s x1: [1, 4, 4, 4] 366s -0.699 366s -0.071 366s 0.618 366s -0.402 366s 366s y: [4, 4, 4, 4] 366s -0.560 -0.508 -0.146 0.421 366s -0.057 -0.051 -0.015 0.043 366s 0.495 0.450 0.129 -0.373 366s -0.322 -0.292 -0.084 0.242 366s test-mul-mat0: iter:309/500 366s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 366s x0: [3, 3, 1, 1] 366s 0.054 0.839 0.459 366s 0.536 0.762 0.292 366s 0.775 -0.075 0.539 366s 366s x1: [3, 4, 1, 1] 366s 0.026 0.634 0.486 366s 0.168 0.521 0.051 366s 0.619 -0.811 -0.244 366s -0.625 0.335 0.782 366s 366s y: [3, 4, 1, 1] 366s 0.756 0.639 0.234 366s 0.470 0.502 0.118 366s -0.759 -0.357 0.409 366s 0.606 0.149 -0.087 366s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 366s x0: [3, 1, 3, 1] 366s 0.882 0.497 0.734 366s 366s x1: [3, 3, 3, 1] 366s 0.868 -0.831 -0.830 366s 0.617 0.221 -0.606 366s 0.419 -0.162 0.895 366s 366s y: [1, 3, 3, 1] 366s -0.257 366s 0.209 366s 0.947 366s testing: mul_mat, [2, 1, 3, 4] = [3, 2, 3, 4] * [3, 1, 3, 4] 366s x0: [3, 2, 3, 4] 366s -0.572 0.713 0.941 366s -0.495 -0.914 0.137 366s 366s x1: [3, 1, 3, 4] 366s -0.214 0.725 -0.806 366s 366s y: [2, 1, 3, 4] 366s -0.119 -0.667 366s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 366s x0: [3, 1, 1, 1] 366s 0.913 0.440 -0.409 366s 366s x1: [3, 2, 1, 1] 366s -0.764 -0.677 -0.150 366s 0.497 -0.321 0.376 366s 366s y: [1, 2, 1, 1] 366s -0.934 366s 0.159 366s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3, 1] 366s x0: [1, 3, 3, 1] 366s 0.720 366s 0.622 366s -0.809 366s 366s x1: [1, 3, 3, 1] 366s -0.261 366s -0.094 366s -0.810 366s 366s y: [3, 3, 3, 1] 366s -0.188 -0.162 0.211 366s -0.067 -0.058 0.076 366s -0.583 -0.504 0.656 366s testing: mul_mat, [2, 1, 3, 4] = [3, 2, 3, 4] * [3, 1, 3, 4] 366s x0: [3, 2, 3, 4] 366s 0.616 0.076 0.416 366s 0.914 -0.773 0.952 366s 366s x1: [3, 1, 3, 4] 366s 0.695 0.837 -0.427 366s 366s y: [2, 1, 3, 4] 366s 0.314 -0.418 366s test-mul-mat0: iter:310/500 366s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 1, 1, 1] 366s 0.350 0.356 0.966 0.485 366s 366s x1: [4, 3, 1, 1] 366s 0.153 -0.631 -0.307 -0.184 366s 0.418 -0.402 0.198 -0.929 366s 0.407 0.626 -0.525 -0.979 366s 366s y: [1, 3, 1, 1] 366s -0.557 366s -0.257 366s -0.617 366s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 366s x0: [4, 2, 4, 1] 366s 0.570 -0.234 -0.550 -0.233 366s -0.163 0.857 -0.607 0.312 366s 366s x1: [4, 1, 4, 1] 366s 0.031 0.643 0.956 0.270 366s 366s y: [2, 1, 4, 1] 366s -0.722 0.050 366s testing: mul_mat, [1, 2, 4, 4] = [4, 1, 4, 4] * [4, 2, 4, 4] 366s x0: [4, 1, 4, 4] 366s 0.732 0.596 0.943 -0.435 366s 366s x1: [4, 2, 4, 4] 366s -0.859 0.107 -0.712 -0.022 366s -0.036 -0.319 -0.711 -0.159 366s 366s y: [1, 2, 4, 4] 366s -1.226 366s -0.817 366s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 366s x0: [4, 4, 1, 1] 366s -0.421 0.288 -0.367 0.014 366s 0.326 0.122 0.581 0.905 366s 0.380 0.582 0.982 -0.792 366s 0.919 0.354 0.184 0.428 366s 366s x1: [4, 1, 1, 1] 366s -0.397 -0.209 -0.973 -0.514 366s 366s y: [4, 1, 1, 1] 366s 0.457 -1.186 -0.821 -0.838 366s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 366s x0: [4, 1, 4, 1] 366s -0.622 -0.197 -0.169 0.887 366s 366s x1: [4, 4, 4, 1] 366s -0.566 -0.782 -0.536 0.414 366s 0.121 -0.924 0.813 -0.097 366s 0.995 0.227 0.506 -0.213 366s 0.254 0.992 -0.544 0.833 366s 366s y: [1, 4, 4, 1] 366s 0.963 366s -0.117 366s -0.938 366s 0.478 366s testing: mul_mat, [3, 4, 4, 4] = [1, 3, 4, 4] * [1, 4, 4, 4] 366s x0: [1, 3, 4, 4] 366s 0.266 366s -0.756 366s -0.094 366s 366s x1: [1, 4, 4, 4] 366s -0.822 366s 0.445 366s -0.079 366s -0.212 366s 366s y: [3, 4, 4, 4] 366s -0.218 0.622 0.077 366s 0.118 -0.337 -0.042 366s -0.021 0.060 0.007 366s -0.056 0.160 0.020 366s test-mul-mat0: iter:311/500 366s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.767 -0.313 366s 0.898 -0.896 366s 366s x1: [2, 3, 1, 1] 366s -0.487 -0.709 366s -0.921 -0.030 366s -0.304 0.246 366s 366s y: [2, 3, 1, 1] 366s -0.151 0.199 366s -0.697 -0.800 366s -0.311 -0.494 366s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 366s x0: [2, 2, 4, 1] 366s -0.069 -0.171 366s -0.430 0.44ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 5 366s 366s x1: [2, 2, 4, 1] 366s -0.629 0.701 366s 0.647 0.536 366s 366s y: [2, 2, 4, 1] 366s -0.077 0.582 366s -0.136 -0.040 366s testing: mul_mat, [1, 2, 4, 3] = [2, 1, 4, 3] * [2, 2, 4, 3] 366s x0: [2, 1, 4, 3] 366s -0.301 0.319 366s 366s x1: [2, 2, 4, 3] 366s 0.333 -0.512 366s 0.962 0.411 366s 366s y: [1, 2, 4, 3] 366s -0.264 366s -0.158 366s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.619 -0.521 366s -0.142 0.557 366s 0.320 -0.360 366s 366s x1: [2, 1, 1, 1] 366s -0.224 0.962 366s 366s y: [3, 1, 1, 1] 366s -0.640 0.568 -0.418 366s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 366s x0: [3, 1, 4, 1] 366s -0.836 0.645 0.723 366s 366s x1: [3, 2, 4, 1] 366s -0.996 -0.527 -0.607 366s -0.910 0.022 -0.838 366s 366s y: [1, 2, 4, 1] 366s 0.054 366s 0.169 366s testing: mul_mat, [3, 3, 4, 3] = [1, 3, 4, 3] * [1, 3, 4, 3] 366s x0: [1, 3, 4, 3] 366s 0.400 366s -0.062 366s -0.942 366s 366s x1: [1, 3, 4, 3] 366s -0.945 366s 0.213 366s -0.447 366s 366s y: [3, 3, 4, 3] 366s -0.378 0.058 0.891 366s 0.085 -0.013 -0.201 366s -0.179 0.028 0.422 366s test-mul-mat0: iter:312/500 366s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 1, 1, 1] 366s 0.008 -0.638 366s 366s x1: [2, 4, 1, 1] 366s 0.680 -0.807 366s 0.275 0.010 366s 0.336 0.767 366s 0.332 -0.490 366s 366s y: [1, 4, 1, 1] 366s 0.520 366s -0.004 366s -0.487 366s 0.315 366s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 366s x0: [2, 4, 3, 1] 366s -0.990 0.272 366s -0.335 -0.317 366s 0.576 -0.276 366s 0.414 0.995 366s 366s x1: [2, 1, 3, 1] 366s 0.362 -0.206 366s 366s y: [4, 1, 3, 1] 366s -0.414 -0.056 0.265 -0.055 366s testing: mul_mat, [4, 4, 3, 3] = [2, 4, 3, 3] * [2, 4, 3, 3] 366s x0: [2, 4, 3, 3] 366s -0.223 0.376 366s -0.389 0.582 366s 0.596 0.387 366s -0.684 -0.789 366s 366s x1: [2, 4, 3, 3] 366s -0.079 0.437 366s 0.104 0.758 366s 0.047 0.473 366s 0.303 0.057 366s 366s y: [4, 4, 3, 3] 366s 0.182 0.285 0.122 -0.291 366s 0.262 0.401 0.355 -0.670 366s 0.167 0.257 0.211 -0.406 366s -0.046 -0.085 0.202 -0.252 366s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 366s x0: [2, 4, 1, 1] 366s 0.726 -0.322 366s 0.145 0.353 366s 0.755 0.069 366s 0.150 0.498 366s 366s x1: [2, 4, 1, 1] 366s -0.152 0.416 366s -0.930 0.300 366s 0.082 0.926 366s -0.324 -0.375 366s 366s y: [4, 4, 1, 1] 366s -0.244 0.125 -0.086 0.184 366s -0.772 -0.029 -0.681 0.010 366s -0.239 0.339 0.126 0.473 366s -0.114 -0.179 -0.270 -0.235 366s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 4, 3, 1] 366s 0.252 -0.536 -0.187 0.725 366s 0.395 -0.271 0.424 0.712 366s 0.231 0.291 -0.730 -0.097 366s -0.395 -0.765 -0.375 -0.694 366s 366s x1: [4, 2, 3, 1] 366s -0.314 0.771 -0.916 -0.868 366s -0.965 0.812 -0.698 0.442 366s 366s y: [4, 2, 3, 1] 366s -0.950 -1.339 0.905 0.480 366s -0.227 -0.582 0.480 -0.284 366s testing: mul_mat, [4, 4, 3, 3] = [4, 4, 3, 3] * [4, 4, 3, 3] 366s x0: [4, 4, 3, 3] 366s -0.621 0.257 -0.734 0.247 366s -0.501 -0.646 -0.425 -0.995 366s 0.129 -0.255 -0.277 -0.573 366s -0.655 -0.722 -0.191 0.254 366s 366s x1: [4, 4, 3, 3] 366s 0.937 0.487 -0.236 0.076 366s 0.780 -0.339 -0.943 -0.144 366s 0.457 0.557 0.342 0.749 366s 0.302 -0.619 0.294 -0.559 366s 366s y: [4, 4, 3, 3] 366s -0.265 -0.759 0.019 -0.900 366s 0.085 0.372 0.531 -0.122 366s -0.206 -1.480 -0.607 -0.576 366s -0.701 0.680 0.435 0.051 366s test-mul-mat0: iter:313/500 366s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.902 0.166 366s 0.573 0.522 366s 366s x1: [2, 1, 1, 1] 366s -0.244 0.548 366s 366s y: [2, 1, 1, 1] 366s -0.129 0.146 366s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 3, 3, 1] 366s 0.248 0.364 366s -0.319 -0.599 366s 0.862 -0.136 366s 366s x1: [2, 2, 3, 1] 366s 0.147 0.061 366s -0.943 -0.177 366s 366s y: [3, 2, 3, 1] 366s 0.059 -0.083 0.118 366s -0.299 0.406 -0.789 366s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 366s x0: [2, 2, 3, 1] 366s 0.179 0.550 366s 0.373 0.351 366s 366s x1: [2, 3, 3, 1] 366s -0.968 -0.250 366s -0.420 -0.145 366s -0.960 -0.280 366s 366s y: [2, 3, 3, 1] 366s -0.311 -0.449 366s -0.155 -0.208 366s -0.326 -0.456 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s 0.298 -0.482 366s 0.150 -0.118 366s 366s x1: [2, 2, 1, 1] 366s 0.986 0.424 366s -0.422 -0.159 366s 366s y: [2, 2, 1, 1] 366s 0.090 0.098 366s -0.049 -0.045 366s testing: mul_mat, [4, 2, 3, 1] = [2, 4, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 4, 3, 1] 366s -0.557 0.389 366s -0.595 -0.741 366s -0.215 0.468 366s 0.513 -0.938 366s 366s x1: [2, 2, 3, 1] 366s -0.739 -0.983 366s 0.054 0.745 366s 366s y: [4, 2, 3, 1] 366s 0.029 1.168 -0.301 0.542 366s 0.260 -0.584 0.337 -0.671 366s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 4, 3, 1] 366s 0.582 0.675 0.270 0.310 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 0.425 -0.375 -0.303 -0.204 366s 0.967 -0.604 0.096 -0.964 366s 0.034 0.839 0.451 -0.292 366s 366s x1: [4, 2, 3, 1] 366s 0.362 -0.836 -0.356 -0.699 366s 0.257 -0.819 -0.387 0.700 366s 366s y: [4, 2, 3, 1] 366s -0.666 0.718 1.495 -0.645 366s -0.291 0.391 0.032 -1.058 366s test-mul-mat0: iter:314/500 366s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 1, 1, 1] 366s 0.186 0.298 -0.573 0.329 366s 366s x1: [4, 3, 1, 1] 366s 0.501 -0.663 -0.214 -0.470 366s 0.593 -0.172 -0.559 -0.276 366s -0.191 -0.750 0.438 -0.784 366s 366s y: [1, 3, 1, 1] 366s -0.136 366s 0.288 366s -0.768 366s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 366s x0: [4, 2, 2, 1] 366s 0.872 0.546 -0.604 0.384 366s -0.238 0.897 0.721 0.548 366s 366s x1: [4, 1, 2, 1] 366s -0.780 -0.902 -0.832 -0.763 366s 366s y: [2, 1, 2, 1] 366s -0.963 -1.642 366s testing: mul_mat, [3, 2, 2, 2] = [4, 3, 2, 2] * [4, 2, 2, 2] 366s x0: [4, 3, 2, 2] 366s -0.326 -0.220 0.072 -0.336 366s 0.699 -0.260 -0.177 -0.145 366s 0.628 0.299 0.060 -0.107 366s 366s x1: [4, 2, 2, 2] 366s 0.401 0.144 -0.338 -0.301 366s 0.571 0.991 -0.081 0.668 366s 366s y: [3, 2, 2, 2] 366s -0.086 0.346 0.307 366s -0.634 0.059 0.578 366s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 366s x0: [4, 1, 1, 1] 366s -0.655 0.981 -0.343 0.824 366s 366s x1: [4, 3, 1, 1] 366s 0.718 0.171 -0.435 -0.887 366s -0.519 -0.157 -0.997 0.031 366s 0.657 -0.426 0.937 -0.518 366s 366s y: [1, 3, 1, 1] 366s -0.885 366s 0.553 366s -1.596 366s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 366s x0: [1, 2, 2, 1] 366s -0.350 366s 0.004 366s 366s x1: [1, 4, 2, 1] 366s -0.856 366s 0.788 366s -0.957 366s 0.130 366s 366s y: [2, 4, 2, 1] 366s 0.299 -0.004 366s -0.276 0.003 366s 0.335 -0.004 366s -0.045 0.001 366s testing: mul_mat, [4, 1, 2, 2] = [2, 4, 2, 2] * [2, 1, 2, 2] 366s x0: [2, 4, 2, 2] 366s -0.148 -0.296 366s 0.479 0.846 366s 0.501 0.314 366s 0.333 0.361 366s 366s x1: [2, 1, 2, 2] 366s 0.787 -0.211 366s 366s y: [4, 1, 2, 2] 366s -0.054 0.198 0.328 0.186 366s test-mul-mat0: iter:315/500 366s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 366s x0: [1, 2, 1, 1] 366s 0.442 366s -0.301 366s 366s x1: [1, 1, 1, 1] 366s 0.029 366s 366s y: [2, 1, 1, 1] 366s 0.013 -0.009 366s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 366s x0: [1, 3, 2, 1] 366s -0.987 366s 0.631 366s 0.259 366s 366s x1: [1, 2, 2, 1] 366s 0.040 366s -0.409 366s 366s y: [3, 2, 2, 1] 366s -0.039 0.025 0.010 366s 0.403 -0.258 -0.106 366s testing: mul_mat, [4, 3, 2, 2] = [1, 4, 2, 2] * [1, 3, 2, 2] 366s x0: [1, 4, 2, 2] 366s -0.607 366s -0.285 366s 0.987 366s 0.422 366s 366s x1: [1, 3, 2, 2] 366s 0.546 366s 0.820 366s -0.458 366s 366s y: [4, 3, 2, 2] 366s -0.332 -0.156 0.539 0.231 366s -0.498 -0.234 0.809 0.346 366s 0.278 0.130 -0.452 -0.193 366s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 366s x0: [1, 1, 1, 1] 366s 0.973 366s 366s x1: [1, 4, 1, 1] 366s -0.119 366s -0.199 366s -0.665 366s -0.299 366s 366s y: [1, 4, 1, 1] 366s -0.116 366s -0.194 366s -0.647 366s -0.291 366s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 366s x0: [1, 2, 2, 1] 366s 0.634 366s -0.714 366s 366s x1: [1, 1, 2, 1] 366s -0.892 366s 366s y: [2, 1, 2, 1] 366s -0.566 0.637 366s testing: mul_mat, [1, 1, 2, 2] = [2, 1, 2, 2] * [2, 1, 2, 2] 366s x0: [2, 1, 2, 2] 366s 0.082 -0.525 366s 366s x1: [2, 1, 2, 2] 366s -0.267 0.040 366s 366s y: [1, 1, 2, 2] 366s -0.043 366s test-mul-mat0: iter:316/500 366s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 3, 1, 1] 366s 0.722 366s 0.558 366s 0.966 366s 366s x1: [1, 3, 1, 1] 366s -0.776 366s -0.051 366s -0.047 366s 366s y: [3, 3, 1, 1] 366s -0.560 -0.433 -0.750 366s -0.037 -0.029 -0.050 366s -0.034 -0.026 -0.045 366s testing: mul_mat, [2, 3, 4, 1] = [1, 2, 4, 1] * [1, 3, 4, 1] 366s x0: [1, 2, 4, 1] 366s 0.688 366s -0.205 366s 366s x1: [1, 3, 4, 1] 366s -0.992 366s -0.309 366s -0.190 366s 366s y: [2, 3, 4, 1] 366s -0.683 0.203 366s -0.212 0.063 366s -0.131 0.039 366s testing: mul_mat, [3, 2, 4, 2] = [1, 3, 4, 2] * [1, 2, 4, 2] 366s x0: [1, 3, 4, 2] 366s 0.967 366s 0.308 366s 0.651 366s 366s x1: [1, 2, 4, 2] 366s 0.226 366s -0.892 366s 366s y: [3, 2, 4, 2] 366s 0.219 0.070 0.147 366s -0.862 -0.275 -0.580 366s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 366s x0: [1, 3, 1, 1] 366s -0.486 366s -0.200 366s 0.046 366s 366s x1: [1, 3, 1, 1] 366s 0.801 366s 0.304 366s -0.804 366s 366s y: [3, 3, 1, 1] 366s -0.389 -0.160 0.037 366s -0.148 -0.061 0.014 366s 0.390 0.161 -0.037 366s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 366s x0: [3, 4, 4, 1] 366s -0.741 0.285 -0.408 366s 0.178 0.578 -0.067 366s 0.343 0.107 -0.186 366s -0.931 0.175 -0.607 366s 366s x1: [3, 1, 4, 1] 366s -0.519 -0.891 -0.303 366s 366s y: [4, 1, 4, 1] 366s 0.253 -0.587 -0.217 0.510 366s testing: mul_mat, [2, 3, 4, 2] = [4, 2, 4, 2] * [4, 3, 4, 2] 366s x0: [4, 2, 4, 2] 366s -0.283 0.071 -0.565 -0.438 366s -0.741 -0.814ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 366s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 366s 0.154 0.090 366s 366s x1: [4, 3, 4, 2] 366s 0.084 -0.374 0.632 -0.450 366s 0.520 -0.548 0.379 -0.055 366s 0.759 0.298 0.465 0.793 366s 366s y: [2, 3, 4, 2] 366s -0.210 0.299 366s -0.376 0.114 366s -0.803 -0.662 366s test-mul-mat0: iter:317/500 366s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 366s x0: [4, 2, 1, 1] 366s -0.727 -0.010 0.143 -0.505 366s 0.816 -0.287 -0.216 -0.857 366s 366s x1: [4, 4, 1, 1] 366s -0.716 0.224 -0.475 -0.713 366s 0.382 0.915 -0.045 -0.924 366s -0.340 0.979 -0.537 0.515 366s -0.942 -0.059 0.348 0.475 366s 366s y: [2, 4, 1, 1] 366s 0.810 0.064 366s 0.173 0.849 366s -0.099 -0.884 366s 0.496 -1.234 366s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 366s x0: [4, 3, 3, 1] 366s -0.886 -0.602 0.662 -0.173 366s 0.182 0.805 -0.631 -0.422 366s -0.153 -0.814 0.348 -0.486 366s 366s x1: [4, 2, 3, 1] 366s 0.542 0.397 0.042 0.817 366s -0.230 0.667 -0.899 0.994 366s 366s y: [3, 2, 3, 1] 366s -0.833 0.047 -0.788 366s -0.965 0.644 -1.304 366s testing: mul_mat, [2, 3, 3, 3] = [4, 2, 3, 3] * [4, 3, 3, 3] 366s x0: [4, 2, 3, 3] 366s -0.190 0.673 -0.535 0.502 366s 0.113 -0.098 -0.567 0.607 366s 366s x1: [4, 3, 3, 3] 366s -0.109 0.221 0.550 0.739 366s 0.407 -0.102 -0.747 0.694 366s -0.760 -0.042 -0.631 -0.042 366s 366s y: [2, 3, 3, 3] 366s 0.246 0.103 366s 0.602 0.901 366s 0.433 0.251 366s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 366s x0: [4, 3, 1, 1] 366s -0.223 0.211 -0.161 0.985 366s 0.602 0.804 0.323 0.381 366s -0.533 0.358 0.197 0.423 366s 366s x1: [4, 2, 1, 1] 366s -0.291 -0.808 -0.869 0.859 366s -0.317 0.841 0.951 0.105 366s 366s y: [3, 2, 1, 1] 366s 0.880 -0.778 0.058 366s 0.198 0.832 0.701 366s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 366s x0: [3, 1, 3, 1] 366s 0.655 0.536 0.628 366s 366s x1: [3, 4, 3, 1] 366s 0.459 0.653 -0.498 366s -0.239 0.250 0.335 366s -0.675 0.780 -0.236 366s 0.210 -0.511 -0.044 366s 366s y: [1, 4, 3, 1] 366s 0.337 366s 0.188 366s -0.172 366s -0.164 366s testing: mul_mat, [3, 3, 3, 3] = [1, 3, 3, 3] * [1, 3, 3, 3] 366s x0: [1, 3, 3, 3] 366s -0.107 366s 0.984 366s -0.121 366s 366s x1: [1, 3, 3, 3] 366s 0.386 366s -0.328 366s -0.034 366s 366s y: [3, 3, 3, 3] 366s -0.041 0.380 -0.047 366s 0.035 -0.323 0.040 366s 0.004 -0.034 0.004 366s test-mul-mat0: iter:318/500 366s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.518 0.238 -0.198 366s 0.921 -0.606 -0.937 366s -0.642 0.011 -0.255 366s 0.955 -0.439 -0.547 366s 366s x1: [3, 1, 1, 1] 366s -0.722 -0.138 0.392 366s 366s y: [4, 1, 1, 1] 366s -0.484 -0.949 0.362 -0.843 366s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 366s x0: [3, 4, 3, 1] 366s 0.716 -0.058 0.479 366s 0.813 0.420 -0.329 366s 0.296 -0.224 -0.625 366s 0.057 -0.006 0.472 366s 366s x1: [3, 4, 3, 1] 366s 0.821 0.899 -0.788 366s -0.474 0.531 0.551 366s 0.400 0.212 0.335 366s -0.902 -0.522 0.191 366s 366s y: [4, 4, 3, 1] 366s 0.157 1.305 0.534 -0.331 366s -0.106 -0.344 -0.604 0.230 366s 0.435 0.304 -0.138 0.180 366s -0.524 -1.016 -0.270 0.042 366s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 366s x0: [3, 1, 3, 1] 366s 0.852 -0.262 0.137 366s 366s x1: [3, 4, 3, 1] 366s -0.125 0.300 0.641 366s 0.250 -0.642 -0.365 366s -0.278 0.733 0.364 366s -0.472 -0.772 0.749 366s 366s y: [1, 4, 3, 1] 366s -0.098 366s 0.331 366s -0.379 366s -0.097 366s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 366s x0: [3, 4, 1, 1] 366s 0.879 0.382 0.190 366s -0.770 -0.441 0.766 366s 0.373 -0.240 -0.928 366s 0.957 -0.275 -0.313 366s 366s x1: [3, 1, 1, 1] 366s -0.221 -0.619 0.553 366s 366s y: [4, 1, 1, 1] 366s -0.325 0.866 -0.447 -0.214 366s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 366s x0: [4, 1, 3, 1] 366s -0.903 0.524 0.886 0.107 366s 366s x1: [4, 3, 3, 1] 366s 0.647 -0.280 0.824 -0.222 366s 0.725 0.597 -0.370 -0.537 366s -0.266 0.010 0.428 0.620 366s 366s y: [1, 3, 3, 1] 366s -0.024 366s -0.727 366s 0.692 366s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 366s x0: [1, 3, 3, 1] 366s -0.981 366s 0.126 366s -0.803 366s 366s x1: [1, 4, 3, 1] 366s -0.284 366s -0.507 366s -0.193 366s -0.989 366s 366s y: [3, 4, 3, 1] 366s 0.278 -0.036 0.228 366s 0.498 -0.064 0.408 366s 0.189 -0.024 0.155 366s 0.970 -0.124 0.794 366s test-mul-mat0: iter:319/500 366s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 3, 1, 1] 366s 0.761 -0.658 366s -0.921 0.773 366s 0.351 -0.294 366s 366s x1: [2, 2, 1, 1] 366s 0.917 -0.097 366s 0.045 0.849 366s 366s y: [3, 2, 1, 1] 366s 0.761 -0.919 0.351 366s -0.525 0.615 -0.234 366s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 366s x0: [2, 3, 3, 1] 366s -0.005 0.273 366s 0.616 -0.961 366s 0.122 -0.112 366s 366s x1: [2, 3, 3, 1] 366s 0.638 0.638 366s 0.878 -0.452 366s -0.033 0.667 366s 366s y: [3, 3, 3, 1] 366s 0.171 -0.219 0.006 366s -0.128 0.975 0.158 366s 0.182 -0.661 -0.079 366s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 366s x0: [2, 2, 3, 1] 366s -0.492 -0.103 366s 0.225 -0.919 366s 366s x1: [2, 3, 3, 1] 366s -0.051 -0.595 366s -0.990 -0.665 366s -0.124 -0.010 366s 366s y: [2, 3, 3, 1] 366s 0.087 0.535 366s 0.555 0.388 366s 0.062 -0.019 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s -0.333 0.793 366s 0.408 -0.651 366s 366s x1: [2, 2, 1, 1] 366s -0.998 0.988 366s 0.677 -0.663 366s 366s y: [2, 2, 1, 1] 366s 1.116 -1.050 366s -0.751 0.707 366s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 2, 3, 1] 366s 0.432 0.956 366s 0.739 -0.275 366s 366s x1: [2, 2, 3, 1] 366s -0.550 0.467 366s 0.387 -0.796 366s 366s y: [2, 2, 3, 1] 366s 0.209 -0.535 366s -0.594 0.505 366s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 366s x0: [2, 1, 3, 1] 366s -0.425 -0.020 366s 366s x1: [2, 2, 3, 1] 366s -0.884 -0.662 366s -0.586 0.524 366s 366s y: [1, 2, 3, 1] 366s 0.389 366s 0.239 366s test-mul-mat0: iter:320/500 366s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 366s x0: [2, 1, 1, 1] 366s -0.863 0.077 366s 366s x1: [2, 1, 1, 1] 366s -0.349 0.006 366s 366s y: [1, 1, 1, 1] 366s 0.302 366s testing: mul_mat, [3, 1, 4, 1] = [2, 3, 4, 1] * [2, 1, 4, 1] 366s x0: [2, 3, 4, 1] 366s 0.252 0.678 366s -0.847 0.430 366s 0.411 -0.998 366s 366s x1: [2, 1, 4, 1] 366s 0.638 0.611 366s 366s y: [3, 1, 4, 1] 366s 0.576 -0.278 -0.348 366s testing: mul_mat, [2, 3, 4, 4] = [2, 2, 4, 4] * [2, 3, 4, 4] 366s x0: [2, 2, 4, 4] 366s 0.645 -0.548 366s 0.200 0.706 366s 366s x1: [2, 3, 4, 4] 366s -0.076 -0.396 366s 0.086 -0.837 366s 0.883 0.166 366s 366s y: [2, 3, 4, 4] 366s 0.168 -0.294 366s 0.514 -0.574 366s 0.478 0.293 366s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 366s x0: [2, 2, 1, 1] 366s -0.459 -0.367 366s 0.370 -0.147 366s 366s x1: [2, 2, 1, 1] 366s 0.128 0.079 366s 0.366 0.832 366s 366s y: [2, 2, 1, 1] 366s -0.088 0.036 366s -0.473 0.013 366s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 366s x0: [2, 3, 4, 1] 366s -0.459 -0.044 366s -0.324 -0.819 367s 0.045 -0.827 367s 367s x1: [2, 2, 4, 1] 367s 0.305 -0.473 367s 0.946 -0.244 367s 367s y: [3, 2, 4, 1] 367s -0.119 0.289 0.405 367s -0.423 -0.107 0.244 367s testing: mul_mat, [1, 2, 4, 4] = [3, 1, 4, 4] * [3, 2, 4, 4] 367s x0: [3, 1, 4, 4] 367s -0.752 0.163 0.580 367s 367s x1: [3, 2, 4, 4] 367s -0.748 -0.379 0.222 367s 0.345 0.707 0.088 367s 367s y: [1, 2, 4, 4] 367s 0.630 367s -0.093 367s test-mul-mat0: iter:321/500 367s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.507 0.710 0.514 -0.755 367s 0.415 0.387 -0.437 -0.004 367s 367s x1: [4, 3, 1, 1] 367s -0.171 0.358 0.205 -0.577 367s 0.939 -0.281 0.085 -0.200 367s 0.625 -0.813 0.909 0.694 367s 367s y: [2, 3, 1, 1] 367s 0.882 -0.019 367s -0.481 0.244 367s -0.951 -0.456 367s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.397 -0.863 -0.335 0.808 367s -0.440 -0.397 -0.472 0.645 367s 367s x1: [4, 2, 1, 1] 367s -0.698 -0.647 -0.330 0.636 367s 0.041 0.201 -0.954 0.307 367s 367s y: [2, 2, 1, 1] 367s 1.461 1.130 367s 0.378 0.551 367s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 367s x0: [4, 4, 1, 2] 367s 0.541 -0.993 0.839 0.588 367s 0.314 0.145 -0.809 0.451 367s 0.810 0.999 -0.989 -0.587 367s -0.473 0.655 0.817 -0.320 367s 367s x1: [4, 2, 1, 2] 367s 0.403 -0.847 0.832 0.312 367s 0.847 -0.297 0.805 0.557 367s 367s y: [4, 2, 1, 2] 367s 1.942 -0.529 -1.526 -0.166 367s 1.757 -0.177 -0.735 -0.116 367s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 3, 1, 1] 367s -0.031 0.053 -0.890 0.134 367s -0.322 0.084 0.094 0.010 367s -0.251 0.733 0.330 0.892 367s 367s x1: [4, 4, 1, 1] 367s -0.431 0.501 0.044 0.883 367s -0.353 0.235 0.334 -0.543 367s 0.235 0.344 -0.130 0.762 367s -0.000 -0.314 -0.558 -0.513 367s 367s y: [3, 4, 1, 1] 367s 0.119 0.194 1.279 367s -0.347 0.160 -0.114 367s 0.229 -0.051 0.831 367s 0.411 -0.084 -0.871 367s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 4, 1, 1] 367s 0.743 -0.490 0.522 367s -0.977 -0.227 -0.793 367s -0.042 0.154 -0.728 367s -0.815 0.188 0.255 367s 367s x1: [3, 4, 1, 1] 367s -0.203 -0.533 -0.539 367s -0.702 0.511 -0.655 367s -0.055 -0.254 0.678 367s 0.401 0.981 0.023 367s 367s y: [4, 4, 1, 1] 367s -0.171 0.747 0.319 -0.072 367s -1.114 1.090 0.585 0.501 367s 0.437 -0.426 -0.530 0.170 367s -0.171 -0.633 0.117 -0.137 367s testing: mul_mat, [4, 3, 1, 2] = [4, 4, 1, 2] * [4, 3, 1, 2] 367s x0: [4, 4, 1, 2] 367s -0.838 0.977 -0.744 -0.615 367s 0.705 0.671 -0.863 -0.437 367s -0.584 -0.314 0.720 0.673 367s -0.631 0.344 -0.266 -0.070 367s 367s x1: [4, 3, 1, 2] 367s -0.683 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s 0.366 -0.415 0.451 367s -0.624 -0.522 -0.752 -0.157 367s -0.061 -0.454 -0.646 0.284 367s 367s y: [4, 3, 1, 2] 367s 0.962 -0.075 0.289 0.636 367s 0.669 -0.072 -0.119 0.425 367s -0.087 0.086 -0.096 0.034 367s test-mul-mat0: iter:322/500 367s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 3, 1, 1] 367s 0.455 367s 0.834 367s -0.719 367s 367s x1: [1, 1, 1, 1] 367s 0.578 367s 367s y: [3, 1, 1, 1] 367s 0.263 0.482 -0.415 367s testing: mul_mat, [1, 3, 4, 1] = [1, 1, 4, 1] * [1, 3, 4, 1] 367s x0: [1, 1, 4, 1] 367s 0.021 367s 367s x1: [1, 3, 4, 1] 367s 0.175 367s -0.432 367s -0.334 367s 367s y: [1, 3, 4, 1] 367s 0.004 367s -0.009 367s -0.007 367s testing: mul_mat, [2, 1, 4, 4] = [1, 2, 4, 4] * [1, 1, 4, 4] 367s x0: [1, 2, 4, 4] 367s 0.975 367s -0.977 367s 367s x1: [1, 1, 4, 4] 367s -0.598 367s 367s y: [2, 1, 4, 4] 367s -0.582 0.584 367s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.283 367s 0.118 367s 0.957 367s 367s x1: [1, 2, 1, 1] 367s 0.496 367s 0.248 367s 367s y: [3, 2, 1, 1] 367s -0.140 0.058 0.474 367s -0.070 0.029 0.237 367s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 367s x0: [3, 1, 4, 1] 367s 0.132 -0.546 -0.730 367s 367s x1: [3, 1, 4, 1] 367s 0.056 -0.683 0.022 367s 367s y: [1, 1, 4, 1] 367s 0.364 367s testing: mul_mat, [1, 3, 4, 4] = [1, 1, 4, 4] * [1, 3, 4, 4] 367s x0: [1, 1, 4, 4] 367s 0.634 367s 367s x1: [1, 3, 4, 4] 367s -0.729 367s 0.782 367s -0.168 367s 367s y: [1, 3, 4, 4] 367s -0.462 367s 0.496 367s -0.106 367s test-mul-mat0: iter:323/500 367s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.421 -0.837 -0.266 367s -0.107 -0.250 -0.181 367s 0.310 -0.616 0.336 367s 367s x1: [3, 1, 1, 1] 367s -0.567 -0.321 0.108 367s 367s y: [3, 1, 1, 1] 367s 0.001 0.121 0.058 367s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 3, 4, 1] 367s -0.191 -0.707 -0.108 367s 0.755 0.514 0.136 367s -0.601 -0.085 0.179 367s 367s x1: [3, 3, 4, 1] 367s 0.127 0.335 0.895 367s -0.653 0.691 0.800 367s -0.892 0.595 0.826 367s 367s y: [3, 3, 4, 1] 367s -0.358 0.390 0.055 367s -0.450 -0.029 0.477 367s -0.339 -0.255 0.634 367s testing: mul_mat, [2, 3, 4, 4] = [3, 2, 4, 4] * [3, 3, 4, 4] 367s x0: [3, 2, 4, 4] 367s 0.499 0.641 -0.379 367s 0.563 0.357 0.599 367s 367s x1: [3, 3, 4, 4] 367s -0.495 0.630 -0.258 367s 0.685 -0.870 -0.418 367s -0.870 0.163 -0.975 367s 367s y: [2, 3, 4, 4] 367s 0.255 -0.208 367s -0.058 -0.176 367s 0.040 -1.016 367s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.351 0.855 -0.039 367s -0.492 -0.673 -0.020 367s 367s x1: [3, 2, 1, 1] 367s -0.696 0.735 0.421 367s 0.683 -0.280 -0.220 367s 367s y: [2, 2, 1, 1] 367s 0.856 -0.161 367s -0.471 -0.143 367s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 367s x0: [2, 4, 4, 1] 367s -0.445 -0.822 367s 0.217 -0.391 367s 0.629 0.317 367s -0.118 -0.835 367s 367s x1: [2, 3, 4, 1] 367s -0.565 0.987 367s -0.142 0.171 367s -0.490 -0.008 367s 367s y: [4, 3, 4, 1] 367s -0.559 -0.509 -0.043 -0.757 367s -0.077 -0.098 -0.035 -0.126 367s 0.225 -0.103 -0.311 0.064 367s testing: mul_mat, [1, 2, 4, 4] = [4, 1, 4, 4] * [4, 2, 4, 4] 367s x0: [4, 1, 4, 4] 367s 0.317 0.542 0.796 -0.290 367s 367s x1: [4, 2, 4, 4] 367s 0.053 0.896 -0.987 0.232 367s -0.495 0.329 0.397 0.972 367s 367s y: [1, 2, 4, 4] 367s -0.351 367s 0.056 367s test-mul-mat0: iter:324/500 367s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 1, 1, 1] 367s 0.568 -0.302 -0.226 0.868 367s 367s x1: [4, 3, 1, 1] 367s 0.223 0.438 -0.671 0.983 367s -0.677 0.009 -0.309 0.003 367s -0.590 -0.151 -0.597 0.333 367s 367s y: [1, 3, 1, 1] 367s 1.000 367s -0.315 367s 0.134 367s testing: mul_mat, [4, 1, 2, 1] = [4, 4, 2, 1] * [4, 1, 2, 1] 367s x0: [4, 4, 2, 1] 367s 0.440 -0.474 0.179 0.689 367s 0.790 -0.598 0.127 -0.881 367s -0.615 0.450 0.128 0.077 367s -0.547 0.538 0.925 -0.143 367s 367s x1: [4, 1, 2, 1] 367s -0.785 -0.378 0.239 -0.636 367s 367s y: [4, 1, 2, 1] 367s -0.562 0.197 0.295 0.538 367s testing: mul_mat, [1, 4, 2, 4] = [4, 1, 2, 4] * [4, 4, 2, 4] 367s x0: [4, 1, 2, 4] 367s 0.544 -0.303 -0.548 -0.806 367s 367s x1: [4, 4, 2, 4] 367s 0.750 -0.938 -0.798 0.540 367s -0.536 0.328 0.659 -0.151 367s -0.222 -0.213 0.925 0.231 367s -0.675 0.851 -0.912 0.195 367s 367s y: [1, 4, 2, 4] 367s 0.694 367s -0.630 367s -0.750 367s -0.283 367s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 367s x0: [4, 4, 1, 1] 367s -0.207 -0.375 -0.059 0.522 367s 0.119 0.093 0.805 0.266 367s -0.624 -0.871 0.628 0.950 367s -0.924 -0.806 0.216 0.602 367s 367s x1: [4, 1, 1, 1] 367s 0.734 0.695 -0.343 0.713 367s 367s y: [4, 1, 1, 1] 367s -0.020 0.065 -0.602 -0.884 367s testing: mul_mat, [2, 4, 2, 1] = [4, 2, 2, 1] * [4, 4, 2, 1] 367s x0: [4, 2, 2, 1] 367s 0.758 -0.065 -0.429 0.105 367s 0.353 -0.190 0.964 -0.748 367s 367s x1: [4, 4, 2, 1] 367s 0.747 0.186 0.493 -0.701 367s 0.992 0.052 0.218 0.029 367s -0.705 0.681 -0.237 0.990 367s -0.662 -0.524 -0.819 0.131 367s 367s y: [2, 4, 2, 1] 367s 0.269 1.228 367s 0.658 0.529 367s -0.373 -1.348 367s -0.102 -1.022 367s testing: mul_mat, [1, 4, 2, 4] = [2, 1, 2, 4] * [2, 4, 2, 4] 367s x0: [2, 1, 2, 4] 367s 0.349 -0.437 367s 367s x1: [2, 4, 2, 4] 367s -0.072 0.016 367s 0.482 -0.736 367s 0.429 -0.926 367s 0.404 0.470 367s 367s y: [1, 4, 2, 4] 367s -0.032 367s 0.490 367s 0.555 367s -0.065 367s test-mul-mat0: iter:325/500 367s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 367s x0: [2, 3, 1, 1] 367s -0.397 0.363 367s 0.637 0.166 367s -0.211 0.900 367s 367s x1: [2, 4, 1, 1] 367s 0.525 0.952 367s 0.856 -0.995 367s 0.724 -0.551 367s -0.307 0.700 367s 367s y: [3, 4, 1, 1] 367s 0.138 0.493 0.746 367s -0.701 0.380 -1.076 367s -0.487 0.369 -0.649 367s 0.376 -0.079 0.695 367s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 367s x0: [2, 3, 4, 1] 367s 0.291 0.199 367s 0.700 -0.543 367s 0.988 0.601 367s 367s x1: [2, 3, 4, 1] 367s 0.852 0.856 367s 0.144 0.438 367s -0.795 0.698 367s 367s y: [3, 3, 4, 1] 367s 0.418 0.132 1.356 367s 0.129 -0.137 0.405 367s -0.092 -0.936 -0.366 367s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 367s x0: [2, 3, 4, 1] 367s 0.785 0.806 367s 0.061 -0.365 367s -0.265 0.935 367s 367s x1: [2, 3, 4, 1] 367s -0.824 -0.076 367s 0.010 0.952 367s 0.611 -0.154 367s 367s y: [3, 3, 4, 1] 367s -0.708 -0.023 0.147 367s 0.775 -0.347 0.887 367s 0.356 0.094 -0.306 367s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 367s x0: [2, 3, 1, 1] 367s 0.725 -0.641 367s -0.265 0.469 367s -0.368 -0.433 367s 367s x1: [2, 3, 1, 1] 367s -0.627 0.534 367s -0.938 -0.029 367s 0.940 -0.071 367s 367s y: [3, 3, 1, 1] 367s -0.797 0.417 -0.000 367s -0.661 0.235 0.358 367s 0.727 -0.283 -0.315 367s testing: mul_mat, [3, 2, 4, 1] = [3, 3, 4, 1] * [3, 2, 4, 1] 367s x0: [3, 3, 4, 1] 367s 0.117 0.476 -0.353 367s -0.848 0.622 -0.250 367s -0.570 -0.003 -0.491 367s 367s x1: [3, 2, 4, 1] 367s 0.170 0.128 0.512 367s 0.933 -0.602 0.304 367s 367s y: [3, 2, 4, 1] 367s -0.100 -0.193 -0.349 367s -0.285 -1.242 -0.679 367s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 3, 4, 1] 367s -0.172 0.204 0.542 367s -0.909 0.581 -0.420 367s 0.653 0.910 0.288 367s 367s x1: [3, 3, 4, 1] 367s 0.100 0.124 0.575 367s 0.609 0.704 -0.278 367s 0.422 -0.005 -0.561 367s 367s y: [3, 3, 4, 1] 367s 0.320 -0.261 0.344 367s -0.112 -0.028 0.958 367s -0.377 -0.152 0.109 367s test-mul-mat0: iter:326/500 367s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 367s x0: [4, 1, 1, 1] 367s 0.401 -0.973 0.131 -0.193 367s 367s x1: [4, 1, 1, 1] 367s -0.405 -0.815 -0.213 0.393 367s 367s y: [1, 1, 1, 1] 367s 0.528 367s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 367s x0: [4, 1, 2, 1] 367s -0.302 -0.684 -0.705 -0.072 367s 367s x1: [4, 1, 2, 1] 367s 0.546 0.655 -0.994 0.156 367s 367s y: [1, 1, 2, 1] 367s 0.077 367s testing: mul_mat, [4, 1, 2, 3] = [4, 4, 2, 3] * [4, 1, 2, 3] 367s x0: [4, 4, 2, 3] 367s -0.208 -0.956 0.010 0.137 367s -0.146 0.034 0.113 0.459 367s 0.054 0.301 0.145 0.670 367s 0.674 0.618 0.679 0.246 367s 367s x1: [4, 1, 2, 3] 367s 0.976 -0.396 -0.980 -0.812 367s 367s y: [4, 1, 2, 3] 367s 0.054 -0.639 -0.752 -0.451 367s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.349 0.453 -0.904 -0.944 367s 0.488 0.330 -0.201 0.944 367s 367s x1: [4, 4, 1, 1] 367s -0.310 0.251 -0.722 -0.230 367s -0.138 -0.900 -0.286 0.540 367s -0.203 -0.805 -0.095 -0.951 367s -0.541 -0.552 0.568 0.455 367s 367s y: [2, 4, 1, 1] 367s 1.092 -0.140 367s -0.610 0.203 367s 0.690 -1.243 367s -1.004 -0.131 367s testing: mul_mat, [3, 4, 2, 1] = [2, 3, 2, 1] * [2, 4, 2, 1] 367s x0: [2, 3, 2, 1] 367s -0.975 0.593 367s 0.902 0.357 367s -0.036 0.307 367s 367s x1: [2, 4, 2, 1] 367s 0.902 -0.604 367s 0.140 0.074 367s 0.959 -0.199 367s 0.765 0.210 367s 367s y: [3, 4, 2, 1] 367s -1.238 0.598 -0.218 367s -0.092 0.153 0.018 367s -1.053 0.794 -0.096 367s -0.621 0.765 0.037 367s testing: mul_mat, [1, 2, 2, 3] = [3, 1, 2, 3] * [3, 2, 2, 3] 367s x0: [3, 1, 2, 3] 367s -0.860 -0.696 -0.721 367s 367s x1: [3, 2, 2, 3] 367s 0.631 -0.116 0.461 367s -0.973 -0.976 -0.465 367s 367s y: [1, 2, 2, 3] 367s -0.794 367s 1.851 367s test-mul-mat0: iter:327/500 367s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.242 367s 367s x1: [1, 4, 1, 1] 367s 0.532 367s 0.135 367s 0.313 367s 0.660 367s 367s y: [1, 4, 1, 1] 367s -0.129 367s -0.033 367s -0.076 367s -0.160 367s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 367s x0: [1, 2, 4, 1] 367s 0.017 367s 0.291 367s 367s x1: [1, 1, 4, 1] 367s 0.814 367s 367s y: [2, 1, 4, 1] 367s 0.014 0.237 367s testing: mul_mat, [2, 2, 4, 3] = [1, 2, 4, 3] * [1, 2, 4, 3] 367s x0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s : [1, 2, 4, 3] 367s -0.552 367s 0.681 367s 367s x1: [1, 2, 4, 3] 367s -0.777 367s -0.950 367s 367s y: [2, 2, 4, 3] 367s 0.429 -0.529 367s 0.524 -0.647 367s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.293 367s 0.721 367s 0.113 367s 367s x1: [1, 2, 1, 1] 367s -0.062 367s 0.853 367s 367s y: [3, 2, 1, 1] 367s 0.018 -0.045 -0.007 367s -0.250 0.615 0.096 367s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 367s x0: [3, 1, 4, 1] 367s 0.208 -0.488 -0.812 367s 367s x1: [3, 1, 4, 1] 367s 0.767 -0.831 -0.206 367s 367s y: [1, 1, 4, 1] 367s 0.733 367s testing: mul_mat, [1, 3, 4, 3] = [1, 1, 4, 3] * [1, 3, 4, 3] 367s x0: [1, 1, 4, 3] 367s 0.642 367s 367s x1: [1, 3, 4, 3] 367s 0.173 367s 0.751 367s -0.791 367s 367s y: [1, 3, 4, 3] 367s 0.111 367s 0.482 367s -0.508 367s test-mul-mat0: iter:328/500 367s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.276 -0.845 -0.035 367s -0.521 0.977 -0.040 367s 367s x1: [3, 2, 1, 1] 367s 0.202 -0.729 0.578 367s 0.909 0.825 -0.457 367s 367s y: [2, 2, 1, 1] 367s 0.652 -0.841 367s -0.429 0.351 367s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.556 -0.953 0.523 367s 367s x1: [3, 2, 1, 1] 367s 0.154 0.121 -0.398 367s 0.469 -0.620 -0.387 367s 367s y: [1, 2, 1, 1] 367s -0.238 367s 0.650 367s testing: mul_mat, [2, 1, 1, 4] = [3, 2, 1, 4] * [3, 1, 1, 4] 367s x0: [3, 2, 1, 4] 367s -0.841 0.097 0.987 367s -0.685 -0.938 -0.534 367s 367s x1: [3, 1, 1, 4] 367s 0.510 -0.461 0.528 367s 367s y: [2, 1, 1, 4] 367s 0.047 -0.199 367s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.556 -0.064 0.501 367s -0.126 0.881 -0.537 367s 0.415 0.050 0.582 367s -0.241 0.958 -0.411 367s 367s x1: [3, 2, 1, 1] 367s 0.832 -0.331 0.529 367s -0.282 -0.715 0.776 367s 367s y: [4, 2, 1, 1] 367s -0.177 -0.681 0.637 -0.735 367s 0.591 -1.011 0.299 -0.936 367s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 1, 1, 1] 367s -0.774 -0.105 -0.771 -0.489 367s 367s x1: [4, 3, 1, 1] 367s -0.745 0.819 0.593 0.604 367s -0.897 -0.880 -0.538 -0.074 367s -0.825 0.679 0.557 0.366 367s 367s y: [1, 3, 1, 1] 367s -0.262 367s 1.238 367s -0.041 367s testing: mul_mat, [3, 4, 1, 4] = [1, 3, 1, 4] * [1, 4, 1, 4] 367s x0: [1, 3, 1, 4] 367s 0.085 367s 0.943 367s -0.243 367s 367s x1: [1, 4, 1, 4] 367s -0.329 367s 0.657 367s -0.045 367s 0.545 367s 367s y: [3, 4, 1, 4] 367s -0.028 -0.310 0.080 367s 0.056 0.620 -0.160 367s -0.004 -0.043 0.011 367s 0.046 0.513 -0.132 367s test-mul-mat0: iter:329/500 367s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 1, 1, 1] 367s 0.831 0.889 367s 367s x1: [2, 1, 1, 1] 367s -0.883 0.408 367s 367s y: [1, 1, 1, 1] 367s -0.371 367s testing: mul_mat, [1, 1, 4, 1] = [2, 1, 4, 1] * [2, 1, 4, 1] 367s x0: [2, 1, 4, 1] 367s -0.216 -0.612 367s 367s x1: [2, 1, 4, 1] 367s -0.350 -0.405 367s 367s y: [1, 1, 4, 1] 367s 0.324 367s testing: mul_mat, [3, 1, 4, 1] = [2, 3, 4, 1] * [2, 1, 4, 1] 367s x0: [2, 3, 4, 1] 367s 0.357 0.235 367s 0.185 0.188 367s 0.124 0.835 367s 367s x1: [2, 1, 4, 1] 367s 0.425 -0.229 367s 367s y: [3, 1, 4, 1] 367s 0.098 0.036 -0.138 367s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 367s x0: [2, 1, 1, 1] 367s 0.214 0.684 367s 367s x1: [2, 3, 1, 1] 367s -0.712 -0.382 367s -0.378 0.152 367s 0.858 -0.551 367s 367s y: [1, 3, 1, 1] 367s -0.414 367s 0.023 367s -0.193 367s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 367s x0: [1, 4, 4, 1] 367s 0.688 367s -0.306 367s -0.952 367s 0.559 367s 367s x1: [1, 2, 4, 1] 367s 0.826 367s -0.598 367s 367s y: [4, 2, 4, 1] 367s 0.569 -0.253 -0.787 0.462 367s -0.411 0.183 0.569 -0.334 367s testing: mul_mat, [3, 1, 4, 1] = [4, 3, 4, 1] * [4, 1, 4, 1] 367s x0: [4, 3, 4, 1] 367s 0.449 0.614 -0.103 -0.280 367s 0.024 0.996 -0.146 -0.630 367s -0.429 0.788 0.395 0.289 367s 367s x1: [4, 1, 4, 1] 367s 0.047 -0.700 -0.414 0.688 367s 367s y: [3, 1, 4, 1] 367s -0.559 -1.069 -0.536 367s test-mul-mat0: iter:330/500 367s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 2, 1, 1] 367s 0.461 0.522 0.557 -0.394 367s 0.352 -0.225 -0.529 0.552 367s 367s x1: [4, 3, 1, 1] 367s -0.640 0.610 0.120 0.353 367s 0.394 0.611 -0.766 -0.421 367s 0.566 0.698 0.213 -0.454 367s 367s y: [2, 3, 1, 1] 367s -0.049 -0.231 367s 0.240 0.174 367s 0.923 -0.321 367s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 367s x0: [4, 3, 3, 1] 367s -0.607 -0.910 0.117 0.168 367s -0.439 -0.330 -0.549 -0.939 367s -0.071 -0.423 -0.950 -0.009 367s 367s x1: [4, 2, 3, 1] 367s 0.283 0.499 -0.740 -0.874 367s 0.990 -0.939 -0.006 0.457 367s 367s y: [3, 2, 3, 1] 367s -0.859 0.938 0.480 367s 0.329 -0.550 0.328 367s testing: mul_mat, [1, 3, 3, 2] = [4, 1, 3, 2] * [4, 3, 3, 2] 367s x0: [4, 1, 3, 2] 367s -0.637 -0.510 0.719 0.200 367s 367s x1: [4, 3, 3, 2] 367s 0.114 0.847 -0.681 -0.957 367s -0.576 -0.631ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 0.034 0.995 367s 0.876 0.252 -0.033 0.200 367s 367s y: [1, 3, 3, 2] 367s -1.186 367s 0.913 367s -0.671 367s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 367s x0: [4, 4, 1, 1] 367s -0.616 0.167 0.469 0.698 367s 0.447 0.839 0.775 -0.601 367s -0.361 0.439 0.465 -0.786 367s -0.126 -0.293 -0.882 -0.368 367s 367s x1: [4, 1, 1, 1] 367s -0.222 0.057 -0.626 0.470 367s 367s y: [4, 1, 1, 1] 367s 0.182 -0.819 -0.555 0.390 367s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 367s x0: [4, 1, 3, 1] 367s 0.675 -0.747 -0.887 0.281 367s 367s x1: [4, 4, 3, 1] 367s 0.794 -0.588 0.696 -0.620 367s -0.237 -0.729 0.998 0.166 367s -0.947 -0.465 0.944 0.110 367s -0.091 0.414 0.386 0.293 367s 367s y: [1, 4, 3, 1] 367s 0.183 367s -0.454 367s -1.099 367s -0.630 367s testing: mul_mat, [2, 4, 3, 2] = [1, 2, 3, 2] * [1, 4, 3, 2] 367s x0: [1, 2, 3, 2] 367s -0.628 367s 0.443 367s 367s x1: [1, 4, 3, 2] 367s 0.452 367s -0.819 367s -0.367 367s -0.141 367s 367s y: [2, 4, 3, 2] 367s -0.284 0.200 367s 0.515 -0.363 367s 0.231 -0.163 367s 0.089 -0.063 367s test-mul-mat0: iter:331/500 367s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 1, 1, 1] 367s -0.085 -0.923 0.631 -0.642 367s 367s x1: [4, 3, 1, 1] 367s 0.121 0.594 -0.668 -0.633 367s -0.618 0.664 0.121 0.394 367s 0.245 0.549 -0.340 -0.061 367s 367s y: [1, 3, 1, 1] 367s -0.573 367s -0.737 367s -0.703 367s testing: mul_mat, [3, 1, 4, 1] = [4, 3, 4, 1] * [4, 1, 4, 1] 367s x0: [4, 3, 4, 1] 367s -0.055 -0.236 0.913 -0.934 367s -0.842 0.158 0.615 -0.182 367s -0.903 -0.125 0.733 -0.826 367s 367s x1: [4, 1, 4, 1] 367s -0.890 0.743 0.589 -0.300 367s 367s y: [3, 1, 4, 1] 367s 0.691 1.284 1.390 367s testing: mul_mat, [3, 3, 4, 2] = [4, 3, 4, 2] * [4, 3, 4, 2] 367s x0: [4, 3, 4, 2] 367s -0.980 0.137 0.466 0.634 367s 0.667 -0.192 0.639 0.976 367s -0.533 0.231 0.351 -0.909 367s 367s x1: [4, 3, 4, 2] 367s -0.247 0.894 -0.380 0.660 367s 0.181 -0.332 0.185 0.158 367s -0.827 0.497 0.052 -0.743 367s 367s y: [3, 3, 4, 2] 367s 0.606 0.065 -0.395 367s -0.037 0.457 -0.252 367s 0.432 -1.338 1.248 367s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 2, 1, 1] 367s 0.403 0.930 0.165 -0.003 367s 0.207 0.185 0.108 0.487 367s 367s x1: [4, 3, 1, 1] 367s 0.412 -0.662 -0.945 0.049 367s -0.706 -0.925 0.287 0.782 367s -0.321 0.786 -0.027 -0.186 367s 367s y: [2, 3, 1, 1] 367s -0.606 -0.115 367s -1.100 0.094 367s 0.598 -0.015 367s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 367s x0: [2, 1, 4, 1] 367s -0.466 -0.197 367s 367s x1: [2, 4, 4, 1] 367s 0.315 0.169 367s 0.443 -0.944 367s 0.779 -0.566 367s 0.082 -0.076 367s 367s y: [1, 4, 4, 1] 367s -0.180 367s -0.021 367s -0.251 367s -0.023 367s testing: mul_mat, [4, 2, 4, 2] = [1, 4, 4, 2] * [1, 2, 4, 2] 367s x0: [1, 4, 4, 2] 367s -0.413 367s -0.160 367s 0.448 367s 0.678 367s 367s x1: [1, 2, 4, 2] 367s -0.824 367s 0.298 367s 367s y: [4, 2, 4, 2] 367s 0.340 0.132 -0.369 -0.559 367s -0.123 -0.048 0.134 0.202 367s test-mul-mat0: iter:332/500 367s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 2, 1, 1] 367s 0.586 0.288 367s 0.471 0.357 367s 367s x1: [2, 2, 1, 1] 367s -0.552 0.146 367s 0.310 0.949 367s 367s y: [2, 2, 1, 1] 367s -0.281 -0.207 367s 0.455 0.484 367s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 4, 1, 1] 367s -0.888 0.188 367s -0.010 0.252 367s 0.188 0.419 367s -0.063 -0.728 367s 367s x1: [2, 2, 1, 1] 367s -0.197 0.913 367s -0.714 0.879 367s 367s y: [4, 2, 1, 1] 367s 0.346 0.232 0.346 -0.652 367s 0.800 0.228 0.235 -0.595 367s testing: mul_mat, [1, 4, 1, 4] = [2, 1, 1, 4] * [2, 4, 1, 4] 367s x0: [2, 1, 1, 4] 367s -0.583 0.475 367s 367s x1: [2, 4, 1, 4] 367s -0.644 0.607 367s -0.235 -0.228 367s -0.137 -0.903 367s -0.364 0.798 367s 367s y: [1, 4, 1, 4] 367s 0.664 367s 0.029 367s -0.349 367s 0.591 367s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 2, 1, 1] 367s -0.661 -0.450 367s 0.771 0.080 367s 367s x1: [2, 1, 1, 1] 367s 0.068 0.858 367s 367s y: [2, 1, 1, 1] 367s -0.432 0.121 367s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 4, 1, 1] 367s 0.671 0.183 367s -0.736 -0.750 367s -0.130 -0.054 367s -0.401 0.040 367s 367s x1: [2, 2, 1, 1] 367s 0.041 0.674 367s 0.734 -0.403 367s 367s y: [4, 2, 1, 1] 367s 0.151 -0.536 -0.042 0.010 367s 0.419 -0.238 -0.073 -0.311 367s testing: mul_mat, [4, 2, 1, 4] = [4, 4, 1, 4] * [4, 2, 1, 4] 367s x0: [4, 4, 1, 4] 367s -0.380 0.732 -0.174 -0.214 367s -0.100 -0.604 -0.575 -0.691 367s 0.162 -0.597 -0.684 -0.307 367s 0.450 -0.905 0.485 -0.774 367s 367s x1: [4, 2, 1, 4] 367s 0.902 0.648 -0.497 -0.682 367s 0.123 -0.171 -0.168 0.234 367s 367s y: [4, 2, 1, 4] 367s 0.365 0.275 0.308 0.106 367s -0.193 0.025 0.164 -0.053 367s test-mul-mat0: iter:333/500 367s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.763 0.717 0.522 0.439 367s 0.108 -0.105 0.470 -0.383 367s 367s x1: [4, 4, 1, 1] 367s -0.273 -0.224 -0.033 0.689 367s -0.352 -0.045 0.437 -0.171 367s 0.634 0.411 -0.647 0.890 367s -0.214 -0.151 0.280 -0.594 367s 367s y: [2, 4, 1, 1] 367s 0.333 -0.285 367s 0.389 0.238 367s -0.137 -0.620 367s -0.059 0.352 367s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 367s x0: [4, 2, 3, 1] 367s 0.638 -0.728 -0.579 -0.467 367s 0.742 0.037 0.484 -0.029 367s 367s x1: [4, 2, 3, 1] 367s -0.049 0.229 0.406 0.141 367s -0.552 0.871 0.868 0.224 367s 367s y: [2, 2, 3, 1] 367s -0.498 0.164 367s -1.593 0.036 367s testing: mul_mat, [4, 2, 3, 4] = [4, 4, 3, 4] * [4, 2, 3, 4] 367s x0: [4, 4, 3, 4] 367s 0.221 -0.913 -0.155 0.117 367s 0.433 -0.022 -0.978 -0.378 367s -0.901 0.878 0.758 0.655 367s 0.249 -0.635 0.408 0.196 367s 367s x1: [4, 2, 3, 4] 367s -0.363 -0.828 -0.715 -0.766 367s -0.713 0.118 0.547 0.925 367s 367s y: [4, 2, 3, 4] 367s 0.697 0.851 -1.444 -0.007 367s -0.242 -1.195 1.767 0.152 367s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 1, 1, 1] 367s 0.319 0.670 0.175 -0.657 367s 367s x1: [4, 4, 1, 1] 367s 0.213 0.933 -0.194 0.976 367s -0.756 0.537 -0.053 -0.907 367s 0.093 -0.906 -0.507 0.178 367s -0.175 0.172 -0.572 -0.243 367s 367s y: [1, 4, 1, 1] 367s 0.018 367s 0.706 367s -0.784 367s 0.119 367s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 367s x0: [1, 4, 3, 1] 367s 0.614 367s 0.257 367s -0.339 367s -0.439 367s 367s x1: [1, 4, 3, 1] 367s 0.200 367s 0.231 367s -0.102 367s -0.033 367s 367s y: [4, 4, 3, 1] 367s 0.123 0.051 -0.068 -0.088 367s 0.142 0.059 -0.078 -0.101 367s -0.063 -0.026 0.035 0.045 367s -0.020 -0.009 0.011 0.015 367s testing: mul_mat, [4, 1, 3, 4] = [4, 4, 3, 4] * [4, 1, 3, 4] 367s x0: [4, 4, 3, 4] 367s 0.124 -0.263 -0.032 -0.735 367s 0.015 0.529 -0.945 0.068 367s 0.933 0.205 -0.027 -0.828 367s 0.287 0.115 0.423 0.155 367s 367s x1: [4, 1, 3, 4] 367s -0.762 -0.707 -0.321 0.908 367s 367s y: [4, 1, 3, 4] 367s -0.566 -0.020 -1.599 -0.295 367s test-mul-mat0: iter:334/500 367s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.193 0.395 0.690 367s 0.506 0.923 -0.784 367s 367s x1: [3, 4, 1, 1] 367s -0.353 -0.821 0.623 367s 0.670 0.575 0.283 367s 0.541 -0.781 -0.569 367s 0.252 -0.560 -0.418 367s 367s y: [2, 4, 1, 1] 367s 0.038 -1.425 367s 0.551 0.648 367s -0.596 -0.001 367s -0.461 -0.061 367s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 367s x0: [3, 3, 3, 1] 367s 0.709 -0.885 0.220 367s -0.851 -0.303 0.318 367s 0.342 -0.909 0.008 367s 367s x1: [3, 2, 3, 1] 367s 0.798 0.874 -0.944 367s -0.928 -0.188 -0.953 367s 367s y: [3, 2, 3, 1] 367s -0.416 -1.244 -0.528 367s -0.702 0.543 -0.155 367s testing: mul_mat, [4, 3, 3, 2] = [3, 4, 3, 2] * [3, 3, 3, 2] 367s x0: [3, 4, 3, 2] 367s 0.653 -0.304 -0.488 367s 0.392 0.114 0.839 367s 0.248 0.273 0.851 367s -0.095 0.143 -0.743 367s 367s x1: [3, 3, 3, 2] 367s 0.742 -0.316 0.801 367s 0.710 -0.607 0.916 367s -0.070 -0.457 -0.387 367s 367s y: [4, 3, 3, 2] 367s 0.189 0.927 0.779 -0.711 367s 0.201 0.978 0.790 -0.835 367s 0.282 -0.404 -0.472 0.229 367s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.194 -0.621 0.925 367s 367s x1: [3, 4, 1, 1] 367s -0.483 0.529 0.680 367s 0.877 -0.524 -0.736 367s -0.773 0.462 -0.599 367s 0.870 -0.948 0.701 367s 367s y: [1, 4, 1, 1] 367s 0.207 367s -0.185 367s -0.991 367s 1.406 367s testing: mul_mat, [1, 3, 3, 1] = [1, 1, 3, 1] * [1, 3, 3, 1] 367s x0: [1, 1, 3, 1] 367s 0.792 367s 367s x1: [1, 3, 3, 1] 367s 0.936 367s 0.962 367s -0.490 367s 367s y: [1, 3, 3, 1] 367s 0.742 367s 0.762 367s -0.388 367s testing: mul_mat, [2, 1, 3, 2] = [1, 2, 3, 2] * [1, 1, 3, 2] 367s x0: [1, 2, 3, 2] 367s -0.212 367s -0.764 367s 367s x1: [1, 1, 3, 2] 367s 0.578 367s 367s y: [2, 1, 3, 2] 367s -0.123 -0.441 367s test-mul-mat0: iter:335/500 367s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.948 -0.970 -0.654 367s -0.939 0.613 0.134 367s -0.703 -0.343 -0.677 367s 367s x1: [3, 3, 1, 1] 367s 0.124 0.789 0.554 367s 0.894 -0.695 0.346 367s -0.021 0.920 -0.076 367s 367s y: [3, 3, 1, 1] 367s -1.245 0.442 -0.733 367s -0.400 -1.218 -0.624 367s -0.822 0.574 -0.249 367s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 3, 4, 1] 367s 0.010 0.967 -0.126 367s -0.143 -0.666 -0.352 367s 0.828 -0.509 0.976 367s 367s x1: [3, 3, 4, 1] 367s -0.597 0.753 0.213 367s -0.903 -0.956 -0.519 367s 0.717 -0.729 0.713 367s 367s y: [3, 3, 4, 1] 367s 0.695 -0.491 -0.670 367s -0.868 0.948 -0.767 367s -0.787 0.132 1.660 367s testing: mul_mat, [3, 3, 4, 4] = [3, 3, 4, 4] * [3, 3, 4, 4] 367s x0: [3, 3, 4, 4] 367s 0.612 0.766 -0.100 367s 0.090 -0.241 -0.111 367s 0.124 -0.757 0.102 367s 367s x1: [3, 3, 4, 4]ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 367s -0.812 0.243 0.157 367s -0.836 0.935 -0.889 367s -0.185 -0.531 0.447 367s 367s y: [3, 3, 4, 4] 367s -0.326 -0.149 -0.268 367s 0.294 -0.202 -0.902 367s -0.565 0.062 0.425 367s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.043 0.140 0.149 367s 0.341 -0.312 -0.313 367s -0.095 -0.232 -0.285 367s 367s x1: [3, 3, 1, 1] 367s 0.133 1.000 -0.130 367s 0.332 0.474 -0.132 367s 0.072 -0.964 0.101 367s 367s y: [3, 3, 1, 1] 367s 0.126 -0.226 -0.207 367s 0.061 0.007 -0.104 367s -0.117 0.294 0.188 367s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 2, 4, 1] 367s -0.538 -0.076 -0.442 367s 0.596 0.179 0.912 367s 367s x1: [3, 3, 4, 1] 367s 0.172 0.538 -0.298 367s 0.118 0.196 0.386 367s -0.632 -0.681 0.423 367s 367s y: [2, 3, 4, 1] 367s -0.002 -0.073 367s -0.249 0.458 367s 0.204 -0.112 367s testing: mul_mat, [1, 3, 4, 4] = [2, 1, 4, 4] * [2, 3, 4, 4] 367s x0: [2, 1, 4, 4] 367s 0.279 0.853 367s 367s x1: [2, 3, 4, 4] 367s 0.528 0.377 367s 0.969 0.643 367s 0.472 0.246 367s 367s y: [1, 3, 4, 4] 367s 0.469 367s 0.818 367s 0.341 367s test-mul-mat0: iter:336/500 367s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.858 -0.644 0.706 -0.305 367s 0.529 0.602 -0.308 0.889 367s 367s x1: [4, 1, 1, 1] 367s 0.076 -0.327 0.189 0.605 367s 367s y: [2, 1, 1, 1] 367s 0.095 0.323 367s testing: mul_mat, [1, 2, 3, 1] = [4, 1, 3, 1] * [4, 2, 3, 1] 367s x0: [4, 1, 3, 1] 367s -0.907 -0.659 -0.687 0.695 367s 367s x1: [4, 2, 3, 1] 367s -0.342 0.215 -0.736 -0.648 367s 0.189 -0.117 -0.997 0.781 367s 367s y: [1, 2, 3, 1] 367s 0.224 367s 1.134 367s testing: mul_mat, [1, 1, 3, 4] = [4, 1, 3, 4] * [4, 1, 3, 4] 367s x0: [4, 1, 3, 4] 367s 0.416 0.204 -0.808 0.441 367s 367s x1: [4, 1, 3, 4] 367s -0.292 0.218 -0.379 0.004 367s 367s y: [1, 1, 3, 4] 367s 0.231 367s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 367s x0: [4, 4, 1, 1] 367s 0.368 -0.134 -0.845 0.846 367s 0.758 -0.040 0.011 -0.289 367s -0.706 0.618 -0.705 0.816 367s 0.641 -0.022 0.542 -0.650 367s 367s x1: [4, 1, 1, 1] 367s -0.482 -0.094 0.871 0.549 367s 367s y: [4, 1, 1, 1] 367s -0.436 -0.511 0.116 -0.192 367s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 367s x0: [4, 1, 3, 1] 367s 0.413 -0.649 -0.592 0.118 367s 367s x1: [4, 4, 3, 1] 367s -0.527 0.920 0.216 -0.464 367s -0.766 -0.514 -0.441 -0.646 367s -0.177 0.207 -0.129 0.730 367s 0.078 -0.580 -0.436 -0.554 367s 367s y: [1, 4, 3, 1] 367s -0.998 367s 0.202 367s -0.046 367s 0.602 367s testing: mul_mat, [2, 4, 3, 4] = [1, 2, 3, 4] * [1, 4, 3, 4] 367s x0: [1, 2, 3, 4] 367s -0.080 367s 0.697 367s 367s x1: [1, 4, 3, 4] 367s 0.308 367s 0.217 367s -0.525 367s 0.436 367s 367s y: [2, 4, 3, 4] 367s -0.025 0.214 367s -0.017 0.151 367s 0.042 -0.366 367s -0.035 0.304 367s test-mul-mat0: iter:337/500 367s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.398 -0.872 0.316 367s -0.090 0.635 0.046 367s 367s x1: [3, 2, 1, 1] 367s -0.981 -0.181 0.485 367s -0.060 -0.483 -0.025 367s 367s y: [2, 2, 1, 1] 367s 0.701 -0.004 367s 0.437 -0.303 367s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 367s x0: [3, 3, 3, 1] 367s -0.700 -0.853 -0.926 367s -0.183 0.121 0.319 367s 0.419 0.250 -0.365 367s 367s x1: [3, 2, 3, 1] 367s 0.835 0.212 0.662 367s -0.316 -0.926 0.638 367s 367s y: [3, 2, 3, 1] 367s -1.378 0.084 0.161 367s 0.420 0.149 -0.597 367s testing: mul_mat, [1, 3, 3, 4] = [3, 1, 3, 4] * [3, 3, 3, 4] 367s x0: [3, 1, 3, 4] 367s 0.407 0.164 -0.446 367s 367s x1: [3, 3, 3, 4] 367s -0.586 0.477 0.292 367s 0.548 0.777 0.438 367s 0.622 -0.406 -0.440 367s 367s y: [1, 3, 3, 4] 367s -0.291 367s 0.155 367s 0.383 367s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.199 0.963 -0.961 367s -0.769 -0.359 0.232 367s 0.001 0.199 0.397 367s 0.172 -0.759 0.789 367s 367s x1: [3, 1, 1, 1] 367s 0.724 -0.651 0.484 367s 367s y: [4, 1, 1, 1] 367s -1.236 -0.210 0.063 1.000 367s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 367s x0: [4, 1, 3, 1] 367s -0.267 0.360 0.595 -0.269 367s 367s x1: [4, 3, 3, 1] 367s 0.860 0.120 0.157 -0.151 367s -0.526 0.885 -0.282 -0.762 367s -0.107 0.998 0.409 0.063 367s 367s y: [1, 3, 3, 1] 367s -0.053 367s 0.496 367s 0.614 367s testing: mul_mat, [1, 4, 3, 4] = [1, 1, 3, 4] * [1, 4, 3, 4] 367s x0: [1, 1, 3, 4] 367s -0.209 367s 367s x1: [1, 4, 3, 4] 367s -0.691 367s 0.929 367s -0.505 367s 0.501 367s 367s y: [1, 4, 3, 4] 367s 0.145 367s -0.195 367s 0.106 367s -0.105 367s test-mul-mat0: iter:338/500 367s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.417 0.694 -0.344 367s -0.408 0.715 0.446 367s -0.297 -0.090 -0.786 367s 0.645 -0.950 0.618 367s 367s x1: [3, 3, 1, 1] 367s 0.001 0.547 -0.494 367s 0.659 -0.349 0.975 367s 0.522 0.941 0.488 367s 367s y: [4, 3, 1, 1ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ] 367s 0.549 0.170 0.339 -0.824 367s -0.853 -0.083 -0.931 1.359 367s 0.268 0.678 -0.623 -0.256 367s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 367s x0: [3, 3, 3, 1] 367s -0.334 0.891 -0.020 367s 0.179 0.114 -0.450 367s 0.351 -0.011 -0.906 367s 367s x1: [3, 4, 3, 1] 367s 0.982 -0.544 -0.472 367s 0.835 -0.644 0.512 367s 0.511 0.677 0.064 367s -0.488 0.223 0.570 367s 367s y: [3, 4, 3, 1] 367s -0.803 0.326 0.778 367s -0.863 -0.155 -0.163 367s 0.431 0.140 0.114 367s 0.350 -0.318 -0.689 367s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 2, 3, 1] 367s 0.785 -0.032 -0.400 367s 0.952 -0.333 0.725 367s 367s x1: [3, 3, 3, 1] 367s -0.833 0.698 0.125 367s -0.917 -0.635 0.016 367s 0.063 0.543 -0.870 367s 367s y: [2, 3, 3, 1] 367s -0.726 -0.935 367s -0.706 -0.650 367s 0.380 -0.752 367s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.024 0.562 0.982 367s 0.557 -0.001 0.251 367s 367s x1: [3, 2, 1, 1] 367s 0.251 -0.668 0.038 367s 0.940 -0.160 -0.645 367s 367s y: [2, 2, 1, 1] 367s -0.344 0.150 367s -0.746 0.362 367s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 367s x0: [2, 1, 3, 1] 367s -0.570 -0.869 367s 367s x1: [2, 3, 3, 1] 367s -0.216 -0.049 367s 0.851 -0.264 367s 0.617 0.577 367s 367s y: [1, 3, 3, 1] 367s 0.166 367s -0.256 367s -0.852 367s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 367s x0: [1, 2, 3, 1] 367s -0.024 367s 0.778 367s 367s x1: [1, 2, 3, 1] 367s -0.753 367s -0.956 367s 367s y: [2, 2, 3, 1] 367s 0.018 -0.586 367s 0.022 -0.744 367s test-mul-mat0: iter:339/500 367s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.018 0.255 0.135 367s 0.229 0.685 0.267 367s 0.415 -0.045 0.753 367s -0.472 0.202 0.797 367s 367s x1: [3, 1, 1, 1] 367s -0.138 0.985 0.156 367s 367s y: [4, 1, 1, 1] 367s 0.275 0.685 0.016 0.389 367s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 367s x0: [3, 2, 2, 1] 367s 0.798 -0.849 -0.007 367s -0.674 0.353 -0.210 367s 367s x1: [3, 4, 2, 1] 367s 0.203 -0.553 0.823 367s 0.129 -0.523 -0.201 367s -0.093 0.243 0.393 367s -0.739 -0.220 -0.365 367s 367s y: [2, 4, 2, 1] 367s 0.626 -0.504 367s 0.549 -0.229 367s -0.284 0.066 367s -0.400 0.497 367s testing: mul_mat, [3, 2, 2, 4] = [3, 3, 2, 4] * [3, 2, 2, 4] 367s x0: [3, 3, 2, 4] 367s 0.577 -0.945 0.458 367s 0.176 -0.626 -0.537 367s 0.565 -0.777 0.268 367s 367s x1: [3, 2, 2, 4] 367s 0.565 0.521 -0.194 367s 0.826 -0.699 0.441 367s 367s y: [3, 2, 2, 4] 367s -0.255 -0.122 -0.138 367s 1.339 0.346 1.128 367s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 1, 1, 1] 367s -0.546 -0.085 0.875 367s 367s x1: [3, 3, 1, 1] 367s -0.130 0.513 0.485 367s -0.723 -0.827 -0.301 367s -0.618 -0.311 -0.225 367s 367s y: [1, 3, 1, 1] 367s 0.452 367s 0.202 367s 0.168 367s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 367s x0: [1, 1, 2, 1] 367s -0.586 367s 367s x1: [1, 3, 2, 1] 367s 0.518 367s 0.116 367s 0.081 367s 367s y: [1, 3, 2, 1] 367s -0.303 367s -0.068 367s -0.047 367s testing: mul_mat, [4, 1, 2, 4] = [1, 4, 2, 4] * [1, 1, 2, 4] 367s x0: [1, 4, 2, 4] 367s 0.454 367s -0.457 367s 0.889 367s 0.730 367s 367s x1: [1, 1, 2, 4] 367s 0.932 367s 367s y: [4, 1, 2, 4] 367s 0.423 -0.426 0.829 0.681 367s test-mul-mat0: iter:340/500 367s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 1, 1, 1] 367s 0.495 0.857 0.759 0.352 367s 367s x1: [4, 3, 1, 1] 367s -0.842 -0.547 0.181 0.521 367s 0.408 0.039 0.799 -0.762 367s -0.585 0.194 -0.851 0.536 367s 367s y: [1, 3, 1, 1] 367s -0.565 367s 0.574 367s -0.580 367s testing: mul_mat, [2, 1, 3, 1] = [4, 2, 3, 1] * [4, 1, 3, 1] 367s x0: [4, 2, 3, 1] 367s -0.133 0.383 0.243 -0.951 367s -0.096 -0.349 0.088 -0.297 367s 367s x1: [4, 1, 3, 1] 367s -0.890 -0.977 -0.302 -0.317 367s 367s y: [2, 1, 3, 1] 367s -0.028 0.494 367s testing: mul_mat, [4, 2, 3, 3] = [4, 4, 3, 3] * [4, 2, 3, 3] 367s x0: [4, 4, 3, 3] 367s -0.184 -0.515 0.021 0.601 367s 0.693 -0.475 -0.287 0.492 367s 0.154 -0.680 -0.587 0.802 367s -0.654 -0.657 0.476 -0.261 367s 367s x1: [4, 2, 3, 3] 367s 0.528 0.488 -0.786 0.977 367s -0.421 0.440 -0.232 0.447 367s 367s y: [4, 2, 3, 3] 367s 0.222 0.839 0.994 -1.294 367s 0.114 -0.214 0.131 -0.241 367s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.877 -0.891 -0.310 0.298 367s 0.099 -0.291 0.251 0.297 367s 367s x1: [4, 4, 1, 1] 367s 0.382 -0.394 0.721 0.165 367s -0.783 -0.428 -0.341 0.276 367s -0.718 -0.024 -0.479 -0.806 367s 0.704 0.995 0.932 -0.397 367s 367s y: [2, 4, 1, 1] 367s -0.159 0.382 367s 1.256 0.043 367s 0.559 -0.424 367s -1.911 -0.103 367s testing: mul_mat, [3, 4, 3, 1] = [2, 3, 3, 1] * [2, 4, 3, 1] 367s x0: [2, 3, 3, 1] 367s -0.813 0.438 367s 0.729 0.027 367s 0.377 -0.326 367s 367s x1: [2, 4, 3, 1] 367s -0.096 0.901 367s 0.124 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s .099 367s 0.893 -0.806 367s -0.519 -0.501 367s 367s y: [3, 4, 3, 1] 367s 0.473 -0.045 -0.330 367s -0.058 0.093 0.014 367s -1.079 0.629 0.599 367s 0.202 -0.392 -0.032 367s testing: mul_mat, [3, 2, 3, 3] = [3, 3, 3, 3] * [3, 2, 3, 3] 367s x0: [3, 3, 3, 3] 367s -0.874 -0.709 -0.148 367s -0.219 -0.193 -0.292 367s 0.656 0.565 0.500 367s 367s x1: [3, 2, 3, 3] 367s 0.875 -0.973 0.166 367s 0.156 -0.461 -0.870 367s 367s y: [3, 2, 3, 3] 367s -0.100 -0.052 0.108 367s 0.319 0.309 -0.593 367s test-mul-mat0: iter:341/500 367s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.744 -0.423 0.667 367s 0.619 -0.958 -0.992 367s 367s x1: [3, 2, 1, 1] 367s -0.590 0.247 -0.868 367s 0.663 -0.983 0.371 367s 367s y: [2, 2, 1, 1] 367s -0.244 0.259 367s 0.170 0.983 367s testing: mul_mat, [1, 2, 3, 1] = [3, 1, 3, 1] * [3, 2, 3, 1] 367s x0: [3, 1, 3, 1] 367s -0.953 -0.053 -0.953 367s 367s x1: [3, 2, 3, 1] 367s 0.671 -0.381 0.551 367s 0.913 -0.114 -0.938 367s 367s y: [1, 2, 3, 1] 367s -1.145 367s 0.030 367s testing: mul_mat, [4, 1, 3, 3] = [3, 4, 3, 3] * [3, 1, 3, 3] 367s x0: [3, 4, 3, 3] 367s -0.295 0.475 -0.733 367s -0.356 -0.589 0.810 367s -0.745 0.967 -0.095 367s 0.778 -0.590 -0.938 367s 367s x1: [3, 1, 3, 3] 367s 0.939 -0.063 0.543 367s 367s y: [4, 1, 3, 3] 367s -0.705 0.144 -0.813 0.258 367s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.366 0.186 0.697 367s 0.252 -0.443 -0.450 367s -0.438 0.999 -0.520 367s 367s x1: [3, 4, 1, 1] 367s -0.077 0.199 0.604 367s -0.290 0.612 0.636 367s 0.799 0.355 -0.661 367s 0.740 0.465 0.941 367s 367s y: [3, 4, 1, 1] 367s 0.430 -0.380 -0.082 367s 0.451 -0.631 0.407 367s -0.103 0.342 0.348 367s 1.013 -0.443 -0.349 367s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 2, 3, 1] 367s -0.058 0.545 0.191 367s 0.296 -0.130 0.676 367s 367s x1: [3, 3, 3, 1] 367s 0.927 0.895 -0.870 367s -0.318 0.665 0.140 367s -0.613 0.063 0.418 367s 367s y: [2, 3, 3, 1] 367s 0.268 -0.430 367s 0.407 -0.086 367s 0.150 0.093 367s testing: mul_mat, [3, 3, 3, 3] = [2, 3, 3, 3] * [2, 3, 3, 3] 367s x0: [2, 3, 3, 3] 367s 0.699 -0.491 367s 0.224 -0.675 367s -0.938 -0.657 367s 367s x1: [2, 3, 3, 3] 367s 0.088 0.282 367s -0.089 -0.353 367s 0.976 0.849 367s 367s y: [3, 3, 3, 3] 367s -0.077 -0.171 -0.268 367s 0.111 0.218 0.315 367s 0.266 -0.354 -1.473 367s test-mul-mat0: iter:342/500 367s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 1, 1, 1] 367s -0.464 -0.656 0.632 367s 367s x1: [3, 1, 1, 1] 367s 0.860 -0.489 0.803 367s 367s y: [1, 1, 1, 1] 367s 0.429 367s testing: mul_mat, [2, 1, 4, 1] = [3, 2, 4, 1] * [3, 1, 4, 1] 367s x0: [3, 2, 4, 1] 367s 0.009 -0.201 -0.644 367s 0.660 -0.753 0.965 367s 367s x1: [3, 1, 4, 1] 367s 0.070 0.210 0.891 367s 367s y: [2, 1, 4, 1] 367s -0.616 0.748 367s testing: mul_mat, [2, 2, 4, 2] = [3, 2, 4, 2] * [3, 2, 4, 2] 367s x0: [3, 2, 4, 2] 367s -0.954 0.576 0.377 367s 0.269 -0.013 0.993 367s 367s x1: [3, 2, 4, 2] 367s -0.722 -0.435 -0.372 367s -0.265 0.958 -0.075 367s 367s y: [2, 2, 4, 2] 367s 0.298 -0.557 367s 0.777 -0.158 367s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.503 -0.668 0.644 367s -0.028 0.581 -0.798 367s 0.095 -0.352 -0.586 367s -0.529 0.739 -0.306 367s 367s x1: [3, 2, 1, 1] 367s 0.296 -0.846 -0.313 367s 0.172 -0.348 0.816 367s 367s y: [4, 2, 1, 1] 367s 0.215 -0.251 0.510 -0.686 367s 0.672 -0.859 -0.339 -0.598 367s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 367s x0: [4, 4, 4, 1] 367s -0.823 0.331 -0.258 -0.950 367s 0.547 -0.096 -0.656 -0.220 367s -0.194 0.331 -0.602 0.838 367s 0.963 0.764 -0.007 0.867 367s 367s x1: [4, 3, 4, 1] 367s 0.748 -0.406 -0.889 -0.445 367s 0.416 -0.917 0.131 -0.021 367s 0.076 -0.449 -0.989 -0.709 367s 367s y: [4, 3, 4, 1] 367s -0.098 1.130 -0.118 0.031 367s -0.660 0.234 -0.481 -0.319 367s 0.717 0.889 -0.162 -0.878 367s testing: mul_mat, [4, 4, 4, 2] = [4, 4, 4, 2] * [4, 4, 4, 2] 367s x0: [4, 4, 4, 2] 367s -0.693 0.517 -0.325 -0.188 367s 0.340 -0.410 -0.331 0.586 367s 0.327 -0.326 -0.032 0.205 367s 0.796 -0.996 -0.684 0.402 367s 367s x1: [4, 4, 4, 2] 367s -0.744 -0.791 0.753 0.260 367s -0.966 -0.366 0.006 -0.798 367s 0.912 0.744 -0.579 0.970 367s -0.678 -0.492 0.744 -0.002 367s 367s y: [4, 4, 4, 2] 367s -0.188 -0.026 0.043 -0.216 367s 0.628 -0.648 -0.360 -0.729 367s -0.241 0.765 0.273 0.770 367s -0.026 -0.277 -0.086 -0.560 367s test-mul-mat0: iter:343/500 367s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 367s x0: [2, 1, 1, 1] 367s -0.543 -0.600 367s 367s x1: [2, 3, 1, 1] 367s -0.761 0.729 367s -0.473 -0.786 367s 0.254 -0.805 367s 367s y: [1, 3, 1, 1] 367s -0.025 367s 0.728 367s 0.345 367s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 367s x0: [2, 4, 2, 1] 367s 0.142 0.324 367s 0.104 0.895 367s 0.220 0.765 367s 0.369 0.346 367s 367s x1: [2, 1, 2, 1] 367s 0.523 -0.080 367s 367s y: [4, 1, 2, 1] 367s 0.048 -0.017 0.054 0.165 367s testing: mul_mat, [3, 4, 2, 4] = [2, 3, 2, 4] * [2, 4, 2, 4] 367s x0: [2, 3, 2, 4] 367s -0.904 -0.144 367s -0.876 -0.741 367s -0.589 0.976 367s 367s x1: [2, 4, 2, 4] 367s -0.069 0.872 367s -0.409 0.459 367s -0.914 0.845 367s 0.654 -0.760 367s 367s y: [3, 4, 2, 4] 367s -0.064 -0.586 0.891 367s 0.303 0.018 0.688 367s 0.704 0.174 1.363 367s -0.482 -0.010 -1.127 367s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 367s x0: [2, 1, 1, 1] 367s -0.913 0.780 367s 367s x1: [2, 3, 1, 1] 367s -0.251 0.929 367s 0.947 -0.222 367s -0.871 0.594 367s 367s y: [1, 3, 1, 1] 367s 0.954 367s -1.038 367s 1.260 367s testing: mul_mat, [2, 2, 2, 1] = [1, 2, 2, 1] * [1, 2, 2, 1] 367s x0: [1, 2, 2, 1] 367s 0.424 367s 0.578 367s 367s x1: [1, 2, 2, 1] 367s 0.054 367s -0.084 367s 367s y: [2, 2, 2, 1] 367s 0.023 0.031 367s -0.036 -0.049 367s testing: mul_mat, [3, 1, 2, 4] = [2, 3, 2, 4] * [2, 1, 2, 4] 367s x0: [2, 3, 2, 4] 367s -0.181 -0.095 367s 0.858 0.638 367s -0.937 0.117 367s 367s x1: [2, 1, 2, 4] 367s 0.679 0.352 367s 367s y: [3, 1, 2, 4] 367s -0.156 0.807 -0.595 367s test-mul-mat0: iter:344/500 367s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.947 -0.291 0.521 367s -0.935 -0.369 -0.742 367s 367s x1: [3, 2, 1, 1] 367s 0.629 -0.962 -0.156 367s -0.524 -0.353 -0.355 367s 367s y: [2, 2, 1, 1] 367s 0.794 -0.117 367s -0.579 0.884 367s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 367s x0: [3, 4, 4, 1] 367s -0.466 0.141 -0.769 367s -0.400 0.772 -0.511 367s 0.688 0.964 -0.399 367s 0.621 -0.092 0.571 367s 367s x1: [3, 2, 4, 1] 367s 0.089 -0.808 -0.888 367s 0.933 -0.056 -0.030 367s 367s y: [4, 2, 4, 1] 367s 0.528 -0.206 -0.364 -0.378 367s -0.419 -0.401 0.600 0.567 367s testing: mul_mat, [3, 4, 4, 4] = [3, 3, 4, 4] * [3, 4, 4, 4] 367s x0: [3, 3, 4, 4] 367s -0.151 -0.698 -0.949 367s 0.283 0.145 -0.889 367s -0.960 0.722 -0.191 367s 367s x1: [3, 4, 4, 4] 367s 0.314 0.255 0.699 367s -0.012 0.643 -0.614 367s 0.606 0.418 0.013 367s -0.989 0.056 -0.772 367s 367s y: [3, 4, 4, 4] 367s -0.888 -0.495 -0.251 367s 0.136 0.636 0.593 367s -0.396 0.220 -0.283 367s 0.843 0.414 1.138 367s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.584 -0.893 -0.763 367s -0.292 0.303 0.405 367s -0.964 0.085 0.115 367s 367s x1: [3, 3, 1, 1] 367s -0.945 0.803 0.021 367s 0.846 -0.652 -0.160 367s -0.317 -0.431 -0.489 367s 367s y: [3, 3, 1, 1] 367s -0.181 0.528 0.982 367s 0.210 -0.510 -0.889 367s 0.943 -0.237 0.212 367s testing: mul_mat, [4, 3, 4, 1] = [3, 4, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 4, 4, 1] 367s -0.376 0.303 0.695 367s 0.140 -0.391 -0.524 367s -0.456 -0.390 0.582 367s -0.410 0.269 0.622 367s 367s x1: [3, 3, 4, 1] 367s -0.412 -0.408 -0.798 367s 0.861 0.731 -0.202 367s -0.466 -0.037 0.769 367s 367s y: [4, 3, 4, 1] 367s -0.524 0.519 -0.118 -0.437 367s -0.242 -0.059 -0.795 -0.282 367s 0.699 -0.453 0.675 0.659 367s testing: mul_mat, [4, 3, 4, 4] = [4, 4, 4, 4] * [4, 3, 4, 4] 367s x0: [4, 4, 4, 4] 367s 0.856 -0.694 -0.671 -0.250 367s -0.697 0.344 0.021 0.216 367s -0.379 0.899 0.795 0.458 367s -0.817 0.912 0.789 -0.473 367s 367s x1: [4, 3, 4, 4] 367s 0.064 0.166 -0.180 -0.283 367s 0.968 0.630 -0.777 -0.453 367s -0.330 -0.824 0.102 0.801 367s 367s y: [4, 3, 4, 4] 367s 0.132 -0.053 -0.148 0.091 367s 1.027 -0.572 -0.626 -0.615 367s 0.021 0.121 -0.168 -0.781 367s test-mul-mat0: iter:345/500 367s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.316 0.435 0.550 367s -0.547 0.817 0.259 367s 367s x1: [3, 3, 1, 1] 367s 0.376 0.053 0.155 367s 0.529 0.112 -0.319 367s -0.980 -0.862 0.804 367s 367s y: [2, 3, 1, 1] 367s -0.011 -0.123 367s -0.294 -0.280 367s 0.377 0.040 367s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 367s x0: [3, 2, 2, 1] 367s 0.169 -0.413 0.014 367s -0.778 0.742 -0.456 367s 367s x1: [3, 2, 2, 1] 367s 0.934 0.810 0.717 367s 0.251 0.892 -0.994 367s 367s y: [2, 2, 2, 1] 367s -0.166 -0.453 367s -0.340 0.920 367s testing: mul_mat, [3, 2, 2, 1] = [3, 3, 2, 1] * [3, 2, 2, 1] 367s x0: [3, 3, 2, 1] 367s 0.353 -0.163 0.061 367s 0.164 0.350 -0.301 367s -0.667 0.937 0.713 367s 367s x1: [3, 2, 2, 1] 367s -0.844 0.661 -0.623 367s -0.391 0.478 0.636 367s 367s y: [3, 2, 2, 1] 367s -0.444 0.281 0.738 367s -0.178 -0.088 1.162 367s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.934 0.287 -0.549 367s -0.628 0.209 -0.441 367s 0.098 -0.842 -0.143 367s 367s x1: [3, 3, 1, 1] 367s 0.518 0.988 -0.975 367s -0.872 0.466 0.661 367s -0.329 -0.246 0.013 367s 367s y: [3, 3, 1, 1] 367s 1.303 0.311 -0.641 367s -1.043 0.354 -0.572 367s -0.385 0.149 0.1ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s 73 367s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 367s x0: [3, 3, 2, 1] 367s -0.189 -0.256 -0.969 367s 0.992 -0.636 0.573 367s 0.374 -0.528 -0.370 367s 367s x1: [3, 3, 2, 1] 367s -0.216 -0.504 -0.430 367s 0.339 -0.825 -0.174 367s 0.229 -0.725 0.646 367s 367s y: [3, 3, 2, 1] 367s 0.587 -0.141 0.345 367s 0.315 0.762 0.627 367s -0.484 1.058 0.229 367s testing: mul_mat, [4, 3, 2, 1] = [3, 4, 2, 1] * [3, 3, 2, 1] 367s x0: [3, 4, 2, 1] 367s 0.914 -0.819 0.700 367s -0.952 0.644 -0.732 367s -0.843 0.663 0.612 367s -0.603 -0.191 -0.417 367s 367s x1: [3, 3, 2, 1] 367s 0.835 0.079 -0.472 367s -0.574 0.681 0.822 367s -0.464 -0.728 0.140 367s 367s y: [4, 3, 2, 1] 367s 0.369 -0.399 -0.940 -0.322 367s -0.508 0.384 1.439 -0.126 367s 0.270 -0.129 -0.005 0.361 367s test-mul-mat0: iter:346/500 367s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 3, 1, 1] 367s 0.910 367s -0.174 367s -0.742 367s 367s x1: [1, 2, 1, 1] 367s 0.996 367s -0.222 367s 367s y: [3, 2, 1, 1] 367s 0.906 -0.173 -0.739 367s -0.202 0.039 0.165 367s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 1, 1, 1] 367s 0.816 367s 367s x1: [1, 3, 1, 1] 367s -0.693 367s 0.007 367s 0.902 367s 367s y: [1, 3, 1, 1] 367s -0.566 367s 0.006 367s 0.736 367s testing: mul_mat, [2, 1, 1, 4] = [1, 2, 1, 4] * [1, 1, 1, 4] 367s x0: [1, 2, 1, 4] 367s -0.880 367s 0.296 367s 367s x1: [1, 1, 1, 4] 367s 0.602 367s 367s y: [2, 1, 1, 4] 367s -0.530 0.179 367s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 4, 1, 1] 367s 0.289 367s -0.778 367s -0.202 367s 0.863 367s 367s x1: [1, 2, 1, 1] 367s 0.771 367s -0.890 367s 367s y: [4, 2, 1, 1] 367s 0.223 -0.600 -0.156 0.666 367s -0.258 0.692 0.180 -0.768 367s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 367s x0: [4, 3, 1, 1] 367s -0.778 -0.876 0.726 0.911 367s -0.565 -0.596 -0.358 -0.018 367s 0.409 0.225 -0.346 -0.227 367s 367s x1: [4, 1, 1, 1] 367s -0.782 0.576 0.964 -0.872 367s 367s y: [3, 1, 1, 1] 367s 0.009 -0.231 -0.327 367s testing: mul_mat, [4, 4, 1, 4] = [3, 4, 1, 4] * [3, 4, 1, 4] 367s x0: [3, 4, 1, 4] 367s 0.929 -0.119 -0.980 367s 0.862 0.459 -0.705 367s 0.539 0.995 -0.718 367s -0.650 0.626 0.564 367s 367s x1: [3, 4, 1, 4] 367s 0.207 0.852 0.266 367s 0.437 0.786 -0.597 367s 0.740 0.557 -0.487 367s -0.765 -0.154 -0.265 367s 367s y: [4, 4, 1, 4] 367s -0.169 0.382 0.769 0.548 367s 0.898 1.159 1.447 -0.130 367s 1.099 1.237 1.303 -0.408 367s -0.433 -0.543 -0.375 0.251 367s test-mul-mat0: iter:347/500 367s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.462 -0.776 0.609 367s 367s x1: [3, 4, 1, 1] 367s -0.218 -0.785 0.685 367s -0.528 -0.714 -0.557 367s 0.105 0.275 -0.832 367s 0.761 -0.807 0.755 367s 367s y: [1, 4, 1, 1] 367s 0.926 367s -0.028 367s -0.671 367s 1.437 367s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.305 -0.479 0.210 367s 0.216 -0.280 0.843 367s 0.183 0.060 -0.077 367s 0.636 -0.719 0.705 367s 367s x1: [3, 1, 1, 1] 367s -0.274 -0.681 0.283 367s 367s y: [4, 1, 1, 1] 367s 0.470 0.370 -0.113 0.515 367s testing: mul_mat, [4, 4, 1, 3] = [3, 4, 1, 3] * [3, 4, 1, 3] 367s x0: [3, 4, 1, 3] 367s 0.006 0.424 -0.739 367s -0.389 0.757 0.260 367s 0.678 -0.687 0.868 367s 0.471 0.188 0.758 367s 367s x1: [3, 4, 1, 3] 367s 0.851 0.966 -0.823 367s -0.863 -0.591 0.283 367s 0.412 -0.423 0.043 367s 0.605 -0.668 -0.001 367s 367s y: [4, 4, 1, 3] 367s 1.023 0.186 -0.800 -0.041 367s -0.465 -0.038 0.066 -0.303 367s -0.209 -0.469 0.608 0.147 367s -0.279 -0.741 0.868 0.159 367s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.282 0.299 0.074 367s 0.748 -0.132 0.445 367s 0.037 0.003 0.406 367s 367s x1: [3, 4, 1, 1] 367s -0.551 0.466 -0.255 367s -0.684 -0.063 0.933 367s -0.926 -0.516 -0.309 367s -0.756 0.318 -0.765 367s 367s y: [3, 4, 1, 1] 367s -0.035 -0.587 -0.123 367s -0.143 -0.088 0.354 367s -0.438 -0.762 -0.161 367s -0.175 -0.948 -0.338 367s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.953 -0.016 0.468 367s -0.489 -0.556 0.135 367s 0.665 0.739 0.983 367s 367s x1: [3, 3, 1, 1] 367s -0.217 0.616 -0.342 367s 0.711 -0.207 0.333 367s -0.100 -0.419 0.766 367s 367s y: [3, 3, 1, 1] 367s 0.036 -0.283 -0.025 367s -0.518 -0.187 0.647 367s 0.461 0.385 0.376 367s testing: mul_mat, [3, 3, 1, 3] = [3, 3, 1, 3] * [3, 3, 1, 3] 367s x0: [3, 3, 1, 3] 367s -0.081 0.706 0.775 367s -0.180 -0.810 0.308 367s -0.430 -0.903 -0.338 367s 367s x1: [3, 3, 1, 3] 367s -0.213 0.370 0.527 367s -0.931 0.118 -0.436 367s 0.368 0.986 0.567 367s 367s y: [3, 3, 1, 3] 367s 0.687 -0.099 -0.421 367s -0.179 -0.063 0.441 367s 1.106 -0.690 -1.240 367s test-mul-mat0: iter:348/500 367s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s x0: [2, 3, 1, 1] 367s 0.493 0.068 367s 0.763 0.078 367s 0.793 -0.035 367s 367s x1: [2, 4, 1, 1] 367s -0.133 -0.725 367s -0.225 -0.427 367s -0.535 -0.127 367s -0.651 0.773 367s 367s y: [3, 4, 1, 1] 367s -0.115 -0.158 -0.080 367s -0.140 -0.205 -0.163 367s -0.272 -0.418 -0.420 367s -0.268 -0.436 -0.543 367s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 367s x0: [2, 1, 3, 1] 367s 0.091 -0.915 367s 367s x1: [2, 3, 3, 1] 367s 0.890 0.565 367s 0.798 0.637 367s 0.176 0.596 367s 367s y: [1, 3, 3, 1] 367s -0.435 367s -0.510 367s -0.529 367s testing: mul_mat, [4, 1, 3, 4] = [2, 4, 3, 4] * [2, 1, 3, 4] 367s x0: [2, 4, 3, 4] 367s 0.629 -0.815 367s -0.782 0.069 367s -0.956 0.214 367s -0.998 -0.029 367s 367s x1: [2, 1, 3, 4] 367s -0.067 -0.073 367s 367s y: [4, 1, 3, 4] 367s 0.018 0.047 0.048 0.069 367s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 367s x0: [2, 1, 1, 1] 367s -0.691 0.718 367s 367s x1: [2, 4, 1, 1] 367s 0.766 0.620 367s 0.262 0.567 367s -0.191 -0.597 367s 0.109 -0.817 367s 367s y: [1, 4, 1, 1] 367s -0.085 367s 0.226 367s -0.296 367s -0.662 367s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 367s x0: [1, 2, 3, 1] 367s -0.831 367s -0.321 367s 367s x1: [1, 2, 3, 1] 367s 0.957 367s -0.382 367s 367s y: [2, 2, 3, 1] 367s -0.795 -0.307 367s 0.317 0.123 367s testing: mul_mat, [3, 1, 3, 4] = [2, 3, 3, 4] * [2, 1, 3, 4] 367s x0: [2, 3, 3, 4] 367s 0.172 -0.133 367s -0.667 -0.327 367s -0.055 0.063 367s 367s x1: [2, 1, 3, 4] 367s -0.684 0.689 367s 367s y: [3, 1, 3, 4] 367s -0.209 0.231 0.081 367s test-mul-mat0: iter:349/500 367s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 367s x0: [4, 2, 1, 1] 367s 0.677 -0.047 0.334 0.573 367s 0.012 0.147 0.005 -0.801 367s 367s x1: [4, 2, 1, 1] 367s -0.771 -0.865 -0.907 0.555 367s 0.051 -0.315 0.662 0.133 367s 367s y: [2, 2, 1, 1] 367s -0.467 -0.585 367s 0.347 -0.150 367s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 367s x0: [4, 3, 3, 1] 367s -0.809 0.303 0.665 0.338 367s -0.693 0.865 -0.202 -0.387 367s 0.326 0.936 -0.238 0.186 367s 367s x1: [4, 2, 3, 1] 367s 0.460 -0.694 0.461 0.138 367s -0.851 0.860 0.367 -0.104 367s 367s y: [3, 2, 3, 1] 367s -0.229 -1.065 -0.584 367s 1.158 1.300 0.422 367s testing: mul_mat, [4, 3, 3, 2] = [4, 4, 3, 2] * [4, 3, 3, 2] 367s x0: [4, 4, 3, 2] 367s 0.534 0.707 -0.895 0.918 367s 0.894 -0.938 0.717 -0.188 367s -0.665 0.446 0.872 -0.002 367s -0.526 -0.789 -0.224 -0.637 367s 367s x1: [4, 3, 3, 2] 367s -0.744 0.697 0.362 0.581 367s 0.633 -0.876 -0.232 -0.064 367s 0.782 -0.714 0.847 0.732 367s 367s y: [4, 3, 3, 2] 367s 0.304 -1.168 1.121 -0.610 367s -0.131 1.233 -1.014 0.451 367s -0.173 1.839 -0.101 -0.504 367s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 3, 1, 1] 367s -0.841 -0.018 -0.642 0.178 367s -0.976 0.920 -0.217 -0.765 367s 0.374 -0.355 -0.409 -0.701 367s 367s x1: [4, 4, 1, 1] 367s -0.524 0.395 0.870 0.968 367s 0.233 -0.903 0.628 -0.753 367s 0.445 0.931 -0.353 -0.310 367s 0.240 -0.295 0.893 0.567 367s 367s y: [3, 4, 1, 1] 367s 0.047 -0.054 -1.371 367s -0.717 -0.618 0.679 367s -0.219 0.736 0.198 367s -0.670 -1.133 -0.568 367s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 367s x0: [3, 2, 3, 1] 367s -0.187 -0.711 0.322 367s 0.909 0.728 -0.410 367s 367s x1: [3, 4, 3, 1] 367s -0.961 -0.999 -0.225 367s 0.435 0.871 -0.257 367s -0.332 0.967 -0.630 367s -0.085 0.412 -0.699 367s 367s y: [2, 4, 3, 1] 367s 0.817 -1.509 367s -0.783 1.135 367s -0.829 0.660 367s -0.502 0.509 367s testing: mul_mat, [2, 3, 3, 2] = [2, 2, 3, 2] * [2, 3, 3, 2] 367s x0: [2, 2, 3, 2] 367s -0.055 0.053 367s -0.636 -0.955 367s 367s x1: [2, 3, 3, 2] 367s 0.422 -0.797 367s 0.259 0.524 367s -0.125 0.329 367s 367s y: [2, 3, 3, 2] 367s -0.065 0.492 367s 0.013 -0.665 367s 0.024 -0.235 367s test-mul-mat0: iter:350/500 367s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 4, 1, 1] 367s -0.760 367s 0.294 367s 0.371 367s 0.293 367s 367s x1: [1, 1, 1, 1] 367s -0.651 367s 367s y: [4, 1, 1, 1] 367s 0.495 -0.191 -0.242 -0.191 367s testing: mul_mat, [1, 4, 3, 1] = [1, 1, 3, 1] * [1, 4, 3, 1] 367s x0: [1, 1, 3, 1] 367s 0.086 367s 367s x1: [1, 4, 3, 1] 367s 0.339 367s 0.467 367s 0.689 367s -0.599 367s 367s y: [1, 4, 3, 1] 367s 0.029 367s 0.040 367s 0.059 367s -0.051 367s testing: mul_mat, [2, 1, 3, 4] = [1, 2, 3, 4] * [1, 1, 3, 4] 367s x0: [1, 2, 3, 4] 367s -0.025 367s -0.516 367s 367s x1: [1, 1, 3, 4] 367s -0.159 367s 367s y: [2, 1, 3, 4] 367s 0.004 0.082 367s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 4, 1, 1] 367s -0.995 367s -0.133 367s -0.986 367s 0.646 367s 367s x1: [1, 2, 1, 1] 367s 0.657 367s 0.263 367s 367s y: [4, 2, 1, 1] 367s -0.654 -0.088 -0.648 0.425 367s -0.262 -0.035 -0.259 0.170 367s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 367s x0: [4, 3, 3, 1] 367s -0.908 -0.650 -0.351 0.316 367s 0.868 0.282 -0.208 -0.336 367s 0.608 -0.4ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s 04 -0.745 0.912 367s 367s x1: [4, 1, 3, 1] 367s 0.842 -0.502 -0.374 -0.716 367s 367s y: [3, 1, 3, 1] 367s -0.534 0.908 0.341 367s testing: mul_mat, [1, 4, 3, 4] = [3, 1, 3, 4] * [3, 4, 3, 4] 367s x0: [3, 1, 3, 4] 367s -0.018 -0.514 0.701 367s 367s x1: [3, 4, 3, 4] 367s -0.530 0.188 0.645 367s -0.275 -0.496 -0.691 367s -0.363 0.082 0.796 367s -0.446 -0.472 0.296 367s 367s y: [1, 4, 3, 4] 367s 0.366 367s -0.225 367s 0.523 367s 0.459 367s test-mul-mat0: iter:351/500 367s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.618 0.077 0.632 367s -0.821 0.710 -0.274 367s 0.226 0.486 0.748 367s 0.255 0.749 -0.529 367s 367s x1: [3, 4, 1, 1] 367s 0.857 0.249 0.994 367s 0.891 -0.300 -0.147 367s 0.230 -0.925 -0.723 367s -0.474 -0.222 -0.169 367s 367s y: [4, 4, 1, 1] 367s 0.117 -0.799 1.059 -0.121 367s -0.667 -0.905 -0.054 0.080 367s -0.671 -0.647 -0.939 -0.251 367s 0.169 0.277 -0.342 -0.198 367s testing: mul_mat, [4, 4, 4, 1] = [3, 4, 4, 1] * [3, 4, 4, 1] 367s x0: [3, 4, 4, 1] 367s 0.176 0.220 -0.535 367s 0.211 0.646 0.052 367s 0.973 0.298 0.126 367s -0.517 -0.796 -0.052 367s 367s x1: [3, 4, 4, 1] 367s -0.856 0.809 0.465 367s 0.526 0.063 0.844 367s 0.384 -0.687 0.839 367s 0.275 0.013 -0.308 367s 367s y: [4, 4, 4, 1] 367s -0.221 0.366 -0.534 -0.225 367s -0.345 0.196 0.637 -0.366 367s -0.533 -0.319 0.274 0.305 367s 0.216 0.050 0.232 -0.136 367s testing: mul_mat, [2, 4, 4, 2] = [3, 2, 4, 2] * [3, 4, 4, 2] 367s x0: [3, 2, 4, 2] 367s -0.023 0.129 -0.381 367s -0.689 0.132 -0.924 367s 367s x1: [3, 4, 4, 2] 367s 0.968 0.951 0.633 367s -0.749 -0.723 -0.338 367s -0.445 0.422 0.431 367s 0.967 -0.645 -0.029 367s 367s y: [2, 4, 4, 2] 367s -0.141 -1.127 367s 0.053 0.733 367s -0.100 -0.036 367s -0.094 -0.725 367s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.086 0.587 0.864 367s 0.977 -0.596 0.808 367s 0.901 -0.836 0.556 367s 367s x1: [3, 2, 1, 1] 367s 0.306 -0.287 0.035 367s 0.741 -0.837 -0.659 367s 367s y: [3, 2, 1, 1] 367s -0.112 0.498 0.535 367s -0.997 0.690 1.001 367s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 1, 4, 1] 367s -0.767 -0.510 0.034 367s 367s x1: [3, 3, 4, 1] 367s -0.240 0.523 -0.920 367s 0.732 -0.615 -0.206 367s 0.202 0.609 0.502 367s 367s y: [1, 3, 4, 1] 367s -0.114 367s -0.254 367s -0.449 367s testing: mul_mat, [1, 3, 4, 2] = [1, 1, 4, 2] * [1, 3, 4, 2] 367s x0: [1, 1, 4, 2] 367s 0.619 367s 367s x1: [1, 3, 4, 2] 367s -0.468 367s -0.887 367s -0.970 367s 367s y: [1, 3, 4, 2] 367s -0.290 367s -0.549 367s -0.600 367s test-mul-mat0: iter:352/500 367s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.374 0.250 0.582 367s 0.102 0.189 -0.446 367s -0.279 0.400 -0.670 367s 367s x1: [3, 4, 1, 1] 367s -0.427 0.513 0.799 367s -0.815 0.930 -0.273 367s -0.544 0.647 -0.848 367s 0.465 -0.606 -0.544 367s 367s y: [3, 4, 1, 1] 367s 0.433 -0.303 -0.211 367s -0.231 0.215 0.782 367s -0.535 0.446 0.979 367s -0.294 0.175 -0.007 367s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 2, 4, 1] 367s 0.562 -0.114 -0.612 367s 0.371 -0.811 -0.566 367s 367s x1: [3, 3, 4, 1] 367s -0.319 0.721 0.352 367s 0.174 -0.845 -0.596 367s 0.236 0.790 -0.753 367s 367s y: [2, 3, 4, 1] 367s -0.477 -0.902 367s 0.559 1.087 367s 0.503 -0.127 367s testing: mul_mat, [4, 2, 4, 3] = [3, 4, 4, 3] * [3, 2, 4, 3] 367s x0: [3, 4, 4, 3] 367s -0.950 -0.419 -0.268 367s 0.951 -0.569 0.112 367s 0.872 0.918 0.259 367s 0.391 -0.445 -0.176 367s 367s x1: [3, 2, 4, 3] 367s 0.596 0.599 -0.876 367s 0.333 0.925 0.219 367s 367s y: [4, 2, 4, 3] 367s -0.583 0.127 0.842 0.120 367s -0.763 -0.185 1.197 -0.320 367s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 4, 1, 1] 367s 0.966 0.556 -0.500 367s -0.241 -0.225 0.656 367s -0.006 -0.652 -0.570 367s 0.417 0.549 -0.630 367s 367s x1: [3, 4, 1, 1] 367s -0.694 0.435 -0.454 367s 0.024 0.936 0.577 367s -0.234 0.689 0.924 367s -0.213 0.218 -0.199 367s 367s y: [4, 4, 1, 1] 367s -0.202 -0.228 -0.021 0.235 367s 0.254 0.162 -0.939 0.160 367s -0.305 0.507 -0.975 -0.301 367s 0.015 -0.128 -0.027 0.156 367s testing: mul_mat, [3, 3, 4, 1] = [4, 3, 4, 1] * [4, 3, 4, 1] 367s x0: [4, 3, 4, 1] 367s -0.414 0.252 -0.307 -0.366 367s -0.790 -0.170 0.798 -0.577 367s 0.253 -0.760 -0.223 0.743 367s 367s x1: [4, 3, 4, 1] 367s 0.319 0.102 -0.178 0.271 367s -0.041 0.825 0.577 -0.606 367s -0.629 -0.399 -0.670 0.948 367s 367s y: [3, 3, 4, 1] 367s -0.151 -0.568 0.244 367s 0.270 0.703 -1.216 367s 0.019 -0.517 0.998 367s testing: mul_mat, [2, 4, 4, 3] = [3, 2, 4, 3] * [3, 4, 4, 3] 367s x0: [3, 2, 4, 3] 367s 0.068 -0.610 0.428 367s 0.433 -0.280 -0.832 367s 367s x1: [3, 4, 4, 3] 367s 0.278 -0.335 -0.635 367s 0.538 0.032 -0.870 367s 0.549 -0.324 -0.347 367s 0.330 0.470 -ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 0.910 367s 367s y: [2, 4, 4, 3] 367s -0.048 0.742 367s -0.355 0.948 367s 0.086 0.617 367s -0.653 0.769 367s test-mul-mat0: iter:353/500 367s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 1, 1, 1] 367s 0.718 0.003 -0.473 0.574 367s 367s x1: [4, 4, 1, 1] 367s -0.615 0.894 0.189 -0.069 367s 0.463 -0.003 -0.721 0.854 367s 0.131 -0.298 0.510 -0.609 367s -0.133 -0.214 -0.769 0.823 367s 367s y: [1, 4, 1, 1] 367s -0.568 367s 1.164 367s -0.498 367s 0.740 367s testing: mul_mat, [4, 1, 2, 1] = [4, 4, 2, 1] * [4, 1, 2, 1] 367s x0: [4, 4, 2, 1] 367s 0.478 -0.992 0.395 -0.333 367s -0.061 -0.141 0.664 0.218 367s -0.287 -0.206 0.920 -0.777 367s 0.185 -0.213 0.009 0.416 367s 367s x1: [4, 1, 2, 1] 367s 0.834 0.550 -0.693 -0.031 367s 367s y: [4, 1, 2, 1] 367s -0.410 -0.595 -0.966 0.018 367s testing: mul_mat, [4, 4, 2, 3] = [4, 4, 2, 3] * [4, 4, 2, 3] 367s x0: [4, 4, 2, 3] 367s 0.644 0.163 -0.844 -0.748 367s -0.530 -0.590 -0.935 0.722 367s -0.181 0.284 -0.037 -0.012 367s -0.221 0.371 -0.862 -0.382 367s 367s x1: [4, 4, 2, 3] 367s -0.866 0.396 -0.015 0.073 367s -0.746 -0.351 -0.709 -0.033 367s 0.443 -0.789 0.190 -0.372 367s -0.002 -0.800 -0.956 0.608 367s 367s y: [4, 4, 2, 3] 367s -0.535 0.292 0.269 0.323 367s 0.085 1.241 0.062 0.658 367s 0.274 -0.216 -0.307 -0.412 367s 0.221 1.806 -0.199 0.296 367s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 1, 1, 1] 367s -0.879 0.629 0.198 0.811 367s 367s x1: [4, 4, 1, 1] 367s -0.267 0.486 -0.630 0.623 367s 0.751 0.062 0.320 0.197 367s -0.426 -0.395 0.193 0.045 367s -0.906 -0.429 0.408 0.405 367s 367s y: [1, 4, 1, 1] 367s 0.920 367s -0.399 367s 0.201 367s 0.936 367s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 367s x0: [1, 1, 2, 1] 367s -0.147 367s 367s x1: [1, 4, 2, 1] 367s 0.256 367s 0.960 367s -0.833 367s 0.678 367s 367s y: [1, 4, 2, 1] 367s -0.038 367s -0.141 367s 0.122 367s -0.100 367s testing: mul_mat, [2, 1, 2, 3] = [1, 2, 2, 3] * [1, 1, 2, 3] 367s x0: [1, 2, 2, 3] 367s -0.141 367s 0.071 367s 367s x1: [1, 1, 2, 3] 367s -0.027 367s 367s y: [2, 1, 2, 3] 367s 0.004 -0.002 367s test-mul-mat0: iter:354/500 367s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 4, 1, 1] 367s -0.525 -0.373 367s -0.053 0.328 367s -0.850 0.920 367s -0.449 0.296 367s 367s x1: [2, 1, 1, 1] 367s 0.132 0.838 367s 367s y: [4, 1, 1, 1] 367s -0.382 0.268 0.659 0.188 367s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 367s x0: [2, 4, 1, 1] 367s 0.366 -0.091 367s -0.433 -0.376 367s -0.659 -0.235 367s 0.080 0.584 367s 367s x1: [2, 4, 1, 1] 367s -0.355 -0.165 367s 0.762 -0.434 367s 0.694 -0.168 367s 0.680 0.598 367s 367s y: [4, 4, 1, 1] 367s -0.115 0.216 0.273 -0.125 367s 0.318 -0.167 -0.400 -0.192 367s 0.269 -0.237 -0.418 -0.042 367s 0.195 -0.519 -0.588 0.404 367s testing: mul_mat, [1, 4, 1, 2] = [2, 1, 1, 2] * [2, 4, 1, 2] 367s x0: [2, 1, 1, 2] 367s -0.469 -0.766 367s 367s x1: [2, 4, 1, 2] 367s -0.780 0.497 367s -0.006 0.352 367s 0.335 -0.547 367s 0.827 0.962 367s 367s y: [1, 4, 1, 2] 367s -0.015 367s -0.266 367s 0.262 367s -1.124 367s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 2, 1, 1] 367s -0.259 -0.798 367s -0.435 0.082 367s 367s x1: [2, 1, 1, 1] 367s 0.832 0.998 367s 367s y: [2, 1, 1, 1] 367s -1.013 -0.280 367s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 2, 1, 1] 367s 0.660 -0.887 367s -0.098 -0.887 367s 367s x1: [2, 2, 1, 1] 367s 0.331 0.282 367s -0.334 0.550 367s 367s y: [2, 2, 1, 1] 367s -0.032 -0.282 367s -0.708 -0.455 367s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 367s x0: [2, 1, 1, 2] 367s -0.872 -0.236 367s 367s x1: [2, 2, 1, 2] 367s -0.271 -0.925 367s 0.514 0.883 367s 367s y: [1, 2, 1, 2] 367s 0.455 367s -0.656 367s test-mul-mat0: iter:355/500 367s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.543 0.841 0.572 0.971 367s 0.724 0.759 -0.194 0.312 367s 367s x1: [4, 3, 1, 1] 367s 0.121 -0.065 0.329 0.323 367s -0.983 -0.340 -0.395 -0.316 367s -0.790 0.383 -0.656 0.112 367s 367s y: [2, 3, 1, 1] 367s 0.381 0.075 367s -0.286 -0.992 367s 0.485 -0.119 367s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 367s x0: [4, 4, 3, 1] 367s 0.981 -0.595 -0.477 0.741 367s 0.210 0.835 0.095 -0.184 367s 0.275 0.213 0.754 0.077 367s -0.903 -0.764 0.074 0.745 367s 367s x1: [4, 2, 3, 1] 367s 0.354 0.606 0.440 -0.881 367s -0.062 0.802 -0.116 -0.519 367s 367s y: [4, 2, 3, 1] 367s -0.876 0.784 0.491 -1.408 367s -0.867 0.741 0.027 -0.952 367s testing: mul_mat, [4, 4, 3, 4] = [4, 4, 3, 4] * [4, 4, 3, 4] 367s x0: [4, 4, 3, 4] 367s -0.743 0.777 0.403 0.184 367s -0.787 -0.646 -0.465 0.679 367s 0.405 0.732 -0.292 0.132 367s 0.114 -0.141 0.696 -0.968 367s 367s x1: [4, 4, 3, 4] 367s 0.750 -0.792 -0.981 -0.296 367s -0.434 -0.689 0.291 -0.230 367s 0.516 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s .854 0.932 0.131 367s -0.149 0.893 -0.274 -0.748 367s 367s y: [4, 4, 3, 4] 367s -1.623 0.176 -0.028 -0.199 367s -0.137 0.495 -0.796 0.473 367s 0.679 -1.303 0.579 0.460 367s 0.556 -0.839 0.575 0.391 367s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 1, 1, 1] 367s 0.113 -0.979 0.662 0.442 367s 367s x1: [4, 4, 1, 1] 367s 0.136 -0.856 0.470 0.334 367s -0.450 0.874 0.378 -0.725 367s 0.117 -0.257 -0.303 -0.833 367s 0.847 -0.845 -0.946 0.803 367s 367s y: [1, 4, 1, 1] 367s 1.311 367s -0.977 367s -0.304 367s 0.652 367s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 367s x0: [1, 4, 3, 1] 367s -0.285 367s 0.556 367s 0.585 367s -0.907 367s 367s x1: [1, 4, 3, 1] 367s 0.935 367s -0.213 367s 0.043 367s 0.844 367s 367s y: [4, 4, 3, 1] 367s -0.267 0.520 0.547 -0.848 367s 0.061 -0.118 -0.125 0.193 367s -0.012 0.024 0.025 -0.039 367s -0.241 0.469 0.493 -0.765 367s testing: mul_mat, [3, 1, 3, 4] = [4, 3, 3, 4] * [4, 1, 3, 4] 367s x0: [4, 3, 3, 4] 367s -0.377 -0.277 0.525 0.197 367s 0.824 0.325 0.662 -0.318 367s -0.426 0.776 0.508 0.922 367s 367s x1: [4, 1, 3, 4] 367s -0.514 -0.060 -0.963 -0.493 367s 367s y: [3, 1, 3, 4] 367s -0.393 -0.924 -0.771 367s test-mul-mat0: iter:356/500 367s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 1, 1, 1] 367s 0.280 367s 367s x1: [1, 4, 1, 1] 367s 0.245 367s 0.852 367s -0.611 367s 0.630 367s 367s y: [1, 4, 1, 1] 367s 0.069 367s 0.238 367s -0.171 367s 0.176 367s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 367s x0: [1, 1, 2, 1] 367s -0.664 367s 367s x1: [1, 1, 2, 1] 367s 0.030 367s 367s y: [1, 1, 2, 1] 367s -0.020 367s testing: mul_mat, [1, 1, 2, 3] = [1, 1, 2, 3] * [1, 1, 2, 3] 367s x0: [1, 1, 2, 3] 367s -0.728 367s 367s x1: [1, 1, 2, 3] 367s 0.679 367s 367s y: [1, 1, 2, 3] 367s -0.495 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.722 367s 367s x1: [1, 1, 1, 1] 367s 0.528 367s 367s y: [1, 1, 1, 1] 367s -0.381 367s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 367s x0: [1, 1, 2, 1] 367s -0.734 367s 367s x1: [1, 1, 2, 1] 367s -0.394 367s 367s y: [1, 1, 2, 1] 367s 0.289 367s testing: mul_mat, [1, 1, 2, 3] = [1, 1, 2, 3] * [1, 1, 2, 3] 367s x0: [1, 1, 2, 3] 367s 0.232 367s 367s x1: [1, 1, 2, 3] 367s -0.221 367s 367s y: [1, 1, 2, 3] 367s -0.051 367s test-mul-mat0: iter:357/500 367s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.295 0.361 0.471 367s 367s x1: [3, 3, 1, 1] 367s 0.296 0.403 0.401 367s 0.071 -0.866 -0.844 367s -0.401 -0.230 -0.566 367s 367s y: [1, 3, 1, 1] 367s 0.422 367s -0.690 367s -0.468 367s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 367s x0: [3, 4, 2, 1] 367s 0.235 0.293 0.663 367s 0.237 -0.796 0.548 367s -0.577 -0.285 -0.427 367s 0.866 -0.920 0.869 367s 367s x1: [3, 1, 2, 1] 367s 0.065 -0.859 -0.874 367s 367s y: [4, 1, 2, 1] 367s -0.816 0.221 0.580 0.088 367s testing: mul_mat, [2, 4, 2, 3] = [3, 2, 2, 3] * [3, 4, 2, 3] 367s x0: [3, 2, 2, 3] 367s -0.387 -0.090 0.670 367s -0.455 0.933 -0.147 367s 367s x1: [3, 4, 2, 3] 367s -0.467 0.574 -0.659 367s -0.002 -0.815 -0.598 367s 0.894 0.421 0.695 367s 0.557 -0.342 0.899 367s 367s y: [2, 4, 2, 3] 367s -0.313 0.845 367s -0.327 -0.671 367s 0.082 -0.116 367s 0.417 -0.705 367s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.530 -0.389 0.505 367s 0.064 0.553 -0.128 367s 0.202 0.336 0.323 367s 367s x1: [3, 2, 1, 1] 367s 0.614 -0.025 -0.136 367s 0.865 -0.054 -0.576 367s 367s y: [3, 2, 1, 1] 367s 0.267 0.043 0.072 367s 0.188 0.099 -0.030 367s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 367s x0: [3, 3, 2, 1] 367s 0.321 -0.148 0.463 367s 0.498 -0.438 -0.885 367s -0.395 0.807 0.143 367s 367s x1: [3, 3, 2, 1] 367s 0.104 -0.656 0.952 367s -0.516 0.693 0.905 367s -0.858 -0.662 0.422 367s 367s y: [3, 3, 2, 1] 367s 0.571 -0.503 -0.434 367s 0.150 -1.362 0.893 367s 0.017 -0.512 -0.134 367s testing: mul_mat, [2, 3, 2, 3] = [3, 2, 2, 3] * [3, 3, 2, 3] 367s x0: [3, 2, 2, 3] 367s 0.693 0.576 0.873 367s 0.905 0.140 0.973 367s 367s x1: [3, 3, 2, 3] 367s -0.302 0.363 0.744 367s 0.931 -0.040 0.594 367s 0.926 -0.640 0.804 367s 367s y: [2, 3, 2, 3] 367s 0.650 0.501 367s 1.140 1.415 367s 0.975 1.531 367s test-mul-mat0: iter:358/500 367s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.142 367s -0.637 367s 367s x1: [1, 4, 1, 1] 367s -0.611 367s 0.097 367s 0.611 367s 0.920 367s 367s y: [2, 4, 1, 1] 367s -0.087 0.389 367s 0.014 -0.062 367s 0.087 -0.389 367s 0.131 -0.586 367s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, 1] * [1, 2, 3, 1] 367s x0: [1, 3, 3, 1] 367s 0.704 367s -0.490 367s -0.721 367s 367s x1: [1, 2, 3, 1] 367s -0.527 367s -0.512 367s 367s y: [3, 2, 3, 1] 367s -0.371 0.258 0.380 367s -0.360 0.251 0.369 367s testing: mul_mat, [1, 3, 3, 3] = [1, 1, 3, 3] * [1, 3, 3, 3] 367s x0: [1, 1, 3, 3] 367s -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s .898 367s 367s x1: [1, 3, 3, 3] 367s -0.955 367s 0.515 367s 0.478 367s 367s y: [1, 3, 3, 3] 367s 0.858 367s -0.462 367s -0.429 367s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.687 367s 0.564 367s 367s x1: [1, 1, 1, 1] 367s 0.040 367s 367s y: [2, 1, 1, 1] 367s 0.027 0.023 367s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 367s x0: [2, 2, 3, 1] 367s -0.728 0.984 367s 0.608 0.985 367s 367s x1: [2, 1, 3, 1] 367s -0.261 0.441 367s 367s y: [2, 1, 3, 1] 367s 0.624 0.276 367s testing: mul_mat, [2, 2, 3, 3] = [2, 2, 3, 3] * [2, 2, 3, 3] 367s x0: [2, 2, 3, 3] 367s -0.064 0.819 367s 0.377 0.951 367s 367s x1: [2, 2, 3, 3] 367s 0.081 -0.223 367s -0.410 0.064 367s 367s y: [2, 2, 3, 3] 367s -0.188 -0.182 367s 0.079 -0.094 367s test-mul-mat0: iter:359/500 367s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 3, 1, 1] 367s -0.234 0.701 367s -0.693 -0.635 367s 0.669 0.884 367s 367s x1: [2, 1, 1, 1] 367s 0.432 0.713 367s 367s y: [3, 1, 1, 1] 367s 0.398 -0.752 0.919 367s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 367s x0: [2, 1, 1, 1] 367s 0.627 -0.889 367s 367s x1: [2, 3, 1, 1] 367s -0.034 0.826 367s 0.990 -0.093 367s -0.215 -0.527 367s 367s y: [1, 3, 1, 1] 367s -0.756 367s 0.703 367s 0.333 367s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 2, 1, 1] 367s 0.405 -0.859 367s 0.440 0.898 367s 367s x1: [2, 1, 1, 1] 367s -0.269 0.416 367s 367s y: [2, 1, 1, 1] 367s -0.466 0.255 367s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 4, 1, 1] 367s -0.589 0.396 367s -0.370 -0.971 367s -0.671 -0.359 367s -0.666 0.762 367s 367s x1: [2, 2, 1, 1] 367s -0.183 -0.106 367s 0.016 0.198 367s 367s y: [4, 2, 1, 1] 367s 0.066 0.170 0.160 0.041 367s 0.069 -0.198 -0.082 0.140 367s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 367s x0: [4, 4, 1, 1] 367s 0.937 0.186 -0.699 0.510 367s 0.099 -0.597 -0.689 -0.864 367s -0.002 0.494 0.521 0.729 367s 0.353 -0.374 0.318 -0.860 367s 367s x1: [4, 2, 1, 1] 367s 0.699 -0.475 -0.273 0.525 367s -0.486 0.635 -0.690 -0.012 367s 367s y: [4, 2, 1, 1] 367s 1.024 0.087 0.005 -0.114 367s 0.139 0.059 -0.054 -0.617 367s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 4, 1, 1] 367s -0.075 -0.188 0.162 -0.324 367s 0.318 0.641 -0.053 -0.227 367s 0.015 0.629 -0.367 0.763 367s 0.328 0.123 0.298 -0.261 367s 367s x1: [4, 4, 1, 1] 367s -0.535 -0.937 0.536 -0.506 367s -0.296 0.298 -0.807 0.228 367s -0.975 0.718 0.743 0.660 367s -0.972 -0.270 0.927 0.965 367s 367s y: [4, 4, 1, 1] 367s 0.467 -0.684 -1.180 0.001 367s -0.238 0.088 0.654 -0.361 367s -0.156 -0.039 0.668 -0.183 367s -0.039 -0.750 0.212 -0.328 367s test-mul-mat0: iter:360/500 367s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 3, 1, 1] 367s 0.987 0.472 -0.709 0.628 367s 0.101 0.414 -0.210 -0.952 367s -0.953 -0.912 -0.276 -0.180 367s 367s x1: [4, 3, 1, 1] 367s -0.549 -0.329 0.632 -0.831 367s -0.587 0.292 -0.803 0.144 367s 0.219 -0.838 -0.027 -0.856 367s 367s y: [3, 3, 1, 1] 367s -1.668 0.466 0.799 367s 0.218 0.093 0.488 367s -0.698 0.496 0.717 367s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 4, 1, 1] 367s 0.133 -0.121 0.087 -0.894 367s 0.023 -0.433 -0.907 -0.505 367s -0.141 0.722 0.597 -0.727 367s -0.488 0.645 -0.680 -0.401 367s 367s x1: [4, 3, 1, 1] 367s 0.851 0.463 0.129 -0.892 367s -0.095 -0.264 -0.441 0.576 367s -0.632 -0.272 0.990 0.660 367s 367s y: [4, 3, 1, 1] 367s 0.866 0.153 0.939 0.153 367s -0.534 0.221 -0.859 -0.055 367s -0.555 -1.127 0.003 -0.804 367s testing: mul_mat, [2, 4, 1, 4] = [4, 2, 1, 4] * [4, 4, 1, 4] 367s x0: [4, 2, 1, 4] 367s -0.420 -0.489 -0.074 -0.811 367s -0.199 0.519 -0.889 -0.537 367s 367s x1: [4, 4, 1, 4] 367s -0.631 0.140 -0.550 0.832 367s -0.731 -0.442 -0.263 0.004 367s 0.117 -0.687 0.372 0.845 367s -0.698 0.032 -0.230 0.436 367s 367s y: [2, 4, 1, 4] 367s -0.437 0.241 367s 0.539 0.148 367s -0.426 -1.164 367s -0.059 0.126 367s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 367s x0: [4, 4, 1, 1] 367s 0.195 -0.928 0.090 -0.194 367s 0.842 0.056 -0.465 0.116 367s 0.859 -0.739 -0.711 0.612 367s -0.042 0.794 0.320 0.746 367s 367s x1: [4, 2, 1, 1] 367s -0.227 0.200 0.417 0.574 367s -0.281 0.528 -0.963 -0.324 367s 367s y: [4, 2, 1, 1] 367s -0.303 -0.308 -0.288 0.730 367s -0.569 0.203 -0.145 -0.118 367s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 1, 1, 1] 367s -0.710 -0.051 0.219 0.346 367s 367s x1: [4, 4, 1, 1] 367s -0.849 -0.564 -0.607 -0.587 367s -0.290 0.622 0.185 0.910 367s 0.039 -0.241 -0.371 -0.432 367s -0.204 0.306 -0.569 0.991 367s 367s y: [1, 4, 1, 1] 367s 0.295 367s 0.529 367s -0.246 367s 0.347 367s testing: mul_mat, [2, 4, 1, 4] = [1, 2, 1, 4] * [1, 4ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s , 1, 4] 367s x0: [1, 2, 1, 4] 367s -0.576 367s 0.352 367s 367s x1: [1, 4, 1, 4] 367s 0.210 367s 0.013 367s -0.564 367s 0.746 367s 367s y: [2, 4, 1, 4] 367s -0.121 0.074 367s -0.008 0.005 367s 0.325 -0.199 367s -0.429 0.262 367s test-mul-mat0: iter:361/500 367s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.873 367s 0.739 367s 367s x1: [1, 1, 1, 1] 367s -0.274 367s 367s y: [2, 1, 1, 1] 367s -0.239 -0.203 367s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 367s x0: [1, 3, 4, 1] 367s 0.349 367s 0.160 367s 0.010 367s 367s x1: [1, 2, 4, 1] 367s -0.578 367s 0.619 367s 367s y: [3, 2, 4, 1] 367s -0.202 -0.092 -0.006 367s 0.216 0.099 0.006 367s testing: mul_mat, [2, 3, 4, 1] = [1, 2, 4, 1] * [1, 3, 4, 1] 367s x0: [1, 2, 4, 1] 367s -0.681 367s 0.865 367s 367s x1: [1, 3, 4, 1] 367s 0.700 367s 0.094 367s 0.540 367s 367s y: [2, 3, 4, 1] 367s -0.476 0.605 367s -0.064 0.082 367s -0.368 0.467 367s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.614 367s 367s x1: [1, 2, 1, 1] 367s 0.209 367s 0.075 367s 367s y: [1, 2, 1, 1] 367s -0.128 367s -0.046 367s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 367s x0: [1, 1, 4, 1] 367s -0.683 367s 367s x1: [1, 1, 4, 1] 367s 0.362 367s 367s y: [1, 1, 4, 1] 367s -0.247 367s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 367s x0: [1, 2, 4, 1] 367s -0.864 367s 0.210 367s 367s x1: [1, 1, 4, 1] 367s 0.608 367s 367s y: [2, 1, 4, 1] 367s -0.526 0.128 367s test-mul-mat0: iter:362/500 367s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.884 0.059 -0.725 367s -0.626 0.762 -0.192 367s 367s x1: [3, 1, 1, 1] 367s 0.520 0.497 0.697 367s 367s y: [2, 1, 1, 1] 367s -0.016 -0.081 367s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 367s x0: [3, 1, 4, 1] 367s -0.798 -0.565 -0.422 367s 367s x1: [3, 2, 4, 1] 367s -0.309 -0.836 -0.564 367s -0.452 0.772 0.798 367s 367s y: [1, 2, 4, 1] 367s 0.957 367s -0.413 367s testing: mul_mat, [3, 1, 4, 4] = [3, 3, 4, 4] * [3, 1, 4, 4] 367s x0: [3, 3, 4, 4] 367s -0.842 -0.609 -0.316 367s 0.166 0.252 0.686 367s 0.559 0.379 0.049 367s 367s x1: [3, 1, 4, 4] 367s -0.576 0.446 -0.499 367s 367s y: [3, 1, 4, 4] 367s 0.371 -0.326 -0.177 367s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.276 0.347 -0.157 367s 0.355 -0.771 0.689 367s 367s x1: [3, 3, 1, 1] 367s -0.816 -0.674 0.213 367s -0.730 -0.328 -0.732 367s 0.650 0.985 0.149 367s 367s y: [2, 3, 1, 1] 367s -0.493 0.376 367s -0.201 -0.511 367s 0.498 -0.426 367s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 367s x0: [2, 3, 4, 1] 367s 0.969 -0.802 367s 0.228 -0.929 367s -0.879 0.809 367s 367s x1: [2, 3, 4, 1] 367s -0.113 0.581 367s -0.993 -0.301 367s 0.326 0.773 367s 367s y: [3, 3, 4, 1] 367s -0.576 -0.566 0.570 367s -0.720 0.053 0.629 367s -0.304 -0.644 0.339 367s testing: mul_mat, [1, 2, 4, 4] = [3, 1, 4, 4] * [3, 2, 4, 4] 367s x0: [3, 1, 4, 4] 367s 0.585 0.220 -0.946 367s 367s x1: [3, 2, 4, 4] 367s -0.542 -0.552 -0.919 367s 0.268 0.141 0.954 367s 367s y: [1, 2, 4, 4] 367s 0.430 367s -0.714 367s test-mul-mat0: iter:363/500 367s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 4, 1, 1] 367s -0.774 367s -0.203 367s -0.595 367s 0.195 367s 367s x1: [1, 1, 1, 1] 367s -0.838 367s 367s y: [4, 1, 1, 1] 367s 0.649 0.170 0.499 -0.163 367s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 2, 1, 1] 367s -0.225 367s -0.801 367s 367s x1: [1, 4, 1, 1] 367s -0.671 367s -0.981 367s -0.625 367s 0.237 367s 367s y: [2, 4, 1, 1] 367s 0.151 0.537 367s 0.220 0.785 367s 0.140 0.501 367s -0.053 -0.190 367s testing: mul_mat, [1, 2, 1, 2] = [1, 1, 1, 2] * [1, 2, 1, 2] 367s x0: [1, 1, 1, 2] 367s -0.211 367s 367s x1: [1, 2, 1, 2] 367s 0.581 367s -0.890 367s 367s y: [1, 2, 1, 2] 367s -0.122 367s 0.188 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.382 367s 367s x1: [1, 1, 1, 1] 367s 0.138 367s 367s y: [1, 1, 1, 1] 367s -0.053 367s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.813 367s -0.285 367s 367s x1: [1, 1, 1, 1] 367s -0.349 367s 367s y: [2, 1, 1, 1] 367s -0.284 0.099 367s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 367s x0: [2, 4, 1, 2] 367s 0.407 -0.269 367s 0.536 0.206 367s -0.123 0.831 367s 0.812 0.106 367s 367s x1: [2, 1, 1, 2] 367s 0.328 0.918 367s 367s y: [4, 1, 1, 2] 367s -0.114 0.365 0.723 0.364 367s test-mul-mat0: iter:364/500 367s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.659 -0.854 -0.327 367s 0.547 0.977 0.779 367s 367s x1: [3, 4, 1, 1] 367s -0.015 0.651 0.833 367s 0.768 0.464 -0.452 367s 0.096 0.383 -0.073 367s -0.594 0.463 -0.666 367s 367s y: [2, 4, 1, 1] 367s -0.819 1.276 367s -0.755 0.522 367s -0.367 0.370 367s 0.213 -0.390 367s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 367s x0: [3, 4, 4, 1] 367s 0.846 -0.337 -0.907 367s 0.393 -0.360 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s .872 367s 0.383 0.667 0.531 367s -0.984 0.326 0.181 367s 367s x1: [3, 2, 4, 1] 367s 0.990 0.027 0.659 367s -0.367 0.659 0.650 367s 367s y: [4, 2, 4, 1] 367s 0.230 0.954 0.748 -0.846 367s -1.122 0.186 0.644 0.694 367s testing: mul_mat, [4, 4, 4, 1] = [3, 4, 4, 1] * [3, 4, 4, 1] 367s x0: [3, 4, 4, 1] 367s -0.899 -0.513 0.780 367s -0.013 -0.077 0.033 367s -0.416 -0.042 0.253 367s -0.272 0.467 0.210 367s 367s x1: [3, 4, 4, 1] 367s -0.775 -0.285 -0.500 367s -0.317 0.667 0.733 367s -0.603 0.691 0.008 367s -0.707 0.193 0.788 367s 367s y: [4, 4, 4, 1] 367s 0.453 0.015 0.208 -0.027 367s 0.515 -0.023 0.289 0.552 367s 0.194 -0.046 0.223 0.489 367s 1.151 0.020 0.485 0.448 367s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.367 -0.737 -0.504 367s 367s x1: [3, 4, 1, 1] 367s -0.088 0.186 -0.633 367s 0.162 -0.835 0.905 367s -0.365 -0.757 -0.742 367s 0.447 0.832 0.210 367s 367s y: [1, 4, 1, 1] 367s 0.150 367s 0.218 367s 0.798 367s -0.555 367s testing: mul_mat, [4, 3, 4, 1] = [1, 4, 4, 1] * [1, 3, 4, 1] 367s x0: [1, 4, 4, 1] 367s 0.945 367s -0.797 367s 0.045 367s 0.131 367s 367s x1: [1, 3, 4, 1] 367s -0.319 367s -0.697 367s 0.187 367s 367s y: [4, 3, 4, 1] 367s -0.302 0.254 -0.014 -0.042 367s -0.659 0.555 -0.031 -0.092 367s 0.177 -0.149 0.008 0.025 367s testing: mul_mat, [4, 1, 4, 1] = [4, 4, 4, 1] * [4, 1, 4, 1] 367s x0: [4, 4, 4, 1] 367s 0.697 -0.481 -0.261 -0.951 367s 0.222 -0.995 0.033 -0.831 367s 0.529 -0.255 0.430 -0.110 367s 0.904 -0.941 -0.317 0.561 367s 367s x1: [4, 1, 4, 1] 367s -0.793 0.512 0.419 -0.490 367s 367s y: [4, 1, 4, 1] 367s -0.443 -0.265 -0.316 -1.607 367s test-mul-mat0: iter:365/500 367s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 1, 1, 1] 367s -0.943 -0.882 -0.119 367s 367s x1: [3, 4, 1, 1] 367s 0.353 -0.742 0.993 367s 0.019 -0.722 0.948 367s -0.266 -0.533 -0.208 367s -0.990 -0.756 0.036 367s 367s y: [1, 4, 1, 1] 367s 0.203 367s 0.506 367s 0.746 367s 1.596 367s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 367s x0: [3, 3, 4, 1] 367s -0.985 -0.278 0.303 367s -0.727 -0.285 -0.678 367s -0.449 -0.337 0.056 367s 367s x1: [3, 1, 4, 1] 367s -0.015 0.753 0.499 367s 367s y: [3, 1, 4, 1] 367s -0.043 -0.542 -0.219 367s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 3, 4, 1] 367s -0.174 -0.733 -0.944 367s -0.386 -0.285 0.678 367s 0.895 0.468 0.664 367s 367s x1: [3, 3, 4, 1] 367s 0.053 -0.152 0.068 367s -0.891 0.866 -0.478 367s 0.175 0.129 0.014 367s 367s y: [3, 3, 4, 1] 367s 0.038 0.069 0.021 367s -0.029 -0.227 -0.709 367s -0.138 -0.095 0.226 367s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.779 0.387 -0.022 367s 0.963 0.120 0.811 367s 367s x1: [3, 3, 1, 1] 367s 0.135 -0.968 0.709 367s -0.202 -0.799 0.781 367s -0.103 0.795 -0.293 367s 367s y: [2, 3, 1, 1] 367s -0.286 0.588 367s -0.484 0.343 367s 0.234 -0.242 367s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 367s x0: [2, 3, 4, 1] 367s 0.690 -0.190 367s 0.964 -0.059 367s 0.871 0.682 367s 367s x1: [2, 3, 4, 1] 367s 0.508 -0.518 367s -0.341 0.348 367s 0.527 -0.221 367s 367s y: [3, 3, 4, 1] 367s 0.449 0.520 0.089 367s -0.301 -0.349 -0.060 367s 0.406 0.521 0.308 367s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 367s x0: [3, 1, 4, 1] 367s -0.876 0.505 -0.471 367s 367s x1: [3, 2, 4, 1] 367s 0.240 -0.168 0.329 367s -0.366 -0.258 0.329 367s 367s y: [1, 2, 4, 1] 367s -0.450 367s 0.036 367s test-mul-mat0: iter:366/500 367s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.015 -0.341 -0.144 367s 367s x1: [3, 2, 1, 1] 367s 0.470 -0.250 0.517 367s 0.030 -0.593 -0.472 367s 367s y: [1, 2, 1, 1] 367s 0.018 367s 0.271 367s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.953 -0.941 0.003 367s -0.518 -0.075 0.225 367s 0.046 0.031 -0.721 367s 367s x1: [3, 1, 1, 1] 367s -0.204 0.946 0.935 367s 367s y: [3, 1, 1, 1] 367s -1.081 0.245 -0.655 367s testing: mul_mat, [1, 3, 1, 4] = [3, 1, 1, 4] * [3, 3, 1, 4] 367s x0: [3, 1, 1, 4] 367s -0.614 0.738 0.803 367s 367s x1: [3, 3, 1, 4] 367s -0.060 0.771 -0.925 367s -0.837 0.628 -0.697 367s -0.321 -0.526 0.773 367s 367s y: [1, 3, 1, 4] 367s -0.137 367s 0.418 367s 0.430 367s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.625 0.183 0.787 367s 0.343 -0.822 -0.957 367s 0.442 -0.183 -0.531 367s 367s x1: [3, 1, 1, 1] 367s -0.852 -0.452 0.433 367s 367s y: [3, 1, 1, 1] 367s -0.274 -0.335 -0.524 367s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.590 -0.698 0.538 367s -0.364 0.390 -0.370 367s -0.111 -0.918 0.318 367s -0.343 0.885 -0.168 367s 367s x1: [3, 3, 1, 1] 367s 0.332 0.680 -0.803 367s 0.039 -0.056 -0.773 367s 0.425 -0.318 -0.970 367s 367s y: [4, 3, 1,ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 1] 367s -1.102 0.442 -0.916 0.623 367s -0.400 0.250 -0.199 0.067 367s -0.551 0.080 -0.063 -0.263 367s testing: mul_mat, [2, 3, 1, 4] = [4, 2, 1, 4] * [4, 3, 1, 4] 367s x0: [4, 2, 1, 4] 367s -0.683 -0.762 -0.357 -0.995 367s 0.388 0.466 0.190 0.981 367s 367s x1: [4, 3, 1, 4] 367s -0.745 -0.339 -0.726 0.437 367s -0.162 0.091 0.225 -0.118 367s 0.560 -0.443 -0.439 0.757 367s 367s y: [2, 3, 1, 4] 367s 0.592 -0.156 367s 0.079 -0.094 367s -0.641 0.670 367s test-mul-mat0: iter:367/500 367s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.118 0.635 -0.329 367s -0.890 0.806 0.668 367s 367s x1: [3, 1, 1, 1] 367s -0.538 0.035 0.687 367s 367s y: [2, 1, 1, 1] 367s -0.267 0.966 367s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 367s x0: [3, 2, 2, 1] 367s -0.056 -0.655 0.371 367s -0.491 0.408 -0.357 367s 367s x1: [3, 2, 2, 1] 367s -0.715 0.636 -0.682 367s -0.266 -0.650 0.504 367s 367s y: [2, 2, 2, 1] 367s -0.629 0.853 367s 0.627 -0.314 367s testing: mul_mat, [4, 2, 2, 2] = [3, 4, 2, 2] * [3, 2, 2, 2] 367s x0: [3, 4, 2, 2] 367s 0.419 0.080 -0.815 367s 0.993 0.880 0.780 367s 0.686 -0.685 -0.954 367s 0.488 -0.444 0.979 367s 367s x1: [3, 2, 2, 2] 367s 0.693 -0.565 -0.734 367s 0.803 -0.759 0.934 367s 367s y: [4, 2, 2, 2] 367s 0.843 -0.382 1.562 -0.130 367s -0.486 0.857 0.179 1.644 367s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.125 -0.050 0.610 367s 0.132 -0.395 -0.821 367s 367s x1: [3, 4, 1, 1] 367s 0.754 0.865 -0.421 367s 0.284 -0.169 -0.605 367s 0.028 -0.138 0.581 367s 0.331 0.329 -0.353 367s 367s y: [2, 4, 1, 1] 367s -0.395 0.103 367s -0.396 0.601 367s 0.358 -0.419 367s -0.273 0.203 367s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 367s x0: [2, 4, 2, 1] 367s -0.854 -0.479 367s -0.874 0.041 367s -0.988 0.839 367s -0.023 -0.898 367s 367s x1: [2, 3, 2, 1] 367s -0.885 0.773 367s 0.724 0.671 367s 0.425 0.825 367s 367s y: [4, 3, 2, 1] 367s 0.386 0.804 1.522 -0.674 367s -0.940 -0.605 -0.152 -0.620 367s -0.758 -0.338 0.273 -0.750 367s testing: mul_mat, [1, 2, 2, 2] = [4, 1, 2, 2] * [4, 2, 2, 2] 367s x0: [4, 1, 2, 2] 367s 0.205 -0.309 0.585 0.640 367s 367s x1: [4, 2, 2, 2] 367s 0.492 0.100 -0.237 0.265 367s -0.175 -0.565 -0.311 -0.351 367s 367s y: [1, 2, 2, 2] 367s 0.100 367s -0.268 367s test-mul-mat0: iter:368/500 367s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.502 -0.011 -0.034 367s -0.842 0.431 -0.607 367s -0.081 -0.364 0.084 367s 367s x1: [3, 1, 1, 1] 367s -0.021 0.759 0.194 367s 367s y: [3, 1, 1, 1] 367s -0.004 0.227 -0.258 367s testing: mul_mat, [4, 3, 3, 1] = [3, 4, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 4, 3, 1] 367s 0.614 -0.806 -0.625 367s -0.882 -0.530 -0.161 367s -0.143 -0.973 -0.975 367s -0.164 0.481 0.556 367s 367s x1: [3, 3, 3, 1] 367s -0.496 -0.724 -0.536 367s -0.261 0.557 0.186 367s 0.979 0.454 0.531 367s 367s y: [4, 3, 3, 1] 367s 0.614 0.907 1.298 -0.565 367s -0.726 -0.095 -0.686 0.414 367s -0.096 -1.189 -1.099 0.352 367s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 367s x0: [3, 4, 3, 1] 367s 0.134 0.581 0.689 367s -0.068 0.483 0.964 367s -0.570 -0.835 0.981 367s 0.573 -0.473 0.779 367s 367s x1: [3, 4, 3, 1] 367s 0.019 0.601 0.156 367s 0.044 -0.563 -0.362 367s -0.401 0.046 -0.188 367s 0.798 0.902 -0.725 367s 367s y: [4, 4, 3, 1] 367s 0.459 0.440 -0.359 -0.152 367s -0.571 -0.625 0.090 0.009 367s -0.157 -0.132 0.006 -0.398 367s 0.131 -0.318 -1.920 -0.535 367s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 4, 1, 1] 367s 0.294 -0.807 0.532 367s -0.562 -0.841 -0.568 367s -0.115 0.532 0.219 367s -0.129 0.439 0.440 367s 367s x1: [3, 4, 1, 1] 367s 0.432 -0.723 0.137 367s 0.413 0.849 0.664 367s 0.192 0.345 -0.834 367s 0.086 0.672 0.156 367s 367s y: [4, 4, 1, 1] 367s 0.783 0.288 -0.404 -0.313 367s -0.211 -1.323 0.549 0.611 367s -0.666 0.076 -0.022 -0.240 367s -0.434 -0.702 0.381 0.352 367s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 367s x0: [4, 1, 3, 1] 367s -0.752 0.683 -0.208 -0.230 367s 367s x1: [4, 3, 3, 1] 367s -0.790 -0.866 -0.071 0.677 367s 0.980 0.901 0.109 -0.743 367s 0.039 -0.478 -0.894 -0.298 367s 367s y: [1, 3, 3, 1] 367s -0.139 367s 0.028 367s -0.101 367s testing: mul_mat, [2, 4, 3, 1] = [1, 2, 3, 1] * [1, 4, 3, 1] 367s x0: [1, 2, 3, 1] 367s 0.602 367s -0.643 367s 367s x1: [1, 4, 3, 1] 367s -0.808 367s -0.220 367s -0.152 367s 0.985 367s 367s y: [2, 4, 3, 1] 367s -0.487 0.520 367s -0.133 0.141 367s -0.092 0.098 367s 0.593 -0.633 367s test-mul-mat0: iter:369/500 367s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.831 0.704 0.707 367s 0.334 0.640 -0.405 367s -0.069 0.059 -0.265 367s 367s x1: [3, 3, 1, 1] 367s 0.256 -0.766 -0.756 367s -0.504 -0.403 0.52ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 8 367s -0.141 -0.017 0.719 367s 367s y: [3, 3, 1, 1] 367s -0.860 -0.098 0.137 367s -0.329 -0.640 -0.129 367s 0.379 -0.349 -0.182 367s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 1, 3, 1] 367s -0.807 0.724 -0.262 367s 367s x1: [3, 3, 3, 1] 367s 0.841 0.401 -0.662 367s -0.802 0.596 0.079 367s 0.982 0.544 -0.930 367s 367s y: [1, 3, 3, 1] 367s -0.215 367s 1.057 367s -0.155 367s testing: mul_mat, [2, 1, 3, 3] = [3, 2, 3, 3] * [3, 1, 3, 3] 367s x0: [3, 2, 3, 3] 367s 0.930 -0.689 -0.289 367s -0.855 -0.424 -0.064 367s 367s x1: [3, 1, 3, 3] 367s -0.786 -0.735 -0.775 367s 367s y: [2, 1, 3, 3] 367s 0.000 1.034 367s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.153 -0.906 0.970 367s -0.475 -0.704 0.876 367s -0.501 -0.837 -0.049 367s 367s x1: [3, 2, 1, 1] 367s 0.446 0.249 0.202 367s -0.367 -0.077 -0.102 367s 367s y: [3, 2, 1, 1] 367s 0.039 -0.210 -0.441 367s -0.086 0.139 0.253 367s testing: mul_mat, [3, 3, 3, 1] = [3, 3, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 3, 3, 1] 367s -0.924 -0.087 0.738 367s -0.837 0.797 -0.062 367s -0.765 -0.835 0.449 367s 367s x1: [3, 3, 3, 1] 367s -0.163 0.634 0.931 367s -0.175 0.142 0.284 367s 0.664 -0.970 0.291 367s 367s y: [3, 3, 3, 1] 367s 0.782 0.584 0.013 367s 0.359 0.242 0.144 367s -0.314 -1.347 0.432 367s testing: mul_mat, [4, 3, 3, 3] = [3, 4, 3, 3] * [3, 3, 3, 3] 367s x0: [3, 4, 3, 3] 367s 0.040 0.701 -0.467 367s 0.661 -0.849 -0.036 367s -0.575 0.962 -0.985 367s -0.315 -0.611 -0.962 367s 367s x1: [3, 3, 3, 3] 367s -0.534 -0.803 -0.422 367s 0.749 -0.728 -0.259 367s 0.984 0.185 -0.462 367s 367s y: [4, 3, 3, 3] 367s -0.388 0.344 -0.051 1.065 367s -0.360 1.122 -0.876 0.458 367s 0.385 0.510 0.068 0.021 367s test-mul-mat0: iter:370/500 367s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 367s x0: [2, 2, 1, 1] 367s -0.421 -0.202 367s -0.673 0.877 367s 367s x1: [2, 4, 1, 1] 367s -0.174 -0.412 367s 0.854 0.935 367s -0.047 -0.590 367s 0.399 0.533 367s 367s y: [2, 4, 1, 1] 367s 0.157 -0.244 367s -0.549 0.245 367s 0.139 -0.485 367s -0.276 0.199 367s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 1, 1, 1] 367s 0.130 0.205 367s 367s x1: [2, 2, 1, 1] 367s -0.122 0.923 367s -0.623 0.607 367s 367s y: [1, 2, 1, 1] 367s 0.174 367s 0.044 367s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 3, 1, 1] 367s -0.094 0.323 367s 0.112 0.182 367s 0.565 -0.230 367s 367s x1: [2, 1, 1, 1] 367s 0.179 -0.855 367s 367s y: [3, 1, 1, 1] 367s -0.293 -0.135 0.298 367s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 367s x0: [2, 1, 1, 1] 367s 0.532 -0.071 367s 367s x1: [2, 3, 1, 1] 367s 0.118 0.268 367s 0.596 0.706 367s 0.122 0.530 367s 367s y: [1, 3, 1, 1] 367s 0.044 367s 0.267 367s 0.027 367s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.011 367s 0.125 367s 0.385 367s 367s x1: [1, 2, 1, 1] 367s -0.808 367s -0.202 367s 367s y: [3, 2, 1, 1] 367s 0.009 -0.101 -0.311 367s 0.002 -0.025 -0.078 367s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.584 -0.108 -0.320 367s 367s x1: [3, 1, 1, 1] 367s 0.867 0.049 0.762 367s 367s y: [1, 1, 1, 1] 367s 0.257 367s test-mul-mat0: iter:371/500 367s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 3, 1, 1] 367s 0.597 -0.667 0.846 367s -0.474 -0.475 -0.355 367s -0.966 0.514 0.770 367s 367s x1: [3, 3, 1, 1] 367s -0.640 0.701 -0.150 367s 0.925 -0.530 0.968 367s 0.193 -0.934 0.674 367s 367s y: [3, 3, 1, 1] 367s -0.975 0.024 0.863 367s 1.725 -0.531 -0.421 367s 1.308 0.113 -0.147 367s testing: mul_mat, [4, 3, 3, 1] = [3, 4, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 4, 3, 1] 367s -0.482 0.700 0.165 367s 0.937 0.081 0.983 367s 0.118 0.937 -0.614 367s 0.191 0.473 0.553 367s 367s x1: [3, 3, 3, 1] 367s 0.419 0.381 -0.181 367s 0.181 -0.144 -0.597 367s -0.927 0.536 0.166 367s 367s y: [4, 3, 3, 1] 367s 0.035 0.245 0.518 0.160 367s -0.286 -0.429 0.253 -0.364 367s 0.849 -0.661 0.291 0.169 367s testing: mul_mat, [4, 4, 3, 4] = [3, 4, 3, 4] * [3, 4, 3, 4] 367s x0: [3, 4, 3, 4] 367s 0.275 0.343 -0.753 367s 0.852 0.871 0.091 367s -0.994 0.524 -0.940 367s 0.887 -0.429 -0.551 367s 367s x1: [3, 4, 3, 4] 367s 0.952 -0.906 0.620 367s -0.661 0.285 0.093 367s 0.891 0.586 -0.310 367s -0.720 0.358 -0.887 367s 367s y: [4, 4, 3, 4] 367s -0.516 0.079 -2.003 0.892 367s -0.154 -0.306 0.720 -0.760 367s 0.679 1.242 -0.288 0.709 367s 0.593 -0.383 1.738 -0.304 367s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.102 0.916 -0.600 367s -0.182 -0.398 0.442 367s 0.843 -0.975 -0.161 367s 367s x1: [3, 4, 1, 1] 367s -0.523 -0.912 0.716 367s -0.785 -0.791 -0.928 367s 0.357 -0.035 -0.639 367s -0.385 -0.764 -0.354 367s 367s y: [3, 4, 1, 1] 367s -1.212 0.775 0.333 367s -0.089 0.048 0.259 367s 0.315 -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s .334 0.439 367s -0.449 0.218 0.478 367s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 2, 3, 1] 367s -0.848 -0.759 0.355 367s -0.447 -0.790 -0.216 367s 367s x1: [3, 3, 3, 1] 367s -0.941 -0.198 -0.457 367s -0.380 0.968 0.693 367s 0.440 0.440 -0.103 367s 367s y: [2, 3, 3, 1] 367s 0.787 0.676 367s -0.167 -0.745 367s -0.743 -0.522 367s testing: mul_mat, [3, 3, 3, 4] = [2, 3, 3, 4] * [2, 3, 3, 4] 367s x0: [2, 3, 3, 4] 367s 0.834 0.257 367s 0.758 0.657 367s 0.699 0.192 367s 367s x1: [2, 3, 3, 4] 367s 0.808 -0.273 367s 0.719 0.913 367s 0.664 0.410 367s 367s y: [3, 3, 3, 4] 367s 0.604 0.433 0.513 367s 0.834 1.145 0.678 367s 0.659 0.773 0.543 367s test-mul-mat0: iter:372/500 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s 0.609 367s 367s x1: [1, 1, 1, 1] 367s -0.720 367s 367s y: [1, 1, 1, 1] 367s -0.439 367s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 367s x0: [1, 1, 2, 1] 367s -0.876 367s 367s x1: [1, 1, 2, 1] 367s -0.735 367s 367s y: [1, 1, 2, 1] 367s 0.644 367s testing: mul_mat, [2, 1, 2, 2] = [1, 2, 2, 2] * [1, 1, 2, 2] 367s x0: [1, 2, 2, 2] 367s 0.280 367s 0.577 367s 367s x1: [1, 1, 2, 2] 367s 0.144 367s 367s y: [2, 1, 2, 2] 367s 0.040 0.083 367s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 4, 1, 1] 367s -0.741 367s -0.833 367s 0.640 367s 0.955 367s 367s x1: [1, 2, 1, 1] 367s -0.492 367s -0.628 367s 367s y: [4, 2, 1, 1] 367s 0.365 0.410 -0.315 -0.470 367s 0.465 0.523 -0.402 -0.600 367s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 367s x0: [4, 3, 2, 1] 367s 0.213 0.799 -0.433 0.561 367s 0.889 0.154 -0.566 -0.477 367s 0.463 0.315 -0.108 0.046 367s 367s x1: [4, 1, 2, 1] 367s -0.455 0.883 0.251 -0.175 367s 367s y: [3, 1, 2, 1] 367s 0.402 -0.326 0.032 367s testing: mul_mat, [2, 4, 2, 2] = [3, 2, 2, 2] * [3, 4, 2, 2] 367s x0: [3, 2, 2, 2] 367s 0.651 0.776 -0.526 367s -0.248 0.990 0.139 367s 367s x1: [3, 4, 2, 2] 367s -0.444 0.820 0.506 367s 0.416 -0.091 0.169 367s 0.735 -0.878 0.058 367s 0.198 0.921 -0.788 367s 367s y: [2, 4, 2, 2] 367s 0.081 0.993 367s 0.111 -0.169 367s -0.234 -1.043 367s 1.259 0.753 367s test-mul-mat0: iter:373/500 367s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.716 367s 0.204 367s -0.415 367s 367s x1: [1, 4, 1, 1] 367s 0.563 367s 0.359 367s -0.492 367s 0.214 367s 367s y: [3, 4, 1, 1] 367s -0.403 0.115 -0.233 367s -0.257 0.073 -0.149 367s 0.352 -0.100 0.204 367s -0.153 0.044 -0.089 367s testing: mul_mat, [3, 3, 4, 1] = [1, 3, 4, 1] * [1, 3, 4, 1] 367s x0: [1, 3, 4, 1] 367s -0.047 367s 0.402 367s -0.954 367s 367s x1: [1, 3, 4, 1] 367s 0.424 367s -0.322 367s -0.868 367s 367s y: [3, 3, 4, 1] 367s -0.020 0.171 -0.405 367s 0.015 -0.130 0.308 367s 0.041 -0.349 0.829 367s testing: mul_mat, [1, 3, 4, 1] = [1, 1, 4, 1] * [1, 3, 4, 1] 367s x0: [1, 1, 4, 1] 367s -0.488 367s 367s x1: [1, 3, 4, 1] 367s -0.041 367s 0.112 367s 0.926 367s 367s y: [1, 3, 4, 1] 367s 0.020 367s -0.055 367s -0.452 367s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 4, 1, 1] 367s 0.685 367s 0.126 367s -0.785 367s -0.959 367s 367s x1: [1, 1, 1, 1] 367s 0.724 367s 367s y: [4, 1, 1, 1] 367s 0.496 0.091 -0.568 -0.694 367s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 367s x0: [4, 2, 4, 1] 367s 0.047 0.479 0.676 0.943 367s 0.165 -0.770 0.088 -0.592 367s 367s x1: [4, 1, 4, 1] 367s 0.097 0.430 -0.319 0.432 367s 367s y: [2, 1, 4, 1] 367s 0.402 -0.599 367s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 367s x0: [2, 3, 4, 1] 367s -0.141 0.456 367s 0.996 0.995 367s 0.570 -0.162 367s 367s x1: [2, 4, 4, 1] 367s 0.914 0.368 367s -0.345 0.590 367s -0.544 -0.403 367s 0.999 0.268 367s 367s y: [3, 4, 4, 1] 367s 0.039 1.277 0.461 367s 0.318 0.243 -0.293 367s -0.107 -0.943 -0.245 367s -0.019 1.261 0.526 367s test-mul-mat0: iter:374/500 367s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.118 367s 0.240 367s -0.923 367s 367s x1: [1, 3, 1, 1] 367s 0.762 367s -0.115 367s 0.670 367s 367s y: [3, 3, 1, 1] 367s -0.090 0.183 -0.704 367s 0.014 -0.028 0.106 367s -0.079 0.161 -0.618 367s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 367s x0: [1, 2, 3, 1] 367s 0.396 367s 0.860 367s 367s x1: [1, 3, 3, 1] 367s -0.123 367s -0.922 367s -0.759 367s 367s y: [2, 3, 3, 1] 367s -0.049 -0.106 367s -0.365 -0.794 367s -0.301 -0.653 367s testing: mul_mat, [1, 2, 3, 1] = [1, 1, 3, 1] * [1, 2, 3, 1] 367s x0: [1, 1, 3, 1] 367s -0.615 367s 367s x1: [1, 2, 3, 1] 367s 0.136 367s 0.019 367s 367s y: [1, 2, 3, 1] 367s -0.084 367s -0.012 367s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.368 367s 0.295 367s 367s x1: [1, 1, 1, 1] 367s -0.945 367s 367s y: [2, 1, 1, 1] 367s -0.348 -0.279 367s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 367s x0: [2, 4, 3, 1] 367s -0.513 0.294 367s 0.898 0.128 367s 0.268 -0.899 367s 0.121 -0.813 367s 367s x1: [2, 1, 3, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 1] 367s -0.847 -0.755 367s 367s y: [4, 1, 3, 1] 367s 0.213 -0.857 0.452 0.511 367s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 367s x0: [4, 4, 3, 1] 367s -0.866 -0.720 0.882 -0.529 367s -0.624 -0.269 -0.725 0.630 367s -0.047 0.915 0.786 0.067 367s 0.948 -0.572 -0.474 0.934 367s 367s x1: [4, 2, 3, 1] 367s 0.967 -0.851 0.151 -0.639 367s 0.358 -0.356 -0.805 0.844 367s 367s y: [4, 2, 3, 1] 367s 0.247 -0.886 -0.749 0.735 367s -1.209 0.988 -0.918 1.713 367s test-mul-mat0: iter:375/500 367s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 367s x0: [4, 4, 1, 1] 367s -0.315 -0.341 -0.665 -0.541 367s -0.567 -0.102 -0.036 -0.877 367s 0.889 0.351 -0.913 -0.334 367s -0.737 -0.478 0.072 -0.758 367s 367s x1: [4, 2, 1, 1] 367s 0.205 0.845 -0.920 0.514 367s 0.141 -0.855 0.865 -0.167 367s 367s y: [4, 2, 1, 1] 367s -0.019 -0.620 1.147 -1.010 367s -0.237 0.122 -0.908 0.493 367s testing: mul_mat, [2, 4, 2, 1] = [4, 2, 2, 1] * [4, 4, 2, 1] 367s x0: [4, 2, 2, 1] 367s 0.456 0.055 -0.121 -0.612 367s -0.955 0.460 0.399 -0.451 367s 367s x1: [4, 4, 2, 1] 367s -0.109 -0.926 0.124 0.785 367s -0.264 -0.068 -0.010 -0.419 367s 0.011 -0.497 0.722 0.157 367s -0.632 -0.445 0.209 0.053 367s 367s y: [2, 4, 2, 1] 367s -0.597 -0.627 367s 0.134 0.406 367s -0.206 -0.022 367s -0.371 0.458 367s testing: mul_mat, [4, 2, 2, 3] = [4, 4, 2, 3] * [4, 2, 2, 3] 367s x0: [4, 4, 2, 3] 367s 0.431 0.078 -0.766 -0.120 367s -0.397 0.047 -0.115 0.044 367s 0.596 -0.346 0.455 0.096 367s 0.732 0.348 0.195 -0.440 367s 367s x1: [4, 2, 2, 3] 367s 0.482 -0.402 0.751 0.958 367s -0.632 -0.918 0.785 -0.775 367s 367s y: [4, 2, 2, 3] 367s -0.514 -0.255 0.860 -0.063 367s -0.853 0.084 0.224 -0.288 367s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.666 -0.258 -0.608 -0.911 367s -0.833 0.566 -0.349 -0.232 367s 367s x1: [4, 4, 1, 1] 367s -0.543 0.783 -0.623 -0.922 367s -0.741 0.453 -0.005 0.840 367s -0.863 -0.061 0.873 0.622 367s -0.530 0.559 0.570 0.059 367s 367s y: [2, 4, 1, 1] 367s 1.378 1.327 367s -0.386 0.681 367s -0.507 0.236 367s -0.192 0.546 367s testing: mul_mat, [3, 4, 2, 1] = [2, 3, 2, 1] * [2, 4, 2, 1] 367s x0: [2, 3, 2, 1] 367s -0.674 0.896 367s 0.325 -0.616 367s 0.098 -0.168 367s 367s x1: [2, 4, 2, 1] 367s -0.093 -0.173 367s -0.073 -0.616 367s 0.784 -0.263 367s -0.158 0.567 367s 367s y: [3, 4, 2, 1] 367s -0.092 0.076 0.020 367s -0.502 0.355 0.096 367s -0.764 0.417 0.121 367s 0.615 -0.401 -0.111 367s testing: mul_mat, [1, 2, 2, 3] = [3, 1, 2, 3] * [3, 2, 2, 3] 367s x0: [3, 1, 2, 3] 367s 0.098 -0.027 0.829 367s 367s x1: [3, 2, 2, 3] 367s 0.884 0.710 0.132 367s -0.289 -0.363 0.516 367s 367s y: [1, 2, 2, 3] 367s 0.177 367s 0.409 367s test-mul-mat0: iter:376/500 367s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.998 -0.714 0.059 367s 0.596 0.757 -0.843 367s 367s x1: [3, 1, 1, 1] 367s -0.436 0.036 -0.668 367s 367s y: [2, 1, 1, 1] 367s -0.500 0.331 367s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 367s x0: [3, 3, 3, 1] 367s -0.674 -0.020 -0.348 367s 0.362 0.312 0.107 367s 0.360 0.598 -0.834 367s 367s x1: [3, 2, 3, 1] 367s -0.432 0.586 0.318 367s -0.971 -0.307 -0.632 367s 367s y: [3, 2, 3, 1] 367s 0.168 0.060 -0.070 367s 0.881 -0.515 -0.005 367s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 1, 3, 1] 367s 0.783 0.116 -0.539 367s 367s x1: [3, 3, 3, 1] 367s 0.641 0.569 0.677 367s 0.728 0.895 -0.344 367s -0.620 0.257 0.968 367s 367s y: [1, 3, 3, 1] 367s 0.203 367s 0.859 367s -0.978 367s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.546 0.247 -0.734 367s -0.673 -0.351 0.773 367s -0.328 0.123 -0.398 367s -0.974 0.108 0.891 367s 367s x1: [3, 1, 1, 1] 367s 0.532 0.837 -0.239 367s 367s y: [4, 1, 1, 1] 367s 0.091 -0.837 0.024 -0.641 367s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 367s x0: [4, 4, 3, 1] 367s 0.102 -0.748 -0.464 -0.744 367s 0.642 -0.606 0.149 0.945 367s 0.045 0.061 -0.534 -0.330 367s -0.495 0.440 0.440 -0.964 367s 367s x1: [4, 3, 3, 1] 367s -0.477 -0.278 -0.503 -0.803 367s -0.191 -0.249 0.980 0.925 367s 0.212 0.404 -0.391 0.349 367s 367s y: [4, 3, 3, 1] 367s 0.990 -0.971 0.495 0.666 367s -0.975 1.048 -0.852 -0.475 367s -0.359 0.163 0.128 -0.435 367s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 367s x0: [4, 1, 3, 1] 367s -0.258 0.925 0.888 0.167 367s 367s x1: [4, 4, 3, 1] 367s 0.892 0.100 -0.180 -0.376 367s -0.741 0.300 0.543 -0.997 367s -0.296 0.371 -0.171 0.230 367s -0.083 0.488 -0.142 -0.895 367s 367s y: [1, 4, 3, 1] 367s -0.360 367s 0.785 367s 0.306 367s 0.197 367s test-mul-mat0: iter:377/500 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.653 367s 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s x1: [1, 1, 1, 1] 367s -0.506 367s 367s y: [1, 1, 1, 1] 367s 0.330 367s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 367s x0: [1, 2, 3, 1] 367s -0.632 367s -0.170 367s 367s x1: [1, 1, 3, 1] 367s 0.298 367s 367s y: [2, 1, 3, 1] 367s -0.189 -0.051 367s testing: mul_mat, [1, 2, 3, 2] = [1, 1, 3, 2] * [1, 2, 3, 2] 367s x0: [1, 1, 3, 2] 367s -0.600 367s 367s x1: [1, 2, 3, 2] 367s -0.557 367s 0.247 367s 367s y: [1, 2, 3, 2] 367s 0.334 367s -0.148 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s 0.695 367s 367s x1: [1, 1, 1, 1] 367s 0.859 367s 367s y: [1, 1, 1, 1] 367s 0.597 367s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 367s x0: [1, 2, 3, 1] 367s -0.016 367s 0.733 367s 367s x1: [1, 1, 3, 1] 367s -0.726 367s 367s y: [2, 1, 3, 1] 367s 0.012 -0.533 367s testing: mul_mat, [2, 1, 3, 2] = [2, 2, 3, 2] * [2, 1, 3, 2] 367s x0: [2, 2, 3, 2] 367s 0.299 -0.474 367s -0.878 -0.867 367s 367s x1: [2, 1, 3, 2] 367s 0.775 -0.999 367s 367s y: [2, 1, 3, 2] 367s 0.705 0.186 367s test-mul-mat0: iter:378/500 367s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 1, 1, 1] 367s -0.741 0.105 0.118 -0.098 367s 367s x1: [4, 4, 1, 1] 367s 0.458 0.157 0.640 -0.243 367s 0.279 -0.834 -0.110 0.798 367s -0.449 -0.796 -0.988 0.209 367s 0.673 0.393 -0.491 -0.875 367s 367s y: [1, 4, 1, 1] 367s -0.223 367s -0.386 367s 0.112 367s -0.429 367s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 367s x0: [4, 3, 3, 1] 367s -0.202 0.563 0.140 0.688 367s 0.362 0.691 0.892 0.373 367s -0.100 0.565 -0.234 0.409 367s 367s x1: [4, 1, 3, 1] 367s 0.859 0.730 -0.194 0.524 367s 367s y: [3, 1, 3, 1] 367s 0.571 0.838 0.587 367s testing: mul_mat, [3, 3, 3, 2] = [4, 3, 3, 2] * [4, 3, 3, 2] 367s x0: [4, 3, 3, 2] 367s -0.340 0.659 0.335 0.391 367s 0.547 -0.109 -0.911 0.890 367s -0.703 0.475 0.381 -0.285 367s 367s x1: [4, 3, 3, 2] 367s -0.032 0.140 0.275 0.868 367s -0.295 -0.959 0.277 -0.605 367s 0.315 -0.055 0.190 0.707 367s 367s y: [3, 3, 3, 2] 367s 0.535 0.489 -0.053 367s -0.676 -0.848 0.029 367s 0.197 0.634 -0.376 367s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 4, 1, 1] 367s 0.633 -0.007 -0.518 0.076 367s -0.138 -0.587 0.271 -0.936 367s -0.874 -0.188 -0.519 -0.774 367s -0.847 -0.579 0.723 -0.884 367s 367s x1: [4, 3, 1, 1] 367s 0.657 0.940 -0.709 0.269 367s -0.243 0.500 -0.845 0.847 367s -0.023 0.158 -0.409 -0.773 367s 367s y: [4, 3, 1, 1] 367s 0.798 -1.086 -0.590 -1.851 367s 0.345 -1.282 -0.098 -1.444 367s 0.138 0.523 0.801 0.316 367s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 367s x0: [4, 3, 3, 1] 367s 0.564 -0.234 0.239 -0.213 367s 0.109 0.848 0.067 -0.650 367s -0.362 -0.026 0.033 0.319 367s 367s x1: [4, 4, 3, 1] 367s 0.468 0.467 0.773 0.408 367s 0.758 0.041 -0.835 0.258 367s 0.196 -0.988 -0.765 -0.646 367s -0.397 -0.538 0.315 -0.764 367s 367s y: [3, 4, 3, 1] 367s 0.252 0.234 -0.026 367s 0.164 -0.106 -0.221 367s 0.297 -0.448 -0.277 367s 0.139 0.018 -0.076 367s testing: mul_mat, [1, 4, 3, 2] = [3, 1, 3, 2] * [3, 4, 3, 2] 367s x0: [3, 1, 3, 2] 367s -0.576 0.214 0.773 367s 367s x1: [3, 4, 3, 2] 367s 0.809 0.602 -0.364 367s -0.158 -0.610 -0.014 367s -0.839 -0.159 -0.912 367s 0.232 -0.271 0.349 367s 367s y: [1, 4, 3, 2] 367s -0.618 367s -0.050 367s -0.256 367s 0.078 367s test-mul-mat0: iter:379/500 367s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.142 -0.315 -0.880 367s -0.057 -0.333 0.674 367s -0.434 0.975 -0.226 367s 367s x1: [3, 4, 1, 1] 367s -0.001 -0.961 -0.911 367s -0.837 -0.950 -0.770 367s -0.717 0.755 0.639 367s -0.293 -0.031 0.411 367s 367s y: [3, 4, 1, 1] 367s 1.104 -0.293 -0.730 367s 1.095 -0.154 -0.388 367s -0.698 0.219 0.903 367s -0.311 0.304 0.004 367s testing: mul_mat, [2, 3, 2, 1] = [3, 2, 2, 1] * [3, 3, 2, 1] 367s x0: [3, 2, 2, 1] 367s 0.966 0.203 0.193 367s -0.176 0.888 0.313 367s 367s x1: [3, 3, 2, 1] 367s 0.683 0.424 -0.238 367s 0.969 -0.337 0.346 367s -0.328 0.153 0.834 367s 367s y: [2, 3, 2, 1] 367s 0.700 0.182 367s 0.935 -0.361 367s -0.126 0.454 367s testing: mul_mat, [2, 2, 2, 3] = [3, 2, 2, 3] * [3, 2, 2, 3] 367s x0: [3, 2, 2, 3] 367s -0.255 -0.918 -0.553 367s 0.073 0.042 -0.928 367s 367s x1: [3, 2, 2, 3] 367s -0.985 0.953 -0.478 367s 0.985 -0.384 0.868 367s 367s y: [2, 2, 2, 3] 367s -0.360 0.412 367s -0.379 -0.750 367s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.188 0.723 -0.758 367s -0.765 0.830 0.181 367s 0.071 0.923 0.990 367s 367s x1: [3, 2, 1, 1] 367s 0.609 -0.316 -0.813 367s 0.459 -0.804 -0.371 367s 367s y: [3, 2, 1, 1] 367s 0.273 -0.876 -1.054 367s -0.386 -1.086 -1.077 367s testing: mul_mat, [4, 3, 2, 1] = [3, 4, 2, 1] * [3, 3, 2, 1] 367s x0: [3, 4, 2, 1] 367s -0.498 0.023 -0.266 367s -0.212 -0.189 0.280 367s 0.740 0.037 -0.966 367s 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s .314 -0.147 -0.276 367s 367s x1: [3, 3, 2, 1] 367s 0.356 -0.379 0.800 367s -0.176 -0.836 0.307 367s 0.168 0.796 0.127 367s 367s y: [4, 3, 2, 1] 367s -0.399 0.220 -0.524 -0.053 367s -0.013 0.281 -0.459 -0.017 367s -0.099 -0.150 0.031 -0.099 367s testing: mul_mat, [1, 3, 2, 3] = [4, 1, 2, 3] * [4, 3, 2, 3] 367s x0: [4, 1, 2, 3] 367s 0.808 0.147 -0.175 0.770 367s 367s x1: [4, 3, 2, 3] 367s -0.321 -0.590 0.961 -0.919 367s -0.284 -0.880 -0.058 0.218 367s -0.092 -0.317 -0.468 0.931 367s 367s y: [1, 3, 2, 3] 367s -1.221 367s -0.182 367s 0.678 367s test-mul-mat0: iter:380/500 367s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 4, 1, 1] 367s 0.413 0.015 0.917 367s 0.637 -0.277 0.268 367s -0.835 0.423 -0.928 367s 0.265 0.575 -0.974 367s 367s x1: [3, 3, 1, 1] 367s -0.014 0.322 -0.914 367s -0.206 -0.531 -0.089 367s -0.437 -0.818 -0.274 367s 367s y: [4, 3, 1, 1] 367s -0.840 -0.343 0.997 1.072 367s -0.174 -0.008 0.030 -0.273 367s -0.444 -0.125 0.273 -0.318 367s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 367s x0: [3, 2, 2, 1] 367s -0.440 -0.196 0.234 367s 0.825 -0.622 0.260 367s 367s x1: [3, 4, 2, 1] 367s 0.192 -0.686 0.235 367s -0.836 -0.141 -0.480 367s -0.480 0.371 0.920 367s 0.506 -0.307 -0.994 367s 367s y: [2, 4, 2, 1] 367s 0.105 0.646 367s 0.283 -0.726 367s 0.353 -0.388 367s -0.395 0.350 367s testing: mul_mat, [4, 2, 2, 4] = [3, 4, 2, 4] * [3, 2, 2, 4] 367s x0: [3, 4, 2, 4] 367s -0.372 -0.612 -0.316 367s 0.264 -0.953 0.405 367s 0.159 -0.278 -0.182 367s 0.549 -0.422 0.782 367s 367s x1: [3, 2, 2, 4] 367s 0.700 -0.078 -0.065 367s 0.206 0.615 -0.059 367s 367s y: [4, 2, 2, 4] 367s -0.192 0.233 0.145 0.367 367s -0.435 -0.555 -0.127 -0.192 367s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.549 -0.117 0.887 367s 367s x1: [3, 4, 1, 1] 367s -0.460 0.252 -0.879 367s -0.650 0.383 -0.517 367s -0.229 -0.074 0.036 367s -0.243 -0.983 0.447 367s 367s y: [1, 4, 1, 1] 367s -1.062 367s -0.860 367s -0.085 367s 0.379 367s testing: mul_mat, [3, 3, 2, 1] = [1, 3, 2, 1] * [1, 3, 2, 1] 367s x0: [1, 3, 2, 1] 367s -0.335 367s 0.904 367s -0.077 367s 367s x1: [1, 3, 2, 1] 367s -0.312 367s -0.212 367s 0.325 367s 367s y: [3, 3, 2, 1] 367s 0.104 -0.282 0.024 367s 0.071 -0.192 0.016 367s -0.109 0.294 -0.025 367s testing: mul_mat, [1, 1, 2, 4] = [3, 1, 2, 4] * [3, 1, 2, 4] 367s x0: [3, 1, 2, 4] 367s -0.694 -0.869 0.345 367s 367s x1: [3, 1, 2, 4] 367s 0.396 0.875 0.255 367s 367s y: [1, 1, 2, 4] 367s -0.948 367s test-mul-mat0: iter:381/500 367s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.903 367s 0.701 367s -0.160 367s 367s x1: [1, 3, 1, 1] 367s -0.213 367s 0.967 367s -0.644 367s 367s y: [3, 3, 1, 1] 367s 0.192 -0.149 0.034 367s -0.873 0.678 -0.155 367s 0.581 -0.451 0.103 367s testing: mul_mat, [2, 3, 2, 1] = [1, 2, 2, 1] * [1, 3, 2, 1] 367s x0: [1, 2, 2, 1] 367s 0.996 367s 0.592 367s 367s x1: [1, 3, 2, 1] 367s 0.936 367s 0.761 367s 0.981 367s 367s y: [2, 3, 2, 1] 367s 0.933 0.554 367s 0.758 0.450 367s 0.978 0.581 367s testing: mul_mat, [2, 2, 2, 3] = [1, 2, 2, 3] * [1, 2, 2, 3] 367s x0: [1, 2, 2, 3] 367s -0.634 367s -0.613 367s 367s x1: [1, 2, 2, 3] 367s -0.630 367s 0.252 367s 367s y: [2, 2, 2, 3] 367s 0.400 0.386 367s -0.160 -0.155 367s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 4, 1, 1] 367s 0.644 367s -0.021 367s -0.999 367s -0.987 367s 367s x1: [1, 2, 1, 1] 367s -0.948 367s 0.431 367s 367s y: [4, 2, 1, 1] 367s -0.610 0.020 0.948 0.935 367s 0.277 -0.009 -0.430 -0.425 367s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 367s x0: [4, 3, 2, 1] 367s -0.839 -0.473 0.987 -0.714 367s 0.712 -0.900 -0.416 0.005 367s -0.766 -0.592 -0.717 0.918 367s 367s x1: [4, 1, 2, 1] 367s -0.769 0.723 -0.865 -0.849 367s 367s y: [3, 1, 2, 1] 367s 0.056 -0.843 0.001 367s testing: mul_mat, [4, 4, 2, 3] = [3, 4, 2, 3] * [3, 4, 2, 3] 367s x0: [3, 4, 2, 3] 367s -0.000 -0.184 -0.965 367s 0.092 -0.707 0.248 367s -0.470 0.414 0.499 367s 0.024 -0.258 -0.627 367s 367s x1: [3, 4, 2, 3] 367s -0.589 0.868 0.673 367s -0.057 -0.549 -0.035 367s 0.359 -0.388 -0.322 367s 0.593 0.139 -0.223 367s 367s y: [4, 4, 2, 3] 367s -0.809 -0.501 0.973 -0.660 367s 0.134 0.374 -0.218 0.162 367s 0.382 0.227 -0.490 0.311 367s 0.189 -0.099 -0.332 0.118 367s test-mul-mat0: iter:382/500 367s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.489 -0.380 -0.423 367s 0.766 -0.508 0.611 367s 0.047 0.260 0.751 367s 367s x1: [3, 3, 1, 1] 367s 0.650 -0.910 -0.079 367s -0.619 -0.415 -0.771 367s -0.106 0.102 -0.355 367s 367s y: [3, 3, 1, 1] 367s 0.062 0.913 -0.266 367s 0.786 -0.734 -0.715 367s 0.163 -0.350 -0.245 367s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.202 -0.104 0.605 367s -0.112 0.817 0.986 367s 0.472 -0.953 -0.120 367s -0.426 -0.30ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 8 -0.564 367s 367s x1: [3, 3, 1, 1] 367s -0.118 0.948 0.812 367s 0.566 0.971 -0.464 367s 0.132 -0.423 0.422 367s 367s y: [4, 3, 1, 1] 367s 0.417 1.588 -1.057 -0.700 367s -0.496 0.272 -0.603 -0.278 367s 0.273 0.056 0.415 -0.164 367s testing: mul_mat, [1, 4, 1, 3] = [3, 1, 1, 3] * [3, 4, 1, 3] 367s x0: [3, 1, 1, 3] 367s 0.127 0.774 -0.649 367s 367s x1: [3, 4, 1, 3] 367s 0.086 0.312 0.013 367s -0.148 0.803 -0.376 367s 0.899 0.063 -0.626 367s -0.219 0.011 -0.814 367s 367s y: [1, 4, 1, 3] 367s 0.244 367s 0.847 367s 0.569 367s 0.509 367s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.682 -0.556 -0.795 367s 367s x1: [3, 1, 1, 1] 367s 0.733 0.325 0.403 367s 367s y: [1, 1, 1, 1] 367s -0.002 367s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 4, 1, 1] 367s -0.793 367s 0.134 367s -0.065 367s 0.803 367s 367s x1: [1, 3, 1, 1] 367s -0.073 367s 0.302 367s 0.229 367s 367s y: [4, 3, 1, 1] 367s 0.058 -0.010 0.005 -0.059 367s -0.240 0.040 -0.020 0.243 367s -0.182 0.031 -0.015 0.184 367s testing: mul_mat, [4, 1, 1, 3] = [4, 4, 1, 3] * [4, 1, 1, 3] 367s x0: [4, 4, 1, 3] 367s 0.829 0.072 -0.202 -0.838 367s 0.747 -0.646 -0.067 0.325 367s -0.049 -0.272 0.681 -0.693 367s 0.562 -0.246 -0.900 0.296 367s 367s x1: [4, 1, 1, 3] 367s 0.474 0.311 -0.289 -0.173 367s 367s y: [4, 1, 1, 3] 367s 0.619 0.117 -0.185 0.399 367s test-mul-mat0: iter:383/500 367s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 4, 1, 1] 367s -0.009 0.194 -0.808 -0.538 367s -0.813 0.545 -0.135 -0.140 367s -0.002 0.431 -0.142 -0.242 367s 0.293 0.116 -0.372 0.926 367s 367s x1: [4, 3, 1, 1] 367s -0.769 0.938 -0.066 -0.367 367s 0.715 -0.628 -0.811 0.067 367s 0.791 0.438 -0.752 -0.043 367s 367s y: [4, 3, 1, 1] 367s 0.441 1.197 0.504 -0.433 367s 0.490 -0.824 -0.174 0.501 367s 0.709 -0.297 0.304 0.522 367s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 367s x0: [4, 4, 3, 1] 367s 0.853 0.484 0.318 -0.918 367s 0.098 -0.892 0.743 0.570 367s 0.657 -0.302 0.159 0.834 367s 0.578 0.690 0.330 -0.385 367s 367s x1: [4, 4, 3, 1] 367s -0.056 0.319 -0.843 -0.118 367s -0.747 -0.211 -0.403 -0.375 367s -0.021 0.664 -0.584 -0.583 367s 0.912 0.373 -0.945 -0.097 367s 367s y: [4, 4, 3, 1] 367s -0.053 -0.984 -0.366 -0.045 367s -0.523 -0.399 -0.804 -0.566 367s 0.653 -1.360 -0.793 0.478 367s 0.747 -1.001 0.255 0.510 367s testing: mul_mat, [3, 4, 3, 2] = [4, 3, 3, 2] * [4, 4, 3, 2] 367s x0: [4, 3, 3, 2] 367s -0.342 -0.813 0.269 0.988 367s 0.724 0.106 0.148 0.944 367s -0.620 -0.253 0.940 0.548 367s 367s x1: [4, 4, 3, 2] 367s -0.403 0.873 -0.645 -0.950 367s 0.120 0.143 -0.706 0.590 367s 0.516 -0.300 0.337 -0.623 367s 0.837 -0.391 0.091 -0.542 367s 367s y: [3, 4, 3, 2] 367s -1.683 -1.191 -1.098 367s 0.235 0.555 -0.451 367s -0.457 -0.197 -0.269 367s -0.479 0.066 -0.631 367s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 2, 1, 1] 367s 0.427 0.913 0.894 0.883 367s 0.147 0.163 0.882 -0.598 367s 367s x1: [4, 3, 1, 1] 367s -0.600 -0.747 0.672 -0.224 367s 0.105 0.258 -0.499 0.518 367s -0.847 0.079 0.718 0.130 367s 367s y: [2, 3, 1, 1] 367s -0.535 0.516 367s 0.292 -0.693 367s 0.467 0.443 367s testing: mul_mat, [1, 4, 3, 1] = [2, 1, 3, 1] * [2, 4, 3, 1] 367s x0: [2, 1, 3, 1] 367s 0.100 -0.014 367s 367s x1: [2, 4, 3, 1] 367s -0.184 0.773 367s 0.939 -0.295 367s -0.444 0.612 367s -0.305 -0.822 367s 367s y: [1, 4, 3, 1] 367s -0.030 367s 0.098 367s -0.053 367s -0.019 367s testing: mul_mat, [2, 2, 3, 2] = [1, 2, 3, 2] * [1, 2, 3, 2] 367s x0: [1, 2, 3, 2] 367s 0.307 367s 0.756 367s 367s x1: [1, 2, 3, 2] 367s 0.799 367s -0.359 367s 367s y: [2, 2, 3, 2] 367s 0.245 0.603 367s -0.110 -0.271 367s test-mul-mat0: iter:384/500 367s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.163 367s 0.145 367s 367s x1: [1, 1, 1, 1] 367s -0.496 367s 367s y: [2, 1, 1, 1] 367s -0.081 -0.072 367s testing: mul_mat, [1, 2, 4, 1] = [1, 1, 4, 1] * [1, 2, 4, 1] 367s x0: [1, 1, 4, 1] 367s -0.358 367s 367s x1: [1, 2, 4, 1] 367s -0.237 367s 0.667 367s 367s y: [1, 2, 4, 1] 367s 0.085 367s -0.239 367s testing: mul_mat, [3, 1, 4, 4] = [1, 3, 4, 4] * [1, 1, 4, 4] 367s x0: [1, 3, 4, 4] 367s 0.596 367s -0.788 367s -0.906 367s 367s x1: [1, 1, 4, 4] 367s 0.934 367s 367s y: [3, 1, 4, 4] 367s 0.556 -0.735 -0.846 367s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.858 367s 367s x1: [1, 3, 1, 1] 367s -0.521 367s 0.945 367s 0.145 367s 367s y: [1, 3, 1, 1] 367s 0.447 367s -0.811 367s -0.124 367s testing: mul_mat, [4, 1, 4, 1] = [1, 4, 4, 1] * [1, 1, 4, 1] 367s x0: [1, 4, 4, 1] 367s 0.600 367s -0.707 367s -0.584 367s 0.388 367s 367s x1: [1, 1, 4, 1] 367s -0.253 367s 367s y: [4, 1, 4, 1] 367s -0.152 0.179 0.148 -0.098 367s testing: mul_mat, [3, 1, 4, 4] = [4, 3, 4, 4] * [4, 1, 4, 4] 367s x0: [4, 3, 4, 4] 367s -0.704 0.81ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 2 -0.376 -0.032 367s -0.171 0.093 -0.051 -0.307 367s -0.702 -0.676 0.717 0.571 367s 367s x1: [4, 1, 4, 4] 367s 0.943 0.849 -0.501 0.208 367s 367s y: [3, 1, 4, 4] 367s 0.208 -0.121 -1.475 367s test-mul-mat0: iter:385/500 367s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.738 0.460 0.109 367s -0.823 0.513 0.359 367s 367s x1: [3, 1, 1, 1] 367s -0.380 -0.304 0.457 367s 367s y: [2, 1, 1, 1] 367s 0.190 0.321 367s testing: mul_mat, [2, 2, 4, 1] = [3, 2, 4, 1] * [3, 2, 4, 1] 367s x0: [3, 2, 4, 1] 367s -0.979 0.987 0.050 367s -0.801 0.500 -0.591 367s 367s x1: [3, 2, 4, 1] 367s 0.111 0.084 -0.085 367s -0.045 0.474 -0.387 367s 367s y: [2, 2, 4, 1] 367s -0.029 0.004 367s 0.493 0.501 367s testing: mul_mat, [2, 2, 4, 4] = [3, 2, 4, 4] * [3, 2, 4, 4] 367s x0: [3, 2, 4, 4] 367s 0.479 -0.422 0.365 367s -0.747 0.161 -0.822 367s 367s x1: [3, 2, 4, 4] 367s 0.094 -0.189 0.857 367s -0.243 -0.430 -0.616 367s 367s y: [2, 2, 4, 4] 367s 0.437 -0.805 367s -0.159 0.618 367s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 3, 1, 1] 367s -1.000 0.029 0.085 367s 0.499 -0.333 0.675 367s -0.281 0.946 -0.355 367s 367s x1: [3, 2, 1, 1] 367s 0.765 0.940 0.309 367s 0.361 0.774 -0.493 367s 367s y: [3, 2, 1, 1] 367s -0.712 0.277 0.565 367s -0.381 -0.411 0.806 367s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 367s x0: [3, 2, 4, 1] 367s 0.033 -0.699 -0.743 367s 0.912 -0.499 0.339 367s 367s x1: [3, 3, 4, 1] 367s 0.190 0.869 0.079 367s 0.062 -0.677 0.450 367s 0.703 -0.645 0.421 367s 367s y: [2, 3, 4, 1] 367s -0.660 -0.234 367s 0.141 0.547 367s 0.161 1.106 367s testing: mul_mat, [4, 3, 4, 4] = [2, 4, 4, 4] * [2, 3, 4, 4] 367s x0: [2, 4, 4, 4] 367s -0.566 0.267 367s -0.278 -0.244 367s -0.441 -0.064 367s 0.751 -0.787 367s 367s x1: [2, 3, 4, 4] 367s -0.183 0.968 367s -0.099 -0.753 367s 0.298 -0.543 367s 367s y: [4, 3, 4, 4] 367s 0.362 -0.185 0.018 -0.899 367s -0.145 0.211 0.092 0.518 367s -0.313 0.050 -0.096 0.650 367s test-mul-mat0: iter:386/500 367s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 2, 1, 1] 367s -0.431 367s 0.104 367s 367s x1: [1, 1, 1, 1] 367s 0.769 367s 367s y: [2, 1, 1, 1] 367s -0.331 0.080 367s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 367s x0: [1, 3, 4, 1] 367s 0.375 367s -0.430 367s -0.613 367s 367s x1: [1, 2, 4, 1] 367s 0.998 367s -0.215 367s 367s y: [3, 2, 4, 1] 367s 0.375 -0.430 -0.612 367s -0.081 0.092 0.132 367s testing: mul_mat, [2, 3, 4, 2] = [1, 2, 4, 2] * [1, 3, 4, 2] 367s x0: [1, 2, 4, 2] 367s 0.589 367s 0.404 367s 367s x1: [1, 3, 4, 2] 367s 0.373 367s -0.130 367s 0.170 367s 367s y: [2, 3, 4, 2] 367s 0.220 0.151 367s -0.076 -0.052 367s 0.100 0.069 367s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.315 367s 0.345 367s 0.205 367s 367s x1: [1, 2, 1, 1] 367s 0.875 367s -0.062 367s 367s y: [3, 2, 1, 1] 367s -0.276 0.302 0.180 367s 0.020 -0.021 -0.013 367s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 367s x0: [3, 3, 4, 1] 367s -0.372 -0.017 0.612 367s 0.485 0.935 -0.426 367s -0.166 -0.526 -0.071 367s 367s x1: [3, 1, 4, 1] 367s -0.654 -0.292 0.357 367s 367s y: [3, 1, 4, 1] 367s 0.467 -0.742 0.237 367s testing: mul_mat, [2, 3, 4, 2] = [3, 2, 4, 2] * [3, 3, 4, 2] 367s x0: [3, 2, 4, 2] 367s 0.260 0.099 0.703 367s -0.246 0.914 -0.613 367s 367s x1: [3, 3, 4, 2] 367s 0.149 0.710 -0.960 367s -0.921 -0.425 -0.991 367s 0.383 0.315 -0.084 367s 367s y: [2, 3, 4, 2] 367s -0.566 1.200 367s -0.979 0.446 367s 0.072 0.245 367s test-mul-mat0: iter:387/500 367s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 4, 1, 1] 367s 0.966 367s 0.990 367s 0.784 367s -0.809 367s 367s x1: [1, 2, 1, 1] 367s -0.893 367s 0.869 367s 367s y: [4, 2, 1, 1] 367s -0.863 -0.884 -0.700 0.723 367s 0.839 0.860 0.681 -0.703 367s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.307 367s 367s x1: [1, 4, 1, 1] 367s -0.153 367s -0.914 367s 0.892 367s -0.035 367s 367s y: [1, 4, 1, 1] 367s 0.047 367s 0.281 367s -0.274 367s 0.011 367s testing: mul_mat, [2, 1, 1, 2] = [1, 2, 1, 2] * [1, 1, 1, 2] 367s x0: [1, 2, 1, 2] 367s 0.326 367s -0.180 367s 367s x1: [1, 1, 1, 2] 367s -0.004 367s 367s y: [2, 1, 1, 2] 367s -0.001 0.001 367s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.752 367s -0.763 367s 367s x1: [1, 2, 1, 1] 367s 0.705 367s 0.416 367s 367s y: [2, 2, 1, 1] 367s 0.530 -0.538 367s 0.313 -0.318 367s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 2, 1, 1] 367s 0.833 -0.614 367s -0.787 0.940 367s 367s x1: [2, 1, 1, 1] 367s 0.590 0.084 367s 367s y: [2, 1, 1, 1] 367s 0.440 -0.385 367s testing: mul_mat, [3, 2, 1, 2] = [2, 3, 1, 2] * [2, 2, 1, 2] 367s x0: [2, 3, 1, 2] 367s 0.884 -0.120 367s -0.741 -0.840 367s -0.319 -0.439 367s 367s x1: [2, 2, 1, 2] 367s -0.918 0.248 367s 0.906 -0.928 367s 367s y: [3, 2, 1, 2] 367s -0.841 0.472 0.184 367s 0.912ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 0.109 0.119 367s test-mul-mat0: iter:388/500 367s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.250 367s 0.816 367s 367s x1: [1, 4, 1, 1] 367s 0.521 367s -0.891 367s 0.310 367s 0.734 367s 367s y: [2, 4, 1, 1] 367s 0.130 0.426 367s -0.222 -0.727 367s 0.077 0.253 367s 0.183 0.600 367s testing: mul_mat, [2, 2, 2, 1] = [1, 2, 2, 1] * [1, 2, 2, 1] 367s x0: [1, 2, 2, 1] 367s -0.193 367s 0.892 367s 367s x1: [1, 2, 2, 1] 367s 0.744 367s 0.156 367s 367s y: [2, 2, 2, 1] 367s -0.144 0.663 367s -0.030 0.139 367s testing: mul_mat, [1, 2, 2, 4] = [1, 1, 2, 4] * [1, 2, 2, 4] 367s x0: [1, 1, 2, 4] 367s -0.931 367s 367s x1: [1, 2, 2, 4] 367s -0.849 367s 0.922 367s 367s y: [1, 2, 2, 4] 367s 0.790 367s -0.858 367s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 4, 1, 1] 367s -0.820 367s 0.223 367s -0.032 367s -0.129 367s 367s x1: [1, 1, 1, 1] 367s 0.332 367s 367s y: [4, 1, 1, 1] 367s -0.272 0.074 -0.011 -0.043 367s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 367s x0: [4, 1, 2, 1] 367s -0.048 -0.590 -0.103 0.365 367s 367s x1: [4, 1, 2, 1] 367s 0.374 -0.111 -0.558 0.685 367s 367s y: [1, 1, 2, 1] 367s 0.354 367s testing: mul_mat, [1, 4, 2, 4] = [1, 1, 2, 4] * [1, 4, 2, 4] 367s x0: [1, 1, 2, 4] 367s 0.599 367s 367s x1: [1, 4, 2, 4] 367s 0.726 367s -0.032 367s 0.750 367s 0.045 367s 367s y: [1, 4, 2, 4] 367s 0.435 367s -0.019 367s 0.449 367s 0.027 367s test-mul-mat0: iter:389/500 367s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 3, 1, 1] 367s -0.867 -0.582 0.029 0.983 367s 0.852 -0.492 0.581 -0.668 367s -0.559 0.487 -0.667 -0.801 367s 367s x1: [4, 3, 1, 1] 367s -0.359 -0.434 0.964 -0.438 367s 0.043 0.503 0.458 -0.074 367s -0.342 0.305 0.221 -0.291 367s 367s y: [3, 3, 1, 1] 367s 0.161 0.760 -0.303 367s -0.389 0.104 -0.025 367s -0.160 -0.120 0.425 367s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 367s x0: [4, 4, 4, 1] 367s -0.418 -0.435 0.357 0.532 367s 0.479 0.071 -0.011 0.974 367s 0.117 -0.113 0.956 0.495 367s -0.016 -0.032 0.453 0.047 367s 367s x1: [4, 3, 4, 1] 367s 0.030 0.580 0.965 0.436 367s 0.042 -0.391 -0.923 0.608 367s -0.428 -0.361 -0.349 -0.924 367s 367s y: [4, 3, 4, 1] 367s 0.312 0.470 1.076 0.439 367s 0.146 0.595 -0.532 -0.378 367s -0.281 -1.127 -0.800 -0.184 367s testing: mul_mat, [3, 4, 4, 4] = [4, 3, 4, 4] * [4, 4, 4, 4] 367s x0: [4, 3, 4, 4] 367s 0.410 0.694 -0.258 0.495 367s -0.162 0.405 -0.200 -0.326 367s -0.890 -0.379 -0.315 -0.018 367s 367s x1: [4, 4, 4, 4] 367s 0.362 -0.663 -0.692 0.621 367s -0.533 0.119 -0.004 0.484 367s 0.887 0.867 0.485 0.623 367s -0.810 -0.467 -0.226 -0.032 367s 367s y: [3, 4, 4, 4] 367s 0.175 -0.391 0.136 367s 0.105 -0.023 0.422 367s 1.148 -0.093 -1.282 367s -0.613 -0.002 0.969 367s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.067 -0.944 0.300 0.723 367s -0.811 0.032 -0.497 -0.092 367s 367s x1: [4, 3, 1, 1] 367s 0.127 -0.892 0.843 0.267 367s -0.777 -0.225 0.465 0.568 367s 0.049 -0.553 0.746 0.097 367s 367s y: [2, 3, 1, 1] 367s 1.280 -0.575 367s 0.815 0.340 367s 0.813 -0.437 367s testing: mul_mat, [2, 4, 4, 1] = [2, 2, 4, 1] * [2, 4, 4, 1] 367s x0: [2, 2, 4, 1] 367s 0.285 -0.251 367s 0.788 0.992 367s 367s x1: [2, 4, 4, 1] 367s -0.060 0.195 367s -0.610 -0.075 367s -0.004 -0.293 367s 0.852 -0.531 367s 367s y: [2, 4, 4, 1] 367s -0.066 0.146 367s -0.155 -0.556 367s 0.072 -0.294 367s 0.376 0.144 367s testing: mul_mat, [4, 2, 4, 4] = [2, 4, 4, 4] * [2, 2, 4, 4] 367s x0: [2, 4, 4, 4] 367s 0.814 -0.377 367s -0.376 -0.703 367s -0.589 -0.961 367s -0.008 -0.889 367s 367s x1: [2, 2, 4, 4] 367s -0.301 -0.221 367s -0.900 0.196 367s 367s y: [4, 2, 4, 4] 367s -0.162 0.268 0.390 0.199 367s -0.806 0.201 0.342 -0.167 367s test-mul-mat0: iter:390/500 367s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.809 0.204 -0.024 367s 367s x1: [3, 1, 1, 1] 367s 0.417 0.205 -0.163 367s 367s y: [1, 1, 1, 1] 367s 0.383 367s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 367s x0: [3, 4, 4, 1] 367s -0.589 -0.104 -0.455 367s 0.285 0.205 -0.702 367s -0.869 -0.983 -0.736 367s -0.997 0.187 0.681 367s 367s x1: [3, 1, 4, 1] 367s -0.740 0.349 -0.035 367s 367s y: [4, 1, 4, 1] 367s 0.415 -0.115 0.326 0.779 367s testing: mul_mat, [4, 4, 4, 3] = [3, 4, 4, 3] * [3, 4, 4, 3] 367s x0: [3, 4, 4, 3] 367s 0.665 0.655 -0.936 367s -0.656 -0.970 -0.651 367s 0.612 0.768 -0.899 367s 0.362 -0.641 -0.054 367s 367s x1: [3, 4, 4, 3] 367s 0.474 -0.926 0.088 367s 0.003 0.253 -0.376 367s -0.587 -0.755 0.505 367s 0.623 -0.075 -0.753 367s 367s y: [4, 4, 4, 3] 367s -0.373 0.530 -0.500 0.761 367s 0.520 -0.002 0.534 -0.141 367s -1.358 0.789 -1.393 0.245 367s 1.070 0.154 1.000 0.314 367s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 1, 1, 1] 367s 0.491 0.419 -0.646 367s 367s x1: [3, 4, 1, 1] 367s -0.158 0.019 0.886 367s 0.462 0.425 -0.658 367s -0.994 0.927 -0.867 367s 0.272 -0.011 0.702 367s 367s y: [1, 4, 1, 1] 367s -0.642 367s 0.830 367s 0.460 367s -0.325 367s testing: mul_mat, [1, 3, 4, 1] = [1, 1, 4, 1] * [1, 3, 4, 1] 367s x0: [1, 1, 4, 1] 367s 0.029 367s 367s x1: [1, 3, 4, 1] 367s 0.086 367s 0.192 367s -0.216 367s 367s y: [1, 3, 4, 1] 367s 0.002 367s 0.006 367s -0.006 367s testing: mul_mat, [3, 1, 4, 3] = [1, 3, 4, 3] * [1, 1, 4, 3] 367s x0: [1, 3, 4, 3] 367s -0.747 367s -0.034 367s 0.603 367s 367s x1: [1, 1, 4, 3] 367s -0.184 367s 367s y: [3, 1, 4, 3] 367s 0.137 0.006 -0.111 367s test-mul-mat0: iter:391/500 367s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 4, 1, 1] 367s 0.333 367s -0.710 367s 0.908 367s 0.470 367s 367s x1: [1, 1, 1, 1] 367s 0.386 367s 367s y: [4, 1, 1, 1] 367s 0.129 -0.274 0.351 0.181 367s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 367s x0: [1, 2, 2, 1] 367s 0.680 367s -0.986 367s 367s x1: [1, 4, 2, 1] 367s -0.876 367s 0.616 367s 0.655 367s -0.226 367s 367s y: [2, 4, 2, 1] 367s -0.596 0.864 367s 0.419 -0.607 367s 0.445 -0.645 367s -0.154 0.223 367s testing: mul_mat, [4, 2, 2, 2] = [1, 4, 2, 2] * [1, 2, 2, 2] 367s x0: [1, 4, 2, 2] 367s -0.289 367s 0.244 367s -0.850 367s -0.903 367s 367s x1: [1, 2, 2, 2] 367s 0.898 367s 0.140 367s 367s y: [4, 2, 2, 2] 367s -0.259 0.219 -0.763 -0.811 367s -0.040 0.034 -0.119 -0.126 367s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.316 367s 0.436 367s 367s x1: [1, 4, 1, 1] 367s 0.524 367s -0.712 367s 0.545 367s 0.538 367s 367s y: [2, 4, 1, 1] 367s 0.166 0.228 367s -0.225 -0.310 367s 0.172 0.237 367s 0.170 0.234 367s testing: mul_mat, [3, 1, 2, 1] = [2, 3, 2, 1] * [2, 1, 2, 1] 367s x0: [2, 3, 2, 1] 367s -0.323 0.388 367s 0.655 -0.100 367s 0.897 -0.953 367s 367s x1: [2, 1, 2, 1] 367s 0.241 0.591 367s 367s y: [3, 1, 2, 1] 367s 0.151 0.099 -0.347 367s testing: mul_mat, [1, 2, 2, 2] = [3, 1, 2, 2] * [3, 2, 2, 2] 367s x0: [3, 1, 2, 2] 367s -0.713 0.084 0.430 367s 367s x1: [3, 2, 2, 2] 367s -0.619 0.279 0.092 367s 0.665 -0.227 0.012 367s 367s y: [1, 2, 2, 2] 367s 0.504 367s -0.488 367s test-mul-mat0: iter:392/500 367s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 3, 1, 1] 367s 0.974 0.361 367s 0.180 -0.287 367s -0.325 0.134 367s 367s x1: [2, 1, 1, 1] 367s -0.295 -0.281 367s 367s y: [3, 1, 1, 1] 367s -0.389 0.028 0.058 367s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 367s x0: [2, 4, 2, 1] 367s -0.076 0.234 367s -0.879 0.499 367s 0.252 0.855 367s -0.631 -0.430 367s 367s x1: [2, 3, 2, 1] 367s -0.871 0.645 367s -0.015 -0.068 367s 0.514 -0.116 367s 367s y: [4, 3, 2, 1] 367s 0.217 1.087 0.331 0.272 367s -0.015 -0.021 -0.062 0.039 367s -0.066 -0.510 0.030 -0.274 367s testing: mul_mat, [1, 4, 2, 1] = [2, 1, 2, 1] * [2, 4, 2, 1] 367s x0: [2, 1, 2, 1] 367s 0.458 -0.374 367s 367s x1: [2, 4, 2, 1] 367s 0.324 -0.826 367s 0.891 -0.031 367s 0.159 -0.178 367s -0.517 -0.958 367s 367s y: [1, 4, 2, 1] 367s 0.458 367s 0.419 367s 0.139 367s 0.122 367s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 2, 1, 1] 367s -0.930 0.835 367s -0.071 0.118 367s 367s x1: [2, 1, 1, 1] 367s -0.005 0.640 367s 367s y: [2, 1, 1, 1] 367s 0.539 0.076 367s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 367s x0: [2, 1, 2, 1] 367s -0.645 -0.750 367s 367s x1: [2, 2, 2, 1] 367s 0.578 -0.125 367s -0.120 -0.097 367s 367s y: [1, 2, 2, 1] 367s -0.279 367s 0.151 367s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 367s x0: [1, 3, 2, 1] 367s -0.232 367s 0.006 367s 0.848 367s 367s x1: [1, 2, 2, 1] 367s -0.394 367s 0.054 367s 367s y: [3, 2, 2, 1] 367s 0.092 -0.002 -0.334 367s -0.013 0.000 0.046 367s test-mul-mat0: iter:393/500 367s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 367s x0: [2, 2, 1, 1] 367s -0.004 0.103 367s -0.180 0.352 367s 367s x1: [2, 4, 1, 1] 367s -0.356 -0.029 367s 0.577 -0.778 367s 0.847 -0.543 367s 0.125 -0.104 367s 367s y: [2, 4, 1, 1] 367s -0.002 0.054 367s -0.082 -0.377 367s -0.059 -0.344 367s -0.011 -0.059 367s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 367s x0: [2, 2, 3, 1] 367s -0.092 -0.021 367s 0.946 -0.956 367s 367s x1: [2, 2, 3, 1] 367s 0.353 -0.546 367s -0.212 0.959 367s 367s y: [2, 2, 3, 1] 367s -0.021 0.856 367s -0.000 -1.117 367s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 367s x0: [2, 2, 3, 1] 367s 0.380 -0.302 367s 0.477 0.127 367s 367s x1: [2, 2, 3, 1] 367s 0.758 -0.302 367s -0.067 -0.139 367s 367s y: [2, 2, 3, 1] 367s 0.379 0.323 367s 0.017 -0.049 367s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 3, 1, 1] 367s -0.007 0.927 367s 0.714 -0.424 367s -0.110 -0.592 367s 367s x1: [2, 2, 1, 1] 367s 0.094 -0.280 367s 0.056 0.956 367s 367s y: [3, 2, 1, 1] 367s -0.260 0.186 0.155 367s 0.886 -0.366 -0.572 367s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 367s x0: [3, 3, 3, 1] 367s 0.370 -0.574 0.419 367s 0.577 0.534 0.248 367s 0.465 0.656 -0.454 367s 367s x1: [3, 2, 3, 1] 367s -0.788 -0.209 -0.619 367s 0.285 -0.035 0.862 367s 367s y: [3, 2, 3, 1] 367s -0.431 -0.720 -0.223 367s 0.487 0.359 -0.282 367s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 1, 3, 1] 367s 0.820 -0.324 0.288 367s 367s x1: [3, 3, 3, 1] 367s -0.074 -0.769 0.696 367s 0.671 0.601 0.273 367s 0.136 -0.973 -0.194 367s 367s y: [1, 3, 3, 1] 367s 0.389 367s 0.434 367s 0.371 367s test-mul-mat0: iter:394/500 367s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 3, 1, 1] 367s 0.775 -0.111 0.746 0.063 367s 0.635 -0.347 -0.953 -0.948 367s 0.160 -0.697 0.604 0.693 367s 367s x1: [4, 4, 1, 1] 367s 0.980 0.117 0.683 -0.228 367s -0.005 0.977 0.122 0.488 367s 0.606 -0.686 0.560 -0.567 367s 0.135 0.499 0.699 0.069 367s 367s y: [3, 4, 1, 1] 367s 1.242 0.147 0.330 367s 0.010 -0.921 -0.270 367s 0.928 0.627 0.520 367s 0.575 -0.820 0.144 367s testing: mul_mat, [2, 3, 4, 1] = [4, 2, 4, 1] * [4, 3, 4, 1] 367s x0: [4, 2, 4, 1] 367s -0.528 -0.214 0.989 0.314 367s -0.029 -0.403 -0.191 0.203 367s 367s x1: [4, 3, 4, 1] 367s 0.366 0.949 0.673 -0.517 367s 0.632 -0.554 0.478 0.609 367s 0.567 -0.035 0.215 0.881 367s 367s y: [2, 3, 4, 1] 367s 0.107 -0.626 367s 0.448 0.237 367s 0.197 0.135 367s testing: mul_mat, [2, 2, 4, 2] = [4, 2, 4, 2] * [4, 2, 4, 2] 367s x0: [4, 2, 4, 2] 367s 0.746 -0.121 -0.587 0.670 367s 0.243 0.210 0.983 0.165 367s 367s x1: [4, 2, 4, 2] 367s -0.523 -0.821 0.223 0.448 367s -0.224 -0.968 -0.349 -0.965 367s 367s y: [2, 2, 4, 2] 367s -0.122 -0.006 367s -0.492 -0.759 367s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.224 -0.093 0.693 -0.398 367s 0.670 0.865 -0.190 0.703 367s 367s x1: [4, 2, 1, 1] 367s 0.567 -0.572 -0.973 0.580 367s -0.610 -0.427 -0.096 -0.325 367s 367s y: [2, 2, 1, 1] 367s -0.980 0.478 367s 0.239 -0.988 367s testing: mul_mat, [2, 4, 4, 1] = [2, 2, 4, 1] * [2, 4, 4, 1] 367s x0: [2, 2, 4, 1] 367s -0.585 -0.727 367s 0.370 -0.135 367s 367s x1: [2, 4, 4, 1] 367s -0.148 0.428 367s 0.357 -0.663 367s 0.966 -0.550 367s 0.912 0.213 367s 367s y: [2, 4, 4, 1] 367s -0.225 -0.112 367s 0.273 0.222 367s -0.165 0.432 367s -0.688 0.309 367s testing: mul_mat, [2, 2, 4, 2] = [2, 2, 4, 2] * [2, 2, 4, 2] 367s x0: [2, 2, 4, 2] 367s -0.236 0.484 367s -0.313 0.974 367s 367s x1: [2, 2, 4, 2] 367s -0.927 -0.075 367s -0.203 -0.935 367s 367s y: [2, 2, 4, 2] 367s 0.182 0.217 367s -0.404 -0.848 367s test-mul-mat0: iter:395/500 367s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.593 -0.521 -0.413 367s 0.859 -0.912 0.727 367s 0.745 -0.197 -0.172 367s 0.269 0.900 0.721 367s 367s x1: [3, 2, 1, 1] 367s 0.725 -0.927 -0.343 367s -0.087 0.903 -0.709 367s 367s y: [4, 2, 1, 1] 367s 0.194 1.219 0.781 -0.886 367s -0.126 -1.415 -0.121 0.278 367s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 367s x0: [3, 2, 3, 1] 367s 0.986 0.150 0.321 367s 0.367 -0.926 -0.541 367s 367s x1: [3, 4, 3, 1] 367s -0.998 0.805 0.162 367s 0.904 0.677 0.225 367s 0.900 -0.687 0.975 367s 0.381 -0.076 0.138 367s 367s y: [2, 4, 3, 1] 367s -0.812 -1.199 367s 1.066 -0.417 367s 1.098 0.439 367s 0.408 0.135 367s testing: mul_mat, [4, 2, 3, 4] = [3, 4, 3, 4] * [3, 2, 3, 4] 367s x0: [3, 4, 3, 4] 367s -0.295 -0.026 0.382 367s 0.675 0.400 0.098 367s -0.764 -0.056 -0.557 367s 0.459 0.862 0.907 367s 367s x1: [3, 2, 3, 4] 367s 0.676 0.356 0.013 367s -0.086 0.536 -0.631 367s 367s y: [4, 2, 3, 4] 367s -0.203 0.600 -0.544 0.629 367s -0.229 0.094 0.388 -0.150 367s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.696 -0.017 0.722 367s 0.592 -0.065 0.517 367s 367s x1: [3, 4, 1, 1] 367s 0.202 0.131 0.885 367s 0.847 -0.564 0.187 367s 0.774 0.481 0.505 367s -0.044 0.544 -0.453 367s 367s y: [2, 4, 1, 1] 367s 0.496 0.569 367s -0.445 0.635 367s -0.182 0.688 367s -0.306 -0.296 367s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 367s x0: [2, 1, 3, 1] 367s -0.571 -0.251 367s 367s x1: [2, 3, 3, 1] 367s -0.452 0.284 367s 0.492 -0.898 367s 0.206 -0.876 367s 367s y: [1, 3, 3, 1] 367s 0.187 367s -0.055 367s 0.102 367s testing: mul_mat, [4, 2, 3, 4] = [1, 4, 3, 4] * [1, 2, 3, 4] 367s x0: [1, 4, 3, 4] 367s -0.546 367s 0.879 367s -0.391 367s -0.573 367s 367s x1: [1, 2, 3, 4] 367s -0.722 367s -0.112 367s 367s y: [4, 2, 3, 4] 367s 0.395 -0.635 0.283 0.414 367s 0.061 -0.098 0.044 0.064 367s test-mul-mat0: iter:396/500 367s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 3, 1, 1] 367s -0.880 0.703 -0.010 367s -0.780 0.274 0.841 367s -0.762 0.520 -0.763 367s 367s x1: [3, 4, 1, 1] 367s -0.992 -0.251 -0.913 367s 0.810 0.240 0.410 367s -0.749 0.755 0.862 367s -0.162 -0.276 -0.805 367s 367s y: [3, 4, 1, 1] 367s 0.706 -0.063 1.322 367s -0.548 -0.221 -0.805 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 1.181 1.516 0.305 367s -0.044 -0.627 0.594 367s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 367s x0: [3, 1, 3, 1] 367s 0.198 0.063 -0.710 367s 367s x1: [3, 3, 3, 1] 367s 0.933 -0.666 0.322 367s 0.168 -0.183 -0.638 367s -0.720 -0.848 0.012 367s 367s y: [1, 3, 3, 1] 367s -0.086 367s 0.475 367s -0.204 367s testing: mul_mat, [4, 1, 3, 1] = [3, 4, 3, 1] * [3, 1, 3, 1] 367s x0: [3, 4, 3, 1] 367s 0.303 -0.036 -0.499 367s 0.922 0.530 0.722 367s -0.338 0.799 0.934 367s 0.621 0.342 0.986 367s 367s x1: [3, 1, 3, 1] 367s -0.421 -0.634 -0.015 367s 367s y: [4, 1, 3, 1] 367s -0.097 -0.735 -0.378 -0.493 367s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.580 -0.861 0.761 367s 0.380 -0.592 -0.015 367s 367s x1: [3, 4, 1, 1] 367s -0.804 -0.059 0.817 367s -0.871 -0.437 0.159 367s -0.884 0.381 0.563 367s -0.608 0.331 0.365 367s 367s y: [2, 4, 1, 1] 367s 0.205 -0.283 367s -0.008 -0.074 367s -0.413 -0.570 367s -0.361 -0.433 367s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 367s x0: [2, 4, 3, 1] 367s 0.985 0.845 367s -0.519 -0.679 367s -0.958 -0.103 367s 0.316 0.225 367s 367s x1: [2, 3, 3, 1] 367s -0.897 0.408 367s 0.915 -0.237 367s -0.537 0.438 367s 367s y: [4, 3, 3, 1] 367s -0.538 0.188 0.816 -0.192 367s 0.701 -0.314 -0.851 0.236 367s -0.159 -0.018 0.469 -0.071 367s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 367s x0: [4, 2, 3, 1] 367s 0.414 0.857 0.584 0.055 367s -0.193 0.532 -0.478 -0.597 367s 367s x1: [4, 2, 3, 1] 367s 0.523 -0.181 -0.576 0.058 367s 0.423 0.341 -0.291 0.408 367s 367s y: [2, 2, 3, 1] 367s -0.272 0.044 367s 0.320 -0.004 367s test-mul-mat0: iter:397/500 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.374 367s 367s x1: [1, 1, 1, 1] 367s 0.349 367s 367s y: [1, 1, 1, 1] 367s -0.130 367s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 367s x0: [1, 1, 2, 1] 367s -0.705 367s 367s x1: [1, 1, 2, 1] 367s -0.237 367s 367s y: [1, 1, 2, 1] 367s 0.167 367s testing: mul_mat, [3, 1, 2, 2] = [1, 3, 2, 2] * [1, 1, 2, 2] 367s x0: [1, 3, 2, 2] 367s -0.431 367s -0.764 367s -0.717 367s 367s x1: [1, 1, 2, 2] 367s 0.005 367s 367s y: [3, 1, 2, 2] 367s -0.002 -0.004 -0.004 367s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.847 367s 0.322 367s -0.956 367s 367s x1: [1, 3, 1, 1] 367s 0.852 367s 0.334 367s 0.830 367s 367s y: [3, 3, 1, 1] 367s -0.721 0.274 -0.815 367s -0.283 0.108 -0.320 367s -0.703 0.267 -0.794 367s testing: mul_mat, [3, 1, 2, 1] = [3, 3, 2, 1] * [3, 1, 2, 1] 367s x0: [3, 3, 2, 1] 367s 0.437 0.387 0.623 367s -0.548 0.915 -0.802 367s 0.940 -0.491 -0.686 367s 367s x1: [3, 1, 2, 1] 367s -0.898 0.948 -0.193 367s 367s y: [3, 1, 2, 1] 367s -0.146 1.514 -1.177 367s testing: mul_mat, [2, 3, 2, 2] = [3, 2, 2, 2] * [3, 3, 2, 2] 367s x0: [3, 2, 2, 2] 367s 0.175 -0.149 -0.149 367s -0.910 -0.503 -0.229 367s 367s x1: [3, 3, 2, 2] 367s -0.790 0.059 0.968 367s -0.637 -0.619 -0.989 367s -0.535 -0.671 -0.182 367s 367s y: [2, 3, 2, 2] 367s -0.292 0.468 367s 0.128 1.117 367s 0.033 0.866 367s test-mul-mat0: iter:398/500 367s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 2, 1, 1] 367s 0.628 -0.830 -0.915 0.470 367s -0.914 0.811 -0.513 -0.459 367s 367s x1: [4, 3, 1, 1] 367s 0.476 0.005 0.974 -0.348 367s 0.095 -0.176 0.149 0.946 367s 0.596 -0.835 -0.005 -0.261 367s 367s y: [2, 3, 1, 1] 367s -0.760 -0.771 367s 0.513 -0.740 367s 0.949 -1.099 367s testing: mul_mat, [2, 2, 2, 1] = [4, 2, 2, 1] * [4, 2, 2, 1] 367s x0: [4, 2, 2, 1] 367s 0.243 0.017 0.392 -0.762 367s 0.756 0.033 0.866 0.927 367s 367s x1: [4, 2, 2, 1] 367s -0.104 -0.234 -0.914 -0.238 367s 0.184 0.952 -0.318 0.950 367s 367s y: [2, 2, 2, 1] 367s -0.206 -1.097 367s -0.788 0.775 367s testing: mul_mat, [2, 2, 2, 3] = [4, 2, 2, 3] * [4, 2, 2, 3] 367s x0: [4, 2, 2, 3] 367s -0.222 -0.487 0.426 0.413 367s 0.124 0.421 0.384 0.748 367s 367s x1: [4, 2, 2, 3] 367s -0.360 -0.412 0.773 -0.408 367s 0.270 0.722 0.789 0.893 367s 367s y: [2, 2, 2, 3] 367s 0.441 -0.227 367s 0.294 1.308 367s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 367s x0: [4, 1, 1, 1] 367s 0.384 0.692 -0.610 -0.355 367s 367s x1: [4, 2, 1, 1] 367s 0.880 -0.360 -0.189 0.420 367s 0.255 -0.651 0.951 -0.878 367s 367s y: [1, 2, 1, 1] 367s 0.054 367s -0.621 367s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 367s x0: [1, 3, 2, 1] 367s 0.632 367s 0.787 367s 0.535 367s 367s x1: [1, 4, 2, 1] 367s 0.811 367s -0.807 367s -0.581 367s 0.355 367s 367s y: [3, 4, 2, 1] 367s 0.513 0.638 0.434 367s -0.510 -0.635 -0.432 367s -0.367 -0.457 -0.311 367s 0.225 0.280 0.190 367s testing: mul_mat, [1, 1, 2, 3] = [3, 1, 2, 3] * [3, 1, 2, 3] 367s x0: [3, 1, 2, 3] 367s 0.048 0.829 0.894 367s 367s x1: [3, 1, 2, 3] 367s -0.123 -0.827 0.311 367s 367s y: [1, 1, 2, 3] 367s -0.413 367s test-mul-mat0: iter:399/500 367s testiggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ng: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.207 -0.350 0.566 367s 0.626 0.227 -0.387 367s 367s x1: [3, 1, 1, 1] 367s 0.992 -0.019 -0.507 367s 367s y: [2, 1, 1, 1] 367s -0.075 0.813 367s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 367s x0: [3, 4, 4, 1] 367s -0.826 0.587 0.558 367s 0.800 -0.187 -0.829 367s 0.255 0.934 -0.846 367s -0.451 -0.086 0.137 367s 367s x1: [3, 2, 4, 1] 367s 0.455 0.121 0.982 367s 0.293 -0.021 -0.017 367s 367s y: [4, 2, 4, 1] 367s 0.243 -0.472 -0.602 -0.081 367s -0.264 0.253 0.070 -0.133 367s testing: mul_mat, [4, 4, 4, 4] = [3, 4, 4, 4] * [3, 4, 4, 4] 367s x0: [3, 4, 4, 4] 367s 0.482 -0.937 0.937 367s 0.523 -0.117 0.738 367s -0.098 -0.534 -0.616 367s -0.741 -0.531 -0.592 367s 367s x1: [3, 4, 4, 4] 367s 0.568 -0.032 0.057 367s 0.279 0.020 -0.036 367s -0.547 0.200 -0.144 367s -0.929 0.499 0.715 367s 367s y: [4, 4, 4, 4] 367s 0.357 0.343 -0.074 -0.438 367s 0.082 0.117 -0.016 -0.196 367s -0.586 -0.416 0.036 0.385 367s -0.246 -0.017 -0.616 0.000 367s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.276 0.701 0.647 367s 0.020 0.265 0.494 367s 367s x1: [3, 4, 1, 1] 367s -0.198 0.217 0.329 367s -0.546 0.522 0.885 367s 0.771 0.708 0.249 367s -0.947 0.551 -0.295 367s 367s y: [2, 4, 1, 1] 367s 0.310 0.216 367s 0.788 0.565 367s 0.870 0.326 367s -0.066 -0.019 367s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 367s x0: [2, 4, 4, 1] 367s -0.308 -0.521 367s -0.168 0.233 367s 0.739 -0.686 367s 0.957 -0.243 367s 367s x1: [2, 3, 4, 1] 367s -0.643 -0.721 367s -0.914 -0.306 367s 0.004 -0.248 367s 367s y: [4, 3, 4, 1] 367s 0.574 -0.060 0.020 -0.440 367s 0.441 0.083 -0.466 -0.801 367s 0.128 -0.058 0.173 0.064 367s testing: mul_mat, [3, 2, 4, 4] = [4, 3, 4, 4] * [4, 2, 4, 4] 367s x0: [4, 3, 4, 4] 367s 0.601 0.559 -0.083 -0.871 367s 0.980 -0.091 -0.342 -0.905 367s 0.225 0.602 -0.394 -0.889 367s 367s x1: [4, 2, 4, 4] 367s 0.050 0.549 0.012 0.528 367s -0.218 0.326 -0.715 -0.899 367s 367s y: [3, 2, 4, 4] 367s -0.124 -0.484 -0.132 367s 0.893 0.815 1.228 367s test-mul-mat0: iter:400/500 367s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 3, 1, 1] 367s 0.595 367s 0.317 367s -0.969 367s 367s x1: [1, 3, 1, 1] 367s -0.271 367s 0.259 367s -0.567 367s 367s y: [3, 3, 1, 1] 367s -0.161 -0.086 0.262 367s 0.154 0.082 -0.251 367s -0.337 -0.180 0.549 367s testing: mul_mat, [4, 3, 3, 1] = [1, 4, 3, 1] * [1, 3, 3, 1] 367s x0: [1, 4, 3, 1] 367s 0.697 367s 0.547 367s 0.354 367s 0.032 367s 367s x1: [1, 3, 3, 1] 367s 0.397 367s 0.128 367s 0.301 367s 367s y: [4, 3, 3, 1] 367s 0.277 0.217 0.140 0.013 367s 0.089 0.070 0.045 0.004 367s 0.210 0.165 0.106 0.010 367s testing: mul_mat, [4, 4, 3, 3] = [1, 4, 3, 3] * [1, 4, 3, 3] 367s x0: [1, 4, 3, 3] 367s 0.112 367s 0.966 367s 0.690 367s -0.483 367s 367s x1: [1, 4, 3, 3] 367s 0.155 367s 0.413 367s -0.384 367s -0.587 367s 367s y: [4, 4, 3, 3] 367s 0.017 0.149 0.107 -0.075 367s 0.046 0.399 0.285 -0.200 367s -0.043 -0.371 -0.265 0.186 367s -0.066 -0.567 -0.405 0.284 367s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 1, 1, 1] 367s 0.372 367s 367s x1: [1, 4, 1, 1] 367s 0.300 367s 0.059 367s 0.981 367s -0.095 367s 367s y: [1, 4, 1, 1] 367s 0.111 367s 0.022 367s 0.364 367s -0.035 367s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 367s x0: [1, 2, 3, 1] 367s 0.365 367s -0.975 367s 367s x1: [1, 1, 3, 1] 367s -0.529 367s 367s y: [2, 1, 3, 1] 367s -0.193 0.515 367s testing: mul_mat, [1, 1, 3, 3] = [2, 1, 3, 3] * [2, 1, 3, 3] 367s x0: [2, 1, 3, 3] 367s -0.521 0.680 367s 367s x1: [2, 1, 3, 3] 367s 0.242 0.414 367s 367s y: [1, 1, 3, 3] 367s 0.155 367s test-mul-mat0: iter:401/500 367s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 367s x0: [4, 4, 1, 1] 367s -0.282 -0.080 0.071 0.525 367s -0.025 0.598 0.697 0.056 367s -0.574 0.852 0.088 -0.462 367s 0.250 -0.466 -0.542 0.676 367s 367s x1: [4, 2, 1, 1] 367s 0.028 -0.395 0.974 0.104 367s -0.068 0.579 -0.761 0.240 367s 367s y: [4, 2, 1, 1] 367s 0.148 0.447 -0.315 -0.266 367s 0.045 -0.169 0.354 0.288 367s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 367s x0: [4, 1, 3, 1] 367s -0.123 0.126 0.084 -0.430 367s 367s x1: [4, 4, 3, 1] 367s 0.547 0.669 -0.941 0.383 367s -0.548 0.727 -0.589 0.056 367s 0.701 0.514 0.989 0.279 367s 0.754 0.229 0.369 -0.529 367s 367s y: [1, 4, 3, 1] 367s -0.227 367s 0.085 367s -0.059 367s 0.194 367s testing: mul_mat, [4, 1, 3, 4] = [4, 4, 3, 4] * [4, 1, 3, 4] 367s x0: [4, 4, 3, 4] 367s 0.977 0.836 -0.982 -0.916 367s -0.179 -0.718 0.738 0.920 367s 0.119 0.941 0.930 -0.671 367s -0.847 0.322 0.260 0.501 367s 367s x1: [4, 1, 3, 4] 367s 0.383 0.721 0.137 0.914 367s 367s y: [4, 1, 3, 4] 367s 0.005 0.355 0.238 0.402 367s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 4, 1, 1] 367s -0.649 0.114 0.127 -0.056 367s -0.956 -0.537 -0.093 -0.546 367s 0.556 0.900 -0.434 -0.830 367s -0.820 -0.900 0.770 0.280 367s 367s x1: [4, 4, 1, 1] 367s -0.601 -0.664 0.611 -0.554 367s 0.250 -0.622 0.802 0.928 367s 0.003 0.035 0.928 -0.319 367s -0.125 0.931 -0.075 0.806 367s 367s y: [4, 4, 1, 1] 367s 0.422 1.176 -0.737 1.406 367s -0.184 -0.485 -1.540 1.233 367s 0.138 0.066 -0.105 0.592 367s 0.133 -0.814 0.132 -0.568 367s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 367s x0: [4, 3, 3, 1] 367s -0.471 -0.988 -0.599 0.734 367s 0.989 0.381 0.339 0.751 367s 0.336 -0.033 -0.631 0.941 367s 367s x1: [4, 4, 3, 1] 367s -0.935 -0.384 -0.470 -0.557 367s -0.582 -0.543 0.446 0.453 367s -0.614 -0.873 -0.672 -0.683 367s 0.052 -0.866 -0.514 0.403 367s 367s y: [3, 4, 3, 1] 367s 0.693 -1.649 -0.529 367s 0.875 -0.292 -0.033 367s 1.053 -1.681 -0.397 367s 1.435 -0.150 0.749 367s testing: mul_mat, [4, 4, 3, 4] = [3, 4, 3, 4] * [3, 4, 3, 4] 367s x0: [3, 4, 3, 4] 367s 0.613 -0.013 -0.545 367s 0.155 0.279 -0.421 367s 0.868 0.564 -0.572 367s 0.506 0.190 0.909 367s 367s x1: [3, 4, 3, 4] 367s 0.300 0.440 0.102 367s 0.670 0.174 -0.147 367s 0.611 0.521 0.081 367s 0.739 0.011 -0.772 367s 367s y: [4, 4, 3, 4] 367s 0.123 0.126 0.450 0.328 367s 0.488 0.214 0.764 0.238 367s 0.324 0.206 0.778 0.481 367s 0.874 0.442 1.089 -0.325 367s test-mul-mat0: iter:402/500 367s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 367s x0: [2, 2, 1, 1] 367s 0.809 0.613 367s -0.756 0.594 367s 367s x1: [2, 4, 1, 1] 367s 0.125 0.496 367s -0.211 -0.669 367s 0.174 -0.715 367s 0.404 0.289 367s 367s y: [2, 4, 1, 1] 367s 0.405 0.201 367s -0.581 -0.239 367s -0.298 -0.556 367s 0.504 -0.133 367s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 3, 1, 1] 367s -0.407 0.369 367s 0.091 0.450 367s -0.360 0.678 367s 367s x1: [2, 2, 1, 1] 367s 0.104 -0.869 367s 0.845 -0.515 367s 367s y: [3, 2, 1, 1] 367s -0.363 -0.381 -0.627 367s -0.533 -0.155 -0.654 367s testing: mul_mat, [1, 3, 1, 2] = [2, 1, 1, 2] * [2, 3, 1, 2] 367s x0: [2, 1, 1, 2] 367s -0.770 -0.273 367s 367s x1: [2, 3, 1, 2] 367s 0.223 0.746 367s -0.340 -0.216 367s 0.371 -0.232 367s 367s y: [1, 3, 1, 2] 367s -0.375 367s 0.321 367s -0.222 367s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 3, 1, 1] 367s 0.444 0.930 367s -0.707 0.246 367s 0.479 -0.927 367s 367s x1: [2, 1, 1, 1] 367s -0.660 -0.838 367s 367s y: [3, 1, 1, 1] 367s -1.072 0.260 0.461 367s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 367s x0: [3, 2, 1, 1] 367s -0.325 0.055 -0.922 367s 0.530 0.443 0.551 367s 367s x1: [3, 2, 1, 1] 367s 0.298 -0.663 0.522 367s 0.938 -0.985 -0.254 367s 367s y: [2, 2, 1, 1] 367s -0.615 0.152 367s -0.125 -0.079 367s testing: mul_mat, [1, 3, 1, 2] = [2, 1, 1, 2] * [2, 3, 1, 2] 367s x0: [2, 1, 1, 2] 367s 0.412 0.760 367s 367s x1: [2, 3, 1, 2] 367s 0.707 0.416 367s -0.623 0.145 367s 0.039 -0.392 367s 367s y: [1, 3, 1, 2] 367s 0.608 367s -0.146 367s -0.282 367s test-mul-mat0: iter:403/500 367s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 4, 1, 1] 367s 0.981 -0.837 -0.901 367s 0.853 0.120 -0.255 367s -0.934 -0.761 0.026 367s 0.723 0.651 -0.214 367s 367s x1: [3, 4, 1, 1] 367s -0.177 0.011 -0.899 367s -0.492 0.686 0.631 367s 0.563 0.129 0.709 367s 0.114 -0.164 0.124 367s 367s y: [4, 4, 1, 1] 367s 0.628 0.079 0.133 0.071 367s -1.626 -0.499 -0.046 -0.044 367s -0.194 0.315 -0.605 0.340 367s 0.137 0.046 0.022 -0.051 367s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 2, 1, 1] 367s 0.130 0.414 -0.797 367s 0.966 -0.461 0.694 367s 367s x1: [3, 4, 1, 1] 367s 0.310 -0.691 -0.381 367s -0.804 0.305 0.974 367s 0.019 -0.684 -0.925 367s 0.527 -0.999 0.705 367s 367s y: [2, 4, 1, 1] 367s 0.058 0.354 367s -0.754 -0.241 367s 0.456 -0.308 367s -0.907 1.459 367s testing: mul_mat, [4, 2, 1, 3] = [3, 4, 1, 3] * [3, 2, 1, 3] 367s x0: [3, 4, 1, 3] 367s 0.641 0.019 0.642 367s 0.642 -0.276 -0.269 367s -0.228 -0.861 -0.066 367s -0.262 -0.323 -0.371 367s 367s x1: [3, 2, 1, 3] 367s 0.947 -0.298 0.793 367s 0.800 0.821 -0.462 367s 367s y: [4, 2, 1, 3] 367s 1.110 0.477 -0.011 -0.446 367s 0.232 0.412 -0.859 -0.303 367s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 367s x0: [3, 4, 1, 1] 367s -0.369 -0.843 -0.387 367s 0.187 -0.232 0.353 367s 0.893 0.415 0.464 367s -0.427 0.148 0.402 367s 367s x1: [3, 4, 1, 1] 367s -0.575 -0.629 -0.329 367s 0.359 0.109 0.348 367s 0.988 0.794 -0.273 367s -0.591 -0.721 0.927 367s 367s y: [4, 4, 1, 1] 367s 0.869 -0.077 -0.927 0.020 367s -0.359 0.165 0.528 0.003 367s -0.928 -0.096 1.084 -0.414 367s 0.466 0.384 -0.396 0.518 367s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 2, 1, 1] 367s -0.7ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s 06 0.280 -0.584 0.205 367s -0.511 -0.426 -0.351 0.603 367s 367s x1: [4, 3, 1, 1] 367s 0.405 0.421 0.210 0.099 367s 0.020 0.743 0.524 0.391 367s -0.586 -0.117 -0.500 0.762 367s 367s y: [2, 3, 1, 1] 367s -0.270 -0.400 367s -0.032 -0.275 367s 0.829 0.984 367s testing: mul_mat, [2, 4, 1, 3] = [2, 2, 1, 3] * [2, 4, 1, 3] 367s x0: [2, 2, 1, 3] 367s 0.609 0.357 367s 0.574 -0.743 367s 367s x1: [2, 4, 1, 3] 367s -0.459 0.778 367s 0.760 0.309 367s 0.193 -0.092 367s 0.921 -0.455 367s 367s y: [2, 4, 1, 3] 367s -0.001 -0.841 367s 0.573 0.207 367s 0.084 0.179 367s 0.398 0.867 367s test-mul-mat0: iter:404/500 367s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 2, 1, 1] 367s -0.392 367s -0.839 367s 367s x1: [1, 3, 1, 1] 367s -0.755 367s 0.794 367s 0.885 367s 367s y: [2, 3, 1, 1] 367s 0.296 0.634 367s -0.311 -0.666 367s -0.347 -0.743 367s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 2, 1, 1] 367s 0.325 367s 0.520 367s 367s x1: [1, 2, 1, 1] 367s 0.868 367s -0.186 367s 367s y: [2, 2, 1, 1] 367s 0.282 0.451 367s -0.060 -0.097 367s testing: mul_mat, [1, 2, 1, 2] = [1, 1, 1, 2] * [1, 2, 1, 2] 367s x0: [1, 1, 1, 2] 367s -0.906 367s 367s x1: [1, 2, 1, 2] 367s 0.302 367s 0.653 367s 367s y: [1, 2, 1, 2] 367s -0.273 367s -0.591 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.440 367s 367s x1: [1, 1, 1, 1] 367s 0.519 367s 367s y: [1, 1, 1, 1] 367s -0.228 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s 0.280 367s 367s x1: [1, 1, 1, 1] 367s -0.682 367s 367s y: [1, 1, 1, 1] 367s -0.191 367s testing: mul_mat, [2, 1, 1, 2] = [1, 2, 1, 2] * [1, 1, 1, 2] 367s x0: [1, 2, 1, 2] 367s -0.545 367s 0.473 367s 367s x1: [1, 1, 1, 2] 367s -0.575 367s 367s y: [2, 1, 1, 2] 367s 0.313 -0.272 367s test-mul-mat0: iter:405/500 367s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 367s x0: [2, 4, 1, 1] 367s -0.875 -0.297 367s -0.795 0.645 367s -0.995 0.857 367s -0.835 -0.752 367s 367s x1: [2, 1, 1, 1] 367s 0.311 0.853 367s 367s y: [4, 1, 1, 1] 367s -0.525 0.302 0.422 -0.901 367s testing: mul_mat, [2, 4, 3, 1] = [2, 2, 3, 1] * [2, 4, 3, 1] 367s x0: [2, 2, 3, 1] 367s 0.029 0.287 367s 0.896 -0.114 367s 367s x1: [2, 4, 3, 1] 367s 0.119 -0.741 367s 0.848 -0.362 367s 0.132 -0.592 367s -0.045 -0.439 367s 367s y: [2, 4, 3, 1] 367s -0.209 0.191 367s -0.080 0.801 367s -0.166 0.185 367s -0.127 0.010 367s testing: mul_mat, [3, 2, 3, 4] = [2, 3, 3, 4] * [2, 2, 3, 4] 367s x0: [2, 3, 3, 4] 367s -0.906 0.409 367s -0.052 -0.143 367s 0.381 0.239 367s 367s x1: [2, 2, 3, 4] 367s 0.598 0.403 367s -0.913 -0.022 367s 367s y: [3, 2, 3, 4] 367s -0.376 -0.089 0.324 367s 0.818 0.051 -0.353 367s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 367s x0: [2, 4, 1, 1] 367s -0.489 0.185 367s -0.901 0.579 367s 0.530 -0.024 367s -0.504 -0.421 367s 367s x1: [2, 3, 1, 1] 367s 0.143 0.181 367s -0.340 -0.452 367s 0.988 -0.325 367s 367s y: [4, 3, 1, 1] 367s -0.036 -0.024 0.072 -0.148 367s 0.082 0.045 -0.170 0.362 367s -0.543 -1.078 0.532 -0.361 367s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 367s x0: [4, 4, 3, 1] 367s -0.439 0.370 -0.317 0.279 367s -0.606 0.847 0.113 0.501 367s 0.268 0.877 -0.866 0.957 367s -0.861 0.588 -0.253 -0.333 367s 367s x1: [4, 2, 3, 1] 367s 0.738 -0.782 -0.164 0.236 367s -0.455 -0.936 -0.833 0.367 367s 367s y: [4, 2, 3, 1] 367s -0.496 -1.010 -0.120 -1.132 367s 0.220 -0.428 0.131 -0.071 367s testing: mul_mat, [1, 4, 3, 4] = [4, 1, 3, 4] * [4, 4, 3, 4] 367s x0: [4, 1, 3, 4] 367s -0.999 -0.855 -0.493 -0.452 367s 367s x1: [4, 4, 3, 4] 367s 0.327 -0.611 -0.508 0.016 367s -0.635 0.197 -0.895 0.650 367s 0.592 -0.421 -0.082 0.018 367s -0.048 -0.731 0.004 0.719 367s 367s y: [1, 4, 3, 4] 367s 0.440 367s 0.613 367s -0.199 367s 0.346 367s test-mul-mat0: iter:406/500 367s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 367s x0: [1, 1, 1, 1] 367s -0.624 367s 367s x1: [1, 2, 1, 1] 367s -0.737 367s 0.198 367s 367s y: [1, 2, 1, 1] 367s 0.460 367s -0.123 367s testing: mul_mat, [4, 1, 2, 1] = [1, 4, 2, 1] * [1, 1, 2, 1] 367s x0: [1, 4, 2, 1] 367s -0.344 367s -0.879 367s 0.863 367s -0.610 367s 367s x1: [1, 1, 2, 1] 367s -0.078 367s 367s y: [4, 1, 2, 1] 367s 0.027 0.068 -0.067 0.047 367s testing: mul_mat, [4, 4, 2, 3] = [1, 4, 2, 3] * [1, 4, 2, 3] 367s x0: [1, 4, 2, 3] 367s -0.162 367s 0.064 367s 0.232 367s 0.106 367s 367s x1: [1, 4, 2, 3] 367s 0.218 367s -0.894 367s 0.692 367s -0.881 367s 367s y: [4, 4, 2, 3] 367s -0.035 0.014 0.050 0.023 367s 0.144 -0.058 -0.207 -0.095 367s -0.112 0.045 0.160 0.073 367s 0.142 -0.057 -0.204 -0.093 367s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 367s x0: [1, 4, 1, 1] 367s 0.702 367s -0.220 367s -0.925 367s -0.233 367s 367s x1: [1, 4, 1, 1] 367s -0.711 367s -0.740 367s 0.028 367s 0.474 367s 367s y: [4, 4, 1, 1] 367s -0.500 0.157 0.658 0.166 367s -0.519 0.163 0.684 0.173 367s 0.020 -0.006 -0.026 -0.007 367s 0.333 -0.104 -0.438 -0.111 367s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 367s x0: [4, 1, 2, 1] 367s -0.189 -0.473 -0.253 -0.682 367s 367s x1: [4, 1, 2, 1] 367s -0.988 0.180 -0.497 0.952 367s 367s y: [1, 1, 2, 1] 367s -0.423 367s testing: mul_mat, [4, 4, 2, 3] = [1, 4, 2, 3] * [1, 4, 2, 3] 367s x0: [1, 4, 2, 3] 367s -0.573 367s -0.782 367s 0.548 367s 0.417 367s 367s x1: [1, 4, 2, 3] 367s 0.605 367s -0.755 367s -0.002 367s -0.145 367s 367s y: [4, 4, 2, 3] 367s -0.346 -0.473 0.332 0.252 367s 0.432 0.590 -0.414 -0.315 367s 0.001 0.001 -0.001 -0.001 367s 0.083 0.114 -0.080 -0.061 367s test-mul-mat0: iter:407/500 367s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 4, 1, 1] 367s 0.128 367s -0.971 367s 0.834 367s -0.324 367s 367s x1: [1, 1, 1, 1] 367s 0.601 367s 367s y: [4, 1, 1, 1] 367s 0.077 -0.584 0.502 -0.195 367s testing: mul_mat, [2, 4, 4, 1] = [1, 2, 4, 1] * [1, 4, 4, 1] 367s x0: [1, 2, 4, 1] 367s 0.627 367s 0.825 367s 367s x1: [1, 4, 4, 1] 367s 0.446 367s -0.615 367s 0.659 367s -0.532 367s 367s y: [2, 4, 4, 1] 367s 0.279 0.368 367s -0.386 -0.507 367s 0.413 0.544 367s -0.334 -0.439 367s testing: mul_mat, [1, 2, 4, 3] = [1, 1, 4, 3] * [1, 2, 4, 3] 367s x0: [1, 1, 4, 3] 367s 0.858 367s 367s x1: [1, 2, 4, 3] 367s -0.213 367s 0.583 367s 367s y: [1, 2, 4, 3] 367s -0.183 367s 0.500 367s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 4, 1, 1] 367s 0.490 367s 0.635 367s -0.058 367s -0.551 367s 367s x1: [1, 1, 1, 1] 367s -0.094 367s 367s y: [4, 1, 1, 1] 367s -0.046 -0.059 0.005 0.052 367s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 367s x0: [4, 1, 4, 1] 367s 0.711 -0.684 0.903 -0.549 367s 367s x1: [4, 1, 4, 1] 367s -0.291 0.154 -0.684 0.119 367s 367s y: [1, 1, 4, 1] 367s -0.995 367s testing: mul_mat, [4, 4, 4, 3] = [1, 4, 4, 3] * [1, 4, 4, 3] 367s x0: [1, 4, 4, 3] 367s 0.003 367s 0.903 367s 0.372 367s -0.500 367s 367s x1: [1, 4, 4, 3] 367s 0.255 367s -0.083 367s 0.589 367s -0.625 367s 367s y: [4, 4, 4, 3] 367s 0.001 0.230 0.095 -0.128 367s -0.000 -0.075 -0.031 0.042 367s 0.002 0.532 0.219 -0.295 367s -0.002 -0.564 -0.232 0.313 367s test-mul-mat0: iter:408/500 367s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 4, 1, 1] 367s -0.123 367s 0.307 367s 0.110 367s 0.879 367s 367s x1: [1, 1, 1, 1] 367s -0.899 367s 367s y: [4, 1, 1, 1] 367s 0.111 -0.276 -0.099 -0.790 367s testing: mul_mat, [3, 4, 4, 1] = [1, 3, 4, 1] * [1, 4, 4, 1] 367s x0: [1, 3, 4, 1] 367s -0.446 367s -0.904 367s -0.212 367s 367s x1: [1, 4, 4, 1] 367s -0.351 367s -0.235 367s -0.130 367s -0.528 367s 367s y: [3, 4, 4, 1] 367s 0.156 0.317 0.074 367s 0.105 0.213 0.050 367s 0.058 0.118 0.028 367s 0.236 0.478 0.112 367s testing: mul_mat, [3, 3, 4, 2] = [1, 3, 4, 2] * [1, 3, 4, 2] 367s x0: [1, 3, 4, 2] 367s -0.202 367s -0.412 367s -0.019 367s 367s x1: [1, 3, 4, 2] 367s 0.731 367s -0.560 367s 0.438 367s 367s y: [3, 3, 4, 2] 367s -0.148 -0.301 -0.014 367s 0.113 0.231 0.011 367s -0.088 -0.180 -0.008 367s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 367s x0: [1, 3, 1, 1] 367s -0.542 367s -0.551 367s -0.233 367s 367s x1: [1, 3, 1, 1] 367s 0.150 367s -0.288 367s 0.546 367s 367s y: [3, 3, 1, 1] 367s -0.081 -0.083 -0.035 367s 0.156 0.159 0.067 367s -0.295 -0.301 -0.127 367s testing: mul_mat, [2, 1, 4, 1] = [3, 2, 4, 1] * [3, 1, 4, 1] 367s x0: [3, 2, 4, 1] 367s -0.354 0.107 0.383 367s -0.465 0.163 0.972 367s 367s x1: [3, 1, 4, 1] 367s 0.257 0.037 0.748 367s 367s y: [2, 1, 4, 1] 367s 0.199 0.614 367s testing: mul_mat, [3, 3, 4, 2] = [2, 3, 4, 2] * [2, 3, 4, 2] 367s x0: [2, 3, 4, 2] 367s 0.807 0.215 367s -0.195 -0.809 367s 0.759 -0.203 367s 367s x1: [2, 3, 4, 2] 367s -0.130 0.574 367s -0.033 -0.655 367s -0.752 -0.191 367s 367s y: [3, 3, 4, 2] 367s 0.019 -0.439 -0.215 367s -0.168 0.537 0.108 367s -0.648 0.301 -0.532 367s test-mul-mat0: iter:409/500 367s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 367s x0: [4, 1, 1, 1] 367s -0.134 -0.378 0.868 -0.851 367s 367s x1: [4, 3, 1, 1] 367s 0.928 -0.998 0.344 0.110 367s -0.946 -0.183 0.249 -0.286 367s -0.182 -0.316 0.182 0.625 367s 367s y: [1, 3, 1, 1] 367s 0.458 367s 0.656 367s -0.230 367s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 367s x0: [4, 2, 4, 1] 367s 0.618 -0.823 -0.754 -0.132 367s -0.109 0.064 0.552 -0.927 367s 367s x1: [4, 1, 4, 1] 367s 0.562 0.305 -0.857 0.240 367s 367s y: [2, 1, 4, 1] 367s 0.711 -0.738 367s testing: mul_mat, [4, 2, 4, 2] = [4, 4, 4, 2] * [4, 2, 4, 2] 367s x0: [4, 4, 4, 2] 367s -0.023 -0.504 0.064 0.655 367s 0.633 -0.538 0.911 -0.978 367s 0.779 -0.168 -0.463 0.617 367s 0.649 0.293 0.215 0.792 367s 367s x1: [4, 2, 4, 2] 367s 0.674 -0.813 0.637 -0.435 367s 0.251 0.189 -0.361 0.941 367s 367s y: [4, 2, 4, 2] 367s 0.150 1.870 0.099 -0.008 367s 0.492 -1.192 0.911 0.886 367s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 367s x0: [4, 1, 1, 1] 367s -0.886 -0.263 -0.139 0.489 367s 367s x1: [4, 4, 1, 1] 367s -0.566 -0.955 -0.207 -0.349 367s -0.554 0.621 -0.631 -0.175 367s 0.384 0.294 0.924 -0.953 367s -0.140 -0.819 -0.100 0.585 367s 367s y: [1, 4, 1, 1] 367s 0.611 367s 0.329 367s -1.013 367s 0.640 367s testing: mul_mat, [3, 4, 4, 1] = [1, 3, 4, 1] * [1, 4, 4, 1] 367s x0: [1, 3, 4, 1] 367s 0.232 367s -0.203 367s 0.093 367s 367s x1: [1, 4, 4, 1] 367s 0.794 367s 0.665 367s 0.149 367s 0.388 367s 367s y: [3, 4, 4, 1] 367s 0.184 -0.161 0.074 367s 0.154 -0.135 0.062 367s 0.035 -0.030 0.014 367s 0.090 -0.079 0.036 367s testing: mul_mat, [3, 1, 4, 2] = [3, 3, 4, 2] * [3, 1, 4, 2] 367s x0: [3, 3, 4, 2] 367s -0.668 0.169 -0.132 367s 0.323 0.181 -0.471 367s 0.345 0.621 0.690 367s 367s x1: [3, 1, 4, 2] 367s 0.698 0.347 -0.930 367s 367s y: [3, 1, 4, 2] 367s -0.285 0.727 -0.186 367s test-mul-mat0: iter:410/500 367s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 367s x0: [2, 2, 1, 1] 367s -0.583 -0.659 367s 0.119 0.698 367s 367s x1: [2, 3, 1, 1] 367s -0.003 0.007 367s 0.632 0.589 367s 0.773 -0.209 367s 367s y: [2, 3, 1, 1] 367s -0.003 0.005 367s -0.756 0.486 367s -0.313 -0.054 367s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 367s x0: [2, 2, 4, 1] 367s -0.856 0.691 367s 0.767 0.140 367s 367s x1: [2, 2, 4, 1] 367s -0.881 -0.272 367s 0.468 0.121 367s 367s y: [2, 2, 4, 1] 367s 0.566 -0.713 367s -0.318 0.376 367s testing: mul_mat, [2, 2, 4, 4] = [2, 2, 4, 4] * [2, 2, 4, 4] 367s x0: [2, 2, 4, 4] 367s 0.640 -0.987 367s -0.549 0.226 367s 367s x1: [2, 2, 4, 4] 367s -0.945 0.771 367s 0.617 0.590 367s 367s y: [2, 2, 4, 4] 367s -1.365 0.693 367s -0.187 -0.206 367s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 3, 1, 1] 367s -0.027 0.702 367s 0.585 -0.075 367s 0.415 -0.818 367s 367s x1: [2, 2, 1, 1] 367s -0.655 -0.016 367s -0.086 -0.347 367s 367s y: [3, 2, 1, 1] 367s 0.007 -0.382 -0.259 367s -0.241 -0.024 0.248 367s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 367s x0: [3, 4, 4, 1] 367s -0.461 0.641 0.833 367s -0.285 -0.393 -0.914 367s -0.576 -0.376 0.900 367s -0.759 0.505 0.100 367s 367s x1: [3, 2, 4, 1] 367s -0.617 0.865 0.226 367s 0.461 -0.606 0.267 367s 367s y: [4, 2, 4, 1] 367s 1.027 -0.371 0.234 0.928 367s -0.378 -0.137 0.202 -0.630 367s testing: mul_mat, [3, 3, 4, 4] = [4, 3, 4, 4] * [4, 3, 4, 4] 367s x0: [4, 3, 4, 4] 367s 0.368 -0.426 -0.303 -0.655 367s -0.414 0.685 -0.908 -0.801 367s -0.301 -0.283 0.929 -0.451 367s 367s x1: [4, 3, 4, 4] 367s -0.511 -0.940 0.144 -0.047 367s -0.886 0.787 0.549 -0.187 367s 0.083 -0.378 0.141 0.945 367s 367s y: [3, 3, 4, 4] 367s 0.199 -0.525 0.575 367s -0.705 0.556 0.638 367s -0.471 -1.179 -0.213 367s test-mul-mat0: iter:411/500 367s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 3, 1, 1] 367s 0.168 -0.284 367s 0.089 -0.401 367s 0.312 0.190 367s 367s x1: [2, 2, 1, 1] 367s 0.404 0.187 367s 0.377 -0.520 367s 367s y: [3, 2, 1, 1] 367s 0.015 -0.039 0.162 367s 0.211 0.242 0.019 367s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 367s x0: [2, 3, 3, 1] 367s -0.246 -0.548 367s 0.251 -0.869 367s -0.067 -0.504 367s 367s x1: [2, 3, 3, 1] 367s 0.729 0.663 367s -0.547 0.489 367s -0.621 0.819 367s 367s y: [3, 3, 3, 1] 367s -0.543 -0.394 -0.384 367s -0.134 -0.562 -0.210 367s -0.296 -0.867 -0.371 367s testing: mul_mat, [2, 3, 3, 3] = [2, 2, 3, 3] * [2, 3, 3, 3] 367s x0: [2, 2, 3, 3] 367s 0.051 0.227 367s -0.380 -0.295 367s 367s x1: [2, 3, 3, 3] 367s 0.388 -0.955 367s -0.891 -0.204 367s -0.896 0.488 367s 367s y: [2, 3, 3, 3] 367s -0.197 0.134 367s -0.092 0.399 367s 0.065 0.196 367s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 367s x0: [2, 2, 1, 1] 367s -0.160 0.084 367s -0.948 0.159 367s 367s x1: [2, 2, 1, 1] 367s -0.445 -0.650 367s 0.285 -0.573 367s 367s y: [2, 2, 1, 1] 367s 0.017 0.318 367s -0.094 -0.362 367s testing: mul_mat, [4, 2, 3, 1] = [2, 4, 3, 1] * [2, 2, 3, 1] 367s x0: [2, 4, 3, 1] 367s 0.778 -0.242 367s -0.482 -0.338 367s 0.975 0.821 367s 0.490 0.702 367s 367s x1: [2, 2, 3, 1] 367s -0.272 -0.165 367s -0.501 0.215 367s 367s y: [4, 2, 3, 1] 367s -0.172 0.187 -0.401 -0.249 367s -0.442 0.169 -0.312 -0.095 367s testing: mul_mat, [4, 2, 3, 3] = [4, 4, 3, 3] * [4, 2, 3, 3] 367s x0: [4, 4, 3, 3] 367s -0.634 -0.844 0.172 -0.630 367s -0.718 0.893 0.944 -0.508 367s 0.746 -0.911 0.394 0.278 367s 0.103 -0.569 -0.561 0.906 367s 367s x1: [4, 2, 3, 3] 367s 0.827 -0.222 0.889 0.641 367s -0.137 0.219 -0.844 -0.360 367s 367s y: [4, 2, 3, 3] 367s -0.588 -0.278 1.347 0.293 367s -0.016 -0.320 -0.734 0.009 367s test-mul-mat0: iter:412/500 367s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 367s x0: [1, 1, 1, 1] 367s 0.112 367s 367s x1: [1, 1, 1, 1] 367s 0.148 367s 367s y: [1, 1, 1, 1] 367s 0.017 367s testing: mul_mat, [1, 1, 3, 1] = [1, 1, 3, 1] * [1, 1, 3, 1] 367s x0: [1, 1, 3, 1] 367s -0.855 367s 367s x1: [1, 1, 3, 1] 367s -0.167 367s 367s y: [1, 1, 3, 1] 367s 0.143 367s testing: mul_mat, [1, 1, 3, 2] = [1, 1, 3, 2] * [1, 1, 3, 2] 367s xggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 367s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s 0: [1, 1, 3, 2] 368s 0.562 368s 368s x1: [1, 1, 3, 2] 368s -0.249 368s 368s y: [1, 1, 3, 2] 368s -0.140 368s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.210 368s -0.026 368s 368s x1: [1, 1, 1, 1] 368s 0.351 368s 368s y: [2, 1, 1, 1] 368s -0.074 -0.009 368s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 368s x0: [2, 2, 3, 1] 368s -0.380 0.018 368s 0.542 -0.172 368s 368s x1: [2, 1, 3, 1] 368s 0.543 -0.361 368s 368s y: [2, 1, 3, 1] 368s -0.213 0.356 368s testing: mul_mat, [4, 2, 3, 2] = [2, 4, 3, 2] * [2, 2, 3, 2] 368s x0: [2, 4, 3, 2] 368s 0.316 -0.576 368s -0.306 -0.115 368s -0.804 0.397 368s -0.063 0.458 368s 368s x1: [2, 2, 3, 2] 368s -0.540 0.034 368s -0.748 -0.898 368s 368s y: [4, 2, 3, 2] 368s -0.190 0.162 0.447 0.049 368s 0.281 0.332 0.245 -0.365 368s test-mul-mat0: iter:413/500 368s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.702 0.020 368s -0.295 0.493 368s 368s x1: [2, 4, 1, 1] 368s -0.700 0.055 368s 0.099 -0.127 368s 0.451 0.232 368s 0.300 0.578 368s 368s y: [2, 4, 1, 1] 368s 0.493 0.233 368s -0.072 -0.092 368s -0.312 -0.019 368s -0.199 0.196 368s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 368s x0: [2, 2, 3, 1] 368s 0.914 0.207 368s -0.549 0.474 368s 368s x1: [2, 2, 3, 1] 368s 0.399 -0.787 368s 0.572 0.176 368s 368s y: [2, 2, 3, 1] 368s 0.203 -0.592 368s 0.560 -0.231 368s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 368s x0: [2, 1, 3, 1] 368s -0.268 -0.224 368s 368s x1: [2, 2, 3, 1] 368s 0.557 -0.307 368s 0.222 -0.423 368s 368s y: [1, 2, 3, 1] 368s -0.080 368s 0.035 368s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 4, 1, 1] 368s 0.689 -0.227 368s -0.065 0.532 368s 0.908 -0.803 368s -0.782 0.845 368s 368s x1: [2, 1, 1, 1] 368s -0.784 -0.805 368s 368s y: [4, 1, 1, 1] 368s -0.358 -0.377 -0.066 -0.067 368s testing: mul_mat, [1, 2, 3, 1] = [4, 1, 3, 1] * [4, 2, 3, 1] 368s x0: [4, 1, 3, 1] 368s 0.655 0.494 0.425 0.187 368s 368s x1: [4, 2, 3, 1] 368s -0.544 0.755 -0.462 0.678 368s -0.668 0.936 -0.607 -0.692 368s 368s y: [1, 2, 3, 1] 368s -0.054 368s -0.362 368s testing: mul_mat, [1, 4, 3, 1] = [1, 1, 3, 1] * [1, 4, 3, 1] 368s x0: [1, 1, 3, 1] 368s 0.086 368s 368s x1: [1, 4, 3, 1] 368s 0.714 368s -0.914 368s 0.290 368s -0.606 368s 368s y: [1, 4, 3, 1] 368s 0.061 368s -0.078 368s 0.025 368s -0.052 368s test-mul-mat0: iter:414/500 368s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 1, 1, 1] 368s 0.341 -0.181 368s 368s x1: [2, 4, 1, 1] 368s -0.574 -1.000 368s 0.674 -0.882 368s 0.270 0.316 368s 0.564 0.077 368s 368s y: [1, 4, 1, 1] 368s -0.015 368s 0.389 368s 0.035 368s 0.178 368s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 368s x0: [2, 4, 2, 1] 368s 0.737 -0.104 368s 0.587 -0.565 368s 0.582 -0.917 368s -0.496 0.789 368s 368s x1: [2, 1, 2, 1] 368s -0.650 0.427 368s 368s y: [4, 1, 2, 1] 368s -0.523 -0.622 -0.770 0.659 368s testing: mul_mat, [2, 4, 2, 2] = [2, 2, 2, 2] * [2, 4, 2, 2] 368s x0: [2, 2, 2, 2] 368s -0.353 0.325 368s 0.913 0.846 368s 368s x1: [2, 4, 2, 2] 368s -0.712 -0.837 368s 0.636 0.558 368s 0.479 0.200 368s -0.365 -0.884 368s 368s y: [2, 4, 2, 2] 368s -0.021 -1.358 368s -0.043 1.053 368s -0.104 0.607 368s -0.158 -1.082 368s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.782 -0.830 368s -0.346 0.999 368s 368s x1: [2, 2, 1, 1] 368s -0.986 0.502 368s -0.563 -0.746 368s 368s y: [2, 2, 1, 1] 368s 0.355 0.843 368s 1.059 -0.550 368s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 368s x0: [2, 2, 2, 1] 368s 0.650 -0.355 368s -0.299 -0.963 368s 368s x1: [2, 2, 2, 1] 368s -0.854 0.649 368s -0.731 0.157 368s 368s y: [2, 2, 2, 1] 368s -0.786 -0.370 368s -0.531 0.067 368s testing: mul_mat, [4, 2, 2, 2] = [2, 4, 2, 2] * [2, 2, 2, 2] 368s x0: [2, 4, 2, 2] 368s 0.597 0.119 368s -0.622 0.758 368s 0.142 0.051 368s 0.401 0.820 368s 368s x1: [2, 2, 2, 2] 368s -0.153 0.690 368s 0.371 -0.340 368s 368s y: [4, 2, 2, 2] 368s -0.009 0.618 0.013 0.505 368s 0.181 -0.488 0.036 -0.130 368s test-mul-mat0: iter:415/500 368s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 2, 1, 1] 368s 0.882 -0.732 368s -0.225 -0.090 368s 368s x1: [2, 2, 1, 1] 368s -0.678 -0.794 368s 0.946 -0.276 368s 368s y: [2, 2, 1, 1] 368s -0.017 0.224 368s 1.036 -0.189 368s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.094 -0.079 368s 0.272 -0.481 368s 0.276 -0.916 368s 368s x1: [2, 2, 1, 1] 368s -0.342 0.945 368s 0.309 0.708 368s 368s y: [3, 2, 1, 1] 368s -0.107 -0.548 -0.960 368s -0.027 -0.257 -0.563 368s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 4, 1, 1] 368s -0.421 0.050 368s 0.305 -0.295 368s 0.372 0.511 368s -0.349 -0.904 368s 368s x1: [2, 3, 1, 1] 368s -0.763 -0.056 368s 0.066 -0.605 368s 0.573 -0.513 368s 368s y: [4, 3, 1, 1] 368s 0.318 -0.216 -0.313 0.317 368s -0.058 0.198 -0.285 0.524 368s -0.267 0.326 -ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s 0.049 0.264 368s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.844 0.650 368s 0.847 0.429 368s 368s x1: [2, 4, 1, 1] 368s -0.195 -0.467 368s -0.636 0.579 368s 0.443 0.022 368s 0.524 -0.248 368s 368s y: [2, 4, 1, 1] 368s -0.139 -0.366 368s 0.913 -0.290 368s -0.360 0.384 368s -0.603 0.338 368s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 4, 1, 1] 368s -0.422 0.146 368s -0.004 -0.978 368s 0.443 -0.885 368s 0.065 -0.549 368s 368s x1: [2, 2, 1, 1] 368s -0.635 -0.074 368s 0.513 -0.398 368s 368s y: [4, 2, 1, 1] 368s 0.257 0.074 -0.216 -0.001 368s -0.274 0.388 0.579 0.252 368s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.355 0.698 0.455 -0.079 368s -0.031 -0.974 -0.681 -0.855 368s 0.870 0.954 -0.725 0.546 368s -0.893 0.349 -0.032 0.918 368s 368s x1: [4, 2, 1, 1] 368s -0.273 0.487 0.961 0.379 368s 0.583 -0.234 0.912 0.947 368s 368s y: [4, 2, 1, 1] 368s 0.651 -1.444 -0.263 0.730 368s 0.384 -1.221 0.140 0.238 368s test-mul-mat0: iter:416/500 368s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 2, 1, 1] 368s 0.476 0.715 0.797 368s -0.758 0.627 0.745 368s 368s x1: [3, 1, 1, 1] 368s -0.273 0.514 -0.663 368s 368s y: [2, 1, 1, 1] 368s -0.291 0.036 368s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 368s x0: [3, 4, 3, 1] 368s 0.265 0.377 0.789 368s -0.220 0.714 -0.997 368s -0.744 0.429 0.801 368s -0.503 0.057 0.545 368s 368s x1: [3, 2, 3, 1] 368s -0.413 -0.018 -0.287 368s -0.544 0.090 -0.588 368s 368s y: [4, 2, 3, 1] 368s -0.343 0.364 0.070 0.050 368s -0.575 0.770 -0.028 -0.043 368s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 368s x0: [3, 4, 3, 1] 368s 0.825 0.101 -0.337 368s 0.603 -0.454 0.822 368s -0.949 -0.987 0.152 368s -0.069 0.442 -0.983 368s 368s x1: [3, 4, 3, 1] 368s -0.827 -0.678 -0.065 368s 0.973 -0.181 0.992 368s 0.519 0.903 -0.969 368s 0.778 0.444 0.159 368s 368s y: [4, 4, 3, 1] 368s -0.730 -0.244 1.444 -0.179 368s 0.450 1.484 -0.594 -1.121 368s 0.847 -0.894 -1.531 1.316 368s 0.633 0.398 -1.152 -0.013 368s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 4, 1, 1] 368s 0.773 -0.947 -0.696 368s -0.445 -0.530 -0.544 368s 0.015 0.233 -0.980 368s -0.350 0.884 0.335 368s 368s x1: [3, 4, 1, 1] 368s -0.660 0.616 -0.308 368s 0.492 -0.453 -0.866 368s 0.509 -0.175 0.870 368s -0.789 0.095 0.217 368s 368s y: [4, 4, 1, 1] 368s -0.880 0.135 0.435 0.672 368s 1.412 0.492 0.750 -0.862 368s -0.046 -0.607 -0.885 -0.041 368s -0.851 0.183 -0.202 0.433 368s testing: mul_mat, [3, 3, 3, 1] = [4, 3, 3, 1] * [4, 3, 3, 1] 368s x0: [4, 3, 3, 1] 368s -0.945 -0.284 -0.114 0.268 368s 0.265 -0.025 0.511 0.141 368s 0.576 -0.451 0.392 0.452 368s 368s x1: [4, 3, 3, 1] 368s -0.688 0.982 0.468 -0.170 368s 0.547 -0.853 0.170 0.163 368s -0.161 -0.339 0.711 -0.027 368s 368s y: [3, 3, 3, 1] 368s 0.273 0.009 -0.733 368s -0.250 0.275 0.840 368s 0.161 0.325 0.326 368s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 368s x0: [3, 1, 3, 1] 368s -0.512 -0.235 0.997 368s 368s x1: [3, 4, 3, 1] 368s -0.874 -0.773 -0.735 368s 0.518 0.494 0.406 368s -0.030 -0.239 0.951 368s 0.037 0.041 -0.450 368s 368s y: [1, 4, 3, 1] 368s -0.104 368s 0.024 368s 1.020 368s -0.477 368s test-mul-mat0: iter:417/500 368s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 2, 1, 1] 368s -0.941 -0.938 -0.536 368s -0.657 -0.563 -0.536 368s 368s x1: [3, 2, 1, 1] 368s 0.999 -0.203 -0.402 368s -0.706 0.116 -0.559 368s 368s y: [2, 2, 1, 1] 368s -0.534 -0.327 368s 0.855 0.698 368s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 4, 1, 1] 368s 0.120 0.994 0.430 368s 0.666 0.290 -0.959 368s -0.352 0.643 0.010 368s 0.695 -0.087 0.009 368s 368s x1: [3, 2, 1, 1] 368s -0.147 0.077 0.982 368s -0.142 0.123 -0.530 368s 368s y: [4, 2, 1, 1] 368s 0.481 -1.017 0.111 -0.100 368s -0.123 0.449 0.124 -0.115 368s testing: mul_mat, [3, 4, 1, 2] = [3, 3, 1, 2] * [3, 4, 1, 2] 368s x0: [3, 3, 1, 2] 368s -0.916 -0.797 0.825 368s 0.779 0.116 -0.166 368s -0.729 -0.373 -0.862 368s 368s x1: [3, 4, 1, 2] 368s -0.508 0.511 0.304 368s 0.608 0.952 0.894 368s 0.667 -0.987 -0.641 368s -0.990 -0.550 -0.177 368s 368s y: [3, 4, 1, 2] 368s 0.309 -0.387 -0.082 368s -0.577 0.436 -1.569 368s -0.354 0.511 0.434 368s 1.199 -0.806 1.080 368s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.575 0.124 0.069 368s 368s x1: [3, 3, 1, 1] 368s -0.582 -0.430 -0.981 368s -0.862 0.220 0.295 368s 0.481 -0.011 -0.436 368s 368s y: [1, 3, 1, 1] 368s 0.214 368s 0.544 368s -0.308 368s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 368s x0: [1, 1, 1, 1] 368s -0.556 368s 368s x1: [1, 3, 1, 1] 368s -0.492 368s 0.328 368s -0.107 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s 368s y: [1, 3, 1, 1] 368s 0.273 368s -0.182 368s 0.059 368s testing: mul_mat, [4, 1, 1, 2] = [1, 4, 1, 2] * [1, 1, 1, 2] 368s x0: [1, 4, 1, 2] 368s 0.865 368s -0.563 368s 0.651 368s 0.897 368s 368s x1: [1, 1, 1, 2] 368s 0.727 368s 368s y: [4, 1, 1, 2] 368s 0.629 -0.410 0.473 0.653 368s test-mul-mat0: iter:418/500 368s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 1, 1, 1] 368s 0.828 -0.510 -0.495 368s 368s x1: [3, 2, 1, 1] 368s 0.521 -0.908 0.076 368s -0.184 0.574 -0.935 368s 368s y: [1, 2, 1, 1] 368s 0.857 368s 0.018 368s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 368s x0: [3, 2, 3, 1] 368s -0.719 -0.001 0.281 368s -0.737 0.242 0.569 368s 368s x1: [3, 1, 3, 1] 368s -0.103 -0.002 0.833 368s 368s y: [2, 1, 3, 1] 368s 0.308 0.549 368s testing: mul_mat, [3, 2, 3, 2] = [3, 3, 3, 2] * [3, 2, 3, 2] 368s x0: [3, 3, 3, 2] 368s -0.163 -0.789 0.371 368s -0.708 -0.647 -0.878 368s 0.129 -0.224 0.872 368s 368s x1: [3, 2, 3, 2] 368s 0.249 -0.986 0.294 368s -0.854 0.012 0.126 368s 368s y: [3, 2, 3, 2] 368s 0.846 0.204 0.509 368s 0.176 0.486 -0.003 368s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 3, 1, 1] 368s 0.660 0.315 -0.157 368s 0.147 -0.159 0.783 368s 0.521 -0.786 -0.632 368s 368s x1: [3, 3, 1, 1] 368s 0.252 0.771 -0.126 368s -0.310 0.965 -0.690 368s -0.958 -0.873 -0.417 368s 368s y: [3, 3, 1, 1] 368s 0.429 -0.184 -0.395 368s 0.208 -0.739 -0.484 368s -0.841 -0.330 0.451 368s testing: mul_mat, [4, 3, 3, 1] = [3, 4, 3, 1] * [3, 3, 3, 1] 368s x0: [3, 4, 3, 1] 368s 0.944 -0.146 -0.294 368s 0.194 0.950 -0.735 368s 0.481 0.190 0.930 368s 0.179 -0.916 -0.978 368s 368s x1: [3, 3, 3, 1] 368s 0.235 0.660 -0.531 368s -0.989 0.230 -0.244 368s 0.075 0.845 0.316 368s 368s y: [4, 3, 3, 1] 368s 0.282 1.063 -0.255 -0.043 368s -0.896 0.206 -0.658 -0.150 368s -0.146 0.585 0.491 -1.070 368s testing: mul_mat, [3, 3, 3, 2] = [4, 3, 3, 2] * [4, 3, 3, 2] 368s x0: [4, 3, 3, 2] 368s 0.304 -0.618 -0.021 0.239 368s 0.683 0.018 0.903 0.376 368s -0.517 0.367 -0.378 0.124 368s 368s x1: [4, 3, 3, 2] 368s 0.419 0.557 -0.418 -0.875 368s 0.823 -0.488 -0.853 0.877 368s -0.295 0.825 -0.028 -0.065 368s 368s y: [3, 3, 3, 2] 368s -0.417 -0.411 0.037 368s 0.779 0.114 -0.173 368s -0.615 -0.237 0.458 368s test-mul-mat0: iter:419/500 368s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 368s x0: [1, 1, 1, 1] 368s 0.214 368s 368s x1: [1, 1, 1, 1] 368s 0.563 368s 368s y: [1, 1, 1, 1] 368s 0.121 368s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 368s x0: [1, 1, 4, 1] 368s -0.736 368s 368s x1: [1, 1, 4, 1] 368s 0.338 368s 368s y: [1, 1, 4, 1] 368s -0.249 368s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 368s x0: [1, 2, 4, 1] 368s 0.745 368s 0.568 368s 368s x1: [1, 1, 4, 1] 368s 0.513 368s 368s y: [2, 1, 4, 1] 368s 0.383 0.291 368s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.839 368s -0.727 368s 368s x1: [1, 2, 1, 1] 368s 0.070 368s -0.303 368s 368s y: [2, 2, 1, 1] 368s -0.058 -0.051 368s 0.254 0.220 368s testing: mul_mat, [2, 1, 4, 1] = [2, 2, 4, 1] * [2, 1, 4, 1] 368s x0: [2, 2, 4, 1] 368s 0.702 -0.281 368s 0.345 0.295 368s 368s x1: [2, 1, 4, 1] 368s 0.282 0.724 368s 368s y: [2, 1, 4, 1] 368s -0.006 0.311 368s testing: mul_mat, [1, 2, 4, 1] = [2, 1, 4, 1] * [2, 2, 4, 1] 368s x0: [2, 1, 4, 1] 368s -0.205 -0.296 368s 368s x1: [2, 2, 4, 1] 368s -0.846 -0.751 368s 0.137 -0.509 368s 368s y: [1, 2, 4, 1] 368s 0.396 368s 0.123 368s test-mul-mat0: iter:420/500 368s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 4, 1, 1] 368s -0.722 -0.199 368s -0.098 -0.092 368s 0.070 -0.901 368s -0.536 -0.223 368s 368s x1: [2, 3, 1, 1] 368s -0.892 0.731 368s -0.462 -0.643 368s -0.132 0.029 368s 368s y: [4, 3, 1, 1] 368s 0.500 0.020 -0.721 0.316 368s 0.461 0.104 0.547 0.391 368s 0.090 0.010 -0.035 0.065 368s testing: mul_mat, [2, 4, 2, 1] = [2, 2, 2, 1] * [2, 4, 2, 1] 368s x0: [2, 2, 2, 1] 368s 0.905 -0.571 368s -0.602 -0.227 368s 368s x1: [2, 4, 2, 1] 368s 0.338 -0.461 368s -0.814 0.921 368s 0.334 -0.111 368s 0.362 0.670 368s 368s y: [2, 4, 2, 1] 368s 0.569 -0.098 368s -1.263 0.281 368s 0.366 -0.176 368s -0.055 -0.370 368s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 368s x0: [2, 2, 2, 1] 368s 0.425 -0.705 368s 0.182 -0.213 368s 368s x1: [2, 2, 2, 1] 368s -0.932 0.959 368s 0.330 -0.833 368s 368s y: [2, 2, 2, 1] 368s -1.072 -0.373 368s 0.727 0.237 368s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 3, 1, 1] 368s -0.054 0.345 368s 0.352 -0.875 368s 0.315 -0.227 368s 368s x1: [2, 2, 1, 1] 368s 0.272 -0.005 368s -0.095 0.447 368s 368s y: [3, 2, 1, 1] 368s -0.016 0.100 0.087 368s 0.159 -0.424 -0.132 368s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 368s x0: [3, 2, 2, 1] 368s -0.187 -0.537 -0.669 368s 0.942 -0.222 0.951 368s 368s x1: [3, 2, 2, 1] 368s -0.489 0.176 0.032 368s -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s .421 0.134 -0.638 368s 368s y: [2, 2, 2, 1] 368s -0.024 -0.470 368s 0.434 -1.034 368s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 368s x0: [2, 1, 2, 1] 368s -0.606 0.781 368s 368s x1: [2, 3, 2, 1] 368s 0.600 -0.598 368s 0.147 -0.055 368s -0.473 0.919 368s 368s y: [1, 3, 2, 1] 368s -0.832 368s -0.132 368s 1.005 368s test-mul-mat0: iter:421/500 368s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 3, 1, 1] 368s -0.748 -0.582 -0.931 368s -0.094 0.467 0.901 368s -0.493 0.869 0.048 368s 368s x1: [3, 1, 1, 1] 368s 0.171 -0.497 -0.185 368s 368s y: [3, 1, 1, 1] 368s 0.334 -0.415 -0.525 368s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 368s x0: [3, 1, 4, 1] 368s 0.434 -0.963 -0.108 368s 368s x1: [3, 3, 4, 1] 368s 0.452 -0.604 -0.033 368s -0.093 -0.902 0.919 368s -0.058 -0.065 0.232 368s 368s y: [1, 3, 4, 1] 368s 0.781 368s 0.729 368s 0.012 368s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 4, 4, 1] 368s -0.898 0.977 0.834 368s -0.406 0.079 0.495 368s -0.971 0.116 -0.613 368s 0.186 -0.517 -0.708 368s 368s x1: [3, 1, 4, 1] 368s -0.992 -0.961 -0.646 368s 368s y: [4, 1, 4, 1] 368s -0.588 0.007 1.248 0.770 368s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 2, 1, 1] 368s 0.800 0.805 -0.141 368s -0.199 -0.121 0.430 368s 368s x1: [3, 4, 1, 1] 368s -0.743 0.322 -0.327 368s 0.863 0.567 0.132 368s 0.094 0.402 0.976 368s -0.310 -0.445 0.337 368s 368s y: [2, 4, 1, 1] 368s -0.289 -0.031 368s 1.128 -0.184 368s 0.261 0.352 368s -0.654 0.260 368s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 368s x0: [2, 2, 4, 1] 368s -0.922 0.639 368s 0.600 -0.043 368s 368s x1: [2, 3, 4, 1] 368s 0.453 -0.054 368s -0.829 -0.586 368s -0.781 0.394 368s 368s y: [2, 3, 4, 1] 368s -0.452 0.274 368s 0.390 -0.472 368s 0.972 -0.486 368s testing: mul_mat, [1, 2, 4, 1] = [2, 1, 4, 1] * [2, 2, 4, 1] 368s x0: [2, 1, 4, 1] 368s -0.177 0.753 368s 368s x1: [2, 2, 4, 1] 368s 0.644 0.022 368s -0.906 -0.603 368s 368s y: [1, 2, 4, 1] 368s -0.098 368s -0.294 368s test-mul-mat0: iter:422/500 368s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 4, 1, 1] 368s 0.624 0.185 368s -0.634 -0.603 368s 0.625 -0.310 368s 0.746 -0.656 368s 368s x1: [2, 3, 1, 1] 368s 0.106 -0.358 368s -0.931 -0.872 368s -0.263 -0.533 368s 368s y: [4, 3, 1, 1] 368s 0.000 0.148 0.178 0.314 368s -0.741 1.116 -0.311 -0.122 368s -0.263 0.489 0.001 0.153 368s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 368s x0: [2, 1, 4, 1] 368s 0.772 -0.317 368s 368s x1: [2, 4, 4, 1] 368s 0.812 -0.070 368s 0.140 -0.465 368s 0.753 -0.106 368s 0.718 0.645 368s 368s y: [1, 4, 4, 1] 368s 0.650 368s 0.256 368s 0.615 368s 0.350 368s testing: mul_mat, [1, 1, 4, 4] = [2, 1, 4, 4] * [2, 1, 4, 4] 368s x0: [2, 1, 4, 4] 368s -0.841 -0.502 368s 368s x1: [2, 1, 4, 4] 368s 0.772 0.766 368s 368s y: [1, 1, 4, 4] 368s -1.034 368s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 2, 1, 1] 368s 0.436 0.010 368s -0.082 0.636 368s 368s x1: [2, 1, 1, 1] 368s -0.025 -0.986 368s 368s y: [2, 1, 1, 1] 368s -0.021 -0.625 368s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 368s x0: [2, 3, 4, 1] 368s 0.596 -0.378 368s 0.287 0.764 368s -0.247 0.315 368s 368s x1: [2, 2, 4, 1] 368s -0.824 0.165 368s -0.146 -0.689 368s 368s y: [3, 2, 4, 1] 368s -0.554 -0.110 0.256 368s 0.173 -0.569 -0.181 368s testing: mul_mat, [4, 2, 4, 4] = [3, 4, 4, 4] * [3, 2, 4, 4] 368s x0: [3, 4, 4, 4] 368s -0.432 0.111 0.372 368s 0.853 0.147 -0.832 368s 0.923 -0.419 -0.080 368s -0.219 0.626 0.580 368s 368s x1: [3, 2, 4, 4] 368s -0.124 -0.334 -0.263 368s 0.231 -0.828 0.731 368s 368s y: [4, 2, 4, 4] 368s -0.081 0.064 0.047 -0.334 368s 0.080 -0.533 0.502 -0.145 368s test-mul-mat0: iter:423/500 368s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 368s x0: [1, 3, 1, 1] 368s -0.430 368s 0.796 368s -0.779 368s 368s x1: [1, 1, 1, 1] 368s -0.130 368s 368s y: [3, 1, 1, 1] 368s 0.056 -0.103 0.101 368s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 368s x0: [1, 4, 2, 1] 368s -0.477 368s 0.997 368s 0.600 368s 0.440 368s 368s x1: [1, 3, 2, 1] 368s -0.897 368s -0.347 368s 0.662 368s 368s y: [4, 3, 2, 1] 368s 0.428 -0.895 -0.538 -0.395 368s 0.165 -0.346 -0.208 -0.153 368s -0.316 0.661 0.398 0.292 368s testing: mul_mat, [4, 4, 2, 2] = [1, 4, 2, 2] * [1, 4, 2, 2] 368s x0: [1, 4, 2, 2] 368s 0.933 368s -0.598 368s -0.470 368s 0.569 368s 368s x1: [1, 4, 2, 2] 368s -0.264 368s 0.470 368s -0.087 368s -0.564 368s 368s y: [4, 4, 2, 2] 368s -0.246 0.158 0.124 -0.150 368s 0.439 -0.281 -0.221 0.268 368s -0.081 0.052 0.041 -0.049 368s -0.526 0.337 0.265 -0.321 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s -0.997 368s 0.798 368s 0.323 368s 368s x1: [1, 4, 1, 1] 368s 0.048 368s -0.869 368s -0.351 368s 0.185 368s 368s y: [3, 4, 1, 1] 368s -0.048 0.038 0.015 368s 0.866 -0.694 -0.281 368s 0.350 -0.280 -0.113 368s -ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s 0.185 0.148 0.060 368s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 368s x0: [3, 1, 2, 1] 368s -0.507 -0.018 0.549 368s 368s x1: [3, 1, 2, 1] 368s -0.406 0.281 0.841 368s 368s y: [1, 1, 2, 1] 368s 0.662 368s testing: mul_mat, [3, 3, 2, 2] = [1, 3, 2, 2] * [1, 3, 2, 2] 368s x0: [1, 3, 2, 2] 368s -0.153 368s -0.395 368s 0.187 368s 368s x1: [1, 3, 2, 2] 368s -0.845 368s 0.946 368s -0.547 368s 368s y: [3, 3, 2, 2] 368s 0.129 0.333 -0.158 368s -0.144 -0.373 0.177 368s 0.083 0.216 -0.102 368s test-mul-mat0: iter:424/500 368s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 4, 1, 1] 368s 0.514 -0.019 0.348 368s -0.880 -0.832 0.993 368s -0.952 0.950 0.920 368s 0.818 0.487 -0.592 368s 368s x1: [3, 4, 1, 1] 368s 0.797 0.306 0.198 368s 0.744 0.760 -0.114 368s -0.776 0.171 0.262 368s -0.162 -0.723 0.797 368s 368s y: [4, 4, 1, 1] 368s 0.473 -0.760 -0.286 0.685 368s 0.328 -1.400 -0.091 1.046 368s -0.311 0.801 1.142 -0.707 368s 0.208 1.535 0.201 -0.956 368s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 368s x0: [3, 2, 4, 1] 368s -0.856 -0.756 -0.856 368s -0.904 -0.459 -0.133 368s 368s x1: [3, 4, 4, 1] 368s -0.831 -0.957 0.389 368s 0.501 -0.852 -0.568 368s 0.298 0.454 0.629 368s 0.042 0.214 -0.485 368s 368s y: [2, 4, 4, 1] 368s 1.102 1.140 368s 0.702 0.014 368s -1.137 -0.561 368s 0.218 -0.072 368s testing: mul_mat, [3, 2, 4, 3] = [3, 3, 4, 3] * [3, 2, 4, 3] 368s x0: [3, 3, 4, 3] 368s -0.992 0.482 -0.317 368s -0.528 -0.681 0.675 368s -0.791 -0.087 -0.472 368s 368s x1: [3, 2, 4, 3] 368s 0.899 0.952 -0.513 368s 0.914 0.492 -0.446 368s 368s y: [3, 2, 4, 3] 368s -0.270 -1.469 -0.551 368s -0.528 -1.119 -0.556 368s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 3, 1, 1] 368s 0.875 0.766 0.622 368s 0.577 0.595 -0.848 368s 0.424 0.345 -0.698 368s 368s x1: [3, 3, 1, 1] 368s -0.741 -0.481 -0.813 368s -0.018 0.487 -0.040 368s -0.828 0.823 -0.822 368s 368s y: [3, 3, 1, 1] 368s -1.523 -0.025 0.086 368s 0.332 0.313 0.188 368s -0.606 0.709 0.507 368s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 368s x0: [3, 1, 4, 1] 368s 0.111 -0.953 0.137 368s 368s x1: [3, 3, 4, 1] 368s -0.954 0.487 -0.730 368s 0.845 -0.249 0.752 368s -0.572 -0.733 -0.665 368s 368s y: [1, 3, 4, 1] 368s -0.670 368s 0.434 368s 0.544 368s testing: mul_mat, [1, 3, 4, 3] = [1, 1, 4, 3] * [1, 3, 4, 3] 368s x0: [1, 1, 4, 3] 368s 0.935 368s 368s x1: [1, 3, 4, 3] 368s -0.750 368s 0.204 368s 0.363 368s 368s y: [1, 3, 4, 3] 368s -0.701 368s 0.191 368s 0.339 368s test-mul-mat0: iter:425/500 368s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 1, 1, 1] 368s -0.371 368s 368s x1: [1, 4, 1, 1] 368s 0.583 368s -0.578 368s -0.844 368s 0.440 368s 368s y: [1, 4, 1, 1] 368s -0.216 368s 0.214 368s 0.313 368s -0.163 368s testing: mul_mat, [3, 1, 4, 1] = [1, 3, 4, 1] * [1, 1, 4, 1] 368s x0: [1, 3, 4, 1] 368s 0.233 368s 0.808 368s 0.099 368s 368s x1: [1, 1, 4, 1] 368s -0.923 368s 368s y: [3, 1, 4, 1] 368s -0.215 -0.746 -0.092 368s testing: mul_mat, [4, 3, 4, 3] = [1, 4, 4, 3] * [1, 3, 4, 3] 368s x0: [1, 4, 4, 3] 368s -0.411 368s 0.328 368s -0.506 368s -0.750 368s 368s x1: [1, 3, 4, 3] 368s -0.277 368s -0.654 368s 0.435 368s 368s y: [4, 3, 4, 3] 368s 0.114 -0.091 0.140 0.208 368s 0.269 -0.215 0.331 0.490 368s -0.179 0.143 -0.220 -0.326 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s 0.090 368s -0.425 368s 0.277 368s 368s x1: [1, 4, 1, 1] 368s -0.786 368s -0.844 368s 0.547 368s 0.095 368s 368s y: [3, 4, 1, 1] 368s -0.071 0.334 -0.217 368s -0.076 0.359 -0.234 368s 0.049 -0.233 0.151 368s 0.009 -0.041 0.026 368s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 3, 4, 1] 368s 0.228 -0.705 0.277 368s 0.452 0.164 0.999 368s -0.625 -0.649 -0.201 368s 368s x1: [3, 1, 4, 1] 368s -0.527 -0.001 0.556 368s 368s y: [3, 1, 4, 1] 368s 0.034 0.318 0.218 368s testing: mul_mat, [2, 3, 4, 3] = [3, 2, 4, 3] * [3, 3, 4, 3] 368s x0: [3, 2, 4, 3] 368s 0.674 0.300 0.906 368s 0.802 0.217 -0.188 368s 368s x1: [3, 3, 4, 3] 368s -0.275 0.730 -0.379 368s 0.524 -0.871 -0.166 368s -0.522 -0.924 -0.857 368s 368s y: [2, 3, 4, 3] 368s -0.310 0.009 368s -0.058 0.263 368s -1.405 -0.458 368s test-mul-mat0: iter:426/500 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s -0.879 368s 0.959 368s 0.122 368s 368s x1: [1, 4, 1, 1] 368s 0.887 368s 0.279 368s -0.659 368s -0.240 368s 368s y: [3, 4, 1, 1] 368s -0.779 0.851 0.108 368s -0.246 0.268 0.034 368s 0.579 -0.632 -0.081 368s 0.211 -0.230 -0.029 368s testing: mul_mat, [3, 3, 2, 1] = [1, 3, 2, 1] * [1, 3, 2, 1] 368s x0: [1, 3, 2, 1] 368s 0.795 368s 0.374 368s 0.789 368s 368s x1: [1, 3, 2, 1] 368s -0.509 368s -0.599 368s -0.718 368s 368s y: [3, 3, 2, 1] 368s -0.405 -0.190 -0.402 368s -0.477 -0.224 -0.473 368s -0.571 -0.269 -0.567 368s testing: mul_mat, [2, 3, 2, 3] = [1, 2, 2, 3] * [1, 3, 2, 3] 368s x0: [1, 2, 2, 3]ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s 368s -0.002 368s -0.519 368s 368s x1: [1, 3, 2, 3] 368s 0.661 368s 0.256 368s 0.080 368s 368s y: [2, 3, 2, 3] 368s -0.001 -0.343 368s -0.000 -0.133 368s -0.000 -0.042 368s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 3, 1, 1] 368s -0.191 368s -0.759 368s -0.826 368s 368s x1: [1, 2, 1, 1] 368s 0.637 368s 0.678 368s 368s y: [3, 2, 1, 1] 368s -0.122 -0.483 -0.526 368s -0.130 -0.515 -0.560 368s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 368s x0: [3, 4, 2, 1] 368s -0.252 0.348 0.666 368s 0.149 -0.904 -0.455 368s -0.653 -0.656 0.879 368s 0.345 0.829 -0.736 368s 368s x1: [3, 1, 2, 1] 368s 0.154 -0.768 -0.517 368s 368s y: [4, 1, 2, 1] 368s -0.650 0.952 -0.050 -0.203 368s testing: mul_mat, [1, 3, 2, 3] = [4, 1, 2, 3] * [4, 3, 2, 3] 368s x0: [4, 1, 2, 3] 368s 0.123 0.773 -0.581 0.422 368s 368s x1: [4, 3, 2, 3] 368s -0.882 0.876 0.806 -0.932 368s 0.986 0.878 0.156 -0.266 368s 0.027 0.502 -0.920 -0.625 368s 368s y: [1, 3, 2, 3] 368s -0.293 368s 0.597 368s 0.663 368s test-mul-mat0: iter:427/500 368s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 1, 1, 1] 368s -0.423 0.023 368s 368s x1: [2, 4, 1, 1] 368s -0.772 0.609 368s -0.976 0.351 368s 0.382 -0.557 368s -0.227 -0.990 368s 368s y: [1, 4, 1, 1] 368s 0.340 368s 0.421 368s -0.174 368s 0.073 368s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.591 -0.409 368s 0.280 0.672 368s -0.581 -0.247 368s 368s x1: [2, 1, 1, 1] 368s 0.456 -0.853 368s 368s y: [3, 1, 1, 1] 368s 0.619 -0.446 -0.054 368s testing: mul_mat, [2, 3, 1, 2] = [2, 2, 1, 2] * [2, 3, 1, 2] 368s x0: [2, 2, 1, 2] 368s -0.871 -0.306 368s -0.954 -0.428 368s 368s x1: [2, 3, 1, 2] 368s -0.151 0.787 368s -0.974 -0.611 368s -0.276 0.916 368s 368s y: [2, 3, 1, 2] 368s -0.109 -0.193 368s 1.035 1.191 368s -0.040 -0.129 368s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.218 -0.626 368s -0.042 0.201 368s 368s x1: [2, 2, 1, 1] 368s 0.079 0.112 368s -0.809 0.678 368s 368s y: [2, 2, 1, 1] 368s -0.087 0.019 368s -0.248 0.170 368s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 4, 1, 1] 368s 0.712 -0.232 368s 0.653 -0.704 368s 0.241 0.064 368s -0.041 -0.889 368s 368s x1: [2, 2, 1, 1] 368s 0.711 0.224 368s -0.012 0.737 368s 368s y: [4, 2, 1, 1] 368s 0.454 0.307 0.185 -0.228 368s -0.179 -0.527 0.044 -0.655 368s testing: mul_mat, [2, 2, 1, 2] = [4, 2, 1, 2] * [4, 2, 1, 2] 368s x0: [4, 2, 1, 2] 368s -0.509 -0.169 -0.776 0.191 368s -0.513 0.479 -0.556 0.877 368s 368s x1: [4, 2, 1, 2] 368s -0.040 0.192 -0.195 0.005 368s 0.764 -0.727 0.061 0.420 368s 368s y: [2, 2, 1, 2] 368s 0.140 0.226 368s -0.234 -0.406 368s test-mul-mat0: iter:428/500 368s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 4, 1, 1] 368s -0.342 368s -0.036 368s 0.355 368s 0.948 368s 368s x1: [1, 4, 1, 1] 368s 0.007 368s 0.590 368s -0.850 368s 0.146 368s 368s y: [4, 4, 1, 1] 368s -0.002 -0.000 0.002 0.006 368s -0.202 -0.021 0.209 0.560 368s 0.290 0.030 -0.301 -0.806 368s -0.050 -0.005 0.052 0.139 368s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 368s x0: [1, 2, 2, 1] 368s -0.671 368s 0.923 368s 368s x1: [1, 4, 2, 1] 368s 0.741 368s -0.038 368s -0.561 368s -0.772 368s 368s y: [2, 4, 2, 1] 368s -0.497 0.684 368s 0.025 -0.035 368s 0.376 -0.518 368s 0.518 -0.713 368s testing: mul_mat, [2, 2, 2, 2] = [1, 2, 2, 2] * [1, 2, 2, 2] 368s x0: [1, 2, 2, 2] 368s 0.796 368s -0.359 368s 368s x1: [1, 2, 2, 2] 368s 0.904 368s -0.526 368s 368s y: [2, 2, 2, 2] 368s 0.720 -0.325 368s -0.419 0.189 368s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 3, 1, 1] 368s -0.681 368s 0.876 368s -0.127 368s 368s x1: [1, 2, 1, 1] 368s 0.880 368s 0.648 368s 368s y: [3, 2, 1, 1] 368s -0.599 0.771 -0.112 368s -0.441 0.568 -0.082 368s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 368s x0: [3, 1, 2, 1] 368s 0.631 -0.062 0.080 368s 368s x1: [3, 1, 2, 1] 368s -0.763 -0.672 0.111 368s 368s y: [1, 1, 2, 1] 368s -0.431 368s testing: mul_mat, [1, 3, 2, 2] = [1, 1, 2, 2] * [1, 3, 2, 2] 368s x0: [1, 1, 2, 2] 368s -0.134 368s 368s x1: [1, 3, 2, 2] 368s -0.349 368s -0.428 368s 0.010 368s 368s y: [1, 3, 2, 2] 368s 0.047 368s 0.058 368s -0.001 368s test-mul-mat0: iter:429/500 368s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.600 0.485 0.171 -0.182 368s -0.984 -0.968 0.683 0.626 368s 368s x1: [4, 4, 1, 1] 368s -0.236 0.727 -0.371 0.370 368s 0.358 0.567 -0.550 -0.282 368s 0.349 -0.590 0.369 0.921 368s 0.420 0.143 -0.711 0.072 368s 368s y: [2, 4, 1, 1] 368s 0.080 -0.493 368s 0.447 -1.453 368s -0.182 1.057 368s 0.187 -0.992 368s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.237 -0.421 0.900 0.213 368s 368s x1: [4, 2, 1, 1] 368s 0.212 -0.575 -0.890 -0.371 368s 0.495 0.704 0.393 0.221 368s 368s y: [1, 2, 1, 1] 368s -0.689 368s -0.012 368s testing: mul_mat, [3, 1, 1, 2] = [4, 3, 1, 2] * [4, 1, 1, 2] 368s x0: [4, 3, 1, 2] 368s 1.000 0.060 0.601 0.171 368s 0.877 0.617 0.203 0.561 368s 0.243 -0.585 0.985 0.353 368s 368s x1: [4, 1, 1, 2] 368s 0.297 0.250 0.623 -0.334 368s 368s y: [3, 1, 1, 2] 368s 0.629 0.354 0.422 368s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.426 0.669 0.657 0.841 368s 368s x1: [4, 3, 1, 1] 368s -0.963 0.244 0.712 0.080 368s 0.053 0.172 0.196 0.052 368s -0.769 -0.204 -0.777 -0.891 368s 368s y: [1, 3, 1, 1] 368s 0.288 368s 0.310 368s -1.723 368s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.322 368s 0.836 368s 368s x1: [1, 4, 1, 1] 368s 0.655 368s 0.010 368s -0.115 368s 0.135 368s 368s y: [2, 4, 1, 1] 368s 0.211 0.547 368s 0.003 0.008 368s -0.037 -0.097 368s 0.043 0.113 368s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 368s x0: [2, 4, 1, 2] 368s 0.695 0.002 368s 0.291 -0.947 368s -0.028 0.991 368s -0.061 -0.826 368s 368s x1: [2, 1, 1, 2] 368s 0.459 -0.478 368s 368s y: [4, 1, 1, 2] 368s 0.317 0.586 -0.487 0.367 368s test-mul-mat0: iter:430/500 368s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 1, 1, 1] 368s 0.261 0.926 0.401 368s 368s x1: [3, 2, 1, 1] 368s 0.125 0.418 -0.027 368s 0.193 -0.261 -0.191 368s 368s y: [1, 2, 1, 1] 368s 0.409 368s -0.268 368s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 3, 4, 1] 368s -0.140 -0.211 -0.028 368s 0.553 -0.028 0.087 368s -0.914 -0.189 -0.788 368s 368s x1: [3, 1, 4, 1] 368s -0.804 0.621 -0.308 368s 368s y: [3, 1, 4, 1] 368s -0.010 -0.488 0.859 368s testing: mul_mat, [4, 3, 4, 3] = [3, 4, 4, 3] * [3, 3, 4, 3] 368s x0: [3, 4, 4, 3] 368s -0.932 -0.447 -0.680 368s 0.294 -0.641 0.063 368s -0.742 0.763 0.624 368s 0.844 -0.143 -0.264 368s 368s x1: [3, 3, 4, 3] 368s 0.132 0.343 -0.441 368s 0.344 -0.154 -0.657 368s -0.251 0.089 -0.064 368s 368s y: [4, 3, 4, 3] 368s 0.023 -0.209 -0.111 0.179 368s 0.195 0.158 -0.782 0.486 368s 0.238 -0.135 0.214 -0.208 368s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.995 -0.480 -0.574 368s -0.758 -0.355 0.825 368s -0.233 0.533 -0.121 368s 0.988 0.001 0.229 368s 368s x1: [3, 4, 1, 1] 368s 0.386 -0.298 0.630 368s -0.337 0.816 0.308 368s 0.979 0.097 -0.675 368s -0.937 0.569 0.329 368s 368s y: [4, 4, 1, 1] 368s -0.602 0.333 -0.325 0.525 368s -0.232 0.220 0.476 -0.263 368s -0.633 -1.333 -0.095 0.813 368s 0.470 0.780 0.482 -0.850 368s testing: mul_mat, [3, 3, 4, 1] = [4, 3, 4, 1] * [4, 3, 4, 1] 368s x0: [4, 3, 4, 1] 368s 0.891 -0.055 -0.356 -0.230 368s -0.082 0.699 -0.988 -0.850 368s 0.856 -0.379 -0.614 0.030 368s 368s x1: [4, 3, 4, 1] 368s -0.897 0.448 -0.218 -0.238 368s 0.603 0.410 -0.852 -0.695 368s 0.041 -0.189 -0.879 -0.652 368s 368s y: [3, 3, 4, 1] 368s -0.692 0.804 -0.811 368s 0.977 1.670 0.863 368s 0.509 1.287 0.627 368s testing: mul_mat, [2, 4, 4, 3] = [3, 2, 4, 3] * [3, 4, 4, 3] 368s x0: [3, 2, 4, 3] 368s 0.420 0.304 0.151 368s -0.322 -0.888 -0.352 368s 368s x1: [3, 4, 4, 3] 368s -0.615 0.311 0.515 368s 0.201 0.708 -0.976 368s -0.862 0.040 0.285 368s -0.586 0.982 -0.850 368s 368s y: [2, 4, 4, 3] 368s -0.086 -0.260 368s 0.152 -0.351 368s -0.307 0.141 368s -0.076 -0.385 368s test-mul-mat0: iter:431/500 368s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 1, 1, 1] 368s 0.042 -0.146 -0.560 368s 368s x1: [3, 1, 1, 1] 368s 0.879 0.136 0.131 368s 368s y: [1, 1, 1, 1] 368s -0.056 368s testing: mul_mat, [2, 1, 4, 1] = [3, 2, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 2, 4, 1] 368s -0.677 0.768 -0.248 368s 0.316 -0.079 0.154 368s 368s x1: [3, 1, 4, 1] 368s 0.028 0.244 0.067 368s 368s y: [2, 1, 4, 1] 368s 0.151 0.000 368s testing: mul_mat, [3, 2, 4, 2] = [3, 3, 4, 2] * [3, 2, 4, 2] 368s x0: [3, 3, 4, 2] 368s -0.659 -0.420 0.859 368s 0.459 0.407 0.865 368s 0.487 0.445 0.958 368s 368s x1: [3, 2, 4, 2] 368s -0.076 0.670 0.469 368s -0.793 0.392 0.302 368s 368s y: [3, 2, 4, 2] 368s 0.172 0.643 0.710 368s 0.617 0.057 0.077 368s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 3, 1, 1] 368s -0.089 0.099 -0.354 368s -0.986 -0.386 0.057 368s 0.146 0.084 -0.070 368s 368s x1: [3, 3, 1, 1] 368s 0.992 0.175 0.257 368s 0.788 -0.628 -0.447 368s -0.663 0.649 0.732 368s 368s y: [3, 3, 1, 1] 368s -0.162 -1.031 0.142 368s 0.026 -0.560 0.094 368s -0.136 0.445 -0.094 368s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 368s x0: [3, 3, 4, 1] 368s -0.301 0.650 -0.406 368s 0.270 -0.801 -0.697 368s 0.338 -0.141 0.637 368s 368s x1: [3, 3, 4, 1] 368s 0.490 -0.961 0.026 368s -0.789 0.420 -0.170 368s -0.783 0.064 0.831 368s 368s y: [3, 3, 4, 1] 368s -0.783 0.884 0.318 368s 0.579 -0.431 -0.434 368s -0.060 -0.841 0.255 368s testing: mul_mat, [3, 3, 4, 2] = [3, 3, 4, 2] * [3ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s , 3, 4, 2] 368s x0: [3, 3, 4, 2] 368s 0.795 0.175 0.411 368s 0.980 0.723 -0.168 368s 0.768 -0.938 -0.820 368s 368s x1: [3, 3, 4, 2] 368s 0.280 -0.452 -0.690 368s -0.083 0.874 0.089 368s 0.885 -0.731 0.992 368s 368s y: [3, 3, 4, 2] 368s -0.140 0.063 1.205 368s 0.123 0.535 -0.956 368s 0.983 0.173 0.551 368s test-mul-mat0: iter:432/500 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s -0.621 368s -0.093 368s 0.225 368s 368s x1: [1, 4, 1, 1] 368s 0.574 368s -0.396 368s 0.721 368s 0.407 368s 368s y: [3, 4, 1, 1] 368s -0.356 -0.053 0.129 368s 0.246 0.037 -0.089 368s -0.447 -0.067 0.162 368s -0.252 -0.038 0.091 368s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.396 368s 0.321 368s 368s x1: [1, 3, 1, 1] 368s 0.494 368s -0.747 368s 0.549 368s 368s y: [2, 3, 1, 1] 368s -0.196 0.159 368s 0.296 -0.240 368s -0.218 0.176 368s testing: mul_mat, [4, 2, 1, 2] = [1, 4, 1, 2] * [1, 2, 1, 2] 368s x0: [1, 4, 1, 2] 368s -0.820 368s 0.311 368s 0.706 368s 0.096 368s 368s x1: [1, 2, 1, 2] 368s -0.572 368s -0.981 368s 368s y: [4, 2, 1, 2] 368s 0.469 -0.178 -0.404 -0.055 368s 0.805 -0.305 -0.692 -0.094 368s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 4, 1, 1] 368s -0.862 368s -0.678 368s 0.180 368s 0.774 368s 368s x1: [1, 4, 1, 1] 368s -0.399 368s -0.800 368s 0.534 368s -0.398 368s 368s y: [4, 4, 1, 1] 368s 0.344 0.271 -0.072 -0.309 368s 0.689 0.542 -0.144 -0.620 368s -0.460 -0.362 0.096 0.413 368s 0.343 0.270 -0.072 -0.309 368s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 4, 1, 1] 368s -0.452 -0.338 0.442 -0.913 368s -0.766 -0.037 -0.009 -0.177 368s -0.056 -0.188 -0.566 0.680 368s 0.870 0.123 0.147 0.958 368s 368s x1: [4, 1, 1, 1] 368s -0.298 -0.912 -0.001 -0.804 368s 368s y: [4, 1, 1, 1] 368s 1.177 0.405 -0.357 -1.142 368s testing: mul_mat, [3, 4, 1, 2] = [4, 3, 1, 2] * [4, 4, 1, 2] 368s x0: [4, 3, 1, 2] 368s 0.015 -0.053 -0.941 0.259 368s 0.200 0.016 -0.393 0.197 368s -0.122 -0.918 0.988 -0.910 368s 368s x1: [4, 4, 1, 2] 368s -0.856 -0.720 -0.842 0.678 368s -0.118 0.932 0.816 0.204 368s 0.112 0.591 0.906 0.200 368s -0.410 -0.898 0.540 0.138 368s 368s y: [3, 4, 1, 2] 368s 0.993 0.282 -0.684 368s -0.766 -0.290 -0.220 368s -0.830 -0.285 0.158 368s -0.431 -0.281 1.282 368s test-mul-mat0: iter:433/500 368s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 2, 1, 1] 368s 0.015 -0.594 0.806 368s 0.350 -0.168 0.089 368s 368s x1: [3, 4, 1, 1] 368s 0.726 -0.004 0.367 368s -0.259 -0.804 -0.755 368s 0.688 0.211 -0.673 368s 0.747 0.818 -0.685 368s 368s y: [2, 4, 1, 1] 368s 0.309 0.288 368s -0.134 -0.023 368s -0.658 0.145 368s -1.027 0.063 368s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 2, 1, 1] 368s 0.896 0.094 -0.304 368s 0.371 -0.536 -0.577 368s 368s x1: [3, 2, 1, 1] 368s -0.308 -0.745 0.820 368s 0.854 0.799 0.439 368s 368s y: [2, 2, 1, 1] 368s -0.595 -0.188 368s 0.707 -0.364 368s testing: mul_mat, [2, 2, 1, 2] = [3, 2, 1, 2] * [3, 2, 1, 2] 368s x0: [3, 2, 1, 2] 368s 0.125 0.412 0.958 368s 0.956 -0.499 -0.350 368s 368s x1: [3, 2, 1, 2] 368s -0.633 0.831 0.164 368s -0.437 -0.924 -0.148 368s 368s y: [2, 2, 1, 2] 368s 0.421 -1.077 368s -0.577 0.095 368s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 1, 1, 1] 368s 0.149 -0.387 -0.792 368s 368s x1: [3, 2, 1, 1] 368s 0.906 -0.147 0.026 368s 0.278 0.318 0.449 368s 368s y: [1, 2, 1, 1] 368s 0.171 368s -0.437 368s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.935 368s 0.575 368s 368s x1: [1, 3, 1, 1] 368s 0.225 368s 0.465 368s 0.982 368s 368s y: [2, 3, 1, 1] 368s -0.211 0.129 368s -0.435 0.267 368s -0.919 0.564 368s testing: mul_mat, [3, 1, 1, 2] = [2, 3, 1, 2] * [2, 1, 1, 2] 368s x0: [2, 3, 1, 2] 368s -0.860 -0.211 368s 0.428 0.639 368s 0.584 0.905 368s 368s x1: [2, 1, 1, 2] 368s -0.653 0.671 368s 368s y: [3, 1, 1, 2] 368s 0.420 0.149 0.226 368s test-mul-mat0: iter:434/500 368s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.758 -0.815 0.912 -0.576 368s 368s x1: [4, 2, 1, 1] 368s -0.522 -0.016 -0.289 -0.315 368s -0.791 -0.824 -0.333 0.838 368s 368s y: [1, 2, 1, 1] 368s 0.327 368s 0.485 368s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.353 -0.276 -0.518 -0.354 368s -0.627 -0.854 -0.731 0.928 368s 368s x1: [4, 1, 1, 1] 368s 0.657 -0.006 -0.436 0.085 368s 368s y: [2, 1, 1, 1] 368s 0.429 -0.010 368s testing: mul_mat, [2, 2, 1, 4] = [4, 2, 1, 4] * [4, 2, 1, 4] 368s x0: [4, 2, 1, 4] 368s -0.760 0.421 -0.974 0.479 368s -0.097 0.754 0.403 -0.405 368s 368s x1: [4, 2, 1, 4] 368s -0.336 -0.081 -0.239 -0.501 368s 0.017 0.239 0.483 0.729 368s 368s y: [2, 2, 1, 4] 368s 0.214 0.078 368s -0.034 0.078 368s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s [4, 2, 1, 1] 368s x0: [4, 4, 1, 1] 368s -0.121 0.106 0.807 -0.567 368s 0.243 0.901 0.590 0.222 368s 0.817 0.926 -0.435 0.786 368s -0.282 -0.979 -0.114 -0.173 368s 368s x1: [4, 2, 1, 1] 368s 0.257 -0.976 0.593 -0.840 368s 0.778 -0.004 -0.245 0.438 368s 368s y: [4, 2, 1, 1] 368s 0.820 -0.653 -1.612 0.960 368s -0.540 0.138 1.083 -0.264 368s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 2, 1, 1] 368s -0.789 -0.636 0.668 0.919 368s 0.742 -0.888 -0.615 0.258 368s 368s x1: [4, 4, 1, 1] 368s -0.864 -0.225 -0.299 0.972 368s -0.226 -0.185 0.229 -0.202 368s -0.592 0.389 -0.424 0.404 368s -0.855 -0.986 -0.606 0.024 368s 368s y: [2, 4, 1, 1] 368s 1.518 -0.007 368s 0.263 -0.197 368s 0.308 -0.420 368s 0.919 0.619 368s testing: mul_mat, [3, 4, 1, 4] = [2, 3, 1, 4] * [2, 4, 1, 4] 368s x0: [2, 3, 1, 4] 368s -0.573 -0.054 368s -0.341 -0.032 368s 0.878 0.245 368s 368s x1: [2, 4, 1, 4] 368s -0.049 -0.195 368s 0.691 -0.828 368s -0.409 -0.481 368s -0.692 0.366 368s 368s y: [3, 4, 1, 4] 368s 0.039 0.023 -0.091 368s -0.351 -0.209 0.405 368s 0.261 0.155 -0.477 368s 0.376 0.224 -0.518 368s test-mul-mat0: iter:435/500 368s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 2, 1, 1] 368s -0.799 -0.978 -0.471 368s -0.042 -0.655 0.172 368s 368s x1: [3, 3, 1, 1] 368s -0.209 0.326 0.514 368s 0.218 0.985 0.392 368s -0.836 -0.047 -0.363 368s 368s y: [2, 3, 1, 1] 368s -0.394 -0.116 368s -1.322 -0.587 368s 0.886 0.004 368s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 368s x0: [3, 2, 3, 1] 368s 0.154 -0.552 -0.109 368s -0.893 0.085 0.220 368s 368s x1: [3, 2, 3, 1] 368s -0.385 -0.817 0.661 368s 0.316 0.549 -0.610 368s 368s y: [2, 2, 3, 1] 368s 0.320 0.420 368s -0.188 -0.370 368s testing: mul_mat, [1, 2, 3, 2] = [3, 1, 3, 2] * [3, 2, 3, 2] 368s x0: [3, 1, 3, 2] 368s -0.298 0.869 0.032 368s 368s x1: [3, 2, 3, 2] 368s 0.282 -0.116 -0.457 368s 0.944 0.889 0.858 368s 368s y: [1, 2, 3, 2] 368s -0.199 368s 0.519 368s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 4, 1, 1] 368s 0.633 0.809 0.894 368s 0.561 -0.739 -0.526 368s -0.513 0.772 0.814 368s 0.730 0.945 -0.074 368s 368s x1: [3, 1, 1, 1] 368s -0.003 -0.183 -0.802 368s 368s y: [4, 1, 1, 1] 368s -0.867 0.555 -0.793 -0.115 368s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 368s x0: [4, 1, 3, 1] 368s -0.377 -0.604 -0.227 0.859 368s 368s x1: [4, 3, 3, 1] 368s 0.346 -0.415 -0.587 0.115 368s 0.412 -0.197 -0.328 -0.597 368s 0.180 0.236 0.865 -0.349 368s 368s y: [1, 3, 3, 1] 368s 0.352 368s -0.474 368s -0.706 368s testing: mul_mat, [1, 4, 3, 2] = [1, 1, 3, 2] * [1, 4, 3, 2] 368s x0: [1, 1, 3, 2] 368s -0.847 368s 368s x1: [1, 4, 3, 2] 368s -0.224 368s -0.281 368s 0.387 368s -0.927 368s 368s y: [1, 4, 3, 2] 368s 0.190 368s 0.238 368s -0.328 368s 0.786 368s test-mul-mat0: iter:436/500 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.681 0.692 0.711 0.140 368s 368s x1: [4, 1, 1, 1] 368s -0.757 -0.895 -0.401 0.223 368s 368s y: [1, 1, 1, 1] 368s -1.389 368s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 368s x0: [4, 2, 2, 1] 368s 0.872 -0.731 0.888 -0.975 368s -0.375 -0.978 -0.855 -0.814 368s 368s x1: [4, 1, 2, 1] 368s -0.002 -0.908 0.389 -0.746 368s 368s y: [2, 1, 2, 1] 368s 1.734 1.163 368s testing: mul_mat, [1, 2, 2, 3] = [4, 1, 2, 3] * [4, 2, 2, 3] 368s x0: [4, 1, 2, 3] 368s 0.225 -0.024 -0.209 -0.723 368s 368s x1: [4, 2, 2, 3] 368s 0.932 0.612 -0.711 0.624 368s 0.322 0.429 -0.379 0.415 368s 368s y: [1, 2, 2, 3] 368s -0.107 368s -0.158 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.016 0.329 -0.594 0.993 368s 368s x1: [4, 1, 1, 1] 368s 0.152 -0.137 -0.946 -0.653 368s 368s y: [1, 1, 1, 1] 368s -0.129 368s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 368s x0: [1, 3, 2, 1] 368s -0.464 368s -0.292 368s -0.253 368s 368s x1: [1, 4, 2, 1] 368s -0.880 368s -0.316 368s 0.169 368s 0.836 368s 368s y: [3, 4, 2, 1] 368s 0.409 0.257 0.223 368s 0.147 0.093 0.080 368s -0.079 -0.049 -0.043 368s -0.388 -0.244 -0.212 368s testing: mul_mat, [2, 1, 2, 3] = [3, 2, 2, 3] * [3, 1, 2, 3] 368s x0: [3, 2, 2, 3] 368s -0.861 -0.963 -0.664 368s -0.189 -0.787 -0.737 368s 368s x1: [3, 1, 2, 3] 368s -0.713 0.498 0.197 368s 368s y: [2, 1, 2, 3] 368s 0.003 -0.403 368s test-mul-mat0: iter:437/500 368s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 3, 1, 1] 368s 0.568 0.489 0.360 368s -0.647 -0.484 -0.665 368s 0.644 0.029 -0.841 368s 368s x1: [3, 1, 1, 1] 368s 0.311 -0.864 0.086 368s 368s y: [3, 1, 1, 1] 368s -0.215 0.160 0.103 368s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 368s x0: [3, 3, 2, 1] 368s -0.200 -0.593 -0.283 368s -0.633 0.896 -0.923 368s -0.280 -0.588 -0.588 368s 368s x1: [3, 3, 2, 1] 368s 0.789 -0.790 0.846 368s -0.884 -0.535 0.391 368s -0.254 -0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s .522 0.963 368s 368s y: [3, 3, 2, 1] 368s 0.070 -1.988 -0.254 368s 0.383 -0.282 0.331 368s 0.088 -1.197 -0.188 368s testing: mul_mat, [2, 3, 2, 4] = [3, 2, 2, 4] * [3, 3, 2, 4] 368s x0: [3, 2, 2, 4] 368s 0.709 -0.749 -0.622 368s 0.015 -0.924 0.278 368s 368s x1: [3, 3, 2, 4] 368s 0.016 -0.405 0.771 368s -0.469 0.895 -0.658 368s 0.028 -0.645 0.783 368s 368s y: [2, 3, 2, 4] 368s -0.165 0.589 368s -0.593 -1.017 368s 0.015 0.814 368s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 2, 1, 1] 368s 0.783 0.612 0.353 368s -0.697 0.327 0.864 368s 368s x1: [3, 2, 1, 1] 368s 0.630 0.120 0.259 368s 0.390 0.006 0.168 368s 368s y: [2, 2, 1, 1] 368s 0.658 -0.176 368s 0.369 -0.125 368s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 368s x0: [2, 4, 2, 1] 368s -0.014 0.478 368s 0.809 0.444 368s 0.815 -0.290 368s 0.589 -0.264 368s 368s x1: [2, 3, 2, 1] 368s -0.466 -0.889 368s -0.634 -0.880 368s 0.885 -0.054 368s 368s y: [4, 3, 2, 1] 368s -0.418 -0.772 -0.121 -0.040 368s -0.411 -0.903 -0.261 -0.142 368s -0.039 0.691 0.736 0.536 368s testing: mul_mat, [2, 2, 2, 4] = [4, 2, 2, 4] * [4, 2, 2, 4] 368s x0: [4, 2, 2, 4] 368s -0.712 0.179 -0.020 -0.121 368s -0.915 0.112 0.906 0.804 368s 368s x1: [4, 2, 2, 4] 368s 0.560 0.605 0.360 0.671 368s 0.971 0.480 0.555 -0.083 368s 368s y: [2, 2, 2, 4] 368s -0.379 0.421 368s -0.607 -0.398 368s test-mul-mat0: iter:438/500 368s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 3, 1, 1] 368s 0.212 0.180 -0.951 368s -0.566 -0.532 -0.936 368s -0.747 0.663 -0.595 368s 368s x1: [3, 1, 1, 1] 368s -0.650 -0.040 0.294 368s 368s y: [3, 1, 1, 1] 368s -0.424 0.115 0.285 368s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 368s x0: [3, 2, 4, 1] 368s -0.925 0.476 -0.795 368s -0.282 0.980 0.841 368s 368s x1: [3, 3, 4, 1] 368s -0.776 0.804 0.178 368s 0.731 -0.411 0.090 368s -0.802 -0.879 -0.729 368s 368s y: [2, 3, 4, 1] 368s 0.959 1.156 368s -0.943 -0.533 368s 0.903 -1.249 368s testing: mul_mat, [4, 2, 4, 4] = [3, 4, 4, 4] * [3, 2, 4, 4] 368s x0: [3, 4, 4, 4] 368s 0.376 -0.719 -0.660 368s -0.289 -0.106 0.581 368s -0.880 0.876 0.711 368s -0.786 0.913 -0.650 368s 368s x1: [3, 2, 4, 4] 368s -0.409 0.868 -0.928 368s 0.436 0.746 0.427 368s 368s y: [4, 2, 4, 4] 368s -0.165 -0.513 0.460 1.717 368s -0.653 0.043 0.572 0.061 368s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 3, 1, 1] 368s -0.385 0.018 -0.087 368s -0.658 0.485 -0.216 368s -0.057 -0.027 0.595 368s 368s x1: [3, 4, 1, 1] 368s 0.581 0.299 0.776 368s 0.516 -0.671 -0.482 368s 0.979 -0.079 0.831 368s 0.417 -0.866 0.615 368s 368s y: [3, 4, 1, 1] 368s -0.286 -0.405 0.421 368s -0.169 -0.560 -0.298 368s -0.451 -0.862 0.441 368s -0.230 -0.827 0.366 368s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 368s x0: [3, 2, 4, 1] 368s -0.104 0.294 0.848 368s 0.414 -0.876 0.249 368s 368s x1: [3, 3, 4, 1] 368s 0.730 0.542 -0.204 368s 0.155 -0.794 -0.930 368s 0.646 0.136 -0.820 368s 368s y: [2, 3, 4, 1] 368s -0.089 -0.224 368s -1.039 0.527 368s -0.723 -0.056 368s testing: mul_mat, [3, 3, 4, 4] = [2, 3, 4, 4] * [2, 3, 4, 4] 368s x0: [2, 3, 4, 4] 368s -0.464 -0.889 368s 0.756 -0.787 368s -0.659 0.469 368s 368s x1: [2, 3, 4, 4] 368s 0.045 -0.537 368s -0.446 -0.910 368s 0.713 -0.168 368s 368s y: [3, 3, 4, 4] 368s 0.457 0.457 -0.282 368s 1.016 0.379 -0.133 368s -0.181 0.671 -0.549 368s test-mul-mat0: iter:439/500 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.662 0.068 -0.623 0.193 368s 368s x1: [4, 1, 1, 1] 368s -0.613 -0.858 0.200 0.018 368s 368s y: [1, 1, 1, 1] 368s 0.226 368s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 368s x0: [4, 1, 2, 1] 368s -0.933 -0.768 0.420 -0.890 368s 368s x1: [4, 1, 2, 1] 368s 0.034 0.329 -0.606 -0.340 368s 368s y: [1, 1, 2, 1] 368s -0.236 368s testing: mul_mat, [1, 1, 2, 4] = [4, 1, 2, 4] * [4, 1, 2, 4] 368s x0: [4, 1, 2, 4] 368s -0.137 -0.203 0.775 0.930 368s 368s x1: [4, 1, 2, 4] 368s 0.215 0.257 -0.319 -0.872 368s 368s y: [1, 1, 2, 4] 368s -1.141 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.193 -0.569 -0.718 0.978 368s 368s x1: [4, 1, 1, 1] 368s 0.457 0.944 -0.209 -0.476 368s 368s y: [1, 1, 1, 1] 368s -0.940 368s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 368s x0: [1, 3, 2, 1] 368s -0.622 368s 0.403 368s 0.349 368s 368s x1: [1, 4, 2, 1] 368s 0.555 368s -0.624 368s 0.336 368s 0.225 368s 368s y: [3, 4, 2, 1] 368s -0.345 0.224 0.194 368s 0.388 -0.252 -0.218 368s -0.209 0.136 0.118 368s -0.140 0.091 0.079 368s testing: mul_mat, [4, 1, 2, 4] = [3, 4, 2, 4] * [3, 1, 2, 4] 368s x0: [3, 4, 2, 4] 368s 0.333 0.074 -0.322 368s 0.134 -0.791 -0.801 368s 0.084 0.306 0.597 368s -0.589 0.005 0.316 368s 368s x1: [3, 1, 2, 4] 368s -0.105 0.931 -0.531 368s 368s y: [4, 1, 2, 4]ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s 368s 0.205 -0.325 -0.041 -0.101 368s test-mul-mat0: iter:440/500 368s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.934 -0.863 0.299 -0.884 368s -0.180 -0.360 0.209 -0.336 368s 368s x1: [4, 4, 1, 1] 368s 0.728 -0.661 0.008 0.346 368s -0.978 0.478 -0.016 -0.327 368s -0.840 -0.138 -0.360 -0.098 368s -0.803 0.001 -0.120 -0.232 368s 368s y: [2, 4, 1, 1] 368s 0.947 -0.008 368s -1.042 0.111 368s -0.686 0.159 368s -0.583 0.197 368s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 368s x0: [4, 1, 4, 1] 368s -0.409 -0.812 0.760 -0.223 368s 368s x1: [4, 2, 4, 1] 368s -0.332 -0.920 0.930 0.155 368s 0.808 -0.815 -0.117 -0.852 368s 368s y: [1, 2, 4, 1] 368s 1.555 368s 0.433 368s testing: mul_mat, [4, 1, 4, 4] = [4, 4, 4, 4] * [4, 1, 4, 4] 368s x0: [4, 4, 4, 4] 368s 0.732 0.453 0.330 0.667 368s -0.386 -0.067 -0.322 0.177 368s 0.435 0.423 -0.823 -0.210 368s -0.504 0.799 -0.244 -0.328 368s 368s x1: [4, 1, 4, 4] 368s -0.140 -0.112 0.818 0.900 368s 368s y: [4, 1, 4, 4] 368s 0.718 -0.042 -0.971 -0.514 368s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.519 0.834 -0.642 0.414 368s 0.935 0.834 -0.859 0.896 368s 368s x1: [4, 4, 1, 1] 368s -0.688 -0.333 -0.150 -0.741 368s -0.566 0.692 0.645 0.585 368s 0.114 0.724 0.612 -0.074 368s 0.297 0.584 0.021 0.599 368s 368s y: [2, 4, 1, 1] 368s -0.846 -1.457 368s 0.111 0.018 368s 0.239 0.118 368s 0.876 1.284 368s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 368s x0: [2, 3, 4, 1] 368s -0.938 -0.608 368s -0.690 0.981 368s 0.290 -0.812 368s 368s x1: [2, 4, 4, 1] 368s -0.095 -0.640 368s 0.680 0.359 368s -0.769 -0.670 368s 0.671 -0.102 368s 368s y: [3, 4, 4, 1] 368s 0.478 -0.562 0.492 368s -0.856 -0.117 -0.095 368s 1.129 -0.127 0.322 368s -0.567 -0.563 0.277 368s testing: mul_mat, [3, 2, 4, 4] = [3, 3, 4, 4] * [3, 2, 4, 4] 368s x0: [3, 3, 4, 4] 368s 0.053 0.185 -0.107 368s 0.701 -0.469 -0.356 368s 0.048 0.364 0.263 368s 368s x1: [3, 2, 4, 4] 368s -0.992 -0.813 0.098 368s -0.293 -0.430 -0.951 368s 368s y: [3, 2, 4, 4] 368s -0.214 -0.349 -0.318 368s 0.007 0.335 -0.420 368s test-mul-mat0: iter:441/500 368s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 3, 1, 1] 368s -0.206 0.292 368s -0.755 0.201 368s -0.947 0.805 368s 368s x1: [2, 1, 1, 1] 368s -0.806 -0.308 368s 368s y: [3, 1, 1, 1] 368s 0.076 0.547 0.516 368s testing: mul_mat, [1, 3, 4, 1] = [2, 1, 4, 1] * [2, 3, 4, 1] 368s x0: [2, 1, 4, 1] 368s 0.375 -0.738 368s 368s x1: [2, 3, 4, 1] 368s -0.528 -0.251 368s 0.659 -0.565 368s 0.371 -0.113 368s 368s y: [1, 3, 4, 1] 368s -0.012 368s 0.664 368s 0.223 368s testing: mul_mat, [2, 1, 4, 1] = [2, 2, 4, 1] * [2, 1, 4, 1] 368s x0: [2, 2, 4, 1] 368s -0.944 0.749 368s 0.025 -0.771 368s 368s x1: [2, 1, 4, 1] 368s -0.043 0.483 368s 368s y: [2, 1, 4, 1] 368s 0.402 -0.374 368s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 1, 1, 1] 368s 0.590 0.819 368s 368s x1: [2, 2, 1, 1] 368s 0.563 -0.410 368s -0.473 -0.138 368s 368s y: [1, 2, 1, 1] 368s -0.003 368s -0.392 368s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 368s x0: [1, 3, 4, 1] 368s 0.096 368s -0.105 368s -0.196 368s 368s x1: [1, 2, 4, 1] 368s 0.121 368s -0.975 368s 368s y: [3, 2, 4, 1] 368s 0.012 -0.013 -0.024 368s -0.094 0.103 0.191 368s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 4, 4, 1] 368s -0.827 -0.973 0.596 368s -0.842 -0.208 0.901 368s -0.783 -0.384 0.488 368s 0.520 -0.078 0.105 368s 368s x1: [3, 1, 4, 1] 368s 0.790 0.279 -0.346 368s 368s y: [4, 1, 4, 1] 368s -1.131 -1.035 -0.894 0.353 368s test-mul-mat0: iter:442/500 368s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.380 -0.759 -0.581 0.417 368s 0.564 -0.912 0.568 -0.518 368s -0.721 0.096 0.437 -0.218 368s 0.280 0.083 -0.050 0.989 368s 368s x1: [4, 4, 1, 1] 368s -0.832 -0.291 0.519 -0.953 368s -0.260 -0.837 0.961 0.399 368s 0.605 0.573 -0.769 0.204 368s 0.402 -0.417 -0.169 -0.976 368s 368s y: [4, 4, 1, 1] 368s -0.795 0.584 1.007 -1.225 368s 0.144 0.955 0.440 0.204 368s 0.327 -0.724 -0.762 0.458 368s 0.160 1.017 -0.191 -0.879 368s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 368s x0: [4, 4, 3, 1] 368s 0.474 0.350 0.698 -0.755 368s 0.595 -0.023 -0.888 0.277 368s 0.038 0.259 -0.958 -0.951 368s -0.606 0.410 -0.944 -0.741 368s 368s x1: [4, 4, 3, 1] 368s -0.398 -0.003 -0.271 -0.234 368s -0.042 -0.872 -0.629 -0.469 368s -0.641 0.575 0.934 -0.058 368s -0.594 0.957 -0.507 0.786 368s 368s y: [4, 4, 3, 1] 368s -0.202 -0.060 0.466 0.669 368s -0.410 0.424 0.821 0.609 368s 0.593 -1.239 -0.715 -0.214 368s -0.894 0.293 -0.037 0.649 368s testing: mul_mat, [4, 4, 3, 4] = [4, 4, 3, 4] * [4, 4, 3, 4] 368s x0: [4, 4, 3, 4] 368s 0.649 0.507 0.782 0.914 368s -0.463 0.968 -0.388 0.608 368s 0.256 -0.186 0.626 -0.261 368s 0.221 -0.004 -0.897 0.613 368s 368s x1: [4, 4, 3, 4] 368s 0.223 -0.124 -0.812 0.742 368s 0.479 -0.685 0.691 0.376 368s -0.073 0.028 0.059 0.809 368s 0.951 0.441 -0.836 -0.776 368s 368s y: [4, 4, 3, 4] 368s 0.125 0.542 -0.622 1.233 368s 0.847 -0.923 0.585 -0.281 368s 0.752 0.529 -0.198 0.426 368s -0.522 -0.161 -0.159 0.482 368s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 3, 1, 1] 368s 0.357 -0.255 -0.352 0.193 368s 0.222 -0.984 0.573 -0.041 368s -0.969 0.659 -0.460 0.491 368s 368s x1: [4, 4, 1, 1] 368s 0.973 -0.715 -0.769 -0.317 368s -0.775 -0.066 -0.192 0.760 368s 0.359 0.066 -0.761 -0.234 368s -0.721 0.680 0.434 0.928 368s 368s y: [3, 4, 1, 1] 368s 0.740 0.492 -1.215 368s -0.045 -0.248 1.168 368s 0.334 -0.413 -0.069 368s -0.405 -0.619 1.402 368s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 368s x0: [3, 3, 3, 1] 368s -0.559 -0.305 0.188 368s -0.525 -0.975 -0.659 368s 0.998 -0.980 -0.678 368s 368s x1: [3, 4, 3, 1] 368s 0.984 0.026 0.464 368s -0.731 0.257 -0.853 368s -0.505 -0.809 -0.044 368s -0.745 0.550 -0.978 368s 368s y: [3, 4, 3, 1] 368s -0.471 -0.848 0.642 368s 0.170 0.695 -0.403 368s 0.521 1.083 0.319 368s 0.065 0.499 -0.620 368s testing: mul_mat, [1, 3, 3, 4] = [3, 1, 3, 4] * [3, 3, 3, 4] 368s x0: [3, 1, 3, 4] 368s -0.328 -0.791 0.388 368s 368s x1: [3, 3, 3, 4] 368s -0.515 -0.137 0.827 368s 0.528 0.303 -0.698 368s 0.526 0.998 -0.673 368s 368s y: [1, 3, 3, 4] 368s 0.598 368s -0.684 368s -1.224 368s test-mul-mat0: iter:443/500 368s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 2, 1, 1] 368s 0.554 -0.132 -0.233 368s 0.534 -0.865 0.051 368s 368s x1: [3, 1, 1, 1] 368s -0.710 -0.213 -0.652 368s 368s y: [2, 1, 1, 1] 368s -0.213 -0.228 368s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 368s x0: [3, 2, 3, 1] 368s 0.457 0.012 0.701 368s -0.756 0.360 -0.167 368s 368s x1: [3, 2, 3, 1] 368s 0.579 -0.055 0.775 368s 0.424 0.015 -0.780 368s 368s y: [2, 2, 3, 1] 368s 0.808 -0.587 368s -0.353 -0.184 368s testing: mul_mat, [3, 2, 3, 3] = [3, 3, 3, 3] * [3, 2, 3, 3] 368s x0: [3, 3, 3, 3] 368s -0.072 -0.242 0.660 368s -0.851 0.148 -0.281 368s -0.985 -0.258 0.755 368s 368s x1: [3, 2, 3, 3] 368s 0.204 0.977 0.669 368s 0.533 -0.495 -0.730 368s 368s y: [3, 2, 3, 3] 368s 0.191 -0.217 0.052 368s -0.400 -0.322 -0.948 368s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 3, 1, 1] 368s -0.552 -0.601 -0.078 368s 0.521 -0.160 -0.918 368s -0.014 -0.850 -0.579 368s 368s x1: [3, 3, 1, 1] 368s 0.430 -0.840 -0.106 368s -0.335 -0.940 -0.160 368s 0.781 0.921 0.408 368s 368s y: [3, 3, 1, 1] 368s 0.276 0.457 0.769 368s 0.763 0.123 0.897 368s -1.017 -0.115 -1.030 368s testing: mul_mat, [4, 3, 3, 1] = [3, 4, 3, 1] * [3, 3, 3, 1] 368s x0: [3, 4, 3, 1] 368s 0.291 -0.095 0.050 368s 0.902 -0.241 -0.198 368s 0.410 0.827 -0.479 368s -0.951 0.803 0.294 368s 368s x1: [3, 3, 3, 1] 368s -0.242 0.003 0.350 368s 0.778 -0.102 -0.709 368s -0.025 -0.282 -0.756 368s 368s y: [4, 3, 3, 1] 368s -0.053 -0.288 -0.264 0.335 368s 0.200 0.867 0.574 -1.031 368s -0.018 0.195 0.119 -0.425 368s testing: mul_mat, [1, 3, 3, 3] = [4, 1, 3, 3] * [4, 3, 3, 3] 368s x0: [4, 1, 3, 3] 368s -0.109 0.644 -0.743 0.649 368s 368s x1: [4, 3, 3, 3] 368s -0.693 -0.928 0.744 0.357 368s -0.126 -0.735 -0.350 -0.260 368s -0.833 -0.738 0.100 0.611 368s 368s y: [1, 3, 3, 3] 368s -0.844 368s -0.369 368s -0.063 368s test-mul-mat0: iter:444/500 368s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 1, 1, 1] 368s 0.290 0.264 0.439 368s 368s x1: [3, 3, 1, 1] 368s 0.286 0.224 -0.879 368s -0.419 -0.543 0.485 368s 0.721 -0.853 -0.192 368s 368s y: [1, 3, 1, 1] 368s -0.244 368s -0.052 368s -0.101 368s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 368s x0: [3, 2, 3, 1] 368s -0.073 0.578 0.671 368s -0.558 -0.467 -0.443 368s 368s x1: [3, 1, 3, 1] 368s 0.675 0.924 0.573 368s 368s y: [2, 1, 3, 1] 368s 0.869 -1.061 368s testing: mul_mat, [3, 2, 3, 3] = [3, 3, 3, 3] * [3, 2, 3, 3] 368s x0: [3, 3, 3, 3] 368s 0.209 0.252 0.737 368s 0.680 -0.786 -0.408 368s 0.845 -0.765 -0.070 368s 368s x1: [3, 2, 3, 3] 368s 0.059 0.482 0.133 368s -0.265 0.406 -0.294 368s 368s y: [3, 2, 3, 3] 368s 0.232 -0.393 -0.328 368s -0.170 -0.380 -0.514 368s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.153 0.231 0.700 368s 368s x1: [3, 3, 1, 1] 368s -0.027 0.580 0.304 368s 0.115 -0.008 -0.908 368s 0.641 0.395 -0.632 368s 368s y: [1, 3, 1, 1] 368s 0.351 368s -0.654 368s -0.449 368s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3, 1] 368s x0: [1, 3, 3, 1] 368s -0.061 368s 0.965 368s -0.375 368s 368s x1: [1, 3, 3, 1] 368s -0.398 368s -0.582 368s -0.877 368s 368s y: [3, 3, 3, 1] 368s 0.024 -0.384 0.149 368s 0.036 -0.561 0.218 368s 0.054 -0.846 0.329 368s testing: mul_mat, [1, 1, 3, 3] = [3, 1, 3, 3] * [3, 1, 3, 3] 368s x0: [3, 1, 3, 3] 368s -0.660 -0.464 0.932 368s 368s x1: [3, 1, 3, 3] 368s 0.362 0.405 -0.054 368s 368s y: [1, 1, 3, 3] 368s -0.477 368s test-mul-mat0: iter:445/500 368s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.497 368s 0.310 368s 368s x1: [1, 2, 1, 1] 368s -0.199 368s 0.369 368s 368s y: [2, 2, 1, 1] 368s 0.099 -0.062 368s -0.184 0.115 368s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.868 368s -0.635 368s 368s x1: [1, 2, 1, 1] 368s -0.597 368s -0.440 368s 368s y: [2, 2, 1, 1] 368s -0.518 0.379 368s -0.383 0.280 368s testing: mul_mat, [3, 2, 1, 2] = [1, 3, 1, 2] * [1, 2, 1, 2] 368s x0: [1, 3, 1, 2] 368s -0.634 368s -0.631 368s 0.600 368s 368s x1: [1, 2, 1, 2] 368s -0.947 368s 0.353 368s 368s y: [3, 2, 1, 2] 368s 0.601 0.598 -0.569 368s -0.224 -0.223 0.212 368s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 368s x0: [1, 4, 1, 1] 368s -0.318 368s 0.365 368s 0.406 368s 0.143 368s 368s x1: [1, 3, 1, 1] 368s 0.666 368s -0.222 368s -0.034 368s 368s y: [4, 3, 1, 1] 368s -0.212 0.243 0.271 0.095 368s 0.071 -0.081 -0.090 -0.032 368s 0.011 -0.012 -0.014 -0.005 368s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 2, 1, 1] 368s -0.553 -0.071 0.627 0.372 368s -0.122 0.007 -0.203 0.680 368s 368s x1: [4, 1, 1, 1] 368s -0.802 -0.000 0.944 0.568 368s 368s y: [2, 1, 1, 1] 368s 1.246 0.292 368s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 368s x0: [2, 4, 1, 2] 368s 0.425 0.437 368s 0.869 0.050 368s -0.475 0.817 368s 0.369 -0.685 368s 368s x1: [2, 4, 1, 2] 368s -0.850 -0.862 368s -0.867 -0.294 368s -0.496 -0.498 368s -0.694 -0.461 368s 368s y: [4, 4, 1, 2] 368s -0.738 -0.782 -0.300 0.277 368s -0.497 -0.768 0.171 -0.118 368s -0.428 -0.456 -0.171 0.158 368s -0.497 -0.626 -0.047 0.060 368s test-mul-mat0: iter:446/500 368s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.276 0.580 368s -0.319 0.582 368s 0.698 -0.036 368s 368s x1: [2, 1, 1, 1] 368s 0.937 0.376 368s 368s y: [3, 1, 1, 1] 368s 0.476 -0.080 0.641 368s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 368s x0: [2, 4, 2, 1] 368s 0.553 -0.407 368s -0.564 -0.522 368s -0.074 -0.809 368s -0.891 0.763 368s 368s x1: [2, 3, 2, 1] 368s 0.252 0.224 368s -0.611 0.121 368s 0.748 0.759 368s 368s y: [4, 3, 2, 1] 368s 0.048 -0.259 -0.200 -0.054 368s -0.387 0.281 -0.053 0.636 368s 0.105 -0.818 -0.669 -0.088 368s testing: mul_mat, [1, 4, 2, 2] = [2, 1, 2, 2] * [2, 4, 2, 2] 368s x0: [2, 1, 2, 2] 368s 0.531 -0.509 368s 368s x1: [2, 4, 2, 2] 368s -0.583 0.174 368s -0.844 0.640 368s 0.564 0.277 368s 0.389 0.322 368s 368s y: [1, 4, 2, 2] 368s -0.399 368s -0.774 368s 0.158 368s 0.042 368s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 4, 1, 1] 368s -0.410 -0.170 368s -0.128 -0.458 368s 0.547 0.295 368s -0.520 -0.022 368s 368s x1: [2, 1, 1, 1] 368s 0.294 0.350 368s 368s y: [4, 1, 1, 1] 368s -0.180 -0.198 0.264 -0.160 368s testing: mul_mat, [1, 2, 2, 1] = [4, 1, 2, 1] * [4, 2, 2, 1] 368s x0: [4, 1, 2, 1] 368s 0.015 0.626 0.313 -0.692 368s 368s x1: [4, 2, 2, 1] 368s 0.646 0.873 0.575 -0.204 368s 0.590 0.701 0.268 -0.976 368s 368s y: [1, 2, 2, 1] 368s 0.877 368s 1.205 368s testing: mul_mat, [3, 4, 2, 2] = [1, 3, 2, 2] * [1, 4, 2, 2] 368s x0: [1, 3, 2, 2] 368s -0.486 368s -0.040 368s -0.598 368s 368s x1: [1, 4, 2, 2] 368s 0.891 368s 0.378 368s -0.322 368s -0.567 368s 368s y: [3, 4, 2, 2] 368s -0.433 -0.036 -0.533 368s -0.184 -0.015 -0.226 368s 0.157 0.013 0.193 368s 0.276 0.023 0.339 368s test-mul-mat0: iter:447/500 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.952 0.957 -0.419 -0.912 368s 368s x1: [4, 1, 1, 1] 368s 0.070 0.628 0.406 -0.274 368s 368s y: [1, 1, 1, 1] 368s 0.747 368s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.284 -0.630 -0.532 0.686 368s 0.503 -0.590 0.334 0.650 368s 368s x1: [4, 1, 1, 1] 368s -0.112 -0.277 -0.752 0.856 368s 368s y: [2, 1, 1, 1] 368s 1.129 0.413 368s testing: mul_mat, [2, 2, 1, 3] = [4, 2, 1, 3] * [4, 2, 1, 3] 368s x0: [4, 2, 1, 3] 368s 0.409 -0.142 0.282 0.819 368s -0.809 -0.067 0.797 -0.514 368s 368s x1: [4, 2, 1, 3] 368s 0.978 -0.706 -0.894 -0.479 368s 0.598 0.461 -0.087 -0.207 368s 368s y: [2, 2, 1, 3] 368s -0.144 -1.209 368s -0.015 -0.478 368s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 3, 1, 1] 368s -0.388 0.545 -0.416 0.084 368s 0.735 0.532 0.851 0.082 368s 0.596 -0.918 -0.834 0.044 368s 368s x1: [4, 2, 1, 1] 368s -0.860 0.566 0.375 0.550 368s -0.493 -0.453 0.330 0.916 368s 368s y: [3, 2, 1, 1] 368s 0.533 0.033 -1.321 368s -0.116 -0.248 -0.113 368s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 3, 1, 1] 368s -0.505 0.825 -0.563 368s 0.976 0.892 0.627 368s 0.325 -0.270 -0.674 368s 368s x1: [3, 4, 1, 1] 368s 0.778 -0.704 -0.658 368s -0.928 -0.782 0.768 368s 0.918 -0.195 0.903 368s 0.404 -0.055 0.469 368s 368s y: [3, 4, 1, 1] 368s -0.603 -0.281 0.886 368s -0.609 -1.121 -0.609 368s -1.133 1.289 -0.257 368s -0.513 0.640 -0.170 368s testing: mul_mat, [3, 3, 1, 3] = [3, 3, 1, 3] * [3, 3, 1, 3] 368s x0: [3, 3, 1, 3] 368s 0.114 0.095 0.661 368s -0.403 0.757 -0.233 368s 0.983 -0.609 -0.034 368s 368s x1: [3, 3, 1, 3] 368s 0.982 0.160 -0.592 368s -0.434 0.010 -0.425 368s 0.650 -0.908 0.618 368s 368s y: [3, 3, 1, 3] 368s -0.264 -0.137 0.888 368s -0.330 0.282 -0.419 368s 0.396 -1.093 1.170 368s test-mul-mat0: iter:448/500 368s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.041 0.021 368s -0.824 -0.993 368s -0.667 0.035 368s 368s x1: [2, 2, 1, 1] 368s -0.735 0.652 368s -0.351 0.360 368s 368s y: [3, 2, 1, 1] 368s -0.016 -0.041 0.514 368s -0.007 -0.069 0.247 368s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 368s x0: [2, 3, 2, 1] 368s 0.535 0.500 368s -0.591 -0.456 368s -0.349 0.464 368s 368s x1: [2, 3, 2, 1] 368s -0.409 0.072 368s -0.717 0.136 368s 0.312 -0.862 368s 368s y: [3, 3, 2, 1] 368s -0.183 0.209 0.176 368s -0.315 0.361 0.313 368s -0.264 0.209 -0.509 368s testing: mul_mat, [2, 3, 2, 4] = [2, 2, 2, 4] * [2, 3, 2, 4] 368s x0: [2, 2, 2, 4] 368s -0.378 -0.035 368s -0.962 -0.282 368s 368s x1: [2, 3, 2, 4] 368s -0.099 0.297 368s -0.300 -0.093 368s 0.630 0.735 368s 368s y: [2, 3, 2, 4] 368s 0.027 0.012 368s 0.117 0.315 368s -0.264 -0.813 368s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 1, 1, 1] 368s 0.226 -0.234 368s 368s x1: [2, 2, 1, 1] 368s -0.924 -0.494 368s -0.052 0.557 368s 368s y: [1, 2, 1, 1] 368s -0.093 368s -0.142 368s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 368s x0: [1, 1, 2, 1] 368s -0.155 368s 368s x1: [1, 2, 2, 1] 368s 0.707 368s -0.821 368s 368s y: [1, 2, 2, 1] 368s -0.110 368s 0.127 368s testing: mul_mat, [3, 1, 2, 4] = [1, 3, 2, 4] * [1, 1, 2, 4] 368s x0: [1, 3, 2, 4] 368s -0.443 368s -0.134 368s 0.743 368s 368s x1: [1, 1, 2, 4] 368s -0.615 368s 368s y: [3, 1, 2, 4] 368s 0.272 0.083 -0.457 368s test-mul-mat0: iter:449/500 368s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 4, 1, 1] 368s -0.759 -0.115 0.809 0.095 368s -0.575 -0.488 -0.976 -0.530 368s -0.837 -0.490 0.795 0.040 368s 0.763 -0.173 -0.903 -0.423 368s 368s x1: [4, 1, 1, 1] 368s 0.256 -0.530 -0.502 0.813 368s 368s y: [4, 1, 1, 1] 368s -0.462 0.171 -0.321 0.397 368s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 368s x0: [4, 1, 3, 1] 368s 0.003 -0.946 0.187 0.191 368s 368s x1: [4, 4, 3, 1] 368s 0.920 0.486 -0.763 -0.347 368s 0.260 0.419 0.459 0.106 368s 0.696 -0.669 -0.639 -0.835 368s -0.171 -0.826 0.501 0.070 368s 368s y: [1, 4, 3, 1] 368s -0.667 368s -0.290 368s 0.356 368s 0.888 368s testing: mul_mat, [4, 1, 3, 3] = [4, 4, 3, 3] * [4, 1, 3, 3] 368s x0: [4, 4, 3, 3] 368s 0.982 0.721 -0.903 0.735 368s 0.244 -0.798 -0.956 -0.466 368s 0.817 -0.590 -0.337 -0.998 368s -0.857 0.793 -0.760 0.062 368s 368s x1: [4, 1, 3, 3] 368s 0.391 -0.172 0.927 -0.333 368s 368s y: [4, 1, 3, 3] 368s -0.822 -0.498 0.441 -1.196 368s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 2, 1, 1] 368s -0.095 -0.138 0.984 0.453 368s -0.392 0.969 -0.272 0.993 368s 368s x1: [4, 4, 1, 1] 368s -0.433 0.113 -0.586 -0.217 368s -0.828 -0.304 -0.395 0.033 368s 0.128 0.198 -0.331 0.873 368s -0.435 -0.942 -0.725 -0.201 368s 368s y: [2, 4, 1, 1] 368s -0.649 0.223 368s -0.253 0.170 368s 0.031 1.098 368s -0.633 -0.745 368s testing: mul_mat, [1, 4, 3, 1] = [2, 1, 3, 1] * [2, 4, 3, 1] 368s x0: [2, 1, 3, 1] 368s 0.265 0.719 368s 368s x1: [2, 4, 3, 1] 368s 0.487 -0.820 368s -0.107 0.400 368s -0.027 -0.541 368s 0.968 -0.915 368s 368s y: [1, 4, 3, 1] 368s -0.460 368s 0.260 368s -0.396 368s -0.401 368s testing: mul_mat, [3, 2, 3, 3] = [1, 3, 3, 3] * [1, 2, 3, 3] 368s x0: [1, 3, 3, 3] 368s 0.932 368s -0.595 368s -0.453 368s 368s x1: [1, 2, 3, 3] 368s 0.480 368s 0.352 368s 368s y: [3, 2, 3, 3] 368s 0.447 -0.285 -0.217 368s 0.328 -0.209 -0.159 368s test-mul-mat0: iter:450/500 368s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.593 -0.045 -0.361 0.655 368s 0.356 -0.564 -0.988 -0.289 368s -0.811 0.361 0.577 0.857 368s -0.729 0.071 0.457 0.586 368s 368s x1: [4, 4, 1, 1] 368s 0.600 0.315 -0.959 0.935 368s 0.470 0.114 0.536 0.344 368s -0.304 0.129 -0.611 -0.220 368s -0.798 -0.214 -0.741 -0.846 368s 368s y: [4, 4, 1, 1] 368s 1.301 0.713 -0.124 -0.306 368s 0.305 -0.526 0.264 0.112 368s -0.110 0.486 -0.248 -0.177 368s -0.751 0.813 -0.581 -0.267 368s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 3, 1, 1] 368s 0.722 0.208 -0.276 -0.842 368s 0.220 0.436 -0.652 -0.419 368s 0.012 -0.795 -0.149 -0.917 368s 368s x1: [4, 4, 1, 1] 368s 0.112 0.393 0.056 -0.775 368s -0.071 -0.601 -0.079 -0.943 368s -0.211 0.701 -0.741 0.575 368s 0.960 -0.587 0.083 0.554 368s 368s y: [3, 4, 1, 1] 368s 0.799 0.484 0.391 368s 0.638 0.169 1.353 368s -0.286 0.501 -0.977 368s 0.082 -0.331 -0.042 368s testing: mul_mat, [1, 3, 1, 3] = [4, 1, 1, 3] * [4, 3, 1, 3] 368s x0: [4, 1, 1, 3] 368s -0.527 0.238 -0.707 -0.486 368s 368s x1: [4, 3, 1, 3] 368s 0.662 -0.563 0.195 0.055 368s 0.493 0.420 0.984 0.892 368s -0.659 -0.959 -0.319 -0.958 368s 368s y: [1, 3, 1, 3] 368s -0.647 368s -1.289 368s 0.810 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.356 -0.343 0.361 0.210 368s 368s x1: [4, 1, 1, 1] 368s 0.609 -0.426 -0.341 -0.584 368s 368s y: [1, 1, 1, 1] 368s -0.317 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s -0.520 368s 0.779 368s 0.057 368s 368s x1: [1, 4, 1, 1] 368s 0.911 368s 0.133 368s -0.374 368s 0.248 368s 368s y: [3, 4, 1, 1] 368s -0.474 0.710 0.052 368s -0.069 0.104 0.008 368s 0.194 -0.291 -0.021 368s -0.129 0.193 0.014 368s testing: mul_mat, [1, 1, 1, 3] = [3, 1, 1, 3] * [3, 1, 1, 3] 368s x0: [3, 1, 1, 3] 368s -0.303 -0.615 -0.330 368s 368s x1: [3, 1, 1, 3] 368s 0.826 0.731 0.530 368s 368s y: [1, 1, 1, 3] 368s -0.875 368s test-mul-mat0: iter:451/500 368s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.269 -0.918 -0.880 368s 368s x1: [3, 4, 1, 1] 368s -0.022 0.533 0.730 368s -0.774 -0.899 -0.790 368s -0.994 0.158 -0.964 368s 0.737 -0.313 0.100 368s 368s y: [1, 4, 1, 1] 368s -1.126 368s 1.729 368s 0.972 368s 0.001 368s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 368s x0: [3, 4, 2, 1] 368s 0.154 -0.342 -0.624 368s -0.807 -0.448 0.443 368s -0.407 0.656 -0.578 368s -0.874 0.386 -0.352 368s 368s x1: [3, 1, 2, 1] 368s 0.588 0.239 -0.183 368s 368s y: [4, 1, 2, 1] 368s 0.123 -0.663 0.023 -0.357 368s testing: mul_mat, [1, 4, 2, 4] = [3, 1, 2, 4] * [3, 4, 2, 4] 368s x0: [3, 1, 2, 4] 368s 0.302 0.413 0.640 368s 368s x1: [3, 4, 2, 4] 368s -0.609 -0.947 0.088 368s -0.636 0.963 -0.504 368s -0.669 0.117 0.154 368s -0.293 0.309 0.706 368s 368s y: [1, 4, 2, 4] 368s -0.519 368s -0.118 368s -0.056 368s 0.491 368s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 2, 1, 1] 368s 0.837 -0.269 0.047 368s 0.633 0.139 -0.629 368s 368s x1: [3, 1, 1, 1] 368s 0.219 0.444 0.750 368s 368s y: [2, 1, 1, 1] 368s 0.098 -0.272 368s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 368s x0: [2, 1, 2, 1] 368s 0.944 0.624 368s 368s x1: [2, 3, 2, 1] 368s -0.993 0.412 368s -0.893 -0.597 368s 0.860 -0.603 368s 368s y: [1, 3, 2, 1] 368s -0.681 368s -1.216 368s 0.436 368s testing: mul_mat, [3, 2, 2, 4] = [1, 3, 2, 4] * [1, 2, 2, 4] 368s x0: [1, 3, 2, 4] 368s -0.616 368s -0.557 368s 0.731 368s 368s x1: [1, 2, 2, 4] 368s 0.614 368s -0.972 368s 368s y: [3, 2, 2, 4] 368s -0.378 -0.342 0.449 368s 0.599 0.542 -0.711 368s test-mul-mat0: iter:452/500 368s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.919 0.762 -0.718 368s 368s x1: [3, 2, 1, 1] 368s 0.458 -0.685 -0.969 368s 0.843 -0.242 0.763 368s 368s y: [1, 2, 1, 1] 368s -0.247 368s -1.506 368s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 1, 1, 1] 368s 0.324 0.153 -0.787 368s 368s x1: [3, 1, 1, 1] 368s 0.118 0.731 0.942 368s 368s y: [1, 1, 1, 1] 368s -0.591 368s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.123 -0.450 -0.084 368s 368s x1: [3, 1, 1, 1] 368s -0.469 -0.608 0.883 368s 368s y: [1, 1, 1, 1] 368s 0.257 368s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 2, 1, 1] 368s 0.530 0.656 0.526 368s 0.211 -0.011 0.687 368s 368s x1: [3, 1, 1, 1] 368s 0.490 -0.947 0.478 368s 368s y: [2, 1, 1, 1] 368s -0.110 0.443 368s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.483 0.581 368s -0.057 -0.733 368s 0.477 -0.370 368s 368s x1: [2, 3, 1, 1] 368s -0.169 -0.716 368s 0.450 0.254 368s -0.635 0.212 368s 368s y: [3, 3, 1, 1] 368s -0.498 0.534 0.184 368s 0.365 -0.212 0.121 368s -0.184 -0.119 -0.381 368s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.807 0.389 -0.719 368s 368s x1: [3, 2, 1, 1] 368s 0.795 -0.202 0.022 368s 0.678 0.277 0.900 368s 368s y: [1, 2, 1, 1] 368s -0.736 368s -1.086 368s test-mul-mat0: iter:453/500 368s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.691 368s 0.631 368s 368s x1: [1, 2, 1, 1] 368s -0.996 368s -0.200 368s 368s y: [2, 2, 1, 1] 368s -0.688 -0.629 368s -0.138 -0.126 368s testing: mul_mat, [1, 2, 3, 1] = [1, 1, 3, 1] * [1, 2, 3, 1] 368s x0: [1, 1, 3, 1] 368s 0.295 368s 368s x1: [1, 2, 3, 1] 368s -0.308 368s 0.141 368s 368s y: [1, 2, 3, 1] 368s -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s .091 368s 0.042 368s testing: mul_mat, [4, 1, 3, 1] = [1, 4, 3, 1] * [1, 1, 3, 1] 368s x0: [1, 4, 3, 1] 368s 0.550 368s 0.594 368s 0.637 368s 0.449 368s 368s x1: [1, 1, 3, 1] 368s 0.562 368s 368s y: [4, 1, 3, 1] 368s 0.309 0.334 0.358 0.253 368s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.482 368s -0.978 368s 368s x1: [1, 4, 1, 1] 368s 0.190 368s -0.286 368s -0.210 368s -0.119 368s 368s y: [2, 4, 1, 1] 368s 0.092 -0.186 368s -0.138 0.280 368s -0.101 0.206 368s -0.057 0.116 368s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 368s x0: [2, 2, 3, 1] 368s 0.201 -0.925 368s 0.231 0.117 368s 368s x1: [2, 1, 3, 1] 368s 0.230 -0.461 368s 368s y: [2, 1, 3, 1] 368s 0.473 -0.001 368s testing: mul_mat, [4, 2, 3, 1] = [2, 4, 3, 1] * [2, 2, 3, 1] 368s x0: [2, 4, 3, 1] 368s 0.751 0.236 368s 0.567 -0.963 368s -0.510 -0.969 368s -0.170 -0.532 368s 368s x1: [2, 2, 3, 1] 368s -0.363 -0.441 368s 0.100 -0.449 368s 368s y: [4, 2, 3, 1] 368s -0.377 0.219 0.613 0.296 368s -0.031 0.489 0.384 0.222 368s test-mul-mat0: iter:454/500 368s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 3, 1, 1] 368s -0.459 -0.953 368s 0.786 0.372 368s 0.284 0.823 368s 368s x1: [2, 1, 1, 1] 368s -0.210 -0.520 368s 368s y: [3, 1, 1, 1] 368s 0.591 -0.358 -0.487 368s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 368s x0: [2, 4, 4, 1] 368s 0.108 -0.031 368s 0.805 -0.520 368s -0.747 0.628 368s 0.883 -0.996 368s 368s x1: [2, 3, 4, 1] 368s 0.403 0.751 368s 0.936 -0.448 368s 0.811 0.674 368s 368s y: [4, 3, 4, 1] 368s 0.020 -0.066 0.171 -0.393 368s 0.115 0.987 -0.981 1.273 368s 0.067 0.302 -0.183 0.045 368s testing: mul_mat, [1, 4, 4, 4] = [2, 1, 4, 4] * [2, 4, 4, 4] 368s x0: [2, 1, 4, 4] 368s -0.403 -0.690 368s 368s x1: [2, 4, 4, 4] 368s 0.463 0.721 368s -0.158 0.715 368s 0.350 -0.276 368s 0.719 -0.086 368s 368s y: [1, 4, 4, 4] 368s -0.684 368s -0.429 368s 0.049 368s -0.231 368s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.832 -0.181 368s -0.680 -0.048 368s 0.860 0.931 368s 368s x1: [2, 1, 1, 1] 368s -0.391 0.547 368s 368s y: [3, 1, 1, 1] 368s -0.424 0.239 0.173 368s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 368s x0: [3, 1, 4, 1] 368s 0.024 0.198 -0.918 368s 368s x1: [3, 2, 4, 1] 368s -0.901 0.209 0.961 368s 0.401 0.522 -0.663 368s 368s y: [1, 2, 4, 1] 368s -0.862 368s 0.721 368s testing: mul_mat, [4, 3, 4, 4] = [1, 4, 4, 4] * [1, 3, 4, 4] 368s x0: [1, 4, 4, 4] 368s -0.564 368s -0.711 368s -0.572 368s -0.159 368s 368s x1: [1, 3, 4, 4] 368s 0.426 368s 0.790 368s 0.319 368s 368s y: [4, 3, 4, 4] 368s -0.240 -0.303 -0.244 -0.068 368s -0.445 -0.562 -0.452 -0.126 368s -0.180 -0.227 -0.183 -0.051 368s test-mul-mat0: iter:455/500 368s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 2, 1, 1] 368s -0.174 0.907 -0.687 368s -0.113 0.547 0.282 368s 368s x1: [3, 2, 1, 1] 368s 0.509 0.891 0.478 368s 0.413 -0.858 -0.494 368s 368s y: [2, 2, 1, 1] 368s 0.392 0.565 368s -0.511 -0.656 368s testing: mul_mat, [4, 2, 2, 1] = [3, 4, 2, 1] * [3, 2, 2, 1] 368s x0: [3, 4, 2, 1] 368s -0.792 0.941 -0.090 368s 0.020 0.153 -0.580 368s -0.090 -0.369 0.833 368s 0.052 0.137 0.506 368s 368s x1: [3, 2, 2, 1] 368s -0.908 -0.079 0.966 368s -0.116 0.433 0.773 368s 368s y: [4, 2, 2, 1] 368s 0.559 -0.590 0.915 0.431 368s 0.430 -0.384 0.494 0.445 368s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 368s x0: [3, 2, 2, 4] 368s 0.451 -0.546 -0.514 368s -0.345 -0.105 -0.452 368s 368s x1: [3, 4, 2, 4] 368s 0.021 0.926 -0.692 368s -0.589 0.068 -0.643 368s -0.935 0.276 -0.703 368s -0.025 -0.705 0.451 368s 368s y: [2, 4, 2, 4] 368s -0.141 0.208 368s 0.028 0.487 368s -0.212 0.611 368s 0.142 -0.121 368s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.729 -0.258 -0.992 368s 0.454 0.788 -0.315 368s -0.978 -0.910 -0.124 368s 0.983 0.482 0.122 368s 368s x1: [3, 2, 1, 1] 368s 0.057 -0.401 0.055 368s -0.588 0.863 0.996 368s 368s y: [4, 2, 1, 1] 368s 0.007 -0.307 0.302 -0.130 368s -0.782 0.099 -0.333 -0.041 368s testing: mul_mat, [3, 3, 2, 1] = [4, 3, 2, 1] * [4, 3, 2, 1] 368s x0: [4, 3, 2, 1] 368s 0.128 0.813 0.640 -0.413 368s -0.222 0.654 0.661 -0.298 368s -0.381 0.741 0.288 0.042 368s 368s x1: [4, 3, 2, 1] 368s 0.827 -0.993 0.547 -0.053 368s 0.040 0.754 -0.933 0.483 368s 0.492 -0.461 -0.678 -0.952 368s 368s y: [3, 3, 2, 1] 368s -0.329 -0.456 -0.896 368s -0.179 -0.277 0.295 368s -0.352 -0.576 -0.764 368s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 368s x0: [3, 2, 2, 4] 368s -0.847 -0.779 0.809 368s -0.197 -0.705 -0.960 368s 368s x1: [3, 4, 2, 4] 368s 0.599 0.266 0.341 368s -0.577 -0.696 -0.362 368s 0.034 0.432 0.416 368s 0.653 0.244 0.070 368s 368s y: [2, 4, 2, 4] 368s -0.439 -0.633 368s 0.738 0.953 368s -0.029 -0.711 368s -0.687 -0.369 368s test-mul-mat0: iter:456ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s /500 368s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 4, 1, 1] 368s 0.097 368s 0.748 368s 0.942 368s 0.477 368s 368s x1: [1, 2, 1, 1] 368s -0.514 368s -0.137 368s 368s y: [4, 2, 1, 1] 368s -0.050 -0.384 -0.484 -0.245 368s -0.013 -0.102 -0.129 -0.065 368s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 1, 1, 1] 368s -0.512 368s 368s x1: [1, 4, 1, 1] 368s 0.411 368s -0.895 368s 0.872 368s 0.227 368s 368s y: [1, 4, 1, 1] 368s -0.211 368s 0.458 368s -0.447 368s -0.116 368s testing: mul_mat, [1, 1, 1, 4] = [1, 1, 1, 4] * [1, 1, 1, 4] 368s x0: [1, 1, 1, 4] 368s -0.336 368s 368s x1: [1, 1, 1, 4] 368s 0.338 368s 368s y: [1, 1, 1, 4] 368s -0.113 368s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.772 368s -0.291 368s 368s x1: [1, 1, 1, 1] 368s -0.080 368s 368s y: [2, 1, 1, 1] 368s -0.061 0.023 368s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 1, 1, 1] 368s 0.868 0.006 368s 368s x1: [2, 1, 1, 1] 368s 0.150 -0.118 368s 368s y: [1, 1, 1, 1] 368s 0.130 368s testing: mul_mat, [1, 2, 1, 4] = [1, 1, 1, 4] * [1, 2, 1, 4] 368s x0: [1, 1, 1, 4] 368s 0.964 368s 368s x1: [1, 2, 1, 4] 368s 0.517 368s -0.268 368s 368s y: [1, 2, 1, 4] 368s 0.499 368s -0.259 368s test-mul-mat0: iter:457/500 368s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 3, 1, 1] 368s -0.521 -0.035 0.799 368s 0.485 -0.518 -0.469 368s -0.226 0.097 0.010 368s 368s x1: [3, 3, 1, 1] 368s 0.467 -0.740 -0.695 368s 0.934 0.825 0.225 368s -0.219 0.596 0.934 368s 368s y: [3, 3, 1, 1] 368s -0.773 0.936 -0.184 368s -0.336 -0.080 -0.129 368s 0.839 -0.854 0.117 368s testing: mul_mat, [4, 3, 4, 1] = [3, 4, 4, 1] * [3, 3, 4, 1] 368s x0: [3, 4, 4, 1] 368s -0.286 -0.220 -0.148 368s -0.284 0.979 0.681 368s -0.815 0.439 -0.186 368s 0.588 -0.278 0.173 368s 368s x1: [3, 3, 4, 1] 368s -0.994 0.188 -0.098 368s 0.342 0.152 -0.980 368s 0.749 0.604 -0.524 368s 368s y: [4, 3, 4, 1] 368s 0.257 0.399 0.911 -0.654 368s 0.014 -0.615 -0.030 -0.010 368s -0.270 0.023 -0.248 0.182 368s testing: mul_mat, [3, 4, 4, 2] = [3, 3, 4, 2] * [3, 4, 4, 2] 368s x0: [3, 3, 4, 2] 368s 0.719 0.626 -0.584 368s 0.461 -0.189 -0.826 368s 0.195 0.822 -0.045 368s 368s x1: [3, 4, 4, 2] 368s 0.136 -0.328 0.782 368s 0.596 -0.084 0.204 368s -0.109 -0.386 -0.751 368s -0.790 -0.332 -0.741 368s 368s y: [3, 4, 4, 2] 368s -0.564 -0.521 -0.278 368s 0.257 0.122 0.038 368s 0.119 0.644 -0.305 368s -0.342 0.311 -0.393 368s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.988 0.155 -0.380 368s 0.554 -0.989 -0.396 368s 0.875 -0.433 0.396 368s -0.628 0.443 0.420 368s 368s x1: [3, 3, 1, 1] 368s 0.628 0.442 0.344 368s -0.934 -0.856 0.939 368s 0.535 0.777 -0.006 368s 368s y: [4, 3, 1, 1] 368s -0.683 -0.225 0.494 -0.054 368s 0.433 -0.042 -0.075 0.602 368s -0.406 -0.470 0.130 0.006 368s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 368s x0: [4, 4, 4, 1] 368s 0.709 0.088 -0.566 0.576 368s -0.870 -0.337 0.732 -0.019 368s -0.664 0.307 -0.257 -0.804 368s 0.503 -0.387 0.601 0.295 368s 368s x1: [4, 3, 4, 1] 368s -0.891 0.429 0.874 0.893 368s 0.715 0.175 0.777 0.211 368s 0.341 0.405 -0.347 -0.315 368s 368s y: [4, 3, 4, 1] 368s -0.574 1.254 -0.219 0.174 368s 0.204 -0.117 -0.791 0.821 368s 0.292 -0.682 0.240 -0.287 368s testing: mul_mat, [1, 4, 4, 2] = [4, 1, 4, 2] * [4, 4, 4, 2] 368s x0: [4, 1, 4, 2] 368s -0.416 0.617 -0.610 0.259 368s 368s x1: [4, 4, 4, 2] 368s 0.307 0.493 -0.310 -0.643 368s -0.082 0.238 0.840 -0.926 368s -0.471 -0.076 -0.193 -0.200 368s -0.796 -0.165 -0.815 0.991 368s 368s y: [1, 4, 4, 2] 368s 0.199 368s -0.571 368s 0.215 368s 0.984 368s test-mul-mat0: iter:458/500 368s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 1, 1, 1] 368s 0.171 -0.474 368s 368s x1: [2, 4, 1, 1] 368s 0.169 -0.888 368s 0.273 -0.242 368s 0.021 -0.333 368s 0.243 0.738 368s 368s y: [1, 4, 1, 1] 368s 0.450 368s 0.161 368s 0.162 368s -0.309 368s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 3, 1, 1] 368s -0.893 0.383 368s -0.600 0.221 368s 0.160 -0.749 368s 368s x1: [2, 1, 1, 1] 368s -0.734 -0.053 368s 368s y: [3, 1, 1, 1] 368s 0.636 0.429 -0.078 368s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 2, 1, 1] 368s 0.648 0.663 368s -0.951 0.585 368s 368s x1: [2, 3, 1, 1] 368s 0.798 0.604 368s 0.826 0.740 368s 0.817 0.550 368s 368s y: [2, 3, 1, 1] 368s 0.918 -0.405 368s 1.026 -0.352 368s 0.894 -0.455 368s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.525 -0.627 368s -0.167 -0.996 368s -0.080 0.446 368s 368s x1: [2, 2, 1, 1] 368s -0.168 -0.839 368s -0.142 0.590 368s 368s y: [3, 2, 1, 1] 368s 0.438 0.864 -0.361 368s -0.445 -0.564 0.275 368s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 3, 1, 1] 368s 0.494 0.098ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s -0.085 368s 0.962 0.788 0.338 368s 0.764 0.504 0.769 368s 368s x1: [3, 2, 1, 1] 368s -0.361 -0.049 0.311 368s -0.255 -0.666 0.711 368s 368s y: [3, 2, 1, 1] 368s -0.210 -0.281 -0.062 368s -0.251 -0.529 0.017 368s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 3, 1, 1] 368s -0.743 -0.739 0.212 368s -0.918 0.527 -0.161 368s -0.096 0.543 -0.712 368s 368s x1: [3, 3, 1, 1] 368s -0.437 0.001 0.818 368s -0.852 0.834 0.979 368s 0.006 0.424 0.161 368s 368s y: [3, 3, 1, 1] 368s 0.497 0.270 -0.540 368s 0.225 1.064 -0.163 368s -0.283 0.192 0.115 368s test-mul-mat0: iter:459/500 368s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 3, 1, 1] 368s 0.088 -0.229 0.897 -0.751 368s 0.302 -0.846 -0.670 -0.794 368s -0.584 0.858 0.748 0.628 368s 368s x1: [4, 3, 1, 1] 368s -0.488 -0.982 -0.861 -0.700 368s 0.522 0.054 0.638 0.291 368s 0.617 -0.361 0.109 0.765 368s 368s y: [3, 3, 1, 1] 368s -0.064 1.816 -1.641 368s 0.387 -0.546 0.401 368s -0.341 -0.189 -0.109 368s testing: mul_mat, [1, 3, 2, 1] = [4, 1, 2, 1] * [4, 3, 2, 1] 368s x0: [4, 1, 2, 1] 368s 0.897 0.069 0.273 0.754 368s 368s x1: [4, 3, 2, 1] 368s -0.304 -0.964 0.175 0.246 368s -0.709 -0.785 0.758 -0.691 368s -0.646 -0.942 0.831 0.408 368s 368s y: [1, 3, 2, 1] 368s -0.105 368s -1.005 368s -0.109 368s testing: mul_mat, [3, 1, 2, 3] = [4, 3, 2, 3] * [4, 1, 2, 3] 368s x0: [4, 3, 2, 3] 368s 0.185 0.170 -0.819 -0.914 368s 0.621 0.035 -0.838 0.318 368s -0.820 -0.547 -0.227 0.634 368s 368s x1: [4, 1, 2, 3] 368s -0.924 0.697 0.145 -0.709 368s 368s y: [3, 1, 2, 3] 368s 0.477 -0.895 -0.106 368s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.716 0.583 0.292 -0.954 368s 368s x1: [4, 3, 1, 1] 368s -0.059 0.937 0.217 -0.047 368s 0.074 0.703 -0.541 0.534 368s -0.580 0.000 0.953 0.236 368s 368s y: [1, 3, 1, 1] 368s 0.696 368s -0.310 368s 0.468 368s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 368s x0: [1, 4, 2, 1] 368s -0.023 368s 0.769 368s -0.248 368s -0.305 368s 368s x1: [1, 4, 2, 1] 368s 0.755 368s -0.995 368s -0.346 368s 0.115 368s 368s y: [4, 4, 2, 1] 368s -0.017 0.581 -0.187 -0.230 368s 0.023 -0.765 0.247 0.303 368s 0.008 -0.266 0.086 0.106 368s -0.003 0.088 -0.028 -0.035 368s testing: mul_mat, [1, 1, 2, 3] = [4, 1, 2, 3] * [4, 1, 2, 3] 368s x0: [4, 1, 2, 3] 368s 0.745 -0.802 -0.874 0.439 368s 368s x1: [4, 1, 2, 3] 368s -0.353 0.706 -0.427 0.106 368s 368s y: [1, 1, 2, 3] 368s -0.411 368s test-mul-mat0: iter:460/500 368s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 4, 1, 1] 368s -0.655 368s 0.078 368s -0.130 368s 0.784 368s 368s x1: [1, 2, 1, 1] 368s -0.400 368s -0.119 368s 368s y: [4, 2, 1, 1] 368s 0.262 -0.031 0.052 -0.313 368s 0.078 -0.009 0.015 -0.093 368s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 1, 1, 1] 368s 0.061 368s 368s x1: [1, 4, 1, 1] 368s 0.909 368s 0.866 368s 0.870 368s -0.555 368s 368s y: [1, 4, 1, 1] 368s 0.055 368s 0.053 368s 0.053 368s -0.034 368s testing: mul_mat, [3, 1, 1, 4] = [1, 3, 1, 4] * [1, 1, 1, 4] 368s x0: [1, 3, 1, 4] 368s -0.311 368s 0.641 368s 0.394 368s 368s x1: [1, 1, 1, 4] 368s -0.778 368s 368s y: [3, 1, 1, 4] 368s 0.242 -0.499 -0.307 368s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.681 368s 0.757 368s 368s x1: [1, 3, 1, 1] 368s -0.830 368s 0.974 368s -0.321 368s 368s y: [2, 3, 1, 1] 368s 0.565 -0.628 368s -0.663 0.737 368s 0.219 -0.243 368s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 4, 1, 1] 368s 0.651 0.034 368s 0.973 -0.668 368s -0.889 0.086 368s 0.087 0.392 368s 368s x1: [2, 1, 1, 1] 368s 0.785 -0.897 368s 368s y: [4, 1, 1, 1] 368s 0.481 1.363 -0.775 -0.283 368s testing: mul_mat, [2, 2, 1, 4] = [4, 2, 1, 4] * [4, 2, 1, 4] 368s x0: [4, 2, 1, 4] 368s -0.201 0.495 0.070 0.979 368s -0.711 -0.776 -0.668 0.617 368s 368s x1: [4, 2, 1, 4] 368s 0.525 0.876 -0.919 0.166 368s 0.271 0.425 -0.219 -0.163 368s 368s y: [2, 2, 1, 4] 368s 0.427 -0.337 368s -0.019 -0.477 368s test-mul-mat0: iter:461/500 368s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.520 -0.219 -0.610 -0.045 368s 0.156 -0.416 -0.397 0.019 368s 0.300 -0.430 -0.962 -0.809 368s -0.171 -0.676 -0.657 -0.588 368s 368s x1: [4, 1, 1, 1] 368s -0.448 0.303 0.982 -0.860 368s 368s y: [4, 1, 1, 1] 368s -0.859 -0.602 -0.512 -0.267 368s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 368s x0: [4, 1, 3, 1] 368s 0.151 -0.120 0.325 0.082 368s 368s x1: [4, 4, 3, 1] 368s -0.793 -0.156 -0.287 0.213 368s 0.569 0.812 -0.493 0.788 368s 0.136 -0.129 -0.660 -0.562 368s -0.147 -0.520 0.373 -0.627 368s 368s y: [1, 4, 3, 1] 368s -0.177 368s -0.108 368s -0.225 368s 0.110 368s testing: mul_mat, [1, 1, 3, 2] = [4, 1, 3, 2] * [4, 1, 3, 2] 368s x0: [4, 1, 3, 2] 368s -0.779 -0.639 0.925 0.135 368s 368s x1: [4, 1, 3, 2] 368s -0.344 0.594 -0.986 0.112 368s 368s y: [1, 1, 3, 2] 368s -1.010 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.337 0.613 0.429 0.698 368s 368s x1: [4, 1, 1, 1] 368s -0.648 -0.430 -0.232 -0.503 368s 368s y: [1, 1, 1, 1] 368s -0.933 368s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 368s x0: [1, 3, 3, 1] 368s -0.392 368s -0.530 368s 0.667 368s 368s x1: [1, 4, 3, 1] 368s 0.538 368s -0.436 368s 0.648 368s -0.547 368s 368s y: [3, 4, 3, 1] 368s -0.211 -0.285 0.359 368s 0.171 0.231 -0.291 368s -0.254 -0.343 0.432 368s 0.214 0.290 -0.365 368s testing: mul_mat, [1, 1, 3, 2] = [3, 1, 3, 2] * [3, 1, 3, 2] 368s x0: [3, 1, 3, 2] 368s 0.023 0.959 -0.982 368s 368s x1: [3, 1, 3, 2] 368s -0.814 -0.335 -0.910 368s 368s y: [1, 1, 3, 2] 368s 0.554 368s test-mul-mat0: iter:462/500 368s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.284 0.294 -0.963 368s -0.325 0.312 0.668 368s 0.105 -0.003 0.599 368s -0.385 0.490 0.364 368s 368s x1: [3, 2, 1, 1] 368s 0.311 0.233 0.046 368s 0.533 0.200 -0.287 368s 368s y: [4, 2, 1, 1] 368s -0.065 0.003 0.060 0.011 368s 0.184 -0.303 -0.117 -0.212 368s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 368s x0: [3, 4, 2, 1] 368s 0.933 -0.002 0.239 368s -0.452 -0.513 -0.397 368s -0.233 0.792 0.918 368s 0.256 0.003 0.865 368s 368s x1: [3, 4, 2, 1] 368s -0.781 0.305 0.315 368s -0.511 0.211 0.947 368s -0.368 -0.456 -0.125 368s -0.923 0.003 0.636 368s 368s y: [4, 4, 2, 1] 368s -0.654 0.072 0.712 0.073 368s -0.251 -0.253 1.155 0.689 368s -0.372 0.450 -0.390 -0.204 368s -0.710 0.163 0.801 0.314 368s testing: mul_mat, [2, 4, 2, 3] = [3, 2, 2, 3] * [3, 4, 2, 3] 368s x0: [3, 2, 2, 3] 368s -0.245 0.269 -0.100 368s -0.698 -0.042 0.997 368s 368s x1: [3, 4, 2, 3] 368s -0.792 -0.623 0.143 368s -0.820 0.205 0.144 368s 0.819 0.138 -0.859 368s 0.058 0.685 -0.372 368s 368s y: [2, 4, 2, 3] 368s 0.012 0.721 368s 0.242 0.707 368s -0.078 -1.434 368s 0.208 -0.440 368s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.921 -0.622 0.996 368s 368s x1: [3, 2, 1, 1] 368s -0.739 -0.808 0.695 368s -0.050 0.405 0.100 368s 368s y: [1, 2, 1, 1] 368s 1.876 368s -0.106 368s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 368s x0: [1, 4, 2, 1] 368s -0.419 368s -0.843 368s -0.499 368s -0.183 368s 368s x1: [1, 3, 2, 1] 368s -0.478 368s 0.147 368s -0.551 368s 368s y: [4, 3, 2, 1] 368s 0.200 0.403 0.239 0.088 368s -0.061 -0.124 -0.073 -0.027 368s 0.231 0.464 0.275 0.101 368s testing: mul_mat, [2, 1, 2, 3] = [4, 2, 2, 3] * [4, 1, 2, 3] 368s x0: [4, 2, 2, 3] 368s 0.408 -0.293 0.885 0.479 368s 0.204 0.762 -0.537 0.184 368s 368s x1: [4, 1, 2, 3] 368s 0.717 0.300 -0.597 0.981 368s 368s y: [2, 1, 2, 3] 368s 0.147 0.876 368s test-mul-mat0: iter:463/500 368s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.058 0.873 0.376 0.825 368s 0.871 0.729 -0.590 -0.550 368s -0.024 -0.370 -0.587 -0.802 368s -0.621 -0.056 -0.524 -0.922 368s 368s x1: [4, 4, 1, 1] 368s -0.723 0.698 0.428 -0.140 368s 0.332 -0.869 0.615 0.874 368s 0.221 -0.299 0.824 0.515 368s -0.506 0.535 0.694 -0.250 368s 368s y: [4, 4, 1, 1] 368s 0.613 -0.295 -0.379 0.315 368s 0.213 -1.188 -0.748 -1.286 368s 0.487 -0.795 -0.791 -1.028 368s 0.493 -0.322 -0.393 0.151 368s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.255 0.816 -0.360 -0.526 368s 368s x1: [4, 4, 1, 1] 368s -0.628 0.336 0.410 -0.497 368s -0.049 0.284 0.723 0.652 368s 0.108 0.239 -0.854 -0.357 368s -0.067 -0.104 0.369 0.990 368s 368s y: [1, 4, 1, 1] 368s 0.548 368s -0.359 368s 0.662 368s -0.721 368s testing: mul_mat, [1, 1, 1, 2] = [4, 1, 1, 2] * [4, 1, 1, 2] 368s x0: [4, 1, 1, 2] 368s 0.313 -0.180 0.460 -0.277 368s 368s x1: [4, 1, 1, 2] 368s -0.774 0.090 0.450 -0.144 368s 368s y: [1, 1, 1, 2] 368s -0.012 368s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 3, 1, 1] 368s 0.217 0.208 0.024 0.250 368s 0.103 0.872 -0.487 0.603 368s 0.127 0.872 -0.654 0.796 368s 368s x1: [4, 1, 1, 1] 368s -0.938 0.115 -0.715 -0.792 368s 368s y: [3, 1, 1, 1] 368s -0.396 -0.126 -0.182 368s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.258 0.500 -0.500 368s 368s x1: [3, 4, 1, 1] 368s -0.894 0.106 0.444 368s 0.342 0.554 0.568 368s -0.345 -0.626 0.028 368s 0.378 0.696 0.438 368s 368s y: [1, 4, 1, 1] 368s 0.061 368s -0.096 368s -0.238 368s 0.031 368s testing: mul_mat, [1, 3, 1, 2] = [1, 1, 1, 2] * [1, 3, 1, 2] 368s x0: [1, 1, 1, 2] 368s -0.316 368s 368s x1: [1, 3, 1, 2] 368s 0.026 368s 0.708 368s -0.742 368s 368s y: [1, 3, 1, 2] 368s -0.008 368s -0.224 368s 0.235 368s test-mul-mat0: iter:464/500 368s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.174 0.459 -0.588 0.829 368s 0.832 0.440 0.207 0.528 368s -0.122 -0.408 -0.730 -0.622 368s 0.092 0.297 -0.914 0.350 368s 368s x1: [4, 1, 1, 1] 368s -0.201 0.399 0.832 0.905 368s 368s y: [4, 1, 1, 1] 368s 0.409 0.658 -1.308 -0.345 368s testing: mul_mat, [3, 4, 4, 1] = [4, 3, 4, 1] * [4, 4, 4, 1] 368s x0: [4, 3, 4, 1] 368s 0.590 0.321 -0.422 0.457 368s 0.605 0.166 -0.482 0.465 368s -0.254 -0.394 -0.196 -0.839 368s 368s x1: [4, 4, 4, 1] 368s 0.541 -0.103 -0.266 -0.008 368s 0.581 0.990 0.468 -0.223 368s 0.593 0.194 0.575 -0.008 368s 0.027 0.480 0.835 -0.799 368s 368s y: [3, 4, 4, 1] 368s 0.395 0.435 -0.038 368s 0.361 0.186 -0.443 368s 0.165 0.110 -0.333 368s -0.548 -0.678 0.311 368s testing: mul_mat, [4, 3, 4, 3] = [4, 4, 4, 3] * [4, 3, 4, 3] 368s x0: [4, 4, 4, 3] 368s 0.499 -0.984 -0.091 0.844 368s -0.704 0.607 0.296 -0.205 368s -0.969 0.373 0.659 -0.029 368s -0.689 0.400 -0.357 0.993 368s 368s x1: [4, 3, 4, 3] 368s -0.731 0.418 0.536 0.356 368s 0.573 -0.066 0.301 -0.966 368s -0.885 0.134 0.073 -0.512 368s 368s y: [4, 3, 4, 3] 368s -0.525 0.854 1.208 0.834 368s -0.492 -0.156 -0.353 -1.489 368s -1.012 0.831 0.971 0.129 368s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 3, 1, 1] 368s 0.202 -0.977 0.556 0.112 368s -0.457 0.235 0.313 -0.505 368s -0.043 0.941 0.355 0.017 368s 368s x1: [4, 4, 1, 1] 368s 0.062 0.563 0.876 -0.882 368s -0.232 -0.788 0.088 0.130 368s 0.598 -0.446 -0.623 -0.676 368s -0.932 0.139 -0.965 0.237 368s 368s y: [3, 4, 1, 1] 368s -0.150 0.824 0.823 368s 0.786 -0.117 -0.698 368s 0.134 -0.231 -0.678 368s -0.833 0.037 -0.168 368s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 368s x0: [3, 2, 4, 1] 368s 0.207 0.589 0.229 368s -0.826 -0.657 -0.489 368s 368s x1: [3, 4, 4, 1] 368s -0.811 0.784 -0.921 368s 0.752 0.660 -0.803 368s -0.480 0.873 0.285 368s 0.650 0.470 0.839 368s 368s y: [2, 4, 4, 1] 368s 0.083 0.605 368s 0.360 -0.663 368s 0.480 -0.317 368s 0.603 -1.257 368s testing: mul_mat, [3, 3, 4, 3] = [2, 3, 4, 3] * [2, 3, 4, 3] 368s x0: [2, 3, 4, 3] 368s 0.952 0.990 368s 0.896 0.442 368s 0.796 0.700 368s 368s x1: [2, 3, 4, 3] 368s -0.584 0.022 368s 0.035 0.173 368s -0.950 -0.823 368s 368s y: [3, 3, 4, 3] 368s -0.533 -0.513 -0.449 368s 0.205 0.108 0.149 368s -1.719 -1.215 -1.332 368s test-mul-mat0: iter:465/500 368s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.301 368s -0.189 368s 368s x1: [1, 4, 1, 1] 368s 0.075 368s -0.255 368s 0.534 368s -0.344 368s 368s y: [2, 4, 1, 1] 368s -0.023 -0.014 368s 0.077 0.048 368s -0.161 -0.101 368s 0.104 0.065 368s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 368s x0: [1, 2, 4, 1] 368s 0.705 368s -0.986 368s 368s x1: [1, 2, 4, 1] 368s 0.022 368s -0.242 368s 368s y: [2, 2, 4, 1] 368s 0.016 -0.022 368s -0.171 0.239 368s testing: mul_mat, [4, 2, 4, 2] = [1, 4, 4, 2] * [1, 2, 4, 2] 368s x0: [1, 4, 4, 2] 368s 0.671 368s -0.044 368s 0.044 368s -0.948 368s 368s x1: [1, 2, 4, 2] 368s -0.665 368s 0.441 368s 368s y: [4, 2, 4, 2] 368s -0.446 0.029 -0.029 0.631 368s 0.296 -0.019 0.019 -0.418 368s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.831 368s 0.797 368s 368s x1: [1, 4, 1, 1] 368s -0.674 368s 0.713 368s -0.926 368s 0.055 368s 368s y: [2, 4, 1, 1] 368s -0.560 -0.537 368s 0.593 0.568 368s -0.770 -0.738 368s 0.046 0.044 368s testing: mul_mat, [3, 1, 4, 1] = [2, 3, 4, 1] * [2, 1, 4, 1] 368s x0: [2, 3, 4, 1] 368s 0.355 -0.916 368s -0.523 -0.838 368s 0.077 -0.408 368s 368s x1: [2, 1, 4, 1] 368s 0.997 0.099 368s 368s y: [3, 1, 4, 1] 368s 0.262 -0.604 0.036 368s testing: mul_mat, [3, 2, 4, 2] = [3, 3, 4, 2] * [3, 2, 4, 2] 368s x0: [3, 3, 4, 2] 368s -0.789 -0.210 -0.170 368s -0.042 -0.472 0.577 368s 0.386 0.599 0.677 368s 368s x1: [3, 2, 4, 2] 368s -0.466 0.987 -0.744 368s -0.853 -0.426 -0.675 368s 368s y: [3, 2, 4, 2] 368s 0.287 -0.876 -0.093 368s 0.878 -0.152 -1.041 368s test-mul-mat0: iter:466/500 368s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 2, 1, 1] 368s -0.938 0.814 0.353 368s 0.268 0.022 0.658 368s 368s x1: [3, 3, 1, 1] 368s 0.484 -0.099 -0.021 368s 0.212 0.321 0.896 368s 0.418 0.018 0.111 368s 368s y: [2, 3, 1, 1] 368s -0.542 0.113 368s 0.378 0.653 368s -0.339 0.185 368s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.856 -0.546 0.450 368s -0.590 0.214 0.252 368s 0.060 0.198 -0.264 368s 0.961 -0.823 0.948 368s 368s x1: [3, 2, 1, 1] 368s -0.670 -0.054 -0.584 368s -0.364 -0.623 -0.844 368s 368s y: [4, 2, 1, 1] 368s 0.340 0.236 0.103 -1.153 368s 0.271 -0.132 0.077 -0.637 368s testing: mul_mat, [2, 4, 1, 2] = [3, 2, 1, 2] * [3, 4, 1, 2] 368s x0: [3, 2, 1, 2] 368s -0.044 -0.640 0.819 368s -0.083 -0.463 0.768 368s 368s x1: [3, 4, 1, 2] 368s 0.282 -0.927 0.366 368s -0.700ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s 0.184 0.910 368s -0.638 -0.003 0.263 368s 0.630 -0.981 -0.079 368s 368s y: [2, 4, 1, 2] 368s 0.881 0.687 368s 0.659 0.672 368s 0.246 0.256 368s 0.536 0.342 368s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 4, 1, 1] 368s 0.774 -0.939 0.260 368s -0.433 0.429 -0.503 368s 0.049 -0.398 0.171 368s 0.116 -0.991 -0.657 368s 368s x1: [3, 2, 1, 1] 368s -0.139 -0.210 -0.692 368s -0.509 -0.191 0.229 368s 368s y: [4, 2, 1, 1] 368s -0.091 0.318 -0.042 0.646 368s -0.155 0.023 0.090 -0.020 368s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.262 0.646 0.923 0.563 368s -0.583 -0.125 -0.038 -0.718 368s -0.577 0.846 -0.727 -0.387 368s -0.929 -0.788 0.353 -0.939 368s 368s x1: [4, 3, 1, 1] 368s 0.454 0.531 -0.837 -0.629 368s -0.932 0.931 -0.429 -0.322 368s 0.065 0.070 -0.528 -0.891 368s 368s y: [4, 3, 1, 1] 368s -0.666 0.153 1.040 -0.544 368s -0.221 0.675 1.761 0.283 368s -0.928 0.614 0.751 0.535 368s testing: mul_mat, [1, 4, 1, 2] = [4, 1, 1, 2] * [4, 4, 1, 2] 368s x0: [4, 1, 1, 2] 368s 0.481 0.488 -0.280 0.367 368s 368s x1: [4, 4, 1, 2] 368s 0.452 -0.044 0.515 -0.017 368s 0.119 0.886 0.052 0.050 368s -0.544 0.730 -0.886 0.526 368s -0.799 -0.776 0.458 0.464 368s 368s y: [1, 4, 1, 2] 368s 0.046 368s 0.493 368s 0.536 368s -0.722 368s test-mul-mat0: iter:467/500 368s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 3, 1, 1] 368s 0.192 -0.698 -0.845 368s 0.098 0.354 -0.201 368s 0.065 0.859 -0.601 368s 368s x1: [3, 3, 1, 1] 368s -0.193 0.935 -0.253 368s -0.312 0.470 0.033 368s 0.444 -0.150 0.531 368s 368s y: [3, 3, 1, 1] 368s -0.476 0.363 0.943 368s -0.416 0.129 0.364 368s -0.259 -0.116 -0.419 368s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 368s x0: [3, 2, 3, 1] 368s -0.761 0.707 0.499 368s -0.216 0.054 -0.381 368s 368s x1: [3, 3, 3, 1] 368s -0.454 0.347 0.119 368s 0.912 0.887 -0.112 368s 0.735 -0.843 -0.228 368s 368s y: [2, 3, 3, 1] 368s 0.651 0.072 368s -0.123 -0.106 368s -1.269 -0.117 368s testing: mul_mat, [1, 2, 3, 2] = [3, 1, 3, 2] * [3, 2, 3, 2] 368s x0: [3, 1, 3, 2] 368s 0.683 0.614 -0.386 368s 368s x1: [3, 2, 3, 2] 368s 0.986 -0.676 0.445 368s -0.990 0.208 0.636 368s 368s y: [1, 2, 3, 2] 368s 0.087 368s -0.794 368s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.311 0.896 -0.573 368s -0.225 0.942 0.417 368s 0.093 0.734 0.840 368s -0.793 -0.774 0.042 368s 368s x1: [3, 1, 1, 1] 368s 0.070 0.158 0.977 368s 368s y: [4, 1, 1, 1] 368s -0.440 0.541 0.943 -0.137 368s testing: mul_mat, [3, 3, 3, 1] = [4, 3, 3, 1] * [4, 3, 3, 1] 368s x0: [4, 3, 3, 1] 368s -0.591 -0.156 -0.112 -0.998 368s 0.314 -0.333 0.054 -0.401 368s 0.767 0.379 -0.538 0.522 368s 368s x1: [4, 3, 3, 1] 368s 0.557 0.911 0.633 0.254 368s -0.321 0.907 -0.993 -0.073 368s -0.080 -0.566 0.353 0.612 368s 368s y: [3, 3, 3, 1] 368s -0.795 -0.195 0.564 368s 0.232 -0.428 0.594 368s -0.515 -0.063 -0.146 368s testing: mul_mat, [2, 4, 3, 2] = [3, 2, 3, 2] * [3, 4, 3, 2] 368s x0: [3, 2, 3, 2] 368s -0.283 0.671 -0.769 368s 0.966 0.509 -0.888 368s 368s x1: [3, 4, 3, 2] 368s 0.043 -0.265 0.974 368s 0.505 -0.263 -0.427 368s 0.028 0.542 0.407 368s -0.615 -0.757 -0.318 368s 368s y: [2, 4, 3, 2] 368s -0.939 -0.957 368s 0.009 0.733 368s 0.043 -0.059 368s -0.090 -0.697 368s test-mul-mat0: iter:468/500 368s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.588 368s 0.451 368s 368s x1: [1, 4, 1, 1] 368s -0.036 368s 0.402 368s 0.738 368s 0.075 368s 368s y: [2, 4, 1, 1] 368s 0.021 -0.016 368s -0.236 0.181 368s -0.434 0.333 368s -0.044 0.034 368s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 368s x0: [1, 4, 2, 1] 368s -0.009 368s 0.583 368s -0.568 368s 0.992 368s 368s x1: [1, 2, 2, 1] 368s -0.526 368s -0.068 368s 368s y: [4, 2, 2, 1] 368s 0.005 -0.307 0.299 -0.522 368s 0.001 -0.040 0.039 -0.068 368s testing: mul_mat, [4, 4, 2, 2] = [1, 4, 2, 2] * [1, 4, 2, 2] 368s x0: [1, 4, 2, 2] 368s 0.912 368s 0.235 368s -0.434 368s -0.182 368s 368s x1: [1, 4, 2, 2] 368s -0.893 368s -0.698 368s 0.593 368s -0.979 368s 368s y: [4, 4, 2, 2] 368s -0.814 -0.210 0.388 0.163 368s -0.637 -0.164 0.303 0.127 368s 0.541 0.140 -0.258 -0.108 368s -0.893 -0.230 0.425 0.178 368s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.262 368s -0.223 368s 368s x1: [1, 4, 1, 1] 368s -0.914 368s 0.936 368s -0.622 368s -0.644 368s 368s y: [2, 4, 1, 1] 368s -0.240 0.204 368s 0.246 -0.209 368s -0.163 0.139 368s -0.169 0.144 368s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 368s x0: [2, 4, 2, 1] 368s -0.376 0.839 368s -0.396 -0.834 368s 0.601 0.354 368s -0.464 -0.654 368s 368s x1: [2, 1, 2, 1] 368s -0.729 -0.254 368s 368s y: [4, 1, 2, 1] 368s 0.060 0.501 -0.528 0.504 368s testing: mul_mat, [2, 2, 2, 2] = [4, 2, 2, 2] * [4, ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s 2, 2, 2] 368s x0: [4, 2, 2, 2] 368s 0.865 0.172 -0.844 -0.432 368s -0.282 -0.415 0.967 -0.628 368s 368s x1: [4, 2, 2, 2] 368s -0.003 -0.243 -0.657 0.083 368s -0.307 -0.279 0.439 -0.204 368s 368s y: [2, 2, 2, 2] 368s 0.474 -0.586 368s -0.595 0.755 368s test-mul-mat0: iter:469/500 368s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.868 -0.855 -0.441 368s 368s x1: [3, 4, 1, 1] 368s -0.185 0.803 -0.841 368s 0.850 0.100 -0.090 368s 0.793 0.092 -0.336 368s 0.248 0.381 0.958 368s 368s y: [1, 4, 1, 1] 368s -0.155 368s -0.783 368s -0.619 368s -0.963 368s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 1, 4, 1] 368s -0.678 -0.196 -0.254 368s 368s x1: [3, 1, 4, 1] 368s -0.270 -0.611 0.198 368s 368s y: [1, 1, 4, 1] 368s 0.253 368s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 1, 4, 1] 368s 0.117 0.311 0.257 368s 368s x1: [3, 1, 4, 1] 368s 0.698 -0.506 0.692 368s 368s y: [1, 1, 4, 1] 368s 0.102 368s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.499 -0.754 -0.718 368s -0.445 0.722 -0.337 368s 0.892 0.635 0.945 368s -0.005 -0.503 0.050 368s 368s x1: [3, 1, 1, 1] 368s 0.876 0.965 -0.082 368s 368s y: [4, 1, 1, 1] 368s -1.106 0.335 1.317 -0.494 368s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 368s x0: [4, 1, 4, 1] 368s 0.622 -0.886 -0.382 0.425 368s 368s x1: [4, 3, 4, 1] 368s 0.344 0.869 0.224 -0.539 368s 0.179 -0.520 -0.335 0.507 368s 0.668 0.190 -0.362 0.602 368s 368s y: [1, 3, 4, 1] 368s -0.871 368s 0.915 368s 0.640 368s testing: mul_mat, [2, 4, 4, 1] = [1, 2, 4, 1] * [1, 4, 4, 1] 368s x0: [1, 2, 4, 1] 368s 0.920 368s -0.376 368s 368s x1: [1, 4, 4, 1] 368s -0.239 368s -0.385 368s -0.739 368s 0.722 368s 368s y: [2, 4, 4, 1] 368s -0.220 0.090 368s -0.354 0.145 368s -0.679 0.278 368s 0.664 -0.272 368s test-mul-mat0: iter:470/500 368s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 3, 1, 1] 368s 0.910 -0.784 0.672 368s -0.599 -0.910 -0.377 368s -0.275 0.957 0.792 368s 368s x1: [3, 1, 1, 1] 368s 0.593 0.648 -0.506 368s 368s y: [3, 1, 1, 1] 368s -0.309 -0.754 0.056 368s testing: mul_mat, [3, 3, 3, 1] = [3, 3, 3, 1] * [3, 3, 3, 1] 368s x0: [3, 3, 3, 1] 368s 0.898 -0.021 -0.962 368s -0.089 -0.487 0.915 368s 0.226 0.246 0.989 368s 368s x1: [3, 3, 3, 1] 368s 0.013 0.534 0.876 368s -0.685 -0.267 -0.926 368s -0.765 0.357 -0.421 368s 368s y: [3, 3, 3, 1] 368s -0.842 0.540 1.001 368s 0.281 -0.656 -1.136 368s -0.290 -0.490 -0.501 368s testing: mul_mat, [1, 3, 3, 2] = [3, 1, 3, 2] * [3, 3, 3, 2] 368s x0: [3, 1, 3, 2] 368s -0.234 -0.065 0.944 368s 368s x1: [3, 3, 3, 2] 368s 0.384 -0.186 0.171 368s 0.631 -0.288 -0.850 368s 0.669 0.623 -0.337 368s 368s y: [1, 3, 3, 2] 368s 0.084 368s -0.931 368s -0.515 368s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 4, 1, 1] 368s 0.721 -0.619 -0.997 368s -0.994 0.785 -0.600 368s -0.580 0.633 -0.935 368s 0.574 0.131 0.947 368s 368s x1: [3, 1, 1, 1] 368s 0.421 0.796 0.940 368s 368s y: [4, 1, 1, 1] 368s -1.126 -0.358 -0.619 1.236 368s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 368s x0: [4, 4, 3, 1] 368s -0.942 -0.622 0.678 -0.804 368s -0.745 0.372 -0.836 -0.906 368s 0.333 -0.970 -0.557 -0.823 368s 0.316 -0.487 0.924 -0.352 368s 368s x1: [4, 3, 3, 1] 368s 0.090 -0.055 0.258 0.852 368s 0.155 -0.249 0.858 -0.675 368s 0.137 0.258 -0.877 -0.961 368s 368s y: [4, 3, 3, 1] 368s -0.561 -1.075 -0.763 -0.007 368s 1.134 -0.314 0.371 1.202 368s -0.111 1.597 1.075 -0.554 368s testing: mul_mat, [3, 4, 3, 2] = [4, 3, 3, 2] * [4, 4, 3, 2] 368s x0: [4, 3, 3, 2] 368s 0.593 0.244 -0.290 0.411 368s 0.090 -0.507 0.337 0.145 368s -0.303 -0.145 -0.412 -0.238 368s 368s x1: [4, 4, 3, 2] 368s -0.756 0.278 -0.715 -0.182 368s -0.598 0.719 0.221 0.713 368s 0.502 -0.602 -0.531 0.484 368s 0.505 0.420 0.524 -0.133 368s 368s y: [3, 4, 3, 2] 368s -0.248 -0.476 0.527 368s 0.050 -0.241 -0.183 368s 0.504 0.242 0.039 368s 0.196 -0.011 -0.399 368s test-mul-mat0: iter:471/500 368s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 4, 1, 1] 368s 0.072 -0.808 -0.172 368s 0.148 0.693 0.463 368s 0.256 0.236 -0.147 368s 0.584 -0.890 -0.359 368s 368s x1: [3, 1, 1, 1] 368s 0.467 -0.191 -0.882 368s 368s y: [4, 1, 1, 1] 368s 0.340 -0.472 0.204 0.760 368s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.677 0.073 0.721 368s 368s x1: [3, 4, 1, 1] 368s -0.260 0.907 0.916 368s 0.419 0.872 0.171 368s -0.523 -0.284 0.743 368s -0.499 0.848 -0.222 368s 368s y: [1, 4, 1, 1] 368s 0.903 368s -0.096 368s 0.870 368s 0.240 368s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 368s x0: [3, 4, 1, 1] 368s 0.383 -0.157 0.353 368s 0.076 -0.693 -0.391 368s -0.687 0.160 -0.807 368s -0.577 0.801 -0.067 368s 368s x1ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s : [3, 1, 1, 1] 368s 0.132 0.191 -0.984 368s 368s y: [4, 1, 1, 1] 368s -0.327 0.262 0.734 0.143 368s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 2, 1, 1] 368s -0.518 -0.779 0.322 368s 0.306 0.674 -0.574 368s 368s x1: [3, 4, 1, 1] 368s -0.670 0.716 -0.648 368s -0.798 -0.113 -0.171 368s -0.082 -0.369 0.330 368s -0.234 0.409 0.585 368s 368s y: [2, 4, 1, 1] 368s -0.419 0.649 368s 0.446 -0.222 368s 0.436 -0.464 368s -0.009 -0.132 368s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 2, 1, 1] 368s 0.632 -0.132 368s -0.419 0.433 368s 368s x1: [2, 3, 1, 1] 368s 0.057 -0.834 368s 0.779 -0.867 368s -0.528 -0.612 368s 368s y: [2, 3, 1, 1] 368s 0.146 -0.385 368s 0.606 -0.701 368s -0.253 -0.044 368s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 4, 1, 1] 368s -0.964 -0.974 368s -0.305 0.085 368s -0.682 -0.924 368s -0.333 0.611 368s 368s x1: [2, 2, 1, 1] 368s 0.514 0.198 368s 0.149 0.866 368s 368s y: [4, 2, 1, 1] 368s -0.689 -0.140 -0.534 -0.050 368s -0.988 0.028 -0.902 0.479 368s test-mul-mat0: iter:472/500 368s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.123 -0.012 0.867 368s -0.490 0.568 -0.264 368s 0.942 0.083 0.934 368s 0.092 -0.051 0.334 368s 368s x1: [3, 2, 1, 1] 368s -0.769 -0.761 -0.179 368s 0.288 -0.595 -0.400 368s 368s y: [4, 2, 1, 1] 368s -0.051 -0.008 -0.954 -0.091 368s -0.375 -0.374 -0.151 -0.077 368s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 368s x0: [3, 4, 3, 1] 368s -0.944 0.910 -0.457 368s -0.515 -0.892 0.851 368s 0.145 -0.455 0.979 368s 0.093 0.387 -0.292 368s 368s x1: [3, 4, 3, 1] 368s 0.128 0.644 0.653 368s 0.795 0.670 -0.263 368s 0.871 0.280 -0.088 368s -0.572 -0.802 0.307 368s 368s y: [4, 4, 3, 1] 368s 0.167 -0.085 0.365 0.070 368s -0.021 -1.230 -0.447 0.409 368s -0.526 -0.774 -0.088 0.215 368s -0.331 1.272 0.584 -0.453 368s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 368s x0: [3, 2, 3, 1] 368s 0.216 0.183 0.772 368s -0.930 0.470 0.144 368s 368s x1: [3, 4, 3, 1] 368s 0.742 0.031 0.545 368s 0.722 -0.876 -0.069 368s -0.570 -0.413 -0.963 368s -0.146 -0.713 0.372 368s 368s y: [2, 4, 3, 1] 368s 0.586 -0.598 368s -0.058 -1.093 368s -0.941 0.197 368s 0.125 -0.146 368s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 1, 1, 1] 368s 0.234 -0.899 0.376 368s 368s x1: [3, 2, 1, 1] 368s 0.056 -0.152 -0.813 368s 0.314 -0.830 0.629 368s 368s y: [1, 2, 1, 1] 368s -0.156 368s 1.056 368s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 368s x0: [1, 2, 3, 1] 368s -0.218 368s -0.951 368s 368s x1: [1, 3, 3, 1] 368s -0.981 368s -0.303 368s 0.110 368s 368s y: [2, 3, 3, 1] 368s 0.214 0.933 368s 0.066 0.288 368s -0.024 -0.105 368s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 368s x0: [2, 3, 3, 1] 368s 0.017 -0.355 368s -0.425 0.148 368s 0.530 -0.237 368s 368s x1: [2, 1, 3, 1] 368s 0.093 -0.802 368s 368s y: [3, 1, 3, 1] 368s 0.286 -0.158 0.239 368s test-mul-mat0: iter:473/500 368s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.647 0.028 368s 0.707 -0.002 368s 368s x1: [2, 4, 1, 1] 368s -0.691 -0.013 368s -0.695 -0.493 368s 0.192 -0.183 368s 0.336 -0.547 368s 368s y: [2, 4, 1, 1] 368s 0.447 -0.488 368s 0.436 -0.490 368s -0.129 0.136 368s -0.233 0.239 368s testing: mul_mat, [4, 2, 4, 1] = [2, 4, 4, 1] * [2, 2, 4, 1] 368s x0: [2, 4, 4, 1] 368s 0.858 -0.714 368s 0.309 -0.836 368s -0.208 -0.499 368s -0.019 -0.872 368s 368s x1: [2, 2, 4, 1] 368s -0.825 -0.530 368s 0.745 -0.302 368s 368s y: [4, 2, 4, 1] 368s -0.330 0.188 0.436 0.478 368s 0.855 0.482 -0.004 0.249 368s testing: mul_mat, [2, 4, 4, 2] = [2, 2, 4, 2] * [2, 4, 4, 2] 368s x0: [2, 2, 4, 2] 368s 0.172 0.725 368s -0.601 -0.665 368s 368s x1: [2, 4, 4, 2] 368s 0.681 0.932 368s -0.216 -0.527 368s -0.568 0.765 368s -0.398 0.385 368s 368s y: [2, 4, 4, 2] 368s 0.792 -1.029 368s -0.419 0.480 368s 0.457 -0.167 368s 0.211 -0.017 368s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.948 0.340 368s 0.892 0.933 368s 0.387 -0.216 368s 368s x1: [2, 2, 1, 1] 368s 0.015 -0.500 368s 0.367 0.427 368s 368s y: [3, 2, 1, 1] 368s -0.156 -0.453 0.114 368s 0.493 0.725 0.050 368s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 368s x0: [3, 1, 4, 1] 368s -0.782 0.805 0.659 368s 368s x1: [3, 2, 4, 1] 368s 0.328 0.925 -0.101 368s -0.884 0.429 0.299 368s 368s y: [1, 2, 4, 1] 368s 0.421 368s 1.234 368s testing: mul_mat, [4, 3, 4, 2] = [1, 4, 4, 2] * [1, 3, 4, 2] 368s x0: [1, 4, 4, 2] 368s 0.606 368s 0.213 368s -0.219 368s -0.392 368s 368s x1: [1, 3, 4, 2] 368s 0.834 368s 0.129 368s -0.360 368s 368s y: [4, 3, 4, 2] 368s 0.506 0.178 -0.183 -0.327 368s 0.078 0.028 -0.028 -0.051 368s -0.218 -0.077 0.079 0.141 368s test-mul-mat0: iter:474/500 368s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.433 0.874 0.734 0.194 368s -0.968 0.413 0.048 0.256 368s -0.962 -0.091 0.817 0.827 368s -0.417 0.375 -0.664 0.808 368s 368s x1: [4, 2, 1, 1] 368s 0.200 0.165 -0.823 0.263 368s -0.837 0.502 0.281 0.508 368s 368s y: [4, 2, 1, 1] 368s -0.322 -0.098 -0.663 0.738 368s 0.380 1.161 1.409 0.762 368s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 368s x0: [4, 1, 4, 1] 368s 0.172 -0.666 0.787 0.407 368s 368s x1: [4, 4, 4, 1] 368s -0.629 -0.549 -0.753 -0.851 368s 0.174 -0.010 0.350 -0.661 368s 0.166 -0.387 -0.498 -0.332 368s 0.894 -0.990 0.399 0.327 368s 368s y: [1, 4, 4, 1] 368s -0.682 368s 0.043 368s -0.241 368s 1.261 368s testing: mul_mat, [3, 1, 4, 3] = [4, 3, 4, 3] * [4, 1, 4, 3] 368s x0: [4, 3, 4, 3] 368s 0.098 -0.434 -0.683 -0.233 368s -0.621 -0.444 0.432 0.672 368s 0.135 0.163 0.555 -0.474 368s 368s x1: [4, 1, 4, 3] 368s 0.612 0.832 0.813 0.809 368s 368s y: [3, 1, 4, 3] 368s -1.045 0.146 0.286 368s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.914 -0.768 -0.877 0.531 368s -0.764 -0.858 0.439 0.967 368s 0.949 0.529 0.105 -0.732 368s 0.227 0.128 -0.385 -0.256 368s 368s x1: [4, 3, 1, 1] 368s -0.844 0.346 0.760 -0.357 368s 0.080 -0.951 0.743 -0.706 368s 0.044 -0.119 0.179 -0.167 368s 368s y: [4, 3, 1, 1] 368s -1.893 0.336 -0.277 -0.348 368s -0.223 0.399 0.166 -0.209 368s -0.114 -0.014 0.120 -0.031 368s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 368s x0: [4, 1, 4, 1] 368s 0.363 -0.639 0.333 0.444 368s 368s x1: [4, 4, 4, 1] 368s -0.454 -0.684 -0.100 0.892 368s -0.924 0.543 -0.028 -0.875 368s 0.286 0.267 0.169 -0.833 368s -0.554 -0.997 -0.431 -0.640 368s 368s y: [1, 4, 4, 1] 368s 0.635 368s -1.080 368s -0.380 368s 0.008 368s testing: mul_mat, [3, 4, 4, 3] = [1, 3, 4, 3] * [1, 4, 4, 3] 368s x0: [1, 3, 4, 3] 368s -0.089 368s 0.771 368s 0.046 368s 368s x1: [1, 4, 4, 3] 368s -0.089 368s -0.802 368s 0.338 368s -0.425 368s 368s y: [3, 4, 4, 3] 368s 0.008 -0.068 -0.004 368s 0.071 -0.618 -0.037 368s -0.030 0.260 0.016 368s 0.038 -0.328 -0.020 368s test-mul-mat0: iter:475/500 368s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.913 -0.332 -0.679 368s 368s x1: [3, 2, 1, 1] 368s -0.222 -0.581 -0.862 368s -0.266 -0.307 -0.485 368s 368s y: [1, 2, 1, 1] 368s 0.981 368s 0.675 368s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 368s x0: [3, 2, 3, 1] 368s 0.961 -0.878 0.011 368s 0.254 -0.257 -0.691 368s 368s x1: [3, 1, 3, 1] 368s 0.544 -0.296 -0.044 368s 368s y: [2, 1, 3, 1] 368s 0.782 0.245 368s testing: mul_mat, [2, 2, 3, 3] = [3, 2, 3, 3] * [3, 2, 3, 3] 368s x0: [3, 2, 3, 3] 368s 0.279 -0.843 0.176 368s -0.205 -0.118 -0.415 368s 368s x1: [3, 2, 3, 3] 368s 0.743 -0.700 -0.735 368s 0.287 0.004 0.221 368s 368s y: [2, 2, 3, 3] 368s 0.668 0.234 368s 0.116 -0.151 368s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 368s x0: [3, 4, 1, 1] 368s -0.910 -0.128 0.446 368s 0.722 -0.186 0.941 368s 0.532 -0.067 0.659 368s -0.781 -0.724 0.183 368s 368s x1: [3, 2, 1, 1] 368s 0.739 0.585 -0.710 368s -0.108 0.138 0.366 368s 368s y: [4, 2, 1, 1] 368s -1.064 -0.245 -0.114 -1.131 368s 0.244 0.241 0.174 0.052 368s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 368s x0: [4, 1, 3, 1] 368s 0.482 0.484 -0.530 0.619 368s 368s x1: [4, 3, 3, 1] 368s 0.956 0.965 -0.890 0.745 368s 0.511 0.451 -0.433 -0.467 368s -0.967 -0.863 0.306 0.291 368s 368s y: [1, 3, 3, 1] 368s 1.861 368s 0.405 368s -0.866 368s testing: mul_mat, [3, 4, 3, 3] = [1, 3, 3, 3] * [1, 4, 3, 3] 368s x0: [1, 3, 3, 3] 368s 0.193 368s 0.283 368s 0.450 368s 368s x1: [1, 4, 3, 3] 368s -0.837 368s 0.830 368s -0.045 368s 0.596 368s 368s y: [3, 4, 3, 3] 368s -0.162 -0.237 -0.377 368s 0.161 0.235 0.373 368s -0.009 -0.013 -0.020 368s 0.115 0.168 0.268 368s test-mul-mat0: iter:476/500 368s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 2, 1, 1] 368s -0.830 0.712 0.383 -0.467 368s -0.576 0.681 0.197 -0.027 368s 368s x1: [4, 2, 1, 1] 368s 0.774 0.525 0.051 -0.498 368s 0.460 -0.714 0.638 0.218 368s 368s y: [2, 2, 1, 1] 368s -0.017 -0.065 368s -0.747 -0.631 368s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 4, 1, 1] 368s -0.565 -0.668 -0.863 -0.324 368s 0.933 0.912 -0.799 -0.016 368s -0.587 0.661 0.270 -0.949 368s -0.121 -0.984 -0.779 -0.409 368s 368s x1: [4, 2, 1, 1] 368s -0.571 0.923 -0.147 0.310 368s -0.948 -0.673 -0.965 0.870 368s 368s y: [4, 2, 1, 1] 368s -0.268 0.421 0.611 -0.852 368s 1.537 -0.742 -0.974 1.172 368s testing: mul_mat, [4, 4, 1, 3] = [4, 4, 1, 3] * [4, 4, 1, 3] 368s x0: [4, 4, 1, 3] 368s 0.252 0.947 -0.333 0.420 368s -0.009 0.853 0.248 0.305 368s 0.090 0.956 0.635 -0.451 368s 0.254 0.364 -0.239 0.567 368s 368s x1: [4, 4, 1, 3] 368s 0.400 -0.246 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s 0.015 0.081 368s 0.951 0.987 0.510 0.874 368s -0.159 -0.180 0.926 0.168 368s -0.144 0.796 -0.276 0.290 368s 368s y: [4, 4, 1, 3] 368s -0.103 -0.185 -0.226 0.054 368s 1.372 1.226 0.959 0.975 368s -0.448 0.129 0.326 -0.232 368s 0.931 0.700 0.441 0.484 368s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 4, 1, 1] 368s -0.580 -0.938 -0.678 0.712 368s 0.250 0.921 -0.902 -0.694 368s -0.932 -0.950 -0.680 0.570 368s 0.257 -0.491 0.484 0.193 368s 368s x1: [4, 4, 1, 1] 368s -0.914 0.481 -0.520 0.048 368s -0.512 -0.885 0.973 0.805 368s 0.745 -0.344 -0.847 -0.338 368s 0.226 -0.604 -0.169 0.743 368s 368s y: [4, 4, 1, 1] 368s 0.466 0.650 0.776 -0.714 368s 1.040 -2.379 1.115 0.929 368s 0.223 0.868 0.015 -0.114 368s 1.078 -0.863 0.901 0.416 368s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.409 0.529 0.867 0.652 368s 0.816 0.330 -0.962 0.965 368s 368s x1: [4, 4, 1, 1] 368s -0.214 -0.382 0.681 -0.099 368s -0.409 0.486 -0.354 0.247 368s 0.639 0.308 -0.527 -0.965 368s -0.861 -0.784 0.341 -0.441 368s 368s y: [2, 4, 1, 1] 368s 0.236 -1.052 368s -0.056 0.405 368s -0.661 0.200 368s -0.759 -1.715 368s testing: mul_mat, [1, 4, 1, 3] = [2, 1, 1, 3] * [2, 4, 1, 3] 368s x0: [2, 1, 1, 3] 368s -0.153 0.791 368s 368s x1: [2, 4, 1, 3] 368s -0.642 0.136 368s 0.676 -0.337 368s -0.293 -0.131 368s 0.449 0.325 368s 368s y: [1, 4, 1, 3] 368s 0.206 368s -0.370 368s -0.059 368s 0.189 368s test-mul-mat0: iter:477/500 368s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 3, 1, 1] 368s 0.705 -0.472 368s 0.782 -0.299 368s 0.691 0.457 368s 368s x1: [2, 2, 1, 1] 368s 0.287 0.196 368s 0.355 -0.388 368s 368s y: [3, 2, 1, 1] 368s 0.110 0.166 0.288 368s 0.434 0.394 0.068 368s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 1, 1, 1] 368s -0.502 -0.319 368s 368s x1: [2, 3, 1, 1] 368s 0.006 0.327 368s 0.168 0.449 368s 0.179 -0.781 368s 368s y: [1, 3, 1, 1] 368s -0.107 368s -0.228 368s 0.159 368s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 1, 1, 1] 368s 0.956 -0.770 368s 368s x1: [2, 1, 1, 1] 368s 0.270 0.345 368s 368s y: [1, 1, 1, 1] 368s -0.007 368s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 1, 1, 1] 368s -0.504 -0.068 368s 368s x1: [2, 1, 1, 1] 368s 0.916 -0.394 368s 368s y: [1, 1, 1, 1] 368s -0.435 368s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 2, 1, 1] 368s -0.266 368s -0.605 368s 368s x1: [1, 2, 1, 1] 368s 0.379 368s 0.783 368s 368s y: [2, 2, 1, 1] 368s -0.101 -0.229 368s -0.208 -0.474 368s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 1, 1, 1] 368s 0.656 0.141 368s 368s x1: [2, 1, 1, 1] 368s 0.874 -0.561 368s 368s y: [1, 1, 1, 1] 368s 0.495 368s test-mul-mat0: iter:478/500 368s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.653 0.273 0.193 0.569 368s 368s x1: [4, 2, 1, 1] 368s -0.719 0.595 0.121 -0.499 368s -0.953 0.619 0.181 0.317 368s 368s y: [1, 2, 1, 1] 368s -0.568 368s -0.238 368s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 368s x0: [4, 3, 3, 1] 368s -0.641 0.781 0.583 -0.495 368s 0.722 0.864 -0.899 -0.157 368s -0.635 -0.852 -0.539 0.546 368s 368s x1: [4, 1, 3, 1] 368s 0.879 -0.334 -0.935 -0.189 368s 368s y: [3, 1, 3, 1] 368s -1.276 1.216 0.127 368s testing: mul_mat, [3, 3, 3, 2] = [4, 3, 3, 2] * [4, 3, 3, 2] 368s x0: [4, 3, 3, 2] 368s 0.367 0.730 0.267 0.161 368s 0.416 0.338 -0.491 0.594 368s -0.538 0.519 0.188 0.820 368s 368s x1: [4, 3, 3, 2] 368s 0.098 -0.710 -0.564 0.463 368s -0.563 -0.102 0.009 0.902 368s -0.678 -0.792 -0.360 -0.059 368s 368s y: [3, 3, 3, 2] 368s -0.558 0.353 -0.147 368s -0.133 0.262 0.991 368s -0.932 -0.408 -0.162 368s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.890 0.211 -0.200 0.279 368s 368s x1: [4, 3, 1, 1] 368s -0.869 -0.376 0.489 0.431 368s -0.150 0.789 0.501 -0.496 368s 0.795 0.793 -0.830 -0.819 368s 368s y: [1, 3, 1, 1] 368s 0.717 368s 0.062 368s -0.602 368s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 368s x0: [1, 4, 3, 1] 368s 0.382 368s 0.145 368s 0.783 368s -0.129 368s 368s x1: [1, 4, 3, 1] 368s -0.364 368s -0.521 368s 0.532 368s -0.665 368s 368s y: [4, 4, 3, 1] 368s -0.139 -0.053 -0.285 0.047 368s -0.199 -0.076 -0.408 0.067 368s 0.203 0.077 0.417 -0.069 368s -0.254 -0.097 -0.521 0.086 368s testing: mul_mat, [1, 1, 3, 2] = [4, 1, 3, 2] * [4, 1, 3, 2] 368s x0: [4, 1, 3, 2] 368s -0.928 0.536 -0.196 -0.473 368s 368s x1: [4, 1, 3, 2] 368s -0.364 0.041 0.417 0.847 368s 368s y: [1, 1, 3, 2] 368s -0.123 368s test-mul-mat0: iter:479/500 368s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 4, 1, 1] 368s -0.477 0.469 368s 0.193 0.534 368s 0.756 0.416 368s 0.027 -0.636 368s 368s x1: [2, 4, 1, 1] 368s -0.238 -0.920 368s -0.250 -0.166 368s 0.616 0.554 368s 0.360 0.022 368s 368s y: [4, 4, 1, 1] 368s -0.318 -0.537 -0.563 0.579 368s 0.041 -0.137 -0.258 0.099 368s -0.033 0.414 0.696 -0.336 368s -0.162 0.081 0.282 -0.004 368s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 368s x0: [2, 3, 4, 1] 368s -0.318 0.569 368s -0.901 0.199 368s -0.283 -0.058 368s 368s x1: [2, 4, 4, 1] 368s 0.868 0.338 368s -0.133 0.147 368s -0.955 -0.483 368s 0.149 -0.157 368s 368s y: [3, 4, 4, 1] 368s -0.083 -0.715 -0.265 368s 0.126 0.149 0.029 368s 0.029 0.765 0.298 368s -0.137 -0.165 -0.033 368s testing: mul_mat, [1, 3, 4, 3] = [2, 1, 4, 3] * [2, 3, 4, 3] 368s x0: [2, 1, 4, 3] 368s -0.242 0.546 368s 368s x1: [2, 3, 4, 3] 368s -0.614 0.277 368s -0.545 0.102 368s 0.092 -0.123 368s 368s y: [1, 3, 4, 3] 368s 0.300 368s 0.188 368s -0.089 368s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 4, 1, 1] 368s 0.167 0.425 368s -0.333 0.308 368s 0.762 -0.232 368s 0.437 0.749 368s 368s x1: [2, 1, 1, 1] 368s -0.541 -0.293 368s 368s y: [4, 1, 1, 1] 368s -0.215 0.090 -0.344 -0.455 368s testing: mul_mat, [2, 2, 4, 1] = [4, 2, 4, 1] * [4, 2, 4, 1] 368s x0: [4, 2, 4, 1] 368s 0.048 -0.794 -0.653 -0.993 368s -0.146 -0.632 0.778 0.285 368s 368s x1: [4, 2, 4, 1] 368s -0.538 0.859 0.520 0.066 368s -0.956 0.320 0.494 0.571 368s 368s y: [2, 2, 4, 1] 368s -1.112 -0.041 368s -1.189 0.484 368s testing: mul_mat, [2, 4, 4, 3] = [2, 2, 4, 3] * [2, 4, 4, 3] 368s x0: [2, 2, 4, 3] 368s 0.872 -0.968 368s -0.231 0.634 368s 368s x1: [2, 4, 4, 3] 368s 0.454 -0.060 368s -0.020 0.801 368s -0.282 -0.013 368s 0.087 0.723 368s 368s y: [2, 4, 4, 3] 368s 0.453 -0.143 368s -0.793 0.513 368s -0.233 0.057 368s -0.624 0.439 368s test-mul-mat0: iter:480/500 368s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.296 368s 0.143 368s 368s x1: [1, 4, 1, 1] 368s 0.884 368s 0.439 368s 0.941 368s 0.364 368s 368s y: [2, 4, 1, 1] 368s 0.262 0.126 368s 0.130 0.063 368s 0.279 0.134 368s 0.108 0.052 368s testing: mul_mat, [1, 2, 3, 1] = [1, 1, 3, 1] * [1, 2, 3, 1] 368s x0: [1, 1, 3, 1] 368s 0.335 368s 368s x1: [1, 2, 3, 1] 368s -0.910 368s 0.297 368s 368s y: [1, 2, 3, 1] 368s -0.305 368s 0.100 368s testing: mul_mat, [2, 1, 3, 4] = [1, 2, 3, 4] * [1, 1, 3, 4] 368s x0: [1, 2, 3, 4] 368s -0.633 368s -0.636 368s 368s x1: [1, 1, 3, 4] 368s 0.762 368s 368s y: [2, 1, 3, 4] 368s -0.482 -0.485 368s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 1, 1, 1] 368s -0.292 368s 368s x1: [1, 2, 1, 1] 368s -0.837 368s -0.930 368s 368s y: [1, 2, 1, 1] 368s 0.245 368s 0.272 368s testing: mul_mat, [4, 1, 3, 1] = [1, 4, 3, 1] * [1, 1, 3, 1] 368s x0: [1, 4, 3, 1] 368s 0.984 368s -0.723 368s -0.513 368s 0.711 368s 368s x1: [1, 1, 3, 1] 368s 0.812 368s 368s y: [4, 1, 3, 1] 368s 0.798 -0.587 -0.416 0.577 368s testing: mul_mat, [1, 1, 3, 4] = [4, 1, 3, 4] * [4, 1, 3, 4] 368s x0: [4, 1, 3, 4] 368s -0.133 0.859 0.268 -0.842 368s 368s x1: [4, 1, 3, 4] 368s 0.756 -0.683 -0.737 -0.406 368s 368s y: [1, 1, 3, 4] 368s -0.543 368s test-mul-mat0: iter:481/500 368s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.546 0.631 0.721 -0.663 368s -0.351 -0.933 0.043 -0.028 368s 368s x1: [4, 3, 1, 1] 368s -0.374 -0.186 -0.493 0.462 368s -0.024 0.332 0.305 0.201 368s 0.495 -0.853 -0.705 0.517 368s 368s y: [2, 3, 1, 1] 368s -0.983 0.270 368s 0.283 -0.293 368s -1.119 0.576 368s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.034 -0.994 0.658 0.541 368s 0.468 -0.366 -0.127 -0.226 368s 368s x1: [4, 2, 1, 1] 368s 0.920 0.999 -0.244 -0.784 368s 0.803 -0.248 -0.447 0.380 368s 368s y: [2, 2, 1, 1] 368s -1.546 0.273 368s 0.186 0.438 368s testing: mul_mat, [2, 2, 1, 2] = [4, 2, 1, 2] * [4, 2, 1, 2] 368s x0: [4, 2, 1, 2] 368s 0.103 -0.141 -0.355 0.657 368s -0.761 0.489 -0.309 -0.755 368s 368s x1: [4, 2, 1, 2] 368s 0.835 -0.632 -0.079 -0.870 368s 0.885 0.179 0.676 0.516 368s 368s y: [2, 2, 1, 2] 368s -0.368 -0.264 368s 0.165 -1.185 368s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 2, 1, 1] 368s -0.546 -0.823 0.548 -0.557 368s -0.508 0.340 -0.929 -0.593 368s 368s x1: [4, 2, 1, 1] 368s 0.409 0.747 0.359 -0.413 368s 0.423 -0.125 0.487 0.436 368s 368s y: [2, 2, 1, 1] 368s -0.412 -0.042 368s -0.104 -0.968 368s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 2, 1, 1] 368s 0.134 0.092 368s -0.872 -0.248 368s 368s x1: [2, 4, 1, 1] 368s -0.284 -0.900 368s -0.354 -0.795 368s -0.209 -0.109 368s 0.353 0.023 368s 368s y: [2, 4, 1, 1] 368s -0.121 0.471 368s -0.121 0.506 368s -0.038 0.209 368s 0.050 -0.313 368s testing: mul_mat, [4, 2, 1, 2] = [2, 4, 1, 2] * [2, 2, 1, 2] 368s x0: [2, 4, 1, 2] 368s -0.125 -0.632 368s 0.028 0.398 368s 0.850 0.123 368s 0.052 -0.188 368s 368s x1: [2, 2, 1, 2] 368s 0.601 -0.500 368s -0.501 -0.040 368s 368s y: [4, 2, 1, 2] 368s 0.240 -0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s 182 0.450 0.125 368s 0.088 -0.030 -0.431 -0.019 368s test-mul-mat0: iter:482/500 368s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 3, 1, 1] 368s -0.090 0.176 -0.403 -0.693 368s -0.701 0.409 -0.888 -0.862 368s -0.679 -0.437 -0.818 -0.977 368s 368s x1: [4, 3, 1, 1] 368s 0.038 0.283 -0.545 0.538 368s 0.781 0.415 -0.374 0.703 368s 0.250 0.201 0.060 -0.876 368s 368s y: [3, 3, 1, 1] 368s -0.107 0.108 -0.229 368s -0.333 -0.652 -1.092 368s 0.595 0.609 0.549 368s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.124 -0.278 -0.746 0.936 368s 368s x1: [4, 3, 1, 1] 368s 0.905 0.578 -0.251 0.851 368s -0.891 0.603 -0.111 0.392 368s -0.942 -0.572 0.173 0.473 368s 368s y: [1, 3, 1, 1] 368s 0.711 368s 0.393 368s 0.589 368s testing: mul_mat, [3, 1, 1, 4] = [4, 3, 1, 4] * [4, 1, 1, 4] 368s x0: [4, 3, 1, 4] 368s 0.584 -0.671 0.184 -0.527 368s 0.721 0.242 -0.099 -0.106 368s -0.285 0.955 0.771 0.438 368s 368s x1: [4, 1, 1, 4] 368s -0.153 -0.517 -0.154 -0.977 368s 368s y: [3, 1, 1, 4] 368s 0.744 -0.118 -0.997 368s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.168 0.049 -0.665 0.499 368s 368s x1: [4, 3, 1, 1] 368s -0.345 -0.882 -0.370 0.285 368s -0.622 0.601 -0.626 0.447 368s -0.600 -0.895 0.349 -0.068 368s 368s y: [1, 3, 1, 1] 368s 0.403 368s 0.773 368s -0.208 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s -0.965 368s -0.425 368s 0.896 368s 368s x1: [1, 4, 1, 1] 368s 0.639 368s -0.665 368s 0.410 368s 0.129 368s 368s y: [3, 4, 1, 1] 368s -0.616 -0.272 0.572 368s 0.641 0.283 -0.596 368s -0.395 -0.174 0.367 368s -0.125 -0.055 0.116 368s testing: mul_mat, [3, 1, 1, 4] = [3, 3, 1, 4] * [3, 1, 1, 4] 368s x0: [3, 3, 1, 4] 368s -0.483 0.866 -0.302 368s 0.823 -0.245 0.804 368s -0.134 -0.078 0.256 368s 368s x1: [3, 1, 1, 4] 368s -0.901 -0.771 -0.732 368s 368s y: [3, 1, 1, 4] 368s -0.011 -1.142 -0.007 368s test-mul-mat0: iter:483/500 368s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.030 -0.385 368s 0.983 -0.654 368s 368s x1: [2, 3, 1, 1] 368s -0.007 0.251 368s 0.074 -0.400 368s 0.823 0.897 368s 368s y: [2, 3, 1, 1] 368s -0.096 -0.171 368s 0.152 0.335 368s -0.370 0.223 368s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 368s x0: [2, 1, 3, 1] 368s -0.120 -0.013 368s 368s x1: [2, 2, 3, 1] 368s 0.601 -0.829 368s -0.702 0.780 368s 368s y: [1, 2, 3, 1] 368s -0.061 368s 0.074 368s testing: mul_mat, [3, 1, 3, 4] = [2, 3, 3, 4] * [2, 1, 3, 4] 368s x0: [2, 3, 3, 4] 368s -0.173 -0.571 368s 0.457 0.934 368s -0.788 0.723 368s 368s x1: [2, 1, 3, 4] 368s -0.411 -0.224 368s 368s y: [3, 1, 3, 4] 368s 0.199 -0.397 0.162 368s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 368s x0: [2, 4, 1, 1] 368s 0.022 -0.342 368s 0.664 0.587 368s 0.685 0.543 368s 0.085 -0.758 368s 368s x1: [2, 3, 1, 1] 368s -0.946 0.438 368s 0.878 -0.578 368s 0.457 0.923 368s 368s y: [4, 3, 1, 1] 368s -0.171 -0.371 -0.410 -0.413 368s 0.217 0.244 0.287 0.513 368s -0.306 0.845 0.815 -0.661 368s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 368s x0: [4, 2, 3, 1] 368s 0.219 -0.836 -0.137 0.256 368s 0.319 -0.195 -0.594 0.813 368s 368s x1: [4, 2, 3, 1] 368s -0.549 -0.050 -0.040 -0.776 368s -0.253 0.812 0.735 0.364 368s 368s y: [2, 2, 3, 1] 368s -0.272 -0.773 368s -0.741 -0.379 368s testing: mul_mat, [2, 4, 3, 4] = [2, 2, 3, 4] * [2, 4, 3, 4] 368s x0: [2, 2, 3, 4] 368s -0.616 0.902 368s 0.691 0.114 368s 368s x1: [2, 4, 3, 4] 368s -0.617 0.516 368s -0.273 -0.774 368s 0.050 -0.039 368s 0.732 -0.732 368s 368s y: [2, 4, 3, 4] 368s 0.846 -0.368 368s -0.530 -0.277 368s -0.066 0.030 368s -1.111 0.423 368s test-mul-mat0: iter:484/500 368s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 1, 1, 1] 368s -0.751 -0.854 368s 368s x1: [2, 4, 1, 1] 368s 0.151 -0.508 368s -0.137 -0.223 368s -0.646 -0.363 368s 0.387 0.058 368s 368s y: [1, 4, 1, 1] 368s 0.320 368s 0.293 368s 0.795 368s -0.340 368s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 368s x0: [2, 1, 2, 1] 368s -0.343 -0.406 368s 368s x1: [2, 1, 2, 1] 368s 0.329 0.014 368s 368s y: [1, 1, 2, 1] 368s -0.118 368s testing: mul_mat, [2, 1, 2, 3] = [2, 2, 2, 3] * [2, 1, 2, 3] 368s x0: [2, 2, 2, 3] 368s -0.052 -0.055 368s 0.770 0.585 368s 368s x1: [2, 1, 2, 3] 368s 0.788 -0.463 368s 368s y: [2, 1, 2, 3] 368s -0.015 0.336 368s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 2, 1, 1] 368s 0.301 0.326 368s -0.630 -0.754 368s 368s x1: [2, 2, 1, 1] 368s -0.970 -0.707 368s 0.133 -0.046 368s 368s y: [2, 2, 1, 1] 368s -0.522 1.145 368s 0.025 -0.049 368s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 368s x0: [2, 2, 2, 1] 368s 0.267 0.952 368s 0.876 0.248 368s 368s x1: [2, 2, 2, 1] 368s -0.860 -0.089 368s -0.423 -0.031 368s 368s y: [2, 2, 2, 1] 368s -0.315 -0.775 368s -0.143 -0.378 368s testing: mul_mat, [1, 2, 2, 3] = [2, 1, 2, 3] * [2, 2, 2, 3] 368s x0: [2, 1, 2, 3] 368s -0.891 -0.887 368s 368s x1: [2, 2, 2, 3] 368s 0.949 -0.125 368s 0.192 0.689 368s 368s y: [1, 2, 2, 3] 368s -0.735 368s -0.782 368s test-mul-mat0: iter:485/500 368s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 2, 1, 1] 368s -0.946 0.332 -0.840 0.284 368s -0.263 0.758 -0.800 -0.107 368s 368s x1: [4, 1, 1, 1] 368s 0.415 0.820 0.458 0.085 368s 368s y: [2, 1, 1, 1] 368s -0.481 0.137 368s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 368s x0: [4, 3, 4, 1] 368s 0.939 -0.014 0.581 -0.267 368s 0.725 -0.288 0.267 0.340 368s 0.720 0.231 -0.177 0.829 368s 368s x1: [4, 2, 4, 1] 368s -0.377 -0.008 0.002 -0.264 368s -0.555 -0.206 -0.261 0.131 368s 368s y: [3, 2, 4, 1] 368s -0.282 -0.360 -0.492 368s -0.706 -0.368 -0.293 368s testing: mul_mat, [4, 3, 4, 3] = [4, 4, 4, 3] * [4, 3, 4, 3] 368s x0: [4, 4, 4, 3] 368s -0.556 -0.406 0.874 -0.326 368s 0.073 -0.432 -0.360 -0.579 368s -0.053 0.482 0.751 -0.360 368s -0.565 -0.562 -0.490 -0.311 368s 368s x1: [4, 3, 4, 3] 368s 0.717 0.342 0.390 0.842 368s -0.636 0.857 0.836 0.331 368s -0.632 0.457 -0.558 0.815 368s 368s y: [4, 3, 4, 3] 368s -0.471 -0.723 0.117 -1.050 368s 0.628 -0.909 0.955 -0.635 368s -0.588 -0.514 -0.459 0.120 368s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.528 0.890 0.437 0.555 368s -0.749 0.054 -0.572 0.657 368s 368s x1: [4, 4, 1, 1] 368s -0.961 -0.091 -0.463 -0.748 368s 0.417 -0.364 -0.918 0.083 368s 0.007 -0.382 -0.629 0.793 368s 0.525 -0.961 -0.500 -0.615 368s 368s y: [2, 4, 1, 1] 368s -1.206 0.488 368s -0.459 0.247 368s -0.171 0.854 368s -1.137 -0.564 368s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 368s x0: [2, 4, 4, 1] 368s -0.775 0.744 368s -0.902 0.489 368s 0.847 -0.357 368s -0.834 0.640 368s 368s x1: [2, 4, 4, 1] 368s -0.256 -0.438 368s -0.627 0.606 368s -0.267 0.941 368s 0.645 0.642 368s 368s y: [4, 4, 4, 1] 368s -0.127 0.017 -0.061 -0.067 368s 0.936 0.862 -0.747 0.911 368s 0.906 0.701 -0.562 0.825 368s -0.023 -0.268 0.318 -0.127 368s testing: mul_mat, [1, 2, 4, 3] = [4, 1, 4, 3] * [4, 2, 4, 3] 368s x0: [4, 1, 4, 3] 368s -0.872 -0.073 -0.515 0.123 368s 368s x1: [4, 2, 4, 3] 368s -0.159 -0.341 -0.955 -0.416 368s -0.852 -0.312 -0.775 -0.305 368s 368s y: [1, 2, 4, 3] 368s 0.605 368s 1.128 368s test-mul-mat0: iter:486/500 368s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 4, 1, 1] 368s -0.611 -0.212 -0.118 0.239 368s -0.773 0.578 0.016 -0.849 368s 0.397 -0.537 -0.182 -0.652 368s -0.989 0.060 -0.962 0.896 368s 368s x1: [4, 1, 1, 1] 368s 0.046 -0.495 0.088 -0.913 368s 368s y: [4, 1, 1, 1] 368s -0.152 0.455 0.863 -0.978 368s testing: mul_mat, [1, 4, 2, 1] = [4, 1, 2, 1] * [4, 4, 2, 1] 368s x0: [4, 1, 2, 1] 368s -0.290 -0.130 -0.943 0.407 368s 368s x1: [4, 4, 2, 1] 368s -0.316 -0.991 -0.526 0.715 368s 0.266 -0.303 0.255 -0.706 368s 0.721 -0.865 0.340 -0.773 368s 0.223 0.428 -0.460 0.612 368s 368s y: [1, 4, 2, 1] 368s 1.007 368s -0.566 368s -0.733 368s 0.563 368s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 368s x0: [4, 2, 2, 1] 368s 0.778 -0.683 0.813 -0.629 368s -0.287 -0.858 0.783 0.458 368s 368s x1: [4, 1, 2, 1] 368s 0.692 0.356 0.483 0.119 368s 368s y: [2, 1, 2, 1] 368s 0.613 -0.070 368s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 3, 1, 1] 368s -0.750 0.586 0.364 0.735 368s 0.217 -0.466 -0.149 0.564 368s 0.244 0.190 0.003 0.145 368s 368s x1: [4, 2, 1, 1] 368s 0.088 -0.163 -0.595 0.439 368s 0.966 -0.335 0.131 0.321 368s 368s y: [3, 2, 1, 1] 368s -0.055 0.431 0.052 368s -0.636 0.527 0.219 368s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 368s x0: [3, 3, 2, 1] 368s 0.150 -0.115 -0.597 368s -0.499 -0.934 0.088 368s -0.883 0.262 -0.111 368s 368s x1: [3, 4, 2, 1] 368s 0.518 0.022 0.627 368s 0.772 0.262 -0.654 368s 0.181 -0.707 0.775 368s 0.461 0.381 -0.388 368s 368s y: [3, 4, 2, 1] 368s -0.300 -0.224 -0.521 368s 0.476 -0.687 -0.541 368s -0.355 0.638 -0.431 368s 0.257 -0.620 -0.265 368s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 368s x0: [3, 3, 2, 1] 368s 0.217 0.391 -0.253 368s -0.481 -0.131 0.511 368s 0.203 -0.562 0.448 368s 368s x1: [3, 3, 2, 1] 368s -0.316 -0.158 -0.800 368s 0.216 -0.207 -0.901 368s 0.896 -0.007 0.215 368s 368s y: [3, 3, 2, 1] 368s 0.072 -0.236 -0.334 368s 0.193 -0.537 -0.244 368s 0.137 -0.320 0.282 368s test-mul-mat0: iter:487/500 368s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 3, 1, 1] 368s -0.130 -0.184 368s 0.168 0.684 368s -0.752 0.482 368s 368s x1: [2, 1, 1, 1] 368s -0.059 0.450 368s 368s y: [3, 1, 1, 1] 368s -0.075 0.298 0.261 368s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s * [2, 3, 2, 1] 368s x0: [2, 1, 2, 1] 368s 0.056 -0.087 368s 368s x1: [2, 3, 2, 1] 368s -0.099 -0.233 368s -0.315 -0.708 368s 0.636 0.123 368s 368s y: [1, 3, 2, 1] 368s 0.015 368s 0.044 368s 0.025 368s testing: mul_mat, [1, 1, 2, 3] = [2, 1, 2, 3] * [2, 1, 2, 3] 368s x0: [2, 1, 2, 3] 368s 0.626 -0.218 368s 368s x1: [2, 1, 2, 3] 368s -0.717 0.678 368s 368s y: [1, 1, 2, 3] 368s -0.597 368s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 4, 1, 1] 368s -0.441 0.566 368s 0.888 0.277 368s -0.109 0.710 368s -0.158 -0.575 368s 368s x1: [2, 1, 1, 1] 368s 0.646 0.571 368s 368s y: [4, 1, 1, 1] 368s 0.038 0.732 0.335 -0.431 368s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 368s x0: [4, 3, 2, 1] 368s -0.082 -0.436 0.123 0.965 368s 0.368 -0.061 -0.172 -0.606 368s 0.019 0.685 -0.424 0.853 368s 368s x1: [4, 2, 2, 1] 368s -0.397 -0.972 0.400 -0.200 368s 0.162 0.544 -0.319 -0.456 368s 368s y: [3, 2, 2, 1] 368s 0.313 -0.035 -1.014 368s -0.730 0.358 0.122 368s testing: mul_mat, [1, 4, 2, 3] = [3, 1, 2, 3] * [3, 4, 2, 3] 368s x0: [3, 1, 2, 3] 368s -0.424 -0.812 0.078 368s 368s x1: [3, 4, 2, 3] 368s -0.899 -0.587 0.935 368s -0.422 0.326 -0.415 368s 0.986 -0.756 0.953 368s 0.004 -0.193 -0.108 368s 368s y: [1, 4, 2, 3] 368s 0.931 368s -0.118 368s 0.270 368s 0.146 368s test-mul-mat0: iter:488/500 368s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 2, 1, 1] 368s 0.049 -0.683 368s 0.473 -0.126 368s 368s x1: [2, 4, 1, 1] 368s 0.017 -0.221 368s -0.993 0.426 368s 0.846 0.883 368s -0.527 -0.871 368s 368s y: [2, 4, 1, 1] 368s 0.152 0.036 368s -0.340 -0.523 368s -0.562 0.289 368s 0.570 -0.139 368s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 368s x0: [2, 2, 2, 1] 368s 0.041 0.752 368s 0.763 -0.292 368s 368s x1: [2, 2, 2, 1] 368s 0.675 0.010 368s -0.938 0.323 368s 368s y: [2, 2, 2, 1] 368s 0.036 0.512 368s 0.205 -0.810 368s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 368s x0: [2, 1, 2, 1] 368s 0.501 0.965 368s 368s x1: [2, 2, 2, 1] 368s 0.216 0.017 368s 0.744 0.223 368s 368s y: [1, 2, 2, 1] 368s 0.124 368s 0.588 368s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.099 -0.438 368s 0.034 -0.554 368s 368s x1: [2, 1, 1, 1] 368s -0.161 0.710 368s 368s y: [2, 1, 1, 1] 368s -0.295 -0.399 368s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 368s x0: [2, 1, 2, 1] 368s 0.618 0.768 368s 368s x1: [2, 2, 2, 1] 368s 0.869 -0.993 368s -0.080 -0.090 368s 368s y: [1, 2, 2, 1] 368s -0.225 368s -0.118 368s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 368s x0: [1, 3, 2, 1] 368s 0.993 368s -0.181 368s -0.013 368s 368s x1: [1, 2, 2, 1] 368s 0.511 368s -0.620 368s 368s y: [3, 2, 2, 1] 368s 0.508 -0.092 -0.006 368s -0.616 0.112 0.008 368s test-mul-mat0: iter:489/500 368s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.065 368s -0.623 368s 368s x1: [1, 4, 1, 1] 368s 0.598 368s -0.492 368s -0.856 368s 0.467 368s 368s y: [2, 4, 1, 1] 368s 0.039 -0.372 368s -0.032 0.307 368s -0.055 0.533 368s 0.030 -0.291 368s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 368s x0: [1, 2, 3, 1] 368s -0.153 368s -0.753 368s 368s x1: [1, 2, 3, 1] 368s -0.725 368s 0.748 368s 368s y: [2, 2, 3, 1] 368s 0.111 0.546 368s -0.114 -0.563 368s testing: mul_mat, [1, 2, 3, 4] = [1, 1, 3, 4] * [1, 2, 3, 4] 368s x0: [1, 1, 3, 4] 368s 0.265 368s 368s x1: [1, 2, 3, 4] 368s -0.286 368s 0.318 368s 368s y: [1, 2, 3, 4] 368s -0.076 368s 0.084 368s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 368s x0: [1, 2, 1, 1] 368s 0.740 368s -0.823 368s 368s x1: [1, 1, 1, 1] 368s -0.431 368s 368s y: [2, 1, 1, 1] 368s -0.319 0.354 368s testing: mul_mat, [1, 1, 3, 1] = [2, 1, 3, 1] * [2, 1, 3, 1] 368s x0: [2, 1, 3, 1] 368s -0.073 0.829 368s 368s x1: [2, 1, 3, 1] 368s -0.245 0.032 368s 368s y: [1, 1, 3, 1] 368s 0.045 368s testing: mul_mat, [3, 2, 3, 4] = [1, 3, 3, 4] * [1, 2, 3, 4] 368s x0: [1, 3, 3, 4] 368s -0.086 368s -0.542 368s 0.091 368s 368s x1: [1, 2, 3, 4] 368s -0.198 368s -0.362 368s 368s y: [3, 2, 3, 4] 368s 0.017 0.107 -0.018 368s 0.031 0.196 -0.033 368s test-mul-mat0: iter:490/500 368s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.758 0.463 368s 0.574 0.666 368s 368s x1: [2, 1, 1, 1] 368s -0.436 -0.726 368s 368s y: [2, 1, 1, 1] 368s -0.006 -0.734 368s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 2, 1, 1] 368s 0.461 0.172 368s -0.757 0.570 368s 368s x1: [2, 2, 1, 1] 368s -0.970 -0.498 368s 0.744 -0.981 368s 368s y: [2, 2, 1, 1] 368s -0.533 0.451 368s 0.174 -1.123 368s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 368s x0: [2, 1, 1, 2] 368s 0.327 -0.553 368s 368s x1: [2, 2, 1, 2] 368s -0.479 0.525 368s -0.257 0.857 368s 368s y: [1, 2, 1, 2] 368s -0.447 368s -0.558 368s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 1, 1, 1] 368s -0.732 -0.406 368s 368s x1: [2, 1, 1, 1] 368s 0.993 0.281 368s 368s y: [1, 1, 1, 1] 368s -0.841 368s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 1, 1, 1] 368s -0.205 368s 368s x1: [1, 2, 1, 1] 368s 0.129 368s 0.731 368s 368s y: [1, 2, 1, 1] 368s -0.026 368s -0.150 368s testing: mul_mat, [1, 1, 1, 2] = [1, 1, 1, 2] * [1, 1, 1, 2] 368s x0: [1, 1, 1, 2] 368s 0.780 368s 368s x1: [1, 1, 1, 2] 368s 0.761 368s 368s y: [1, 1, 1, 2] 368s 0.593 368s test-mul-mat0: iter:491/500 368s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 4, 1, 1] 368s -0.237 -0.204 368s 0.016 0.085 368s -0.211 -0.703 368s 0.972 0.057 368s 368s x1: [2, 4, 1, 1] 368s 0.473 0.104 368s -0.687 0.329 368s -0.458 -0.059 368s 0.435 -0.651 368s 368s y: [4, 4, 1, 1] 368s -0.133 0.017 -0.173 0.465 368s 0.096 0.017 -0.087 -0.649 368s 0.121 -0.012 0.138 -0.449 368s 0.030 -0.049 0.366 0.386 368s testing: mul_mat, [2, 4, 2, 1] = [2, 2, 2, 1] * [2, 4, 2, 1] 368s x0: [2, 2, 2, 1] 368s -0.918 -0.708 368s 0.257 0.518 368s 368s x1: [2, 4, 2, 1] 368s -0.109 0.101 368s -0.212 0.059 368s 0.895 -0.451 368s 0.730 -0.566 368s 368s y: [2, 4, 2, 1] 368s 0.029 0.024 368s 0.152 -0.024 368s -0.503 -0.004 368s -0.269 -0.106 368s testing: mul_mat, [1, 2, 2, 4] = [2, 1, 2, 4] * [2, 2, 2, 4] 368s x0: [2, 1, 2, 4] 368s -0.143 -0.485 368s 368s x1: [2, 2, 2, 4] 368s -0.180 0.366 368s -0.774 0.116 368s 368s y: [1, 2, 2, 4] 368s -0.152 368s 0.054 368s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 368s x0: [2, 1, 1, 1] 368s 0.019 0.031 368s 368s x1: [2, 1, 1, 1] 368s -0.241 0.512 368s 368s y: [1, 1, 1, 1] 368s 0.011 368s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 368s x0: [1, 4, 2, 1] 368s -0.615 368s 0.509 368s 0.395 368s 0.784 368s 368s x1: [1, 2, 2, 1] 368s -0.937 368s 0.571 368s 368s y: [4, 2, 2, 1] 368s 0.576 -0.477 -0.370 -0.734 368s -0.351 0.291 0.226 0.448 368s testing: mul_mat, [2, 1, 2, 4] = [4, 2, 2, 4] * [4, 1, 2, 4] 368s x0: [4, 2, 2, 4] 368s -0.192 0.114 0.242 0.216 368s 0.803 -0.541 0.909 -0.535 368s 368s x1: [4, 1, 2, 4] 368s -0.607 -0.179 0.311 -0.394 368s 368s y: [2, 1, 2, 4] 368s 0.086 0.103 368s test-mul-mat0: iter:492/500 368s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 4, 1, 1] 368s 0.601 368s 0.369 368s 0.551 368s -0.455 368s 368s x1: [1, 2, 1, 1] 368s 0.517 368s -0.520 368s 368s y: [4, 2, 1, 1] 368s 0.311 0.191 0.285 -0.235 368s -0.312 -0.192 -0.286 0.236 368s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 368s x0: [1, 2, 2, 1] 368s 0.353 368s -0.961 368s 368s x1: [1, 4, 2, 1] 368s -0.806 368s 0.716 368s 0.960 368s 0.253 368s 368s y: [2, 4, 2, 1] 368s -0.285 0.774 368s 0.253 -0.688 368s 0.339 -0.922 368s 0.089 -0.243 368s testing: mul_mat, [4, 2, 2, 2] = [1, 4, 2, 2] * [1, 2, 2, 2] 368s x0: [1, 4, 2, 2] 368s -0.592 368s -0.435 368s -0.593 368s -0.112 368s 368s x1: [1, 2, 2, 2] 368s 0.453 368s 0.919 368s 368s y: [4, 2, 2, 2] 368s -0.268 -0.197 -0.269 -0.051 368s -0.544 -0.400 -0.545 -0.103 368s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 4, 1, 1] 368s 0.585 368s 0.239 368s -0.556 368s 0.628 368s 368s x1: [1, 4, 1, 1] 368s 0.747 368s -0.980 368s -0.455 368s 0.786 368s 368s y: [4, 4, 1, 1] 368s 0.437 0.179 -0.415 0.469 368s -0.573 -0.235 0.545 -0.616 368s -0.266 -0.109 0.253 -0.286 368s 0.460 0.188 -0.437 0.494 368s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 368s x0: [4, 1, 2, 1] 368s -0.805 0.164 -0.894 -0.548 368s 368s x1: [4, 1, 2, 1] 368s 0.783 0.636 -0.198 -0.102 368s 368s y: [1, 1, 2, 1] 368s -0.295 368s testing: mul_mat, [4, 4, 2, 2] = [1, 4, 2, 2] * [1, 4, 2, 2] 368s x0: [1, 4, 2, 2] 368s 0.129 368s -0.358 368s -0.949 368s -0.565 368s 368s x1: [1, 4, 2, 2] 368s 0.172 368s -0.394 368s 0.674 368s -0.055 368s 368s y: [4, 4, 2, 2] 368s 0.022 -0.061 -0.163 -0.097 368s -0.051 0.141 0.374 0.222 368s 0.087 -0.241 -0.640 -0.381 368s -0.007 0.020 0.052 0.031 368s test-mul-mat0: iter:493/500 368s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 2, 1, 1] 368s -0.011 -0.424 -0.176 0.632 368s -0.373 0.259 0.911 -0.224 368s 368s x1: [4, 2, 1, 1] 368s 0.996 -0.268 0.019 0.486 368s -0.339 0.465 0.420 -0.783 368s 368s y: [2, 2, 1, 1] 368s 0.407 -0.533 368s -0.762 0.805 368s testing: mul_mat, [4, 2, 4, 1] = [4, 4, 4, 1] * [4, 2, 4, 1] 368s x0: [4, 4, 4, 1] 368s -0.446 0.077 -0.107 -0.498 368s -0.969 -0.352 -0.464 0.575 368s -0.208 0.602 0.508 -0.252 368s -0.335 -0.827 0.723 0.213 368s 368s x1: [4, 2, 4, 1] 368s 0.889 0.354 0.032 -0.030 368s 0.038 0.947 -0.046 0.754 368s 368s y: [4, 2, 4, 1] 368s -0.357 -1.018 0.053 -0.574 368s -0.314 0.085 0.349 -0.668 368s testing: mul_mat, [3, 4, 4, 1] = [4, 3, 4, 1] * [4, 4, 4, 1] 368s x0: [4, 3, 4, 1] 368s 0.987 0.465 0.360 -0.199 368s 0.598 0.774 -0.323 -0.885 368s -0.798 0.505 0.517 -0.908 368s 368s x1: [4, 4, 4, 1] 368s -0.151 0.452 -0.979 0.476 368s -0.527 -0.712 -0.105 0.096 368s -0.936 0.062 -0.616 -0.987 368s -0.048 -0.115 0.462 0.004 368s 368s y: [3, 4, 4, 1] 368s -0.386 0.155 -0.590 368s -0.908 -0.917 -0.080 368s -0.920 0.560 1.355 368s 0.064 -0.271 0.216 368s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.433 0.895 0.279 -0.482 368s 368s x1: [4, 3, 1, 1] 368s -0.536 -0.637 -0.802 -0.553 368s 0.225 0.374 0.178 -0.007 368s -0.826 0.772 0.709 0.041 368s 368s y: [1, 3, 1, 1] 368s -0.759 368s 0.486 368s 0.511 368s testing: mul_mat, [3, 4, 4, 1] = [1, 3, 4, 1] * [1, 4, 4, 1] 368s x0: [1, 3, 4, 1] 368s 0.587 368s -0.828 368s 0.105 368s 368s x1: [1, 4, 4, 1] 368s 0.549 368s 0.364 368s 0.376 368s 0.881 368s 368s y: [3, 4, 4, 1] 368s 0.323 -0.455 0.058 368s 0.214 -0.301 0.038 368s 0.221 -0.311 0.040 368s 0.517 -0.729 0.093 368s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 4, 4, 1] 368s -0.096 0.782 0.671 368s -0.165 -0.434 -0.602 368s -0.165 0.632 0.120 368s -0.955 0.954 -0.050 368s 368s x1: [3, 1, 4, 1] 368s -0.389 -0.175 0.764 368s 368s y: [4, 1, 4, 1] 368s 0.413 -0.320 0.045 0.167 368s test-mul-mat0: iter:494/500 368s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 4, 1, 1] 368s -0.504 368s -0.088 368s 0.691 368s 0.705 368s 368s x1: [1, 2, 1, 1] 368s -0.895 368s -0.054 368s 368s y: [4, 2, 1, 1] 368s 0.451 0.079 -0.618 -0.631 368s 0.027 0.005 -0.037 -0.038 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s 0.564 368s 0.345 368s -0.336 368s 368s x1: [1, 4, 1, 1] 368s 0.105 368s -0.802 368s -0.783 368s -0.798 368s 368s y: [3, 4, 1, 1] 368s 0.059 0.036 -0.035 368s -0.452 -0.276 0.270 368s -0.442 -0.270 0.263 368s -0.450 -0.275 0.268 368s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 368s x0: [1, 1, 1, 1] 368s 0.291 368s 368s x1: [1, 3, 1, 1] 368s 0.477 368s -0.881 368s 0.742 368s 368s y: [1, 3, 1, 1] 368s 0.139 368s -0.256 368s 0.216 368s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 368s x0: [1, 4, 1, 1] 368s 0.398 368s -0.869 368s 0.402 368s 0.773 368s 368s x1: [1, 1, 1, 1] 368s 0.939 368s 368s y: [4, 1, 1, 1] 368s 0.374 -0.816 0.378 0.726 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.131 0.189 -0.331 0.627 368s 368s x1: [4, 1, 1, 1] 368s -0.377 -0.307 0.026 -0.758 368s 368s y: [1, 1, 1, 1] 368s -0.591 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s 0.549 368s 0.408 368s -0.902 368s 368s x1: [1, 4, 1, 1] 368s -0.900 368s -0.640 368s 0.332 368s 0.205 368s 368s y: [3, 4, 1, 1] 368s -0.494 -0.367 0.812 368s -0.351 -0.261 0.578 368s 0.182 0.135 -0.300 368s 0.113 0.084 -0.185 368s test-mul-mat0: iter:495/500 368s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 4, 1, 1] 368s 0.248 0.560 -0.427 -0.350 368s 0.333 0.197 0.343 -0.641 368s 0.439 -0.423 0.490 -0.372 368s 0.246 0.117 -0.272 0.606 368s 368s x1: [4, 1, 1, 1] 368s -0.129 -0.497 0.651 -0.838 368s 368s y: [4, 1, 1, 1] 368s -0.295 0.620 0.784 -0.775 368s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 368s x0: [4, 2, 4, 1] 368s -0.189 0.055 -0.713 0.357 368s -0.040 0.760 0.603 0.829 368s 368s x1: [4, 4, 4, 1] 368s -0.551 0.934 -0.836 0.998 368s 0.341 -0.738 0.111 -0.906 368s 0.024 0.701 -0.035 0.527 368s 0.352 0.127 0.970 -0.400 368s 368s y: [2, 4, 4, 1] 368s 1.108 1.055 368s -0.508 -1.259 368s 0.247 0.947 368s -0.894 0.335 368s testing: mul_mat, [3, 2, 4, 2] = [4, 3, 4, 2] * [4, 2, 4, 2] 368s x0: [4, 3, 4, 2] 368s 0.402 0.806 0.619 -0.421 368s 0.662 0.456 -0.390 0.268 368s -0.774 -0.998 0.866 -0.224 368s 368s x1: [4, 2, 4, 2] 368s -0.759 0.500 -0.269 0.200 368s 0.260 -0.666 0.029 -0.980 368s 368s y: [3, 2, 4, 2] 368s -0.153 -0.116 -0.190 368s -0.002 -0.405 0.709 368s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 4, 1, 1] 368s -0.956 -0.632 -0.161 -0.415 368s 0.367 0.880 0.972 0.683 368s 0.790 0.262 -0.603 -0.549 368s -0.235 0.330 -0.890 -0.806 368s 368s x1: [4, 3, 1, 1] 368s -0.722 0.149 0.068 0.229 368s 0.722 -0.593 -0.538 0.012 368s -0.419 0.945 -0.030 0.579 368s 368s y: [4, 3, 1, 1] 368s 0.490 0.088 -0.697 -0.026 368s -0.234 -0.772 0.733 0.103 368s -0.432 1.044 -0.383 -0.030 368s testing: mul_mat, [3, 4, 4, 1] = [4, 3, 4, 1] * [4, 4, 4, 1] 368s x0: [4, 3, 4, 1] 368s 0.729 -0.907 0.449 -0.233 368s 0.464 -0.625 0.581 -0.340 368s 0.719 -0.161 -0.934 0.875 368s 368s x1: [4, 4, 4, 1] 368s 0.109 -0.761 -0.528 -0.742 368s 0.307 0.701 0.980 0.713 368s -0.837 -0.008 -0.706 -0.892 368s 0.962 0.874 -0.419 -0.994 368s 368s y: [3, 4, 4, 1] 368s 0.705 0.472 0.043 368s -0.138 0.031 -0.183 368s -0.712 -0.490 -0.723 368s -0.048 -0.005 0.071 368s testing: mul_mat, [1, 4, 4, 2] = [3, 1, 4, 2] * [3, 4, 4, 2] 368s x0: [3, 1, 4, 2] 368s -0.286 0.212 0.611 368s 368s x1: [3, 4, 4, 2] 368s -0.523 0.161 0.108 368s -0.515 0.607 0.079 368s 0.215 -0.225 -0.731 368s -0.681 0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 368s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 368s .449 -0.197 368s 368s y: [1, 4, 4, 2] 368s 0.250 368s 0.324 368s -0.556 368s 0.169 368s test-mul-mat0: iter:496/500 368s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 3, 1, 1] 368s -0.336 -0.698 -0.726 -0.180 368s -0.261 0.918 -0.675 0.124 368s -0.423 0.693 -0.584 0.629 368s 368s x1: [4, 3, 1, 1] 368s 0.209 -0.734 0.065 0.923 368s 0.478 -0.323 -0.674 -0.230 368s 0.864 -0.279 0.143 -0.169 368s 368s y: [3, 3, 1, 1] 368s 0.228 -0.658 -0.055 368s 0.595 0.005 -0.177 368s -0.169 -0.599 -0.749 368s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.428 -0.408 0.407 -0.429 368s 368s x1: [4, 3, 1, 1] 368s 0.899 0.978 -0.532 0.228 368s -0.085 -0.014 -0.563 0.181 368s -0.948 -0.640 -0.342 -0.272 368s 368s y: [1, 3, 1, 1] 368s -0.329 368s -0.338 368s -0.168 368s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 1, 1, 1] 368s -0.945 -0.536 0.895 -0.620 368s 368s x1: [4, 1, 1, 1] 368s 0.423 0.702 0.235 0.725 368s 368s y: [1, 1, 1, 1] 368s -1.015 368s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 368s x0: [4, 2, 1, 1] 368s 0.971 -0.431 -0.103 0.636 368s 0.982 0.199 0.555 -0.922 368s 368s x1: [4, 1, 1, 1] 368s 0.588 -0.528 -0.928 -0.996 368s 368s y: [2, 1, 1, 1] 368s 0.260 0.876 368s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 368s x0: [2, 2, 1, 1] 368s -0.650 0.971 368s 0.792 -0.925 368s 368s x1: [2, 4, 1, 1] 368s 0.607 0.996 368s -0.264 -0.665 368s -0.318 -0.836 368s -0.073 -0.910 368s 368s y: [2, 4, 1, 1] 368s 0.572 -0.441 368s -0.474 0.406 368s -0.605 0.522 368s -0.836 0.784 368s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 368s x0: [2, 3, 1, 1] 368s -0.873 -0.869 368s -0.863 -0.762 368s -0.522 -0.551 368s 368s x1: [2, 2, 1, 1] 368s 0.769 -0.974 368s -0.461 0.665 368s 368s y: [3, 2, 1, 1] 368s 0.175 0.079 0.135 368s -0.175 -0.108 -0.125 368s test-mul-mat0: iter:497/500 368s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 368s x0: [4, 4, 1, 1] 368s -0.280 -0.806 0.279 0.125 368s -0.680 0.416 0.603 -0.549 368s 0.654 -0.949 -0.436 -0.539 368s 0.699 0.573 -0.177 -0.017 368s 368s x1: [4, 4, 1, 1] 368s -0.638 0.284 0.087 0.969 368s 0.280 0.823 -0.696 0.963 368s 0.988 0.231 -0.948 -0.277 368s 0.580 0.845 -0.306 0.655 368s 368s y: [4, 4, 1, 1] 368s 0.095 0.072 -1.247 -0.315 368s -0.816 -0.796 -0.814 0.775 368s -0.761 -0.994 0.989 0.995 368s -0.847 -0.587 -0.642 0.933 368s testing: mul_mat, [4, 4, 2, 1] = [4, 4, 2, 1] * [4, 4, 2, 1] 368s x0: [4, 4, 2, 1] 368s -0.666 0.809 0.256 -0.440 368s -0.719 0.582 -0.161 0.476 368s 0.489 -0.038 0.366 0.050 368s -0.301 0.975 -0.195 0.715 368s 368s x1: [4, 4, 2, 1] 368s -0.339 -0.208 -0.737 -0.516 368s 0.096 -0.774 -0.528 -0.674 368s -0.722 0.195 0.907 -0.877 368s 0.889 0.562 0.737 -0.391 368s 368s y: [4, 4, 2, 1] 368s 0.096 -0.004 -0.453 -0.326 368s -0.529 -0.755 -0.151 -1.162 368s 1.256 0.069 -0.073 -0.396 368s 0.223 -0.618 0.663 -0.143 368s testing: mul_mat, [2, 4, 2, 2] = [4, 2, 2, 2] * [4, 4, 2, 2] 368s x0: [4, 2, 2, 2] 368s 0.635 -0.614 0.462 0.491 368s -0.966 0.180 -0.234 0.365 368s 368s x1: [4, 4, 2, 2] 368s -0.703 0.562 -0.514 -0.422 368s 0.144 0.325 -0.946 -0.367 368s -0.714 0.420 0.683 -0.014 368s 0.394 -0.512 -0.299 0.103 368s 368s y: [2, 4, 2, 2] 368s -1.236 0.746 368s -0.725 0.007 368s -0.402 0.599 368s 0.477 -0.365 368s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 368s x0: [4, 1, 1, 1] 368s 0.092 -0.687 -0.757 0.465 368s 368s x1: [4, 2, 1, 1] 368s -0.492 0.960 -0.167 -0.458 368s 0.140 0.599 0.907 0.024 368s 368s y: [1, 2, 1, 1] 368s -0.792 368s -1.075 368s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 368s x0: [1, 3, 2, 1] 368s -0.967 368s 0.793 368s -0.474 368s 368s x1: [1, 4, 2, 1] 368s 0.853 368s 0.373 368s 0.804 368s -0.276 368s 368s y: [3, 4, 2, 1] 368s -0.824 0.676 -0.404 368s -0.361 0.296 -0.177 368s -0.777 0.637 -0.381 368s 0.267 -0.219 0.131 368s testing: mul_mat, [2, 1, 2, 2] = [3, 2, 2, 2] * [3, 1, 2, 2] 368s x0: [3, 2, 2, 2] 368s 0.195 -0.582 -0.340 368s 0.287 0.048 -0.778 368s 368s x1: [3, 1, 2, 2] 368s -0.095 -0.242 0.576 368s 368s y: [2, 1, 2, 2] 368s -0.074 -0.487 368s test-mul-mat0: iter:498/500 368s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 368s x0: [3, 1, 1, 1] 368s -0.414 -0.033 -0.655 368s 368s x1: [3, 4, 1, 1] 368s 0.186 -0.942 -0.542 368s -0.619 0.345 -0.124 368s 0.429 -0.995 0.098 368s 0.201 0.671 -0.293 368s 368s y: [1, 4, 1, 1] 368s 0.309 368s 0.326 368s -0.209 368s 0.087 368s testing: mul_mat, [4, 1, 3, 1] = [3, 4, 3, 1] * [3, 1, 3, 1] 368s x0: [3, 4, 3, 1] 368s -0.623 0.326 -0.265 368s 0.391 -0.169 0.920 368s 0.450 0.289 -0.699 368s -0.205 -0.836 0.730 368s 368s x1: [3, 1, 3, 1] 368s 0.294 0.754 0.717 368s 368s y: [4, 1, 3, 1] 368s -0.128 0.647 -0.152 -0.167 368s testing: mul_mat, [3, 4, 3, 3] = [3, 3, 3, 3] * [3, 4, 3, 3] 368s x0: [3, 3, 3, 3] 368s -0.168 0.395 -0.083 368s -0.996 -0.326 0.380 368s -0.553 0.716 -0.592 368s 368s x1: [3, 4, 3, 3] 368s 0.687 -0.947 0.872 368s 0.988 -0.152 -0.964 368s 0.718 0.648 0.299 368s -0.351 -0.881 0.269 368s 368s y: [3, 4, 3, 3] 368s -0.562 -0.045 -1.574 368s -0.147 -1.301 -0.085 368s 0.110 -0.813 -0.110 368s -0.311 0.738 -0.596 368s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 368s x0: [3, 3, 1, 1] 368s -0.494 -0.352 0.064 368s -0.305 0.945 0.394 368s -0.511 0.853 0.905 368s 368s x1: [3, 3, 1, 1] 368s -0.262 -0.188 -0.620 368s 0.001 0.626 0.736 368s -0.519 0.963 -0.735 368s 368s y: [3, 3, 1, 1] 368s 0.156 -0.343 -0.588 368s -0.174 0.882 1.199 368s -0.130 0.779 0.422 368s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 368s x0: [3, 1, 3, 1] 368s -0.603 0.406 0.957 368s 368s x1: [3, 3, 3, 1] 368s 0.469 0.451 -0.420 368s -0.207 -0.842 0.891 368s -0.644 -0.571 0.747 368s 368s y: [1, 3, 3, 1] 368s -0.502 368s 0.636 368s 0.872 368s testing: mul_mat, [1, 3, 3, 3] = [1, 1, 3, 3] * [1, 3, 3, 3] 368s x0: [1, 1, 3, 3] 368s 0.154 368s 368s x1: [1, 3, 3, 3] 368s 0.015 368s 0.445 368s -0.825 368s 368s y: [1, 3, 3, 3] 368s 0.002 368s 0.069 368s -0.127 368s test-mul-mat0: iter:499/500 368s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 368s x0: [1, 1, 1, 1] 368s 0.097 368s 368s x1: [1, 2, 1, 1] 368s -0.115 368s -0.391 368s 368s y: [1, 2, 1, 1] 368s -0.011 368s -0.038 368s testing: mul_mat, [4, 1, 4, 1] = [1, 4, 4, 1] * [1, 1, 4, 1] 368s x0: [1, 4, 4, 1] 368s -0.776 368s 0.796 368s 0.573 368s -0.148 368s 368s x1: [1, 1, 4, 1] 368s 0.623 368s 368s y: [4, 1, 4, 1] 368s -0.483 0.496 0.357 -0.092 368s testing: mul_mat, [4, 4, 4, 2] = [1, 4, 4, 2] * [1, 4, 4, 2] 368s x0: [1, 4, 4, 2] 368s -0.434 368s 0.259 368s 0.574 368s 0.023 368s 368s x1: [1, 4, 4, 2] 368s -0.122 368s 0.144 368s 0.287 368s 0.732 368s 368s y: [4, 4, 4, 2] 368s 0.053 -0.032 -0.070 -0.003 368s -0.062 0.037 0.082 0.003 368s -0.124 0.074 0.165 0.007 368s -0.318 0.190 0.420 0.017 368s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 368s x0: [1, 3, 1, 1] 368s 0.799 368s -0.840 368s -0.539 368s 368s x1: [1, 4, 1, 1] 368s -0.308 368s 0.534 368s -0.057 368s -0.431 368s 368s y: [3, 4, 1, 1] 368s -0.246 0.258 0.166 368s 0.426 -0.449 -0.288 368s -0.045 0.048 0.031 368s -0.344 0.362 0.232 368s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 368s x0: [3, 4, 4, 1] 368s 0.616 0.929 -0.493 368s -0.978 -0.453 -0.035 368s -0.949 0.044 -0.946 368s 0.691 0.362 -0.959 368s 368s x1: [3, 1, 4, 1] 368s 0.778 0.458 -0.860 368s 368s y: [4, 1, 4, 1] 368s 1.328 -0.937 0.095 1.527 368s testing: mul_mat, [4, 3, 4, 2] = [4, 4, 4, 2] * [4, 3, 4, 2] 368s x0: [4, 4, 4, 2] 368s -0.098 0.579 -0.110 0.879 368s -0.433 0.923 -0.015 0.200 368s 0.544 -0.770 0.666 0.494 368s -0.418 0.165 0.131 -0.611 368s 368s x1: [4, 3, 4, 2] 368s 0.187 -0.115 0.229 0.655 368s -0.346 -0.844 -0.215 0.320 368s 0.784 -0.766 0.291 -0.746 368s 368s y: [4, 3, 4, 2] 368s 0.466 -0.060 0.667 -0.468 368s -0.150 -0.562 0.476 -0.218 368s -1.208 -1.200 0.841 0.040 392s compression: 0.000000 392s convert time: 0.000000 ms / method = 0 392s -0.040188 0.785085 392s 0.405617 0.183192 392s 0.016901 0.564737 392s 0.001560 0.452003 392s -0.111647 -0.158779 392s 0.602449 0.452541 392s 0.464777 0.596264 392s 0.031770 0.389379 392s 0.522225 -0.094984 392s 0.246030 0.173205 392s 0.322603 -0.063522 392s 0.171129 0.240454 392s 0.435216 0.577889 392s 0.286599 -0.102425 392s -0.152230 0.570174 392s -0.116195 0.552043 392s main: elapsed ticks: 23889053 392s main: elapsed us: 24022316 / 24022.316000 ms 392s 135.421090 392s Testing 1 devices 392s 392s Backend 1/1: CPU 392s Device description: CPU 392s Device memory: 3996 MB (3996 MB free) 392s 392s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=1): OK 392s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=2): OK 392s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=3): OK 392s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=4): OK 392s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=5): OK 392s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=6): OK 392s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=2): OK 392s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=4): OK 392s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=6): OK 392s test_dataset(shuffle=no, ndata_shard=3, ndata_batch=3): OK 392s test_dataset(shuffle=no, ndata_shard=3, ndata_batch=6): OK 392s test_dataset(shuffle=no, ndata_shard=4, ndata_batch=4): OK 392s test_dataset(shuffle=no, ndata_shard=5, ndata_batch=5): OK 392s test_dataset(shuffle=no, ndata_shard=6, ndata_batch=6): OK 392s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=1): OK 392s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=2): OK 392s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=3): OK 392s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=4): OK 392s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=5): OK 392s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=6): OK 392s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=2): OK 392s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=4): OK 392s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=6): OK 392s test_dataset(shuffle=yes, ndata_shard=3, ndata_batch=3): OK 392s test_dataset(shuffle=yes, ndata_shard=3, ndata_batch=6): OK 392s test_dataset(shuffle=yes, ndata_shard=4, ndata_batch=4): OK 392s test_dataset(shuffle=yes, ndata_shard=5, ndata_batch=5): OK 392s test_dataset(shuffle=yes, ndata_shard=6, ndata_batch=6): OK 392s test_grad(): OK 392s test_forward_backward(high_level=no, shuffle=no, subtest=results_initial): OK 392s test_forward_backward(high_level=no, shuffle=no, subtest=weights_after_forward): OK 392s test_forward_backward(high_level=no, shuffle=no, subtest=results_after_forward): OK 392s test_forward_backward(high_level=no, shuffle=no, subtest=weights_after_forward_backward): OK 392s test_forward_backward(high_level=no, shuffle=no, subtest=result_after_forward_backward): OK 392s test_forward_backward(high_level=yes, shuffle=no, subtest=results_initial): OK 392s test_forward_backward(high_level=yes, shuffle=no, subtest=weights_after_forward): OK 392s test_forward_backward(high_level=yes, shuffle=no, subtest=results_after_forward): OK 392s test_forward_backward(high_level=yes, shuffle=no, subtest=weights_after_forward_backward): OK 392s test_forward_backward(high_level=yes, shuffle=no, subtest=result_after_forward_backward): OK 392s test_forward_backward(high_level=yes, shuffle=yes, subtest=results_initial): OK 392s test_forward_backward(high_level=yes, shuffle=yes, subtest=weights_after_forward): OK 392s test_forward_backward(high_level=yes, shuffle=yes, subtest=results_after_forward): OK 392s test_forward_backward(high_level=yes, shuffle=yes, subtest=weights_after_forward_backward): OK 392s test_forward_backward(high_level=yes, shuffle=yes, subtest=result_after_forward_backward): OK 392s test_epoch_vs_fit(): OK 392s test_idata_split(high_level=no, epoch=1, subtest=weights): OK 392s test_idata_split(high_level=no, epoch=1, subtest=results_backward): OK 392s test_idata_split(high_level=no, epoch=1, subtest=results_forward): OK 392s test_idata_split(high_level=no, epoch=2, subtest=weights): OK 392s test_idata_split(high_level=no, epoch=2, subtest=results_backward): OK 392s test_idata_split(high_level=no, epoch=2, subtest=results_forward): OK 392s test_idata_split(high_level=no, epoch=3, subtest=weights): OK 392s test_idata_split(high_level=no, epoch=3, subtest=results_backward): OK 392s test_idata_split(high_level=no, epoch=3, subtest=results_forward): OK 392s test_idata_split(high_level=no, epoch=4, subtest=weights): OK 392s test_idata_split(high_level=no, epoch=4, subtest=results_backward): OK 392s test_idata_split(high_level=no, epoch=4, subtest=results_forward): OK 392s test_idata_split(high_level=yes, epoch=1, subtest=weights): OK 392s test_idata_split(high_level=yes, epoch=1, subtest=results_backward): OK 392s test_idata_split(high_level=yes, epoch=1, subtest=results_forward): OK 392s test_idata_split(high_level=yes, epoch=2, subtest=weights): OK 392s test_idata_split(high_level=yes, epoch=2, subtest=results_backward): OK 392s test_idata_split(high_level=yes, epoch=2, subtest=results_forward): OK 392s test_idata_split(high_level=yes, epoch=3, subtest=weights): OK 392s test_idata_split(high_level=yes, epoch=3, subtest=results_backward): OK 392s test_idata_split(high_level=yes, epoch=3, subtest=results_forward): OK 392s test_idata_split(high_level=yes, epoch=4, subtest=weights): OK 392s test_idata_split(high_level=yes, epoch=4, subtest=results_backward): OK 392s test_idata_split(high_level=yes, epoch=4, subtest=results_forward): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=results): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=grads): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=weights): OK 392s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=results): OK 392s test_regression(subtest=weights): OK 392s 118/118 tests passed 392s Backend CPU: OK 392s 392s 1/1 backends passed 392s OK 392s test_pad_reflect_1d: using CPU backend 526s Testing f32 526s Testing f16 526s Testing q4_0 526s Testing q4_1 526s Testing q5_0 526s Testing q5_1 526s Testing q8_0 526s q8_0 reference implementation error: FAILED (0.000175) 526s Testing q8_1 526s Testing q2_K 526s Testing q3_K 526s Testing q4_K 526s Testing q5_K 526s Testing q6_K 526s Testing q8_K 526s Testing iq2_xxs 526s Testing iq2_xs 526s Testing iq3_xxs 526s Testing iq1_s 526s Testing iq4_nl 526s Testing iq3_s 526s Testing iq2_s 526s Testing iq4_xs 526s Testing i8 526s Testing i16 526s Testing i32 526s Testing i64 526s Testing f64 526s Testing iq1_m 526s Testing bf16 526s Testing tq1_0 526s Testing tq2_0 526s 1 tests failed 526s f16 526s quantize_row_q_reference 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 0.32 GB/s 526s quantized throughput : 0.16 GB/s 526s 526s quantize_row_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 0.32 GB/s 526s quantized throughput : 0.16 GB/s 526s 526s dequantize_row_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 6.94 GB/s 526s quantized throughput : 3.47 GB/s 526s 526s quantize_row_q_dot 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 0.32 GB/s 526s quantized throughput : 0.16 GB/s 526s 526s vec_dot_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 1.86 GB/s 526s quantized throughput : 0.93 GB/s 526s 526s q4_0 526s quantize_row_q_reference 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 1.06 GB/s 526s quantized throughput : 0.15 GB/s 526s 526s quantize_row_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 0.72 GB/s 526s quantized throughput : 0.10 GB/s 526s 526s dequantize_row_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 2.42 GB/s 526s quantized throughput : 0.34 GB/s 526s 526s quantize_row_q_dot 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 1.86 GB/s 526s quantized throughput : 0.26 GB/s 526s 526s vec_dot_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 38.15 GB/s 526s quantized throughput : 5.36 GB/s 526s 526s q4_1 526s quantize_row_q_reference 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 0.83 GB/s 526s quantized throughput : 0.13 GB/s 526s 526s quantize_row_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 0.83 GB/s 526s quantized throughput : 0.13 GB/s 526s 526s dequantize_row_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 3.55 GB/s 526s quantized throughput : 0.55 GB/s 526s 526s quantize_row_q_dot 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 1.75 GB/s 526s quantized throughput : 0.27 GB/s 526s 526s vec_dot_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 30.52 GB/s 526s quantized throughput : 4.77 GB/s 526s 526s q5_0 526s quantize_row_q_reference 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 0.87 GB/s 526s quantized throughput : 0.15 GB/s 526s 526s quantize_row_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 1.15 GB/s 526s quantized throughput : 0.20 GB/s 526s 526s dequantize_row_q 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 2.42 GB/s 526s quantized throughput : 0.42 GB/s 526s 526s quantize_row_q_dot 526s 4096 values (0.02 MB) 526s min cycles/32 vals : 0.00 526s avg cycles/32 vals : 0.00 526s float32 throughput : 1.88 GB/s 526s quantized throughput : 0.32 GB/s 526s 526s vec_dot_q 526s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.96 GB/s 527s quantized throughput : 0.34 GB/s 527s 527s q5_1 527s quantize_row_q_reference 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.71 GB/s 527s quantized throughput : 0.13 GB/s 527s 527s quantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.59 GB/s 527s quantized throughput : 0.11 GB/s 527s 527s dequantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 3.32 GB/s 527s quantized throughput : 0.62 GB/s 527s 527s quantize_row_q_dot 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 2.15 GB/s 527s quantized throughput : 0.40 GB/s 527s 527s vec_dot_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 2.38 GB/s 527s quantized throughput : 0.45 GB/s 527s 527s q8_0 527s quantize_row_q_reference 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.90 GB/s 527s quantized throughput : 0.24 GB/s 527s 527s quantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.19 GB/s 527s quantized throughput : 0.32 GB/s 527s 527s dequantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 4.77 GB/s 527s quantized throughput : 1.27 GB/s 527s 527s quantize_row_q_dot 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.84 GB/s 527s quantized throughput : 0.49 GB/s 527s 527s vec_dot_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 38.15 GB/s 527s quantized throughput : 10.13 GB/s 527s 527s q2_K 527s quantize_row_q_reference 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.00 GB/s 527s quantized throughput : 0.00 GB/s 527s 527s quantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.00 GB/s 527s quantized throughput : 0.00 GB/s 527s 527s dequantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 3.11 GB/s 527s quantized throughput : 0.26 GB/s 527s 527s quantize_row_q_dot 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.66 GB/s 527s quantized throughput : 0.14 GB/s 527s 527s vec_dot_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 2.31 GB/s 527s quantized throughput : 0.19 GB/s 527s 527s q3_K 527s quantize_row_q_reference 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.28 GB/s 527s quantized throughput : 0.03 GB/s 527s 527s quantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.28 GB/s 527s quantized throughput : 0.03 GB/s 527s 527s dequantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.19 GB/s 527s quantized throughput : 0.13 GB/s 527s 527s quantize_row_q_dot 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.66 GB/s 527s quantized throughput : 0.18 GB/s 527s 527s vec_dot_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 15.26 GB/s 527s quantized throughput : 1.64 GB/s 527s 527s q4_K 527s quantize_row_q_reference 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.01 GB/s 527s quantized throughput : 0.00 GB/s 527s 527s quantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.00 GB/s 527s quantized throughput : 0.00 GB/s 527s 527s dequantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 3.32 GB/s 527s quantized throughput : 0.47 GB/s 527s 527s quantize_row_q_dot 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.64 GB/s 527s quantized throughput : 0.23 GB/s 527s 527s vec_dot_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 30.52 GB/s 527s quantized throughput : 4.29 GB/s 527s 527s q5_K 527s quantize_row_q_reference 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.01 GB/s 527s quantized throughput : 0.00 GB/s 527s 527s quantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.00 GB/s 527s quantized throughput : 0.00 GB/s 527s 527s dequantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.61 GB/s 527s quantized throughput : 0.28 GB/s 527s 527s quantize_row_q_dot 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.64 GB/s 527s quantized throughput : 0.28 GB/s 527s 527s vec_dot_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 30.52 GB/s 527s quantized throughput : 5.25 GB/s 527s 527s q6_K 527s quantize_row_q_reference 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.00 GB/s 527s quantized throughput : 0.00 GB/s 527s 527s quantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.05 GB/s 527s quantized throughput : 0.01 GB/s 527s 527s dequantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 2.63 GB/s 527s quantized throughput : 0.54 GB/s 527s 527s quantize_row_q_dot 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 1.66 GB/s 527s quantized throughput : 0.34 GB/s 527s 527s vec_dot_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 16.95 GB/s 527s quantized throughput : 3.48 GB/s 527s 527s iq4_nl 527s quantize_row_q_reference 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.04 GB/s 527s quantized throughput : 0.01 GB/s 527s 527s quantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 527s float32 throughput : 0.01 GB/s 527s quantized throughput : 0.00 GB/s 527s 527s dequantize_row_q 527s 4096 values (0.02 MB) 527s min cycles/32 vals : 0.00 527s avg cycles/32 vals : 0.00 528s float32 throughput : 2.50 GB/s 528s quantized throughput : 0.35 GB/s 528s 528s quantize_row_q_dot 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 1.88 GB/s 528s quantized throughput : 0.26 GB/s 528s 528s vec_dot_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 25.43 GB/s 528s quantized throughput : 3.58 GB/s 528s 528s iq4_xs 528s quantize_row_q_reference 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 0.00 GB/s 528s quantized throughput : 0.00 GB/s 528s 528s quantize_row_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 0.00 GB/s 528s quantized throughput : 0.00 GB/s 528s 528s dequantize_row_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 2.42 GB/s 528s quantized throughput : 0.32 GB/s 528s 528s quantize_row_q_dot 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 1.66 GB/s 528s quantized throughput : 0.22 GB/s 528s 528s vec_dot_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 30.52 GB/s 528s quantized throughput : 4.05 GB/s 528s 528s bf16 528s quantize_row_q_reference 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 2.77 GB/s 528s quantized throughput : 1.39 GB/s 528s 528s quantize_row_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 2.99 GB/s 528s quantized throughput : 1.50 GB/s 528s 528s dequantize_row_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 9.54 GB/s 528s quantized throughput : 4.77 GB/s 528s 528s quantize_row_q_dot 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 2.88 GB/s 528s quantized throughput : 1.44 GB/s 528s 528s vec_dot_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 2.09 GB/s 528s quantized throughput : 1.05 GB/s 528s 528s tq1_0 528s quantize_row_q_reference 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 0.82 GB/s 528s quantized throughput : 0.04 GB/s 528s 528s quantize_row_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 0.89 GB/s 528s quantized throughput : 0.05 GB/s 528s 528s dequantize_row_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 1.26 GB/s 528s quantized throughput : 0.07 GB/s 528s 528s quantize_row_q_dot 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 1.66 GB/s 528s quantized throughput : 0.09 GB/s 528s 528s vec_dot_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 3.63 GB/s 528s quantized throughput : 0.19 GB/s 528s 528s tq2_0 528s quantize_row_q_reference 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 0.87 GB/s 528s quantized throughput : 0.06 GB/s 528s 528s quantize_row_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 1.00 GB/s 528s quantized throughput : 0.06 GB/s 528s 528s dequantize_row_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 2.31 GB/s 528s quantized throughput : 0.15 GB/s 528s 528s quantize_row_q_dot 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 1.44 GB/s 528s quantized throughput : 0.09 GB/s 528s 528s vec_dot_q 528s 4096 values (0.02 MB) 528s min cycles/32 vals : 0.00 528s avg cycles/32 vals : 0.00 528s float32 throughput : 15.26 GB/s 528s quantized throughput : 0.98 GB/s 528s 528s ggml_roll(3(1), 7(0), 4(-1), 2(0), contiguous): PASSED 528s ggml_roll(37(-4), 42(3), 59(-7), 2(1), contiguous): PASSED 528s ggml_roll(37(-4), 42(3), 59(-7), 2(1), permuted): PASSED 528s 0.8439 -0.9970 0.6497 0.9733 0.9981 0.9999 1.0000 -0.5366 -0.0776 0.7602 0.2296 0.0621 0.0167 0.0045 0.0000 0.0000 0.4242 0.9880 -0.1558 0.8946 0.9923 0.9994 1.0000 -0.9056 0.1547 0.9878 0.4470 0.1240 0.0333 0.0089 0.0000 0.0000 528s ----------------------------------- 528s 0.8439 -0.9970 0.6497 0.9733 0.9981 0.9999 1.0000 -0.5366 -0.0776 0.7602 0.2296 0.0621 0.0167 0.0045 0.0000 0.0000 0.4242 0.9880 -0.1558 0.8946 0.9923 0.9994 1.0000 -0.9056 0.1547 0.9878 0.4470 0.1240 0.0333 0.0089 0.0000 0.0000 561s main: elapsed ticks: 29352433 561s 2951358969841371840512.000000 562s ggml_print_objects: objects in context 0x2aa34cdd130: 562s - ggml_object: type = 0, offset = 32, size = 384, next = 0x3ff9c67f2a0 562s - ggml_object: type = 0, offset = 448, size = 736, next = 0x3ff9c67f5a0 562s - ggml_object: type = 0, offset = 1216, size = 24336, next = (nil) 562s ggml_print_objects: --- end --- 562s autopkgtest [22:09:32]: test libggml-cpu: -----------------------] 563s libggml-cpu FAIL non-zero exit status 1 563s autopkgtest [22:09:33]: test libggml-cpu: - - - - - - - - - - results - - - - - - - - - - 564s autopkgtest [22:09:34]: test libggml-cpu: preparing testbed 586s Creating nova instance adt-questing-s390x-ggml-20250707-220010-juju-7f2275-prod-proposed-migration-environment-15-dcac3358-3866-45f1-9911-a717e2eb1c01 from image adt/ubuntu-questing-s390x-server-20250707.img (UUID 141e4834-1e59-4811-9d22-ee73c0890197)... 775s autopkgtest [22:13:05]: testbed dpkg architecture: s390x 777s autopkgtest [22:13:07]: testbed apt version: 3.1.3 779s autopkgtest [22:13:09]: @@@@@@@@@@@@@@@@@@@@ test bed setup 780s autopkgtest [22:13:10]: testbed release detected to be: questing 783s autopkgtest [22:13:13]: updating testbed package index (apt update) 783s Get:1 http://ftpmaster.internal/ubuntu questing-proposed InRelease [249 kB] 783s Hit:2 http://ftpmaster.internal/ubuntu questing InRelease 783s Hit:3 http://ftpmaster.internal/ubuntu questing-updates InRelease 783s Hit:4 http://ftpmaster.internal/ubuntu questing-security InRelease 783s Get:5 http://ftpmaster.internal/ubuntu questing-proposed/multiverse Sources [15.8 kB] 783s Get:6 http://ftpmaster.internal/ubuntu questing-proposed/main Sources [28.2 kB] 783s Get:7 http://ftpmaster.internal/ubuntu questing-proposed/universe Sources [182 kB] 783s Get:8 http://ftpmaster.internal/ubuntu questing-proposed/main s390x Packages [46.7 kB] 783s Get:9 http://ftpmaster.internal/ubuntu questing-proposed/universe s390x Packages [154 kB] 784s Get:10 http://ftpmaster.internal/ubuntu questing-proposed/multiverse s390x Packages [5272 B] 784s Fetched 681 kB in 0s (1643 kB/s) 784s Reading package lists... 786s autopkgtest [22:13:16]: upgrading testbed (apt dist-upgrade and autopurge) 787s Reading package lists... 787s Building dependency tree... 787s Reading state information... 787s Calculating upgrade... 787s The following packages will be upgraded: 787s apparmor libapparmor1 libc-bin libc-dev-bin libc6 libc6-dev libnss-systemd 787s libpam-systemd libsystemd-shared libsystemd0 libudev1 libxml2-16 locales 787s systemd systemd-cryptsetup systemd-resolved systemd-sysv systemd-timesyncd 787s udev 787s 19 upgraded, 0 newly installed, 0 to remove and 0 not upgraded. 787s Need to get 19.4 MB of archives. 787s After this operation, 54.3 kB of additional disk space will be used. 787s Get:1 http://ftpmaster.internal/ubuntu questing-proposed/main s390x libc6-dev s390x 2.41-9ubuntu1 [1678 kB] 787s Get:2 http://ftpmaster.internal/ubuntu questing-proposed/main s390x libc-dev-bin s390x 2.41-9ubuntu1 [23.0 kB] 787s Get:3 http://ftpmaster.internal/ubuntu questing-proposed/main s390x libc6 s390x 2.41-9ubuntu1 [2896 kB] 787s Get:4 http://ftpmaster.internal/ubuntu questing-proposed/main s390x libc-bin s390x 2.41-9ubuntu1 [672 kB] 787s Get:5 http://ftpmaster.internal/ubuntu questing/main s390x libsystemd0 s390x 257.7-1ubuntu1 [538 kB] 787s Get:6 http://ftpmaster.internal/ubuntu questing/main s390x libnss-systemd s390x 257.7-1ubuntu1 [179 kB] 787s Get:7 http://ftpmaster.internal/ubuntu questing/main s390x systemd-sysv s390x 257.7-1ubuntu1 [11.9 kB] 787s Get:8 http://ftpmaster.internal/ubuntu questing/main s390x systemd-resolved s390x 257.7-1ubuntu1 [333 kB] 787s Get:9 http://ftpmaster.internal/ubuntu questing/main s390x libpam-systemd s390x 257.7-1ubuntu1 [263 kB] 787s Get:10 http://ftpmaster.internal/ubuntu questing/main s390x libsystemd-shared s390x 257.7-1ubuntu1 [2335 kB] 787s Get:11 http://ftpmaster.internal/ubuntu questing/main s390x systemd s390x 257.7-1ubuntu1 [3018 kB] 788s Get:12 http://ftpmaster.internal/ubuntu questing/main s390x systemd-timesyncd s390x 257.7-1ubuntu1 [41.5 kB] 788s Get:13 http://ftpmaster.internal/ubuntu questing/main s390x systemd-cryptsetup s390x 257.7-1ubuntu1 [128 kB] 788s Get:14 http://ftpmaster.internal/ubuntu questing/main s390x udev s390x 257.7-1ubuntu1 [1416 kB] 788s Get:15 http://ftpmaster.internal/ubuntu questing/main s390x libudev1 s390x 257.7-1ubuntu1 [207 kB] 788s Get:16 http://ftpmaster.internal/ubuntu questing/main s390x libapparmor1 s390x 4.1.1-0ubuntu3 [57.1 kB] 788s Get:17 http://ftpmaster.internal/ubuntu questing/main s390x libxml2-16 s390x 2.14.4+dfsg-0exp1 [628 kB] 788s Get:18 http://ftpmaster.internal/ubuntu questing-proposed/main s390x locales all 2.41-9ubuntu1 [4253 kB] 788s Get:19 http://ftpmaster.internal/ubuntu questing/main s390x apparmor s390x 4.1.1-0ubuntu3 [762 kB] 788s Preconfiguring packages ... 788s Fetched 19.4 MB in 1s (17.5 MB/s) 789s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 789s Preparing to unpack .../libc6-dev_2.41-9ubuntu1_s390x.deb ... 789s Unpacking libc6-dev:s390x (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 789s Preparing to unpack .../libc-dev-bin_2.41-9ubuntu1_s390x.deb ... 789s Unpacking libc-dev-bin (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 789s Preparing to unpack .../libc6_2.41-9ubuntu1_s390x.deb ... 790s Unpacking libc6:s390x (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 790s Setting up libc6:s390x (2.41-9ubuntu1) ... 791s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 791s Preparing to unpack .../libc-bin_2.41-9ubuntu1_s390x.deb ... 791s Unpacking libc-bin (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 791s Setting up libc-bin (2.41-9ubuntu1) ... 791s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 791s Preparing to unpack .../libsystemd0_257.7-1ubuntu1_s390x.deb ... 791s Unpacking libsystemd0:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 791s Setting up libsystemd0:s390x (257.7-1ubuntu1) ... 791s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 791s Preparing to unpack .../libnss-systemd_257.7-1ubuntu1_s390x.deb ... 791s Unpacking libnss-systemd:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 791s Preparing to unpack .../systemd-sysv_257.7-1ubuntu1_s390x.deb ... 791s Unpacking systemd-sysv (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 791s Preparing to unpack .../systemd-resolved_257.7-1ubuntu1_s390x.deb ... 791s Unpacking systemd-resolved (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 791s Preparing to unpack .../libpam-systemd_257.7-1ubuntu1_s390x.deb ... 791s Unpacking libpam-systemd:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 791s Preparing to unpack .../libsystemd-shared_257.7-1ubuntu1_s390x.deb ... 791s Unpacking libsystemd-shared:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 791s Setting up libsystemd-shared:s390x (257.7-1ubuntu1) ... 792s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85970 files and directories currently installed.) 792s Preparing to unpack .../systemd_257.7-1ubuntu1_s390x.deb ... 792s Unpacking systemd (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 792s Preparing to unpack .../systemd-timesyncd_257.7-1ubuntu1_s390x.deb ... 792s Unpacking systemd-timesyncd (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 792s Preparing to unpack .../systemd-cryptsetup_257.7-1ubuntu1_s390x.deb ... 792s Unpacking systemd-cryptsetup (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 792s Preparing to unpack .../udev_257.7-1ubuntu1_s390x.deb ... 792s Unpacking udev (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 792s Preparing to unpack .../libudev1_257.7-1ubuntu1_s390x.deb ... 792s Unpacking libudev1:s390x (257.7-1ubuntu1) over (257.6-1ubuntu1) ... 792s Setting up libudev1:s390x (257.7-1ubuntu1) ... 792s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85971 files and directories currently installed.) 792s Preparing to unpack .../libapparmor1_4.1.1-0ubuntu3_s390x.deb ... 792s Unpacking libapparmor1:s390x (4.1.1-0ubuntu3) over (4.1.1-0ubuntu2) ... 793s Preparing to unpack .../libxml2-16_2.14.4+dfsg-0exp1_s390x.deb ... 793s Unpacking libxml2-16:s390x (2.14.4+dfsg-0exp1) over (2.14.3+dfsg-0exp3) ... 793s Preparing to unpack .../locales_2.41-9ubuntu1_all.deb ... 793s Unpacking locales (2.41-9ubuntu1) over (2.41-6ubuntu2) ... 793s Preparing to unpack .../apparmor_4.1.1-0ubuntu3_s390x.deb ... 794s Unpacking apparmor (4.1.1-0ubuntu3) over (4.1.1-0ubuntu2) ... 795s Setting up libapparmor1:s390x (4.1.1-0ubuntu3) ... 795s Setting up libxml2-16:s390x (2.14.4+dfsg-0exp1) ... 795s Setting up systemd (257.7-1ubuntu1) ... 795s /usr/lib/tmpfiles.d/legacy.conf:14: Duplicate line for path "/run/lock", ignoring. 795s /usr/lib/tmpfiles.d/legacy.conf:14: Duplicate line for path "/run/lock", ignoring. 796s Setting up locales (2.41-9ubuntu1) ... 797s Generating locales (this might take a while)... 801s en_US.UTF-8... done 801s Generation complete. 801s Setting up apparmor (4.1.1-0ubuntu3) ... 801s Installing new version of config file /etc/apparmor.d/lsusb ... 801s Installing new version of config file /etc/apparmor.d/mbsync ... 804s Reloading AppArmor profiles 809s Setting up systemd-cryptsetup (257.7-1ubuntu1) ... 809s Setting up systemd-timesyncd (257.7-1ubuntu1) ... 810s systemd-time-wait-sync.service is a disabled or a static unit not running, not starting it. 810s Setting up udev (257.7-1ubuntu1) ... 812s Creating group 'clock' with GID 987. 813s Setting up libc-dev-bin (2.41-9ubuntu1) ... 813s Setting up systemd-resolved (257.7-1ubuntu1) ... 816s Setting up systemd-sysv (257.7-1ubuntu1) ... 816s Setting up libnss-systemd:s390x (257.7-1ubuntu1) ... 816s Setting up libpam-systemd:s390x (257.7-1ubuntu1) ... 816s Setting up libc6-dev:s390x (2.41-9ubuntu1) ... 816s Processing triggers for libc-bin (2.41-9ubuntu1) ... 817s Processing triggers for man-db (2.13.1-1) ... 822s Processing triggers for dbus (1.16.2-2ubuntu1) ... 822s Processing triggers for shared-mime-info (2.4-5build2) ... 823s Processing triggers for procps (2:4.0.4-8ubuntu2) ... 823s Processing triggers for initramfs-tools (0.149ubuntu1) ... 823s update-initramfs: Generating /boot/initrd.img-6.15.0-3-generic 823s W: No lz4 in /usr/bin:/sbin:/bin, using gzip 834s Using config file '/etc/zipl.conf' 834s Building bootmap in '/boot' 834s Adding IPL section 'ubuntu' (default) 835s Preparing boot device for LD-IPL: vda (0000). 835s Done. 836s Reading package lists... 836s Building dependency tree... 836s Reading state information... 836s Solving dependencies... 836s 0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded. 837s autopkgtest [22:14:07]: rebooting testbed after setup commands that affected boot 870s Reading package lists... 870s Building dependency tree... 870s Reading state information... 870s Solving dependencies... 870s The following NEW packages will be installed: 870s libgfortran5 libggml libggml-blas libgomp1 libopenblas0-pthread 870s 0 upgraded, 5 newly installed, 0 to remove and 0 not upgraded. 870s Need to get 8390 kB of archives. 870s After this operation, 26.7 MB of additional disk space will be used. 870s Get:1 http://ftpmaster.internal/ubuntu questing/main s390x libgfortran5 s390x 15.1.0-8ubuntu1 [620 kB] 870s Get:2 http://ftpmaster.internal/ubuntu questing/universe s390x libopenblas0-pthread s390x 0.3.29+ds-3 [6952 kB] 871s Get:3 http://ftpmaster.internal/ubuntu questing/main s390x libgomp1 s390x 15.1.0-8ubuntu1 [154 kB] 871s Get:4 http://ftpmaster.internal/ubuntu questing/universe s390x libggml-blas s390x 0.0~git20250618.9e4bee1-1 [358 kB] 871s Get:5 http://ftpmaster.internal/ubuntu questing/universe s390x libggml s390x 0.0~git20250618.9e4bee1-1 [307 kB] 872s Fetched 8390 kB in 1s (8822 kB/s) 872s Selecting previously unselected package libgfortran5:s390x. 872s (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 85973 files and directories currently installed.) 872s Preparing to unpack .../libgfortran5_15.1.0-8ubuntu1_s390x.deb ... 872s Unpacking libgfortran5:s390x (15.1.0-8ubuntu1) ... 872s Selecting previously unselected package libopenblas0-pthread:s390x. 872s Preparing to unpack .../libopenblas0-pthread_0.3.29+ds-3_s390x.deb ... 872s Unpacking libopenblas0-pthread:s390x (0.3.29+ds-3) ... 873s Selecting previously unselected package libgomp1:s390x. 873s Preparing to unpack .../libgomp1_15.1.0-8ubuntu1_s390x.deb ... 873s Unpacking libgomp1:s390x (15.1.0-8ubuntu1) ... 873s Selecting previously unselected package libggml-blas:s390x. 873s Preparing to unpack .../libggml-blas_0.0~git20250618.9e4bee1-1_s390x.deb ... 873s Unpacking libggml-blas:s390x (0.0~git20250618.9e4bee1-1) ... 873s Selecting previously unselected package libggml:s390x. 873s Preparing to unpack .../libggml_0.0~git20250618.9e4bee1-1_s390x.deb ... 873s Unpacking libggml:s390x (0.0~git20250618.9e4bee1-1) ... 873s Setting up libgomp1:s390x (15.1.0-8ubuntu1) ... 873s Setting up libgfortran5:s390x (15.1.0-8ubuntu1) ... 873s Setting up libopenblas0-pthread:s390x (0.3.29+ds-3) ... 873s update-alternatives: using /usr/lib/s390x-linux-gnu/openblas-pthread/libblas.so.3 to provide /usr/lib/s390x-linux-gnu/libblas.so.3 (libblas.so.3-s390x-linux-gnu) in auto mode 873s update-alternatives: using /usr/lib/s390x-linux-gnu/openblas-pthread/liblapack.so.3 to provide /usr/lib/s390x-linux-gnu/liblapack.so.3 (liblapack.so.3-s390x-linux-gnu) in auto mode 873s update-alternatives: using /usr/lib/s390x-linux-gnu/openblas-pthread/libopenblas.so.0 to provide /usr/lib/s390x-linux-gnu/libopenblas.so.0 (libopenblas.so.0-s390x-linux-gnu) in auto mode 873s Setting up libggml-blas:s390x (0.0~git20250618.9e4bee1-1) ... 873s Setting up libggml:s390x (0.0~git20250618.9e4bee1-1) ... 873s Processing triggers for libc-bin (2.41-9ubuntu1) ... 878s autopkgtest [22:14:48]: test libggml-cpu: /usr/libexec/ggml/runtests 878s autopkgtest [22:14:48]: test libggml-cpu: [----------------------- 878s dpkg-architecture: warning: cannot determine CC system type, falling back to default (native compilation) 878s 0.00 1.00 2.00 878s Testing 2 devices 878s 878s Backend 1/2: BLAS 878s Device description: BLAS 878s Device memory: 0 MB (0 MB free) 878s 878s ABS(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s ABS(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s SGN(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s SGN(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s NEG(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s NEG(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s STEP(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s STEP(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s TANH(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s TANH(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s ELU(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s ELU(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s RELU(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s RELU(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s SIGMOID(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s SIGMOID(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s GELU(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s GELU(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s GELU_QUICK(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s GELU_QUICK(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s SILU(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s SILU(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s HARDSWISH(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s HARDSWISH(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s HARDSIGMOID(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s HARDSIGMOID(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s EXP(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s EXP(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s GELU_ERF(type=f16,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s GELU_ERF(type=f16,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s ABS(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s ABS(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s SGN(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s SGN(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s NEG(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s NEG(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s STEP(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s STEP(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s TANH(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s TANH(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s ELU(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s ELU(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s RELU(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s RELU(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s SIGMOID(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s SIGMOID(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s GELU(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s GELU(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s GELU_QUICK(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s GELU_QUICK(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s SILU(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s SILU(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s HARDSWISH(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s HARDSWISH(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s HARDSIGMOID(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s HARDSIGMOID(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s EXP(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s EXP(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s GELU_ERF(type=f16,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s GELU_ERF(type=f16,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s ABS(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s ABS(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s SGN(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s SGN(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s NEG(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s NEG(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s STEP(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s STEP(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s TANH(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s TANH(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s ELU(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s ELU(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s RELU(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s RELU(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s SIGMOID(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s SIGMOID(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s GELU(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s GELU(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s GELU_QUICK(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s GELU_QUICK(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s SILU(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s SILU(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s HARDSWISH(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s HARDSWISH(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s HARDSIGMOID(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s HARDSIGMOID(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s EXP(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s EXP(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s GELU_ERF(type=f32,ne_a=[128,2,2,2],v=0): not supported [BLAS] 878s GELU_ERF(type=f32,ne_a=[5,7,11,13],v=0): not supported [BLAS] 878s ABS(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s ABS(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s SGN(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s SGN(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s NEG(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s NEG(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s STEP(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s STEP(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s TANH(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s TANH(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s ELU(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s ELU(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s RELU(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s RELU(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s SIGMOID(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s SIGMOID(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s GELU(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s GELU(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s GELU_QUICK(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s GELU_QUICK(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s SILU(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s SILU(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s HARDSWISH(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s HARDSWISH(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s HARDSIGMOID(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s HARDSIGMOID(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s EXP(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s EXP(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s GELU_ERF(type=f32,ne_a=[128,2,2,2],v=1): not supported [BLAS] 878s GELU_ERF(type=f32,ne_a=[5,7,11,13],v=1): not supported [BLAS] 878s GET_ROWS(type=f32,n=1,m=8,r=2,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=f32,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=f32,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=f32,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=f32,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=f16,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=f16,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=f16,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=f16,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=bf16,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=bf16,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=bf16,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=bf16,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q4_0,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q4_0,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q4_0,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q4_0,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q4_1,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q4_1,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q4_1,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q4_1,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q5_0,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q5_0,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q5_0,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q5_0,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q5_1,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q5_1,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q5_1,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q5_1,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q8_0,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q8_0,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q8_0,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q8_0,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q2_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q2_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q2_K,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q2_K,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q3_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q3_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q3_K,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q3_K,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q4_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q4_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q4_K,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q4_K,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q5_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q5_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q5_K,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q5_K,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=q6_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=q6_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=q6_K,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=q6_K,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq2_xxs,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq2_xxs,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq2_xxs,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq2_xxs,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq2_xs,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq2_xs,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq2_xs,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq2_xs,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq2_s,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq2_s,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq2_s,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq2_s,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq3_xxs,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq3_xxs,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq3_xxs,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq3_xxs,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq1_s,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq1_s,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq1_s,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq1_s,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq1_m,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq1_m,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq1_m,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq1_m,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq4_nl,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq4_nl,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq4_nl,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq4_nl,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq3_s,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq3_s,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq3_s,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq3_s,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=iq4_xs,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=iq4_xs,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=iq4_xs,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=iq4_xs,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS(type=i32,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS(type=i32,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS(type=i32,n=256,m=5,r=4,b=7,v=0): not supported [BLAS] 878s GET_ROWS(type=i32,n=256,m=5,r=4,b=7,v=1): not supported [BLAS] 878s GET_ROWS_BACK(type=f32,n=1,m=8,r=2,b=1,v=0): not supported [BLAS] 878s GET_ROWS_BACK(type=f32,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS_BACK(type=f32,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS_BACK(type=f16,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] 878s GET_ROWS_BACK(type=f16,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] 878s GET_ROWS_BACK(type=bf16,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=bf16,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q4_0,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q4_0,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q4_1,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q4_1,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q5_0,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q5_0,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q5_1,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q5_1,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q8_0,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q8_0,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q2_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q2_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q3_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q3_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q4_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q4_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q5_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q5_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q6_K,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=q6_K,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq2_xxs,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq2_xxs,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq2_xs,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq2_xs,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq2_s,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq2_s,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq3_xxs,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq3_xxs,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq1_s,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq1_s,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq1_m,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq1_m,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq4_nl,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq4_nl,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq3_s,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq3_s,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq4_xs,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=iq4_xs,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=i32,n=256,m=5,r=4,b=1,v=0): not supported [BLAS] not supported [CPU] 878s GET_ROWS_BACK(type=i32,n=256,m=5,r=4,b=1,v=1): not supported [BLAS] not supported [CPU] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=avg,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=1,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=1,s0=2,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=1,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=1,k1=3,s0=2,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=1,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=1,s0=2,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=1,s1=2,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=1,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=1,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=1,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=1,p0=1,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=2,p0=0,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=2,p0=0,p1=1): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=2,p0=1,p1=0): not supported [BLAS] 878s POOL_2D(pool_type=max,type_input=f32,ne_input=[10,10,3,1],k0=3,k1=3,s0=2,s1=2,p0=1,p1=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[3000,128,1,1],ne_kernel=[3,128,1280,1],s0=1,s1=0,p0=1,p1=0,d0=1,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f32,ne_input=[3000,128,1,1],ne_kernel=[3,128,1280,1],s0=1,s1=0,p0=1,p1=0,d0=1,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[3000,128,1,1],ne_kernel=[3,128,1280,1],s0=1,s1=0,p0=1,p1=0,d0=1,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,2,2,1],ne_kernel=[3,2,2,1],s0=1,s1=0,p0=0,p1=0,d0=1,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,2,2,1],ne_kernel=[3,2,2,1],s0=1,s1=0,p0=0,p1=0,d0=3,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,2,2,1],ne_kernel=[3,2,2,1],s0=1,s1=0,p0=3,p1=0,d0=1,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,2,2,1],ne_kernel=[3,2,2,1],s0=1,s1=0,p0=3,p1=0,d0=3,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,2,2,1],ne_kernel=[3,2,2,1],s0=3,s1=0,p0=0,p1=0,d0=1,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,2,2,1],ne_kernel=[3,2,2,1],s0=3,s1=0,p0=0,p1=0,d0=3,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,2,2,1],ne_kernel=[3,2,2,1],s0=3,s1=0,p0=3,p1=0,d0=1,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,2,2,1],ne_kernel=[3,2,2,1],s0=3,s1=0,p0=3,p1=0,d0=3,d1=0,is_2D=0): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[10,10,3,1],ne_kernel=[3,3,3,1],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f32,ne_input=[10,10,3,1],ne_kernel=[3,3,3,1],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[10,10,3,1],ne_kernel=[3,3,3,1],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=0,p1=0,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=0,p1=0,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=0,p1=0,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=0,p1=0,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=0,p1=3,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=0,p1=3,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=0,p1=3,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=0,p1=3,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=3,p1=0,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=3,p1=0,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=3,p1=0,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=3,p1=0,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=3,p1=3,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=3,p1=3,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=3,p1=3,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=1,p0=3,p1=3,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=0,p1=0,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=0,p1=0,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=0,p1=0,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=0,p1=0,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=0,p1=3,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=0,p1=3,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=0,p1=3,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=0,p1=3,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=3,p1=0,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=3,p1=0,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=3,p1=0,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=3,p1=0,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=3,p1=3,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=3,p1=3,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=3,p1=3,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=1,s1=3,p0=3,p1=3,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=0,p1=0,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=0,p1=0,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=0,p1=0,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=0,p1=0,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=0,p1=3,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=0,p1=3,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=0,p1=3,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=0,p1=3,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=3,p1=0,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=3,p1=0,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=3,p1=0,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=3,p1=0,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=3,p1=3,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=3,p1=3,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=3,p1=3,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=1,p0=3,p1=3,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=0,p1=0,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=0,p1=0,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=0,p1=0,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=0,p1=0,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=0,p1=3,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=0,p1=3,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=0,p1=3,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=0,p1=3,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=3,p1=0,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=3,p1=0,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=3,p1=0,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=3,p1=0,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=3,p1=3,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=3,p1=3,d0=1,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=3,p1=3,d0=3,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f32,dst_type=f32,ne_input=[20,20,2,2],ne_kernel=[3,3,2,2],s0=3,s1=3,p0=3,p1=3,d0=3,d1=3,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[12,12,1,32],ne_kernel=[3,3,1,32],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[12,12,2,32],ne_kernel=[3,3,2,32],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[12,12,1,1024],ne_kernel=[3,3,1,1024],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[12,12,2,1024],ne_kernel=[3,3,2,1024],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[12,12,1,2048],ne_kernel=[3,3,1,2048],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[12,12,2,2048],ne_kernel=[3,3,2,2048],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[12,12,1,2560],ne_kernel=[3,3,1,2560],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[12,12,2,2560],ne_kernel=[3,3,2,2560],s0=1,s1=1,p0=1,p1=1,d0=1,d1=1,is_2D=1): not supported [BLAS] 879s CONV_2D_DW(ne_input=[17,34,9,1],ne_kernel=[3,3,1,9],stride=1,padding=0,dilation=1,cwhn=0): not supported [BLAS] 879s CONV_2D_DW(ne_input=[17,34,9,1],ne_kernel=[3,3,1,9],stride=1,padding=0,dilation=1,cwhn=1): not supported [BLAS] 879s CONV_2D_DW(ne_input=[32,8,64,1],ne_kernel=[3,3,1,64],stride=2,padding=1,dilation=1,cwhn=0): not supported [BLAS] 879s CONV_2D_DW(ne_input=[32,8,64,1],ne_kernel=[3,3,1,64],stride=2,padding=1,dilation=1,cwhn=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[3,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[3,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[3,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[3,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[3,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[3,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[3,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[3,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[3,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1337,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1337,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1337,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1337,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1337,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1337,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1337,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1337,1,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1337,1,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[3,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[3,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[3,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[3,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[3,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[3,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[3,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[3,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[3,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1337,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1337,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1337,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1337,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1337,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1337,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1337,1,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1337,1,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1337,1,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[3,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[3,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[3,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[3,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[3,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[3,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[3,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[3,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[3,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1337,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1337,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,1,1,1],ne_kernel=[1337,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1337,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1337,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[1337,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1337,9,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1337,9,1,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,1,1,1],ne_kernel=[1337,9,1,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[3,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[3,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[3,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[3,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[3,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[3,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[3,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[3,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[3,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1337,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1337,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[1,7,1,1],ne_kernel=[1337,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1337,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1337,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,7,1,1],ne_kernel=[1337,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1337,9,7,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1337,9,7,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[13,7,1,1],ne_kernel=[1337,9,7,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[197,32,1,1],ne_kernel=[16,32,32,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[2,3,2,1],s0=3,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[2,3,2,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[2,3,2,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[3,2,2,1],s0=2,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[3,2,2,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[3,1,2,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[3,1,1,1],s0=1,p0=0,d0=1): not supported [BLAS] 879s COUNT_EQUAL(type=f32,ne=[4,500,1,1]): not supported [BLAS] 879s COUNT_EQUAL(type=f32,ne=[4,5000,1,1]): not supported [BLAS] 879s ARGMAX(type=f32,ne=[32,1,1,1]): not supported [BLAS] 879s ARGMAX(type=f32,ne=[100,10,1,1]): not supported [BLAS] 879s ARGMAX(type=f32,ne=[1024,10,1,1]): not supported [BLAS] 879s ARGMAX(type=f32,ne=[1024,12,1,1]): not supported [BLAS] 879s ARGMAX(type=f32,ne=[2000,10,1,1]): not supported [BLAS] 879s ARGMAX(type=f32,ne=[5438,3,1,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,1],nr=[2,1,1,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,1],nr=[1,2,1,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,1],nr=[1,1,2,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,1],nr=[1,1,1,2]): not supported [BLAS] 879s REPEAT(type=i32,ne=[10,5,4,1],nr=[2,1,1,1]): not supported [BLAS] 879s REPEAT(type=i16,ne=[10,5,4,1],nr=[1,1,1,2]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s REPEAT(type=f32,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s REPEAT(type=i32,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s REPEAT(type=i16,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[1,1,1,1],v=0): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[2,1,1,1],v=0): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[1,2,1,1],v=0): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[1,1,2,1],v=0): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[1,1,1,2],v=0): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[1,1,1,1],v=1): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[2,1,1,1],v=1): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[1,2,1,1],v=1): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[1,1,2,1],v=1): not supported [BLAS] 879s REPEAT_BACK(type=f32,ne=[8,6,4,2],nr=[1,1,1,2],v=1): not supported [BLAS] 879s DUP(type=f32,ne=[10,10,20,1]): not supported [BLAS] 879s DUP(type=f16,ne=[10,10,20,1]): not supported [BLAS] 879s DUP(type=i32,ne=[10,10,20,1]): not supported [BLAS] 879s DUP(type=i16,ne=[10,10,20,1]): not supported [BLAS] 879s DUP(type=f32,ne=[10,10,5,1],permute=[0,2,1,3]): not supported [BLAS] 879s DUP(type=f16,ne=[10,10,5,1],permute=[0,2,1,3]): not supported [BLAS] 879s DUP(type=f32,ne=[10,10,5,1],permute=[1,0,2,3]): not supported [BLAS] 879s DUP(type=f16,ne=[10,10,5,1],permute=[1,0,2,3]): not supported [BLAS] 879s DUP(type=i16,ne=[10,8,3,1],permute=[0,2,1,3]): not supported [BLAS] 879s DUP(type=i16,ne=[10,8,3,1],permute=[1,2,0,3]): not supported [BLAS] 879s SET(type_src=f32,type_dst=f32,ne=[6,5,4,3],dim=1): not supported [BLAS] 879s SET(type_src=f32,type_dst=f32,ne=[6,5,4,3],dim=2): not supported [BLAS] 879s SET(type_src=f32,type_dst=f32,ne=[6,5,4,3],dim=3): not supported [BLAS] 879s SET(type_src=i32,type_dst=i32,ne=[6,5,4,3],dim=1): not supported [BLAS] 879s SET(type_src=i32,type_dst=i32,ne=[6,5,4,3],dim=2): not supported [BLAS] 879s SET(type_src=i32,type_dst=i32,ne=[6,5,4,3],dim=3): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[1,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[1,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[1,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[2,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[2,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[2,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[3,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[3,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[3,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[1,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[1,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[1,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[2,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[2,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[2,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[3,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[3,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[3,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[1,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[1,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[1,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[2,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[2,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[2,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[3,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[3,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[3,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[32,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[32,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[32,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[64,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[64,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[64,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[96,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[96,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=q4_0,ne=[96,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[32,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[32,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[32,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[64,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[64,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[64,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[96,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[96,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=q4_1,ne=[96,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[32,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[32,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[32,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[64,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[64,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[64,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[96,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[96,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=q5_0,ne=[96,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[32,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[32,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[32,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[64,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[64,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[64,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[96,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[96,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=q5_1,ne=[96,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[32,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[32,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[32,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[64,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[64,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[64,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[96,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[96,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=q8_0,ne=[96,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=q2_K,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=q3_K,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=q4_K,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=q5_K,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=q6_K,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xxs,type_dst=iq2_xxs,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_xs,type_dst=iq2_xs,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq2_s,type_dst=iq2_s,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_xxs,type_dst=iq3_xxs,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_s,type_dst=iq1_s,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq1_m,type_dst=iq1_m,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[32,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[32,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[32,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[64,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[64,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[64,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[96,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[96,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=iq4_nl,ne=[96,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq3_s,type_dst=iq3_s,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[256,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[256,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[512,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[512,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[512,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[768,2,3,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[768,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=iq4_xs,ne=[768,2,3,4],permute_src=[0,3,1,2],permute_dst=[0,2,1,3]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=bf16,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=bf16,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q4_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q4_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q4_1,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q4_1,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q5_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q5_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q5_1,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q5_1,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q8_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q8_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q2_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q2_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q3_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q3_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q4_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q4_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q5_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q5_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q6_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=q6_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=iq2_xxs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq2_xxs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq2_xs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq2_xs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq2_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq2_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq3_xxs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq3_xxs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq1_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq1_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq1_m,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq1_m,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq4_nl,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=iq4_nl,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=iq3_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq3_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f16,type_dst=iq4_xs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=iq4_xs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=f16,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=f16,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=bf16,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q4_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q4_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q4_1,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q4_1,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q5_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q5_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q5_1,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q5_1,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q8_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q8_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q2_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q2_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q3_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q3_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q4_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q4_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q5_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q5_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q6_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=q6_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=iq2_xxs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq2_xxs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq2_xs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq2_xs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq2_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq2_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq3_xxs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq3_xxs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq1_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq1_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq1_m,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq1_m,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq4_nl,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=iq4_nl,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=iq3_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq3_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=bf16,type_dst=iq4_xs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=iq4_xs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f16,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f16,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=bf16,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=bf16,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q4_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q4_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q4_1,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q4_1,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q5_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q5_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q5_1,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q5_1,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q8_0,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q8_0,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q2_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q2_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q3_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q3_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q4_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q4_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q5_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q5_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q6_K,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=q6_K,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=iq2_xxs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq2_xxs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq2_xs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq2_xs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq2_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq2_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq3_xxs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq3_xxs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq1_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq1_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq1_m,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq1_m,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq4_nl,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=iq4_nl,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=iq3_s,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq3_s,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] not supported [CPU] 879s CPY(type_src=f32,type_dst=iq4_xs,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=iq4_xs,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=bf16,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_0,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_1,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_0,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_1,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q8_0,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q2_K,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q3_K,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q4_K,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q5_K,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=q6_K,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq2_xxs,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq2_xxs,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq2_xs,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq2_xs,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq2_s,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq2_s,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq3_xxs,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq3_xxs,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq1_s,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq1_s,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq1_m,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq1_m,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_nl,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq3_s,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq3_s,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=f32,ne=[256,4,4,4],permute_src=[0,0,0,0],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=iq4_xs,type_dst=f32,ne=[256,2,3,4],permute_src=[0,2,1,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f16,ne=[256,2,3,4],permute_src=[1,0,2,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f16,type_dst=f32,ne=[256,2,3,4],permute_src=[1,0,2,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f16,ne=[256,2,3,4],permute_src=[1,0,2,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CPY(type_src=f32,type_dst=f32,ne=[256,2,3,4],permute_src=[1,0,2,3],permute_dst=[0,0,0,0]): not supported [BLAS] 879s CONT(type=f32,ne=[10,10,10,1]): not supported [BLAS] 879s CONT(type=f32,ne=[2,1,1,1]): not supported [BLAS] 879s CONT(type=f32,ne=[2,1,3,5]): not supported [BLAS] 879s CONT(type=f32,ne=[2,3,5,7]): not supported [BLAS] 879s CONT(type=f16,ne=[2,1,1,1]): not supported [BLAS] 879s CONT(type=f16,ne=[2,1,3,5]): not supported [BLAS] 879s CONT(type=f16,ne=[2,3,5,7]): not supported [BLAS] 879s CONT(type=bf16,ne=[2,1,1,1]): not supported [BLAS] 879s CONT(type=bf16,ne=[2,1,3,5]): not supported [BLAS] 879s CONT(type=bf16,ne=[2,3,5,7]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,8,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,8,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,8,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,8,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,1,1],nr=[32,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,1,1],nr=[32,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,1,1],nr=[32,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,1,1],nr=[32,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,320,320],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,320,320],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,320,320],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,320,320],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,3],nr=[1,1,2,2]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,3],nr=[1,1,2,2]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,3],nr=[1,1,2,2]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,3],nr=[1,1,2,2]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,3],nr=[1,2,2,2]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,3],nr=[1,2,2,2]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,3],nr=[1,2,2,2]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,3],nr=[1,2,2,2]): not supported [BLAS] 879s ADD(type=f16,ne=[10,5,4,3],nr=[2,2,2,2]): not supported [BLAS] 879s SUB(type=f16,ne=[10,5,4,3],nr=[2,2,2,2]): not supported [BLAS] 879s MUL(type=f16,ne=[10,5,4,3],nr=[2,2,2,2]): not supported [BLAS] 879s DIV(type=f16,ne=[10,5,4,3],nr=[2,2,2,2]): not supported [BLAS] 879s ADD(type=f16,ne=[1280,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1280,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1280,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1280,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1280,1,1,1],nr=[1,16,16,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1280,1,1,1],nr=[1,16,16,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1280,1,1,1],nr=[1,16,16,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1280,1,1,1],nr=[1,16,16,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1280,16,16,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1280,16,16,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1280,16,16,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1280,16,16,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1280,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1280,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1280,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1280,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,1280,1],nr=[16,16,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,1280,1],nr=[16,16,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,1280,1],nr=[16,16,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,1280,1],nr=[16,16,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[16,16,1280,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[16,16,1280,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[16,16,1280,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[16,16,1280,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,1920,1],nr=[16,16,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,1920,1],nr=[16,16,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,1920,1],nr=[16,16,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,1920,1],nr=[16,16,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,2560,1],nr=[16,16,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,2560,1],nr=[16,16,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,2560,1],nr=[16,16,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,2560,1],nr=[16,16,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,1280,1],nr=[32,32,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,1280,1],nr=[32,32,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,1280,1],nr=[32,32,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,1280,1],nr=[32,32,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,1920,1],nr=[32,32,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,1920,1],nr=[32,32,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,1920,1],nr=[32,32,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,1920,1],nr=[32,32,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[1,1,640,1],nr=[32,32,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[1,1,640,1],nr=[32,32,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[1,1,640,1],nr=[32,32,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[1,1,640,1],nr=[32,32,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[5120,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[5120,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[5120,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[5120,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s ADD(type=f16,ne=[640,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f16,ne=[640,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f16,ne=[640,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f16,ne=[640,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,8,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,8,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,8,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,8,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,1,1],nr=[32,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,1,1],nr=[32,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,1,1],nr=[32,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,1,1],nr=[32,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,320,320],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,320,320],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,320,320],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,320,320],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,3],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,3],nr=[2,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,3],nr=[1,2,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,3],nr=[1,1,2,1]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,3],nr=[1,1,1,2]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,3],nr=[1,1,2,2]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,3],nr=[1,1,2,2]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,3],nr=[1,1,2,2]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,3],nr=[1,1,2,2]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,3],nr=[1,2,2,2]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,3],nr=[1,2,2,2]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,3],nr=[1,2,2,2]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,3],nr=[1,2,2,2]): not supported [BLAS] 879s ADD(type=f32,ne=[10,5,4,3],nr=[2,2,2,2]): not supported [BLAS] 879s SUB(type=f32,ne=[10,5,4,3],nr=[2,2,2,2]): not supported [BLAS] 879s MUL(type=f32,ne=[10,5,4,3],nr=[2,2,2,2]): not supported [BLAS] 879s DIV(type=f32,ne=[10,5,4,3],nr=[2,2,2,2]): not supported [BLAS] 879s ADD(type=f32,ne=[1280,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1280,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1280,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1280,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1280,1,1,1],nr=[1,16,16,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1280,1,1,1],nr=[1,16,16,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1280,1,1,1],nr=[1,16,16,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1280,1,1,1],nr=[1,16,16,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1280,16,16,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1280,16,16,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1280,16,16,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1280,16,16,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1280,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1280,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1280,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1280,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,1280,1],nr=[16,16,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,1280,1],nr=[16,16,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,1280,1],nr=[16,16,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,1280,1],nr=[16,16,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[16,16,1280,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[16,16,1280,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[16,16,1280,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[16,16,1280,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,1920,1],nr=[16,16,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,1920,1],nr=[16,16,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,1920,1],nr=[16,16,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,1920,1],nr=[16,16,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,2560,1],nr=[16,16,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,2560,1],nr=[16,16,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,2560,1],nr=[16,16,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,2560,1],nr=[16,16,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,1280,1],nr=[32,32,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,1280,1],nr=[32,32,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,1280,1],nr=[32,32,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,1280,1],nr=[32,32,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,1920,1],nr=[32,32,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,1920,1],nr=[32,32,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,1920,1],nr=[32,32,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,1920,1],nr=[32,32,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[1,1,640,1],nr=[32,32,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[1,1,640,1],nr=[32,32,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[1,1,640,1],nr=[32,32,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[1,1,640,1],nr=[32,32,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[5120,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[5120,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[5120,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[5120,1,1,1],nr=[1,256,1,1]): not supported [BLAS] 879s ADD(type=f32,ne=[640,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s SUB(type=f32,ne=[640,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s MUL(type=f32,ne=[640,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s DIV(type=f32,ne=[640,1,1,1],nr=[1,1,1,1]): not supported [BLAS] 879s ADD1(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s SCALE(type=f32,ne=[10,10,10,10],scale=2.000000): not supported [BLAS] 879s SILU_BACK(type=f32,ne=[64,5,4,3],eps=0.000001): not supported [BLAS] 879s NORM(type=f32,ne=[64,5,4,3],v=0,eps=0.000000): not supported [BLAS] 879s RMS_NORM(type=f32,ne=[64,5,4,3],v=0,eps=0.000000): not supported [BLAS] 879s NORM(type=f32,ne=[64,5,4,3],v=1,eps=0.000000): not supported [BLAS] 879s RMS_NORM(type=f32,ne=[64,5,4,3],v=1,eps=0.000000): not supported [BLAS] 879s RMS_NORM_BACK(type=f32,ne=[64,5,4,3],eps=0.000000): not supported [BLAS] 879s L2_NORM(type=f32,ne=[64,5,4,3]): not supported [BLAS] 879s NORM(type=f32,ne=[64,5,4,3],v=0,eps=0.000001): not supported [BLAS] 879s RMS_NORM(type=f32,ne=[64,5,4,3],v=0,eps=0.000001): not supported [BLAS] 879s NORM(type=f32,ne=[64,5,4,3],v=1,eps=0.000001): not supported [BLAS] 879s RMS_NORM(type=f32,ne=[64,5,4,3],v=1,eps=0.000001): not supported [BLAS] 879s RMS_NORM_BACK(type=f32,ne=[64,5,4,3],eps=0.000001): not supported [BLAS] 879s L2_NORM(type=f32,ne=[64,5,4,3]): not supported [BLAS] 879s NORM(type=f32,ne=[64,5,4,3],v=0,eps=0.000100): not supported [BLAS] 879s RMS_NORM(type=f32,ne=[64,5,4,3],v=0,eps=0.000100): not supported [BLAS] 879s NORM(type=f32,ne=[64,5,4,3],v=1,eps=0.000100): not supported [BLAS] 879s RMS_NORM(type=f32,ne=[64,5,4,3],v=1,eps=0.000100): not supported [BLAS] 879s RMS_NORM_BACK(type=f32,ne=[64,5,4,3],eps=0.000100): not supported [BLAS] 879s L2_NORM(type=f32,ne=[64,5,4,3]): not supported [BLAS] 879s NORM(type=f32,ne=[64,5,4,3],v=0,eps=0.100000): not supported [BLAS] 879s RMS_NORM(type=f32,ne=[64,5,4,3],v=0,eps=0.100000): not supported [BLAS] 879s NORM(type=f32,ne=[64,5,4,3],v=1,eps=0.100000): not supported [BLAS] 879s RMS_NORM(type=f32,ne=[64,5,4,3],v=1,eps=0.100000): not supported [BLAS] 879s RMS_NORM_BACK(type=f32,ne=[64,5,4,3],eps=0.100000): not supported [BLAS] 879s L2_NORM(type=f32,ne=[64,5,4,3]): not supported [BLAS] 879s L2_NORM(type=f32,ne=[64,5,4,3]): not supported [BLAS] 879s SSM_CONV(type=f32,ne_a=[4,1536,1,1],ne_b=[4,1536,1,1]): not supported [BLAS] 879s SSM_CONV(type=f32,ne_a=[8,1536,1,1],ne_b=[4,1536,1,1]): not supported [BLAS] 879s SSM_CONV(type=f32,ne_a=[4,1536,4,1],ne_b=[4,1536,1,1]): not supported [BLAS] 879s SSM_SCAN(type=f32,d_state=16,d_inner=1024,n_seq_tokens=32,n_seqs=4): not supported [BLAS] 879s RWKV_WKV6(type=f32,head_count=32,head_size=64,n_seq_tokens=1,n_seqs=1): not supported [BLAS] 879s RWKV_WKV6(type=f32,head_count=32,head_size=64,n_seq_tokens=32,n_seqs=1): not supported [BLAS] 879s RWKV_WKV6(type=f32,head_count=32,head_size=64,n_seq_tokens=32,n_seqs=4): not supported [BLAS] 879s RWKV_WKV6(type=f32,head_count=32,head_size=64,n_seq_tokens=128,n_seqs=4): not supported [BLAS] 879s RWKV_WKV7(type=f32,head_count=32,head_size=64,n_seq_tokens=1,n_seqs=1): not supported [BLAS] 879s RWKV_WKV7(type=f32,head_count=32,head_size=64,n_seq_tokens=32,n_seqs=1): not supported [BLAS] 879s RWKV_WKV7(type=f32,head_count=32,head_size=64,n_seq_tokens=32,n_seqs=4): not supported [BLAS] 879s RWKV_WKV7(type=f32,head_count=32,head_size=64,n_seq_tokens=128,n_seqs=4): not supported [BLAS] 879s GATED_LINEAR_ATTN(type=f32,head_count=32,head_size=64,n_seq_tokens=1,n_seqs=1): not supported [BLAS] 879s GATED_LINEAR_ATTN(type=f32,head_count=32,head_size=64,n_seq_tokens=32,n_seqs=1): not supported [BLAS] 879s GATED_LINEAR_ATTN(type=f32,head_count=32,head_size=64,n_seq_tokens=32,n_seqs=4): not supported [BLAS] 879s GATED_LINEAR_ATTN(type=f32,head_count=32,head_size=64,n_seq_tokens=128,n_seqs=4): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=2,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=3,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=4,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=5,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=6,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=7,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=8,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=9,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f32,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f32,type_b=f16,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f16,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q8_0,type_b=f16,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f32,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_0,type_b=f16,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f16,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f32,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_K,type_b=f16,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f32,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[1,1],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[3,1],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[1,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[3,2],nr=[2,2],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=8,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,1,3,2],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=256,bs=[2,3],nr=[1,1],per=[0,3,2,1],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=1,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=8,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=iq2_xxs,type_b=f16,m=16,n=16,k=1024,bs=[3,2],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] not supported [CPU] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=32,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q4_1,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=1,k=32,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=1,k=32,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_1,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=32,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q8_0,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q2_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q3_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q5_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=q6_K,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_xs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq2_s,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_xxs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_s,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq1_m,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=1,k=32,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq3_s,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=iq4_xs,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=1,k=1,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=bf16,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=64,n=2,k=128,bs=[8,1],nr=[1,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=83,n=2,k=128,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=64,n=2,k=64,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=83,n=2,k=64,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=64,n=45,k=128,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=0): OK 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=45,k=64,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=0): OK 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=193,bs=[1,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=67,bs=[1,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=128,bs=[1,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1056,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=129,bs=[1,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1057,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=128,bs=[1,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1056,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=129,bs=[1,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1057,bs=[1,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=128,bs=[1,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1056,bs=[1,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=129,bs=[1,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1057,bs=[1,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=128,bs=[1,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1056,bs=[1,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=129,bs=[1,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1057,bs=[1,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=128,bs=[2,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1056,bs=[2,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=129,bs=[2,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1057,bs=[2,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=128,bs=[2,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1056,bs=[2,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=129,bs=[2,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1057,bs=[2,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=128,bs=[2,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1056,bs=[2,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=129,bs=[2,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1057,bs=[2,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=128,bs=[2,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1056,bs=[2,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=129,bs=[2,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1057,bs=[2,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=128,bs=[4,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1056,bs=[4,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=129,bs=[4,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1057,bs=[4,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=128,bs=[4,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1056,bs=[4,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=129,bs=[4,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1057,bs=[4,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=128,bs=[4,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1056,bs=[4,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=129,bs=[4,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1057,bs=[4,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=128,bs=[4,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1056,bs=[4,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=129,bs=[4,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1057,bs=[4,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=128,bs=[8,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1056,bs=[8,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=129,bs=[8,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1057,bs=[8,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=128,bs=[8,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1056,bs=[8,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=129,bs=[8,1],nr=[1,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1057,bs=[8,1],nr=[1,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=128,bs=[8,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1056,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1056,n=1,k=129,bs=[8,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=128,n=1,k=1057,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=128,bs=[8,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1056,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=1057,n=1,k=129,bs=[8,1],nr=[4,1],per=[0,2,1,3],v=0): not supported [BLAS] 879s MUL_MAT(type_a=f16,type_b=f32,m=129,n=1,k=1057,bs=[8,1],nr=[4,1],per=[0,1,2,3],v=1): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=f16,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_0,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_K,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=4,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xxs,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=129,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q4_1,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q5_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q5_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q5_1,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q5_1,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q8_0,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q2_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q2_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q3_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q3_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q5_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q5_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q6_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=q6_K,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_xs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_s,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq2_s,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq3_xxs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq3_xxs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq1_s,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq1_s,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq1_m,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq1_m,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq4_nl,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq4_nl,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq3_s,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq3_s,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq4_xs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=iq4_xs,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=bf16,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=1,k=256): not supported [BLAS] 879s MUL_MAT_ID(type_a=bf16,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): OK 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): OK 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): OK 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): OK 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f32,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=f16,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q8_0,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_0,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_1,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=q4_K,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f32,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=1,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=1,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[1,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[1,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[3,1],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[1,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,1],trans_b=0): not supported [BLAS] not supported [CPU] 879s OUT_PROD(type_a=iq2_xxs,type_b=f16,m=256,n=16,k=16,bs=[3,3],nr=[2,2],trans_b=0): not supported [BLAS] not supported [CPU] 879s SQR(type=f16,ne=[10,5,4,3]): not supported [BLAS] 879s SQRT(type=f16,ne=[10,3,3,2]): not supported [BLAS] 879s LOG(type=f16,ne=[10,5,4,3]): not supported [BLAS] 879s SIN(type=f16,ne=[10,2,2,2]): not supported [BLAS] 879s COS(type=f16,ne=[10,2,2,2]): not supported [BLAS] 879s CLAMP(type=f16,ne=[10,5,4,3],min=-0.500000,max=0.500000): not supported [BLAS] 879s SQR(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s SQRT(type=f32,ne=[10,3,3,2]): not supported [BLAS] 879s LOG(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s SIN(type=f32,ne=[10,2,2,2]): not supported [BLAS] 879s COS(type=f32,ne=[10,2,2,2]): not supported [BLAS] 879s CLAMP(type=f32,ne=[10,5,4,3],min=-0.500000,max=0.500000): not supported [BLAS] 879s DIAG_MASK_INF(type=f32,ne=[10,10,1,1],n_past=5): not supported [BLAS] 879s DIAG_MASK_INF(type=f32,ne=[10,10,3,1],n_past=5): not supported [BLAS] 879s DIAG_MASK_INF(type=f32,ne=[10,10,3,2],n_past=5): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=0,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=0,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=0,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=0,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=0,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=0,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=0,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=0,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=1,m_prec=f32,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=1,m_prec=f16,scale=1.000000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,16,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,15,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,1024,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[15,1023,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,16,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,15,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1024,1024,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[1023,1023,1,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,2,32,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,2,32,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[16,2,32,1],mask=0,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[32,2,32,1],mask=1,m_prec=f32,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[32,2,32,1],mask=1,m_prec=f16,scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[32,2,32,1],mask=1,m_prec=f32,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX(type=f32,ne=[32,2,32,1],mask=1,m_prec=f16,scale=0.100000,max_bias=8.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[16,16,1,1],scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[15,15,1,1],scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[16,1024,1,1],scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[15,1023,1,1],scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[1024,16,1,1],scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[1023,15,1,1],scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[1024,1024,1,1],scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[1023,1023,1,1],scale=1.000000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[16,16,1,1],scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[15,15,1,1],scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[16,1024,1,1],scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[15,1023,1,1],scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[1024,16,1,1],scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[1023,15,1,1],scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[1024,1024,1,1],scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[1023,1023,1,1],scale=0.100000,max_bias=0.000000): not supported [BLAS] 879s SOFT_MAX_BACK(type=f32,ne=[16,16,1,1],scale=1.000000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[15,15,1,1],scale=1.000000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[16,1024,1,1],scale=1.000000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[15,1023,1,1],scale=1.000000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[1024,16,1,1],scale=1.000000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[1023,15,1,1],scale=1.000000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[1024,1024,1,1],scale=1.000000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[1023,1023,1,1],scale=1.000000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[16,16,1,1],scale=0.100000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[15,15,1,1],scale=0.100000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[16,1024,1,1],scale=0.100000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[15,1023,1,1],scale=0.100000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[1024,16,1,1],scale=0.100000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[1023,15,1,1],scale=0.100000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[1024,1024,1,1],scale=0.100000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s SOFT_MAX_BACK(type=f32,ne=[1023,1023,1,1],scale=0.100000,max_bias=8.000000): not supported [BLAS] not supported [CPU] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,40,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,52,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,64,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,1,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,71,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,8,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[80,32,2,1],n_dims=20,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[80,32,2,1],n_dims=32,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,12,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,28,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[80,16,2,1],n_dims=80,mode=24,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,40,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,52,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,64,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,1,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,71,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,8,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[80,32,2,1],n_dims=20,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[80,32,2,1],n_dims=32,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,12,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,28,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[80,16,2,1],n_dims=80,mode=24,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,40,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,52,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,64,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,1,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,71,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,8,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[80,32,2,1],n_dims=20,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[80,32,2,1],n_dims=32,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,12,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,28,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[80,16,2,1],n_dims=80,mode=24,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,40,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,52,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,64,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,1,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,71,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,8,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[80,32,2,1],n_dims=20,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[80,32,2,1],n_dims=32,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,12,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,28,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[80,16,2,1],n_dims=80,mode=24,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,40,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,52,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,64,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,1,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,71,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,8,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[80,32,2,1],n_dims=20,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[80,32,2,1],n_dims=32,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,12,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,28,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[80,16,2,1],n_dims=80,mode=24,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,40,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,52,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,64,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,1,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,71,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,8,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[80,32,2,1],n_dims=20,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[80,32,2,1],n_dims=32,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,12,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,28,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[80,16,2,1],n_dims=80,mode=24,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,40,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,52,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,64,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,1,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,71,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,8,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[80,32,2,1],n_dims=20,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[80,32,2,1],n_dims=32,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,12,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,28,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[80,16,2,1],n_dims=80,mode=24,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,40,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,52,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,64,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,1,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,71,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,8,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[80,32,2,1],n_dims=20,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[80,32,2,1],n_dims=32,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,12,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,28,2,1],n_dims=128,mode=8,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[80,16,2,1],n_dims=80,mode=24,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=0): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.000000,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.000000,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.000000,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f32,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=0,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[128,32,2,1],n_dims=128,mode=0,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s ROPE_BACK(type=f16,ne_a=[64,128,2,1],n_dims=64,mode=2,n_ctx=512,fs=1.424500,ef=0.746500,af=1.424500,ff=1,v=1): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=0,v=0): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=0,v=0): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=1,v=0): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=1,v=0): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=2,v=0): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=2,v=0): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=3,v=0): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=3,v=0): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=0,v=1): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=0,v=1): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=1,v=1): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=1,v=1): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=2,v=1): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=2,v=1): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=3,v=1): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=3,v=1): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=0,v=2): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=0,v=2): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=1,v=2): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=1,v=2): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=2,v=2): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=2,v=2): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=3,v=2): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=3,v=2): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=0,v=3): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=0,v=3): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=1,v=3): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=1,v=3): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=2,v=3): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=2,v=3): not supported [BLAS] 879s CONCAT(type=f32,ne_a=[11,12,13,14],ne_b_d=7,dim=3,v=3): not supported [BLAS] 879s CONCAT(type=i32,ne_a=[11,12,13,14],ne_b_d=7,dim=3,v=3): not supported [BLAS] 879s ARGSORT(type=f32,ne=[8,1,1,1],order=0): not supported [BLAS] 879s ARGSORT(type=f32,ne=[16,10,10,10],order=0): not supported [BLAS] 879s ARGSORT(type=f32,ne=[60,10,10,10],order=0): not supported [BLAS] 879s ARGSORT(type=f32,ne=[8,1,1,1],order=1): not supported [BLAS] 879s ARGSORT(type=f32,ne=[16,10,10,10],order=1): not supported [BLAS] 879s ARGSORT(type=f32,ne=[60,10,10,10],order=1): not supported [BLAS] 879s UPSCALE(type=f32,ne=[512,512,3,2],scale_factor=2,mode=nearest,transpose=0): not supported [BLAS] 879s UPSCALE(type=f32,ne=[512,512,3,2],scale_factor=2,mode=nearest,transpose=1): not supported [BLAS] 879s UPSCALE(type=f32,ne=[2,5,7,11],ne_tgt=[5,7,11,13],mode=nearest): not supported [BLAS] 879s UPSCALE(type=f32,ne=[512,512,3,2],scale_factor=2,mode=bilinear,transpose=0): not supported [BLAS] 879s UPSCALE(type=f32,ne=[512,512,3,2],scale_factor=2,mode=bilinear,transpose=1): not supported [BLAS] 879s UPSCALE(type=f32,ne=[2,5,7,11],ne_tgt=[5,7,11,13],mode=bilinear): not supported [BLAS] 879s SUM(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s SUM_ROWS(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s MEAN(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s GROUP_NORM(type=f32,ne=[64,64,320,1],num_groups=32,eps=0.000001): not supported [BLAS] 879s GROUP_NORM(type=f32,ne=[9,9,1280,1],num_groups=32,eps=0.000001): not supported [BLAS] 879s ACC(type=f32,ne_a=[256,17,1,1],ne_b=[256,16,1,1]): not supported [BLAS] 879s PAD(type=f32,ne_a=[512,512,1,1],pad_0=1,pad_1=1): not supported [BLAS] 879s PAD_REFLECT_1D(type=f32,ne_a=[512,34,2,1],pad_0=10,pad_1=9): not supported [BLAS] 879s ARANGE(type=f32,start=0.000000,stop=10.000000,step=1.000000): not supported [BLAS] 879s TIMESTEP_EMBEDDING(type=f32,ne_a=[2,1,1,1],dim=320,max_period=10000): not supported [BLAS] 879s LEAKY_RELU(type=f32,ne_a=[10,5,4,3],negative_slope=0.100000): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=64,hsv=64,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=80,hsv=80,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=128,hsv=128,nh=4,nr=16,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=10.000000,prec=def,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=128,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=192,hsv=192,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=256,hsv=256,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,2,1,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=1,max_bias=8.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=1,kv=1024,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=1,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=3,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=32,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=f16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=bf16,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q8_0,permute=[0,1,2,3]): not supported [BLAS] 879s FLASH_ATTN_EXT(hsk=576,hsv=512,nh=4,nr=4,kv=512,nb=35,mask=0,max_bias=0.000000,logit_softcap=0.000000,prec=f32,type_KV=q4_0,permute=[0,1,2,3]): not supported [BLAS] 879s CROSS_ENTROPY_LOSS(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s CROSS_ENTROPY_LOSS(type=f32,ne=[30000,1,1,1]): not supported [BLAS] 879s CROSS_ENTROPY_LOSS_BACK(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s CROSS_ENTROPY_LOSS_BACK(type=f32,ne=[30000,1,1,1]): not supported [BLAS] 879s OPT_STEP_ADAMW(type=f32,ne=[10,5,4,3]): not supported [BLAS] 879s 5635/5635 tests passed 879s Backend BLAS: OK 879s 879s Backend 2/2: CPU 879s Skipping CPU backend 879s 2/2 backends passed 879s OK 879s main: compute buffer size: 0.19 MB 879s load_model: ggml tensor size = 336 bytes 879s load_model: backend buffer size = 0.09 MB 879s 879s Performing test: 879s ggml_conv_1d_transpose (4): PASSED 879s ggml_conv_1d_transpose (5): PASSED 879s ggml_conv_1d_transpose (10): PASSED 879s ggml_conv_1d_transpose (14): PASSED 879s ggml_conv_1d_transpose (12): PASSED 879s ggml_conv_1d_transpose (18): PASSED 879s ggml_conv_1d_transpose (24): PASSED 879s ggml_conv_1d_transpose (50688): PASSED 879s main: compute buffer size: 0.00 MB 879s load_model: ggml tensor size = 336 bytes 879s load_model: backend buffer size = 0.00 MB 879s 879s Performing test: 879s ggml_im2col (240): PASSED 879s ggml_conv1d (80): PASSED 879s main: compute buffer size: 0.02 MB 879s load_model: ggml tensor size = 336 bytes 879s load_model: backend buffer size = 0.00 MB 879s 879s Performing test: 879s ggml_im2col (4320): PASSED 879s ggml_conv2d (480): PASSED 879s ggml_conv_2d_dw(channels=3, kernel=1x1, stride=1, pad=0, dilation=1, layout=WHCN): PASSED 879s ggml_conv_2d_dw(channels=3, kernel=1x1, stride=1, pad=0, dilation=1, layout=CWHN): PASSED 879s ggml_conv_2d_dw(channels=42, kernel=3x3, stride=2, pad=1, dilation=1, layout=WHCN): PASSED 879s ggml_conv_2d_dw(channels=42, kernel=3x3, stride=2, pad=1, dilation=1, layout=CWHN): PASSED 879s ggml_conv_2d_dw(channels=8, kernel=5x5, stride=1, pad=2, dilation=2, layout=WHCN): PASSED 879s ggml_conv_2d_dw(channels=8, kernel=5x5, stride=1, pad=2, dilation=2, layout=CWHN): PASSED 879s Testing dup on i16 -> i16 copy 879s Testing dup on i32 -> i32 copy 879s Testing dup on f16 -> f16 copy 879s Testing dup on f16 -> f32 copy 879s Testing dup on f32 -> f16 copy 879s Testing dup on f32 -> f32 copy 879s 879s Performing gemm_f16_out_f32 test: 879s 1224.0f,1023.0f,1158.0f,1259.0f,1359.0f,1194.0f,1535.0f,1247.0f,1185.0f,1029.0f,889.0f,1182.0f,955.0f,1179.0f,1147.0f,1048.0f, 879s 1216.0f,1087.0f,1239.0f,1361.0f,1392.0f,1260.0f,1247.0f,1563.0f,1167.0f,1052.0f,942.0f,1214.0f,1045.0f,1134.0f,1264.0f,1126.0f, 879s 1125.0f,966.0f,1079.0f,1333.0f,1287.0f,1101.0f,1185.0f,1167.0f,1368.0f,990.0f,967.0f,1121.0f,971.0f,1086.0f,1130.0f,980.0f, 879s 999.0f,902.0f,1020.0f,1056.0f,1076.0f,929.0f,1029.0f,1052.0f,990.0f,1108.0f,823.0f,989.0f,759.0f,1041.0f,1003.0f,870.0f, 879s gemm_mult (64): PASSED 879s load_model: ggml tensor size = 336 bytes 879s load_model: backend buffer size = 3584 bytes 879s Matrix A: [36, 4] 879s Matrix B: [36, 16] 879s 879s Performing ggml_mul_mat test: 879s 1224.0 1023.0 1158.0 1259.0 1359.0 1194.0 1535.0 1247.0 1185.0 1029.0 889.0 1182.0 955.0 1179.0 1147.0 1048.0 879s 1216.0 1087.0 1239.0 1361.0 1392.0 1260.0 1247.0 1563.0 1167.0 1052.0 942.0 1214.0 1045.0 1134.0 1264.0 1126.0 879s 1125.0 966.0 1079.0 1333.0 1287.0 1101.0 1185.0 1167.0 1368.0 990.0 967.0 1121.0 971.0 1086.0 1130.0 980.0 879s 999.0 902.0 1020.0 1056.0 1076.0 929.0 1029.0 1052.0 990.0 1108.0 823.0 989.0 759.0 1041.0 1003.0 870.0 879s ggml_mul_mat (64): PASSED 879s main: compute buffer size: 0.00 MB 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s test-mul-mat0: iter:0/500 879s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 879s x0: [4, 1, 1, 1] 879s 0.435 -0.717 0.214 -0.967 879s 879s x1: [4, 3, 1, 1] 879s 0.823 -0.605 -0.330 0.536 879s -0.444 0.108 -0.045 0.258 879s -0.270 0.027 0.904 0.832 879s 879s y: [1, 3, 1, 1] 879s 0.202 879s -0.530 879s -0.748 879s testing: mul_mat, [4, 1, 2, 1] = [4, 4, 2, 1] * [4, 1, 2, 1] 879s x0: [4, 4, 2, 1] 879s 0.026 0.678 0.225 -0.408 879s 0.275 0.049 -0.013 0.946 879s -0.415 0.543 0.053 0.540 879s -0.200 0.783 -0.433 -0.295 879s 879s x1: [4, 1, 2, 1] 879s -0.514 -0.726 0.608 -0.687 879s 879s y: [4, 1, 2, 1] 879s -0.088 -0.834 -0.519 -0.527 879s testing: mul_mat, [2, 4, 2, 4] = [4, 2, 2, 4] * [4, 4, 2, 4] 879s x0: [4, 2, 2, 4] 879s 0.755 0.496 0.258 -0.929 879s 0.496 0.666 0.851 0.747 879s 879s x1: [4, 4, 2, 4] 879s 0.804 0.702 -0.467 0.080 879s -0.250 0.520 0.025 0.335 879s 0.063 -0.921 -0.125 0.864 879s 0.862 0.442 -0.431 0.477 879s 879s y: [2, 4, 2, 4] 879s 0.761 0.529 879s -0.236 0.495 879s -1.244 -0.044 879s 0.315 0.711 879s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 879s x0: [4, 1, 1, 1] 879s 0.065 -0.825 -0.479 0.755 879s 879s x1: [4, 2, 1, 1] 879s 0.254 0.461 -0.343 0.481 879s -0.596 0.842 0.370 0.306 879s 879s y: [1, 2, 1, 1] 879s 0.163 879s -0.679 879s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 879s x0: [1, 1, 2, 1] 879s -0.423 879s 879s x1: [1, 4, 2, 1] 879s 0.372 879s -0.813 879s -0.777 879s -0.277 879s 879s y: [1, 4, 2, 1] 879s -0.158 879s 0.344 879s 0.329 879s 0.117 879s testing: mul_mat, [2, 1, 2, 4] = [1, 2, 2, 4] * [1, 1, 2, 4] 879s x0: [1, 2, 2, 4] 879s 0.530 879s 0.398 879s 879s x1: [1, 1, 2, 4] 879s -0.620 879s 879s y: [2, 1, 2, 4] 879s -0.329 -0.247 879s test-mul-mat0: iter:1/500 879s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 879s x0: [1, 1, 1, 1] 879s -0.418 879s 879s x1: [1, 4, 1, 1] 879s 0.729 879s -0.015 879s -0.073 879s 0.698 879s 879s y: [1, 4, 1, 1] 879s -0.305 879s 0.006 879s 0.030 879s -0.292 879s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 879s x0: [1, 1, 2, 1] 879s -0.722 879s 879s x1: [1, 1, 2, 1] 879s -0.639 879s 879s y: [1, 1, 2, 1] 879s 0.461 879s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 879s x0: [1, 2, 2, 1] 879s -0.644 879s -0.556 879s 879s x1: [1, 1, 2, 1] 879s -0.015 879s 879s y: [2, 1, 2, 1] 879s 0.010 0.008 879s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 879s x0: [1, 2, 1, 1] 879s 0.864 879s 0.817 879s 879s x1: [1, 2, 1, 1] 879s -0.724 879s -0.279 879s 879s y: [2, 2, 1, 1] 879s -0.625 -0.591 879s -0.241 -0.228 879s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 879s x0: [2, 1, 2, 1] 879s -0.211 0.318 879s 879s x1: [2, 1, 2, 1] 879s 0.244 0.674 879s 879s y: [1, 1, 2, 1] 879s 0.162 879s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 879s x0: [1, 1, 2, 1] 879s -0.423 879s 879s x1: [1, 2, 2, 1] 879s -0.698 879s -0.855 879s 879s y: [1, 2, 2, 1] 879s 0.295 879s 0.362 879s test-mul-mat0: iter:2/500 879s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 879s x0: [1, 2, 1, 1] 879s 0.524 879s -0.025 879s 879s x1: [1, 1, 1, 1] 879s -0.469 879s 879s y: [2, 1, 1, 1] 879s -0.246 0.012 879s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 879s x0: [1, 3, 1, 1] 879s 0.035 879s -0.584 879s 0.115 879s 879s x1: [1, 2, 1, 1] 879s -0.685 879s 0.766 879s 879s y: [3, 2, 1, 1] 879s -0.024 0.401 -0.079 879s 0.027 -0.448 0.088 879s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 879s x0: [1, 2, 1, 1] 879s -0.348 879s 0.459 879s 879s x1: [1, 3, 1, 1] 879s -0.148 879s 0.660 879s -0.211 879s 879s y: [2, 3, 1, 1] 879s 0.051 -0.068 879s -0.230 0.303 879s 0.073 -0.097 879s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 879s x0: [1, 4, 1, 1] 879s 0.795 879s -0.728 879s -0.178 879s -0.989 879s 879s x1: [1, 2, 1, 1] 879s 0.277 879s 0.970 879s 879s y: [4, 2, 1, 1] 879s 0.220 -0.202 -0.049 -0.274 879s 0.771 -0.706 -0.173 -0.959 879s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 879s x0: [4, 1, 1, 1] 879s 0.442 -0.902 -0.102 0.973 879s 879s x1: [4, 1, 1, 1] 879s 0.567 0.549 -0.413 -0.771 879s 879s y: [1, 1, 1, 1] 879s -0.952 879s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 879s x0: [1, 2, 1, 1] 879s -0.801 879s -0.234 879s 879s x1: [1, 4, 1, 1] 879s 0.416 879s -0.578 879s -0.052 879s 0.730 879s 879s y: [2, 4, 1, 1] 879s -0.333 -0.097 879s 0.463 0.135 879s 0.042 0.012 879s -0.585 -0.171 879s test-mul-mat0: iter:3/500 879s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 879s x0: [3, 1, 1, 1] 879s 0.777 -0.047 -0.660 879s 879s x1: [3, 4, 1, 1] 879s -0.897 -0.893 -0.085 879s 0.562 0.384 -0.115 879s -0.762 0.179 0.157 879s 0.060 0.190 -0.276 879s 879s y: [1, 4, 1, 1] 879s -0.600 879s 0.495 879s -0.704 879s 0.220 879s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 879s x0: [3, 1, 1, 1] 879s -0.533 0.660 -0.860 879s 879s x1: [3, 1, 1, 1] 879s 0.219 0.051 0.238 879s 879s y: [1, 1, 1, 1] 879s -0.287 879s testing: mul_ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s mat, [4, 1, 1, 3] = [3, 4, 1, 3] * [3, 1, 1, 3] 879s x0: [3, 4, 1, 3] 879s 0.630 0.318 -0.927 879s -0.485 0.557 0.252 879s 0.672 -0.384 -0.558 879s -0.604 0.225 -0.781 879s 879s x1: [3, 1, 1, 3] 879s -0.802 0.847 -0.661 879s 879s y: [4, 1, 1, 3] 879s 0.377 0.694 -0.496 1.191 879s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 879s x0: [3, 1, 1, 1] 879s -0.694 -0.934 -0.873 879s 879s x1: [3, 4, 1, 1] 879s -0.376 0.292 -0.988 879s 0.066 0.688 0.237 879s 0.285 0.037 -0.199 879s -0.276 0.438 0.604 879s 879s y: [1, 4, 1, 1] 879s 0.851 879s -0.895 879s -0.059 879s -0.745 879s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 879s x0: [1, 3, 1, 1] 879s 0.136 879s -0.998 879s -0.989 879s 879s x1: [1, 3, 1, 1] 879s 0.371 879s -0.625 879s 0.238 879s 879s y: [3, 3, 1, 1] 879s 0.050 -0.371 -0.367 879s -0.085 0.623 0.618 879s 0.032 -0.237 -0.235 879s testing: mul_mat, [1, 1, 1, 3] = [3, 1, 1, 3] * [3, 1, 1, 3] 879s x0: [3, 1, 1, 3] 879s 0.770 -0.627 -0.686 879s 879s x1: [3, 1, 1, 3] 879s -0.390 -0.477 0.311 879s 879s y: [1, 1, 1, 3] 879s -0.214 879s test-mul-mat0: iter:4/500 879s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 879s x0: [2, 3, 1, 1] 879s -0.011 0.105 879s 0.878 0.599 879s 0.628 0.189 879s 879s x1: [2, 2, 1, 1] 879s 0.865 -0.534 879s 0.853 0.103 879s 879s y: [3, 2, 1, 1] 879s -0.066 0.440 0.443 879s 0.001 0.811 0.555 879s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 879s x0: [2, 3, 2, 1] 879s -0.666 0.591 879s 0.730 0.746 879s 0.329 -0.175 879s 879s x1: [2, 3, 2, 1] 879s 0.314 0.991 879s 0.872 -0.351 879s 0.749 0.178 879s 879s y: [3, 3, 2, 1] 879s 0.376 0.969 -0.070 879s -0.788 0.375 0.348 879s -0.393 0.680 0.215 879s testing: mul_mat, [4, 3, 2, 3] = [2, 4, 2, 3] * [2, 3, 2, 3] 879s x0: [2, 4, 2, 3] 879s 0.905 0.898 879s 0.798 0.534 879s -0.333 0.073 879s -0.562 -0.045 879s 879s x1: [2, 3, 2, 3] 879s -0.934 0.402 879s 0.036 0.665 879s 0.030 -0.775 879s 879s y: [4, 3, 2, 3] 879s -0.485 -0.531 0.340 0.507 879s 0.630 0.384 0.037 -0.050 879s -0.668 -0.390 -0.067 0.018 879s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 879s x0: [2, 1, 1, 1] 879s 0.931 -0.689 879s 879s x1: [2, 4, 1, 1] 879s 0.203 0.541 879s 0.774 0.867 879s -0.654 -0.104 879s -0.025 0.590 879s 879s y: [1, 4, 1, 1] 879s -0.184 879s 0.124 879s -0.537 879s -0.430 879s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 879s x0: [1, 3, 2, 1] 879s -0.048 879s -0.454 879s 0.893 879s 879s x1: [1, 2, 2, 1] 879s -0.414 879s 0.764 879s 879s y: [3, 2, 2, 1] 879s 0.020 0.188 -0.370 879s -0.037 -0.347 0.683 879s testing: mul_mat, [2, 1, 2, 3] = [3, 2, 2, 3] * [3, 1, 2, 3] 879s x0: [3, 2, 2, 3] 879s -0.859 -0.007 -0.414 879s -0.213 -0.127 -0.512 879s 879s x1: [3, 1, 2, 3] 879s 0.437 0.849 -0.632 879s 879s y: [2, 1, 2, 3] 879s -0.120 0.123 879s test-mul-mat0: iter:5/500 879s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 879s x0: [4, 1, 1, 1] 879s 0.174 -0.662 0.169 -0.047 879s 879s x1: [4, 2, 1, 1] 879s -0.728 0.178 -0.884 0.779 879s 0.891 -0.888 0.850 -0.062 879s 879s y: [1, 2, 1, 1] 879s -0.431 879s 0.890 879s testing: mul_mat, [4, 1, 4, 1] = [4, 4, 4, 1] * [4, 1, 4, 1] 879s x0: [4, 4, 4, 1] 879s 0.290 -0.113 -0.462 0.406 879s -0.334 -0.571 0.518 -0.484 879s 0.367 -0.968 0.690 0.705 879s 0.202 -0.357 0.336 0.054 879s 879s x1: [4, 1, 4, 1] 879s 0.631 0.852 0.053 0.165 879s 879s y: [4, 1, 4, 1] 879s 0.129 -0.750 -0.440 -0.150 879s testing: mul_mat, [4, 4, 4, 2] = [4, 4, 4, 2] * [4, 4, 4, 2] 879s x0: [4, 4, 4, 2] 879s -0.642 -0.280 -0.114 -0.243 879s 0.295 -0.799 -0.349 0.739 879s 0.215 -0.792 0.612 0.499 879s -0.202 -0.266 -0.212 -0.456 879s 879s x1: [4, 4, 4, 2] 879s -0.521 0.619 -0.810 0.493 879s -0.446 -0.653 0.875 0.522 879s -0.807 0.962 0.691 -0.317 879s 0.385 -0.087 -0.131 0.308 879s 879s y: [4, 4, 4, 2] 879s 0.134 -0.002 -0.851 -0.113 879s 0.243 0.471 1.217 -0.159 879s 0.246 -1.481 -0.671 -0.095 879s -0.283 0.456 0.225 -0.167 879s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 879s x0: [4, 2, 1, 1] 879s 0.843 -0.821 0.937 -0.991 879s 0.857 0.642 0.018 -0.624 879s 879s x1: [4, 4, 1, 1] 879s 0.204 0.671 0.127 -0.596 879s 0.606 0.345 -0.857 0.925 879s -0.050 -0.231 -0.284 0.862 879s 0.834 -0.794 0.802 0.751 879s 879s y: [2, 4, 1, 1] 879s 0.329 0.979 879s -1.492 0.150 879s -0.971 -0.733 879s 1.361 -0.249 879s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 879s x0: [2, 1, 4, 1] 879s -0.704 0.851 879s 879s x1: [2, 4, 4, 1] 879s -0.426 0.255 879s -0.476 0.497 879s -0.927 0.444 879s -0.299 0.744 879s 879s y: [1, 4, 4, 1] 879s 0.516 879s 0.758 879s 1.030 879s 0.843 879s testing: mul_mat, [3, 2, 4, 2] = [1, 3, 4, 2] * [1, 2, 4, 2] 879s x0: [1, 3, 4, 2] 879s 0.231 879s 0.035 879s -0.242 879s 879s x1: [1, 2, 4, 2] 879s -0.554 879s -0.774 879s 879s y: [3, 2, 4, 2] 879s -0.128 -0.019 0.134 879s -0.179 -0.027 0.188 879s test-mul-mat0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s : iter:6/500 879s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 879s x0: [2, 1, 1, 1] 879s 0.047 -0.506 879s 879s x1: [2, 2, 1, 1] 879s -0.741 -0.974 879s -0.710 0.490 879s 879s y: [1, 2, 1, 1] 879s 0.458 879s -0.282 879s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 879s x0: [2, 2, 1, 1] 879s 0.688 -0.529 879s -0.166 0.479 879s 879s x1: [2, 1, 1, 1] 879s -0.551 0.083 879s 879s y: [2, 1, 1, 1] 879s -0.423 0.131 879s testing: mul_mat, [4, 2, 1, 2] = [2, 4, 1, 2] * [2, 2, 1, 2] 879s x0: [2, 4, 1, 2] 879s -0.103 -0.083 879s -0.395 0.173 879s 0.752 0.030 879s 0.944 0.308 879s 879s x1: [2, 2, 1, 2] 879s -0.046 -0.815 879s -0.073 0.882 879s 879s y: [4, 2, 1, 2] 879s 0.073 -0.123 -0.059 -0.294 879s -0.066 0.182 -0.029 0.202 879s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 879s x0: [2, 2, 1, 1] 879s 0.035 -0.145 879s 0.848 0.568 879s 879s x1: [2, 4, 1, 1] 879s 0.728 0.067 879s 0.169 -0.801 879s 0.902 -0.352 879s 0.153 -0.913 879s 879s y: [2, 4, 1, 1] 879s 0.016 0.656 879s 0.122 -0.312 879s 0.083 0.565 879s 0.138 -0.389 879s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 879s x0: [2, 4, 1, 1] 879s 0.638 -0.746 879s -0.054 -0.666 879s 0.045 0.552 879s 0.582 0.851 879s 879s x1: [2, 2, 1, 1] 879s -0.722 0.411 879s -0.535 0.194 879s 879s y: [4, 2, 1, 1] 879s -0.767 -0.234 0.194 -0.071 879s -0.486 -0.100 0.083 -0.146 879s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 879s x0: [4, 4, 1, 2] 879s -0.820 -0.237 -0.220 0.417 879s 0.569 0.735 0.865 0.814 879s 0.719 -0.716 0.963 -0.772 879s -0.355 0.709 0.115 -1.000 879s 879s x1: [4, 2, 1, 2] 879s 0.114 -0.417 -0.504 -0.613 879s -0.937 -0.776 0.455 0.232 879s 879s y: [4, 2, 1, 2] 879s -0.139 -1.177 0.368 0.219 879s 0.948 -0.521 0.141 -0.398 879s test-mul-mat0: iter:7/500 879s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 879s x0: [2, 2, 1, 1] 879s 0.538 0.979 879s -0.487 0.538 879s 879s x1: [2, 3, 1, 1] 879s -0.610 -0.745 879s -0.798 0.170 879s -0.879 -0.835 879s 879s y: [2, 3, 1, 1] 879s -1.057 -0.104 879s -0.263 0.480 879s -1.290 -0.021 879s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 879s x0: [2, 4, 2, 1] 879s -0.685 0.494 879s -0.301 0.461 879s 0.655 0.635 879s -0.212 0.385 879s 879s x1: [2, 2, 2, 1] 879s -0.711 0.129 879s 0.602 -0.177 879s 879s y: [4, 2, 2, 1] 879s 0.551 0.273 -0.384 0.200 879s -0.499 -0.263 0.282 -0.196 879s testing: mul_mat, [2, 4, 2, 4] = [2, 2, 2, 4] * [2, 4, 2, 4] 879s x0: [2, 2, 2, 4] 879s -0.939 0.612 879s -0.225 0.137 879s 879s x1: [2, 4, 2, 4] 879s -0.755 -0.865 879s -0.326 -0.776 879s -0.352 -0.787 879s -0.487 0.777 879s 879s y: [2, 4, 2, 4] 879s 0.179 0.052 879s -0.169 -0.033 879s -0.152 -0.029 879s 0.933 0.216 879s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 879s x0: [2, 2, 1, 1] 879s 0.270 0.090 879s 0.194 -0.650 879s 879s x1: [2, 2, 1, 1] 879s 0.498 -0.052 879s 0.721 0.609 879s 879s y: [2, 2, 1, 1] 879s 0.130 0.130 879s 0.250 -0.256 879s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 879s x0: [2, 4, 2, 1] 879s 0.226 0.064 879s -0.114 0.987 879s 0.138 -0.882 879s 0.092 -0.940 879s 879s x1: [2, 2, 2, 1] 879s 0.853 0.949 879s -0.609 -0.320 879s 879s y: [4, 2, 2, 1] 879s 0.254 0.839 -0.719 -0.813 879s -0.158 -0.246 0.198 0.244 879s testing: mul_mat, [2, 2, 2, 4] = [4, 2, 2, 4] * [4, 2, 2, 4] 879s x0: [4, 2, 2, 4] 879s 0.218 0.277 -0.239 0.011 879s 0.639 -0.276 0.482 0.000 879s 879s x1: [4, 2, 2, 4] 879s 0.054 0.003 -0.782 -0.752 879s -0.907 -0.432 -0.899 -0.958 879s 879s y: [2, 2, 2, 4] 879s 0.192 -0.343 879s -0.113 -0.894 879s test-mul-mat0: iter:8/500 879s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 879s x0: [4, 4, 1, 1] 879s -0.236 0.357 0.853 0.689 879s 0.244 -0.226 0.368 -0.607 879s -0.700 0.613 0.360 -0.539 879s 0.642 0.786 -0.464 -0.819 879s 879s x1: [4, 3, 1, 1] 879s -0.942 0.863 0.800 -0.918 879s 0.023 0.488 -0.465 -0.347 879s 0.065 -0.130 0.936 0.159 879s 879s y: [4, 3, 1, 1] 879s 0.580 0.427 1.971 0.454 879s -0.467 -0.065 0.302 0.898 879s 0.846 0.293 0.126 -0.624 879s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 879s x0: [4, 1, 3, 1] 879s -0.407 0.816 -0.495 -0.683 879s 879s x1: [4, 4, 3, 1] 879s -0.570 -0.994 -0.761 -0.708 879s 0.807 -0.680 0.315 0.295 879s -0.145 0.968 -0.640 0.726 879s 0.904 0.519 0.908 -0.332 879s 879s y: [1, 4, 3, 1] 879s 0.281 879s -1.241 879s 0.670 879s -0.167 879s testing: mul_mat, [2, 1, 3, 1] = [4, 2, 3, 1] * [4, 1, 3, 1] 879s x0: [4, 2, 3, 1] 879s 0.261 0.015 0.994 0.497 879s -0.289 0.964 -0.910 -0.473 879s 879s x1: [4, 1, 3, 1] 879s 0.793 0.951 -0.692 0.192 879s 879s y: [2, 1, 3, 1] 879s -0.371 1.226 879s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 879s x0: [4, 4, 1, 1] 879s 0.855 0.772 0.375 -0.457 879s 0.682 -0.482 -0.903 -0.065 879s 0.991 -0.712 -0.760 0.356 879s 0.819 -0.821 -0.565 0.457 879s 879s x1: [4, 2, 1, 1] 879s -0.409 0.232 -0.930 -0.924 879s -0.076 -0.735 0.861 -0.815 879s 879s y: [4, 2, 1, 1] 879s -0.098 0.508 -0.194 -0.423 879s 0.063 -0.422 -0.497 -0.319 879s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 879s x0: [4, 3, 3, 1] 879s -0.008 -0.415 -0.724 -0.490 879s 0.517 -0.077 -0.429 0.153 879s 0.254 -0.390 -0.600 0.546 879s 879s x1: [4, 4, 3, 1] 879s 0.423 -0.772 -0.321 0.103 879s 0.432 0.855 0.693 -0.336 879s 0.925 0.769 0.588 -0.810 879s 0.630 0.773 0.471 0.486 879s 879s y: [3, 4, 3, 1] 879s 0.498 0.431 0.657 879s -0.696 -0.191 -0.823 879s -0.356 0.042 -0.860 879s -0.905 0.139 -0.159 879s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 879s x0: [3, 2, 3, 1] 879s 0.883 -0.871 -0.332 879s -0.993 -0.009 0.099 879s 879s x1: [3, 4, 3, 1] 879s -0.758 0.377 -0.608 879s -0.358 0.887 0.545 879s -0.812 -0.911 -0.296 879s 0.108 -0.339 -0.030 879s 879s y: [2, 4, 3, 1] 879s -0.796 0.690 879s -1.271 0.402 879s 0.175 0.786 879s 0.401 -0.107 879s test-mul-mat0: iter:9/500 879s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 879s x0: [3, 4, 1, 1] 879s 0.515 -0.231 0.660 879s 0.522 -0.102 -0.349 879s -0.810 0.996 -0.522 879s -0.683 0.132 -0.661 879s 879s x1: [3, 2, 1, 1] 879s 0.029 0.566 0.464 879s -0.917 0.597 0.344 879s 879s y: [4, 2, 1, 1] 879s 0.191 -0.205 0.298 -0.252 879s -0.383 -0.660 1.158 0.478 879s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 879s x0: [3, 2, 2, 1] 879s -0.073 -0.203 -0.981 879s 0.243 0.928 -0.642 879s 879s x1: [3, 4, 2, 1] 879s -0.148 0.942 -0.505 879s 0.086 -0.735 -0.962 879s -0.876 0.624 -0.652 879s -0.659 -0.463 -0.675 879s 879s y: [2, 4, 2, 1] 879s 0.314 1.162 879s 1.087 -0.044 879s 0.576 0.784 879s 0.805 -0.158 879s testing: mul_mat, [4, 2, 2, 1] = [3, 4, 2, 1] * [3, 2, 2, 1] 879s x0: [3, 4, 2, 1] 879s 0.314 -0.609 -0.412 879s -0.949 0.191 -0.871 879s -0.023 0.988 -0.852 879s -0.779 0.916 -0.493 879s 879s x1: [3, 2, 2, 1] 879s 0.305 0.760 0.240 879s 0.646 -0.704 0.565 879s 879s y: [4, 2, 2, 1] 879s -0.466 -0.354 0.539 0.340 879s 0.398 -1.240 -1.191 -1.427 879s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 879s x0: [3, 2, 1, 1] 879s -0.182 -0.505 -0.615 879s -0.531 0.039 0.001 879s 879s x1: [3, 4, 1, 1] 879s 0.464 -0.622 -0.588 879s 0.932 -0.524 -0.100 879s 0.630 0.790 0.291 879s -0.782 0.841 -0.518 879s 879s y: [2, 4, 1, 1] 879s 0.592 -0.271 879s 0.157 -0.515 879s -0.692 -0.303 879s 0.036 0.447 879s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 879s x0: [2, 3, 2, 1] 879s -0.868 -0.392 879s 0.575 -0.524 879s 0.499 0.129 879s 879s x1: [2, 3, 2, 1] 879s 0.355 -0.828 879s -0.638 -0.147 879s -0.906 -0.386 879s 879s y: [3, 3, 2, 1] 879s 0.016 0.639 0.071 879s 0.612 -0.290 -0.338 879s 0.938 -0.319 -0.502 879s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 879s x0: [3, 2, 2, 1] 879s 0.146 -0.936 0.434 879s -0.729 -0.100 0.632 879s 879s x1: [3, 2, 2, 1] 879s 0.058 -0.282 -0.812 879s -0.903 0.103 0.190 879s 879s y: [2, 2, 2, 1] 879s -0.080 -0.527 879s -0.146 0.768 879s test-mul-mat0: iter:10/500 879s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 879s x0: [1, 2, 1, 1] 879s 0.335 879s -0.781 879s 879s x1: [1, 1, 1, 1] 879s 0.501 879s 879s y: [2, 1, 1, 1] 879s 0.168 -0.391 879s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 879s x0: [1, 1, 2, 1] 879s -0.403 879s 879s x1: [1, 2, 2, 1] 879s 0.488 879s 0.432 879s 879s y: [1, 2, 2, 1] 879s -0.197 879s -0.174 879s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 879s x0: [1, 2, 2, 1] 879s 0.150 879s 0.113 879s 879s x1: [1, 1, 2, 1] 879s 0.190 879s 879s y: [2, 1, 2, 1] 879s 0.029 0.021 879s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 879s x0: [1, 4, 1, 1] 879s 0.044 879s 0.110 879s 0.978 879s -0.317 879s 879s x1: [1, 2, 1, 1] 879s -0.988 879s -0.900 879s 879s y: [4, 2, 1, 1] 879s -0.043 -0.108 -0.966 0.313 879s -0.039 -0.099 -0.880 0.285 879s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 879s x0: [4, 1, 2, 1] 879s 0.567 -0.366 0.766 -0.111 879s 879s x1: [4, 1, 2, 1] 879s -0.405 -0.762 -0.279 0.848 879s 879s y: [1, 1, 2, 1] 879s -0.258 879s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 879s x0: [1, 4, 2, 1] 879s 0.052 879s 0.622 879s 0.810 879s 0.030 879s 879s x1: [1, 4, 2, 1] 879s 0.840 879s 0.453 879s 0.935 879s -0.009 879s 879s y: [4, 4, 2, 1] 879s 0.044 0.522 0.681 0.025 879s 0.024 0.282 0.367 0.014 879s 0.049 0.581 0.758 0.028 879s -0.000 -0.006 -0.008 -0.000 879s test-mul-mat0: iter:11/500 879s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 879s x0: [1, 2, 1, 1] 879s -0.943 879s -0.795 879s 879s x1: [1, 4, 1, 1] 879s -0.741 879s -0.642 879s 0.168 879s -0.107 879s 879s y: [2, 4, 1, 1] 879s 0.698 0.589 879s 0.605 0.510 879s -0.159 -0.134 879s 0.101 0.085 879s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s 1] * [1, 2, 3, 1] 879s x0: [1, 3, 3, 1] 879s 0.562 879s -0.719 879s 0.811 879s 879s x1: [1, 2, 3, 1] 879s -0.225 879s 0.543 879s 879s y: [3, 2, 3, 1] 879s -0.127 0.162 -0.183 879s 0.305 -0.390 0.440 879s testing: mul_mat, [4, 3, 3, 2] = [1, 4, 3, 2] * [1, 3, 3, 2] 879s x0: [1, 4, 3, 2] 879s -0.183 879s 0.060 879s -0.406 879s -0.621 879s 879s x1: [1, 3, 3, 2] 879s -0.934 879s 0.198 879s 0.162 879s 879s y: [4, 3, 3, 2] 879s 0.171 -0.056 0.379 0.580 879s -0.036 0.012 -0.080 -0.123 879s -0.030 0.010 -0.066 -0.100 879s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 879s x0: [1, 1, 1, 1] 879s 0.574 879s 879s x1: [1, 4, 1, 1] 879s 0.630 879s 0.313 879s -0.805 879s -0.996 879s 879s y: [1, 4, 1, 1] 879s 0.362 879s 0.180 879s -0.462 879s -0.572 879s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 879s x0: [1, 2, 3, 1] 879s -0.671 879s 0.975 879s 879s x1: [1, 1, 3, 1] 879s -0.007 879s 879s y: [2, 1, 3, 1] 879s 0.005 -0.007 879s testing: mul_mat, [3, 1, 3, 2] = [2, 3, 3, 2] * [2, 1, 3, 2] 879s x0: [2, 3, 3, 2] 879s 0.340 -0.347 879s 0.035 0.230 879s 0.130 0.135 879s 879s x1: [2, 1, 3, 2] 879s 0.128 0.413 879s 879s y: [3, 1, 3, 2] 879s -0.100 0.099 0.072 879s test-mul-mat0: iter:12/500 879s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 879s x0: [4, 2, 1, 1] 879s 0.152 0.027 -0.478 0.091 879s -0.574 -0.496 -0.936 -0.044 879s 879s x1: [4, 4, 1, 1] 879s -0.841 -0.556 0.506 -0.775 879s 0.857 0.212 0.139 0.568 879s -0.824 0.181 -0.308 0.561 879s -0.302 0.541 0.686 0.128 879s 879s y: [2, 4, 1, 1] 879s -0.455 0.319 879s 0.121 -0.752 879s 0.078 0.646 879s -0.348 -0.743 879s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 879s x0: [4, 2, 1, 1] 879s 0.944 0.575 0.129 0.083 879s 0.144 0.306 -0.736 0.835 879s 879s x1: [4, 2, 1, 1] 879s 0.139 -0.733 -0.380 0.561 879s 0.274 -0.588 0.719 0.718 879s 879s y: [2, 2, 1, 1] 879s -0.293 0.544 879s 0.072 -0.070 879s testing: mul_mat, [1, 2, 1, 4] = [4, 1, 1, 4] * [4, 2, 1, 4] 879s x0: [4, 1, 1, 4] 879s -0.950 0.302 0.044 0.306 879s 879s x1: [4, 2, 1, 4] 879s -0.134 -0.038 0.377 -0.447 879s -0.910 0.256 0.705 0.117 879s 879s y: [1, 2, 1, 4] 879s -0.005 879s 1.009 879s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 879s x0: [4, 1, 1, 1] 879s 0.119 -0.999 -0.125 -0.152 879s 879s x1: [4, 1, 1, 1] 879s -0.812 -0.149 -0.527 -0.693 879s 879s y: [1, 1, 1, 1] 879s 0.224 879s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 879s x0: [1, 1, 1, 1] 879s 0.795 879s 879s x1: [1, 4, 1, 1] 879s -0.162 879s -0.242 879s -0.160 879s 0.511 879s 879s y: [1, 4, 1, 1] 879s -0.129 879s -0.192 879s -0.127 879s 0.407 879s testing: mul_mat, [4, 1, 1, 4] = [1, 4, 1, 4] * [1, 1, 1, 4] 879s x0: [1, 4, 1, 4] 879s 0.000 879s 0.952 879s 0.616 879s -0.296 879s 879s x1: [1, 1, 1, 4] 879s 0.561 879s 879s y: [4, 1, 1, 4] 879s 0.000 0.534 0.346 -0.166 879s test-mul-mat0: iter:13/500 879s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 879s x0: [3, 3, 1, 1] 879s 0.138 -0.208 0.291 879s -0.245 0.496 -0.390 879s 0.032 -0.598 -0.283 879s 879s x1: [3, 3, 1, 1] 879s 0.400 0.009 0.158 879s -0.089 -0.484 -0.047 879s -0.528 0.324 0.792 879s 879s y: [3, 3, 1, 1] 879s 0.099 -0.155 -0.037 879s 0.075 -0.200 0.300 879s 0.090 -0.019 -0.435 879s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 879s x0: [3, 3, 1, 1] 879s 0.332 -0.266 0.873 879s -0.660 0.891 -0.216 879s -0.144 -0.156 0.256 879s 879s x1: [3, 3, 1, 1] 879s 0.098 -0.372 -0.404 879s 0.352 0.705 -0.053 879s 0.912 0.081 -0.584 879s 879s y: [3, 3, 1, 1] 879s -0.221 -0.308 -0.059 879s -0.118 0.408 -0.174 879s -0.228 -0.403 -0.293 879s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 879s x0: [3, 4, 1, 1] 879s 0.326 0.214 -0.797 879s 0.954 0.810 0.555 879s 0.659 -0.243 0.467 879s -0.260 0.173 0.940 879s 879s x1: [3, 3, 1, 1] 879s -0.820 -0.364 -0.405 879s 0.318 0.428 0.887 879s -0.927 -0.076 -0.503 879s 879s y: [4, 3, 1, 1] 879s -0.023 -1.302 -0.641 -0.230 879s -0.511 1.142 0.519 0.824 879s 0.083 -1.225 -0.827 -0.245 879s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 879s x0: [3, 4, 1, 1] 879s -0.294 -0.170 -0.852 879s -0.666 0.944 -0.090 879s -0.161 -0.044 0.854 879s 0.630 -0.955 0.702 879s 879s x1: [3, 4, 1, 1] 879s -0.928 0.907 0.812 879s -0.588 0.798 -0.404 879s 0.268 -0.358 0.852 879s 0.448 0.278 -0.553 879s 879s y: [4, 4, 1, 1] 879s -0.573 1.402 0.803 -0.880 879s 0.382 1.182 -0.286 -1.416 879s -0.744 -0.594 0.700 1.109 879s 0.293 0.014 -0.557 -0.372 879s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 879s x0: [4, 2, 1, 1] 879s -0.870 -0.492 0.816 -0.886 879s 0.369 0.408 0.274 -0.851 879s 879s x1: [4, 3, 1, 1] 879s -0.431 -0.389 0.169 0.309 879s 0.784 0.109 0.380 0.691 879s -0.079 0.792 0.488 0.517 879s 879s y: [2, 3, 1, 1] 879s 0.431 -0.534 879s -1.038 -0.150 879s -0.381 -0.012 879s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s [2, 4, 1, 1] 879s x0: [2, 4, 1, 1] 879s 0.907 0.486 879s -0.822 0.274 879s 0.489 -0.815 879s 0.076 -0.134 879s 879s x1: [2, 4, 1, 1] 879s -0.887 0.744 879s -0.021 -0.943 879s -0.301 0.024 879s -0.795 0.609 879s 879s y: [4, 4, 1, 1] 879s -0.443 0.933 -1.040 -0.167 879s -0.477 -0.241 0.758 0.124 879s -0.261 0.254 -0.167 -0.026 879s -0.425 0.820 -0.885 -0.142 879s test-mul-mat0: iter:14/500 879s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 879s x0: [1, 1, 1, 1] 879s -0.417 879s 879s x1: [1, 1, 1, 1] 879s -0.377 879s 879s y: [1, 1, 1, 1] 879s 0.157 879s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 879s x0: [1, 1, 4, 1] 879s -0.896 879s 879s x1: [1, 1, 4, 1] 879s 0.992 879s 879s y: [1, 1, 4, 1] 879s -0.888 879s testing: mul_mat, [1, 1, 4, 4] = [1, 1, 4, 4] * [1, 1, 4, 4] 879s x0: [1, 1, 4, 4] 879s 0.346 879s 879s x1: [1, 1, 4, 4] 879s -0.064 879s 879s y: [1, 1, 4, 4] 879s -0.022 879s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 879s x0: [1, 4, 1, 1] 879s 0.951 879s -0.753 879s -0.738 879s 0.734 879s 879s x1: [1, 1, 1, 1] 879s -0.909 879s 879s y: [4, 1, 1, 1] 879s -0.865 0.684 0.670 -0.667 879s testing: mul_mat, [4, 1, 4, 1] = [4, 4, 4, 1] * [4, 1, 4, 1] 879s x0: [4, 4, 4, 1] 879s 0.902 0.986 -0.705 0.584 879s 0.581 -0.796 0.321 0.746 879s -0.340 0.896 -0.988 0.124 879s -0.569 -0.939 -0.458 -0.827 879s 879s x1: [4, 1, 4, 1] 879s -0.418 -0.410 -0.407 -0.598 879s 879s y: [4, 1, 4, 1] 879s -0.844 -0.493 0.102 1.304 879s testing: mul_mat, [4, 4, 4, 4] = [4, 4, 4, 4] * [4, 4, 4, 4] 879s x0: [4, 4, 4, 4] 879s -0.478 0.013 -0.869 0.974 879s -0.615 0.129 -0.643 0.983 879s -0.833 0.941 -0.785 0.662 879s -0.938 0.475 0.312 -0.355 879s 879s x1: [4, 4, 4, 4] 879s -0.266 -0.658 -0.041 -0.169 879s -0.771 0.925 0.223 0.437 879s -0.097 -0.844 -0.772 0.626 879s -0.786 0.193 -0.312 0.797 879s 879s y: [4, 4, 4, 4] 879s -0.011 -0.061 -0.478 -0.017 879s 0.612 0.879 1.627 1.078 879s 1.316 1.063 0.307 -0.772 879s 1.425 1.492 1.608 0.449 879s test-mul-mat0: iter:15/500 879s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 879s x0: [1, 4, 1, 1] 879s 0.574 879s 0.840 879s 0.803 879s 0.902 879s 879s x1: [1, 4, 1, 1] 879s 0.623 879s 0.952 879s 0.923 879s 0.902 879s 879s y: [4, 4, 1, 1] 879s 0.358 0.524 0.501 0.562 879s 0.546 0.799 0.764 0.858 879s 0.530 0.775 0.741 0.832 879s 0.518 0.757 0.724 0.813 879s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 879s x0: [1, 1, 2, 1] 879s -0.895 879s 879s x1: [1, 4, 2, 1] 879s 0.097 879s -0.027 879s 0.511 879s -0.836 879s 879s y: [1, 4, 2, 1] 879s -0.087 879s 0.024 879s -0.457 879s 0.749 879s testing: mul_mat, [2, 1, 2, 3] = [1, 2, 2, 3] * [1, 1, 2, 3] 879s x0: [1, 2, 2, 3] 879s -0.061 879s -0.019 879s 879s x1: [1, 1, 2, 3] 879s 0.624 879s 879s y: [2, 1, 2, 3] 879s -0.038 -0.012 879s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 879s x0: [1, 3, 1, 1] 879s 0.292 879s 0.607 879s -0.197 879s 879s x1: [1, 2, 1, 1] 879s 0.781 879s 0.444 879s 879s y: [3, 2, 1, 1] 879s 0.228 0.475 -0.154 879s 0.130 0.269 -0.087 879s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 879s x0: [3, 4, 2, 1] 879s 0.186 0.683 0.017 879s -0.472 0.533 0.813 879s -0.720 0.430 0.269 879s -0.509 -0.336 -0.547 879s 879s x1: [3, 1, 2, 1] 879s -0.021 -0.061 0.296 879s 879s y: [4, 1, 2, 1] 879s -0.040 0.218 0.069 -0.131 879s testing: mul_mat, [2, 3, 2, 3] = [4, 2, 2, 3] * [4, 3, 2, 3] 879s x0: [4, 2, 2, 3] 879s -0.190 -0.626 -0.456 -0.768 879s 0.574 0.385 -0.937 -0.483 879s 879s x1: [4, 3, 2, 3] 879s -0.949 -0.888 0.317 -0.927 879s 0.156 -0.873 0.719 -0.658 879s -0.345 0.999 -0.167 -0.662 879s 879s y: [2, 3, 2, 3] 879s 1.304 -0.736 879s 0.695 -0.602 879s 0.025 0.663 879s test-mul-mat0: iter:16/500 879s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 879s x0: [1, 2, 1, 1] 879s -0.366 879s 0.518 879s 879s x1: [1, 2, 1, 1] 879s -0.666 879s 0.232 879s 879s y: [2, 2, 1, 1] 879s 0.243 -0.345 879s -0.085 0.120 879s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, 1] * [1, 2, 3, 1] 879s x0: [1, 3, 3, 1] 879s 0.666 879s 0.093 879s 0.475 879s 879s x1: [1, 2, 3, 1] 879s 0.291 879s -0.654 879s 879s y: [3, 2, 3, 1] 879s 0.194 0.027 0.138 879s -0.436 -0.061 -0.311 879s testing: mul_mat, [3, 3, 3, 4] = [1, 3, 3, 4] * [1, 3, 3, 4] 879s x0: [1, 3, 3, 4] 879s 0.405 879s -0.384 879s -0.712 879s 879s x1: [1, 3, 3, 4] 879s -0.680 879s 0.379 879s -0.392 879s 879s y: [3, 3, 3, 4] 879s -0.275 0.261 0.484 879s 0.154 -0.146 -0.270 879s -0.159 0.151 0.279 879s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 879s x0: [1, 4, 1, 1] 879s 0.393 879s -0.758 879s -0.179 879s 0.518 879s 879s x1: [1, 3, 1, 1] 879s -0.728 879s 0.255 879s 0.353 879s 879s y: [4, 3, 1, 1] 879s -0.287 0.552 0.130 -0.377 879s 0.100 -0.194 -0.046 0.132 879s 0.139 -0.268 -0.063 0.183 879s testing: mul_mat, [1, 1, 3, 1] = [4, 1, 3, 1] * [4, 1, 3, 1] 879s x0: [4, 1, 3, 1] 879s -0.173 -0.932 0.600 0.040 879s 879s x1: [4, 1, 3, 1] 879s 0.816 -0.3ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s 78 -0.178 0.743 879s 879s y: [1, 1, 3, 1] 879s 0.134 879s testing: mul_mat, [4, 4, 3, 4] = [1, 4, 3, 4] * [1, 4, 3, 4] 879s x0: [1, 4, 3, 4] 879s -0.021 879s -0.828 879s 0.722 879s -0.925 879s 879s x1: [1, 4, 3, 4] 879s 0.553 879s -0.904 879s 0.178 879s 0.794 879s 879s y: [4, 4, 3, 4] 879s -0.012 -0.458 0.399 -0.511 879s 0.019 0.749 -0.652 0.836 879s -0.004 -0.147 0.128 -0.164 879s -0.017 -0.658 0.573 -0.735 879s test-mul-mat0: iter:17/500 879s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 879s x0: [1, 1, 1, 1] 879s -0.004 879s 879s x1: [1, 2, 1, 1] 879s -0.143 879s 0.623 879s 879s y: [1, 2, 1, 1] 879s 0.001 879s -0.003 879s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 879s x0: [1, 3, 1, 1] 879s 0.075 879s -0.077 879s 0.254 879s 879s x1: [1, 1, 1, 1] 879s 0.532 879s 879s y: [3, 1, 1, 1] 879s 0.040 -0.041 0.135 879s testing: mul_mat, [1, 3, 1, 4] = [1, 1, 1, 4] * [1, 3, 1, 4] 879s x0: [1, 1, 1, 4] 879s 0.189 879s 879s x1: [1, 3, 1, 4] 879s -0.048 879s 0.293 879s -0.636 879s 879s y: [1, 3, 1, 4] 879s -0.009 879s 0.055 879s -0.120 879s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 879s x0: [1, 3, 1, 1] 879s -0.897 879s 0.536 879s 0.068 879s 879s x1: [1, 1, 1, 1] 879s -0.888 879s 879s y: [3, 1, 1, 1] 879s 0.797 -0.476 -0.061 879s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 879s x0: [3, 1, 1, 1] 879s -0.309 -0.404 0.032 879s 879s x1: [3, 1, 1, 1] 879s -0.039 0.159 -0.720 879s 879s y: [1, 1, 1, 1] 879s -0.075 879s testing: mul_mat, [2, 3, 1, 4] = [1, 2, 1, 4] * [1, 3, 1, 4] 879s x0: [1, 2, 1, 4] 879s 0.903 879s 0.410 879s 879s x1: [1, 3, 1, 4] 879s 0.614 879s 0.851 879s 0.984 879s 879s y: [2, 3, 1, 4] 879s 0.555 0.251 879s 0.769 0.348 879s 0.889 0.403 879s test-mul-mat0: iter:18/500 879s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 879s x0: [2, 4, 1, 1] 879s -0.471 0.499 879s -0.190 -0.857 879s 0.350 -0.206 879s 0.050 0.565 879s 879s x1: [2, 1, 1, 1] 879s 0.838 -0.098 879s 879s y: [4, 1, 1, 1] 879s -0.443 -0.075 0.313 -0.013 879s testing: mul_mat, [4, 4, 3, 1] = [2, 4, 3, 1] * [2, 4, 3, 1] 879s x0: [2, 4, 3, 1] 879s -0.885 0.793 879s -0.832 -0.091 879s -0.157 0.733 879s -0.763 -0.464 879s 879s x1: [2, 4, 3, 1] 879s 0.328 0.693 879s -0.196 -0.454 879s -0.430 -0.475 879s 0.917 -0.267 879s 879s y: [4, 4, 3, 1] 879s 0.259 -0.337 0.456 -0.572 879s -0.187 0.204 -0.302 0.360 879s 0.004 0.401 -0.281 0.549 879s -1.023 -0.739 -0.340 -0.575 879s testing: mul_mat, [4, 4, 3, 4] = [2, 4, 3, 4] * [2, 4, 3, 4] 879s x0: [2, 4, 3, 4] 879s 0.570 0.862 879s -0.545 0.675 879s -0.919 -0.964 879s 0.703 -0.567 879s 879s x1: [2, 4, 3, 4] 879s -0.523 0.997 879s 0.954 0.796 879s 0.302 0.604 879s -0.386 0.417 879s 879s y: [4, 4, 3, 4] 879s 0.561 0.958 -0.481 -0.933 879s 1.230 0.017 -1.645 0.220 879s 0.693 0.243 -0.860 -0.131 879s 0.139 0.492 -0.047 -0.508 879s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 879s x0: [2, 4, 1, 1] 879s 0.032 0.120 879s 0.142 -0.502 879s -0.080 -0.993 879s -0.282 0.433 879s 879s x1: [2, 4, 1, 1] 879s -0.147 -0.647 879s -0.666 -0.222 879s -0.214 0.791 879s 0.982 -0.533 879s 879s y: [4, 4, 1, 1] 879s -0.082 0.304 0.655 -0.239 879s -0.048 0.017 0.274 0.092 879s 0.088 -0.427 -0.768 0.403 879s -0.033 0.407 0.450 -0.508 879s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 879s x0: [4, 2, 3, 1] 879s -0.932 0.539 -0.086 -0.388 879s -0.093 -0.434 -0.028 0.243 879s 879s x1: [4, 2, 3, 1] 879s -0.955 -0.670 -0.602 0.494 879s -0.733 0.067 -0.514 0.107 879s 879s y: [2, 2, 3, 1] 879s 0.388 0.517 879s 0.722 0.080 879s testing: mul_mat, [4, 4, 3, 4] = [2, 4, 3, 4] * [2, 4, 3, 4] 879s x0: [2, 4, 3, 4] 879s -0.468 -0.049 879s -0.193 0.995 879s 0.338 0.117 879s 0.320 0.332 879s 879s x1: [2, 4, 3, 4] 879s 0.295 -0.212 879s -0.734 -0.428 879s 0.936 -0.544 879s -0.010 -0.995 879s 879s y: [4, 4, 3, 4] 879s -0.128 -0.268 0.075 0.024 879s 0.364 -0.284 -0.298 -0.377 879s -0.411 -0.722 0.253 0.119 879s 0.053 -0.989 -0.119 -0.333 879s test-mul-mat0: iter:19/500 879s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 879s x0: [3, 2, 1, 1] 879s 0.122 -0.698 -0.011 879s -0.949 0.958 0.025 879s 879s x1: [3, 3, 1, 1] 879s -0.022 0.587 0.161 879s -0.016 -0.000 0.189 879s 0.294 -0.191 0.480 879s 879s y: [2, 3, 1, 1] 879s -0.414 0.586 879s -0.004 0.019 879s 0.164 -0.450 879s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 879s x0: [3, 4, 1, 1] 879s -0.263 -0.148 0.480 879s -0.741 0.941 -0.639 879s 0.918 0.928 0.340 879s 0.504 0.088 -0.676 879s 879s x1: [3, 2, 1, 1] 879s 0.294 0.556 0.789 879s 0.579 -0.284 0.486 879s 879s y: [4, 2, 1, 1] 879s 0.219 -0.198 1.054 -0.336 879s 0.123 -1.007 0.433 -0.062 879s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 879s x0: [3, 4, 1, 1] 879s 0.574 0.084 0.359 879s -0.711 -0.430 0.614 879s 0.026 0.422 0.094 879s 0.285 0.364 0.456 879s 879s x1: [3, 4, 1, 1] 879s -0.496 -0.722 0.618 879s 0.313 0.7ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 879s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 879s 57 0.281 879s -0.565 -0.940 -0.730 879s -0.514 -0.983 0.295 879s 879s y: [4, 4, 1, 1] 879s -0.123 1.043 -0.259 -0.122 879s 0.344 -0.375 0.354 0.492 879s -0.665 0.357 -0.480 -0.836 879s -0.272 0.969 -0.400 -0.369 879s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 879s x0: [3, 3, 1, 1] 879s -0.526 -0.509 -0.935 879s -0.712 0.583 -0.333 879s 0.444 0.224 -0.417 879s 879s x1: [3, 4, 1, 1] 879s 0.202 0.291 -0.205 879s -0.294 -0.621 0.119 879s 0.210 -0.343 -0.263 879s -0.477 -0.586 -0.982 879s 879s y: [3, 4, 1, 1] 879s -0.063 0.094 0.240 879s 0.359 -0.193 -0.319 879s 0.309 -0.262 0.126 879s 1.467 0.325 0.066 879s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 879s x0: [3, 2, 1, 1] 879s 0.315 -0.427 -0.338 879s 0.543 0.694 0.783 879s 879s x1: [3, 3, 1, 1] 879s -0.646 0.238 -0.803 879s 0.380 -0.340 0.292 879s -0.336 -0.977 -0.253 879s 879s y: [2, 3, 1, 1] 879s -0.034 -0.814 879s 0.166 0.198 879s 0.397 -1.058 879s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 879s x0: [2, 4, 1, 1] 879s 0.706 0.932 879s -0.292 0.795 879s 0.731 0.956 879s 0.289 -0.484 879s 879s x1: [2, 3, 1, 1] 879s -0.649 0.399 879s -0.694 -0.234 879s 0.418 0.264 879s 879s y: [4, 3, 1, 1] 879s -0.086 0.507 -0.092 -0.381 879s -0.708 0.016 -0.731 -0.087 879s 0.541 0.089 0.558 -0.007 879s test-mul-mat0: iter:20/500 879s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 879s x0: [1, 4, 1, 1] 879s -0.248 879s -0.941 879s 0.191 879s -0.705 879s 879s x1: [1, 3, 1, 1] 879s 0.853 879s -0.995 879s -0.807 879s 879s y: [4, 3, 1, 1] 879s -0.212 -0.803 0.163 -0.602 879s 0.247 0.936 -0.190 0.702 879s 0.200 0.759 -0.154 0.569 879s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 879s x0: [1, 2, 1, 1] 879s -0.644 879s -0.278 879s 879s x1: [1, 4, 1, 1] 879s -0.368 879s -0.115 879s -0.043 879s -0.585 879s 879s y: [2, 4, 1, 1] 879s 0.237 0.102 879s 0.074 0.032 879s 0.028 0.012 879s 0.376 0.163 879s testing: mul_mat, [2, 2, 1, 3] = [1, 2, 1, 3] * [1, 2, 1, 3] 879s x0: [1, 2, 1, 3] 879s -0.232 879s -0.373 879s 879s x1: [1, 2, 1, 3] 879s -0.998 879s 0.774 879s 879s y: [2, 2, 1, 3] 879s 0.231 0.372 879s -0.179 -0.289 879s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 879s x0: [1, 2, 1, 1] 879s 0.771 879s 0.860 879s 879s x1: [1, 2, 1, 1] 879s 0.918 879s 0.856 879s 879s y: [2, 2, 1, 1] 879s 0.708 0.790 879s 0.660 0.736 879s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 879s x0: [2, 4, 1, 1] 879s -0.302 0.725 879s -0.160 0.863 879s -0.093 0.746 879s 0.330 -0.039 879s 879s x1: [2, 1, 1, 1] 879s -0.361 0.648 879s 879s y: [4, 1, 1, 1] 879s 0.579 0.618 0.517 -0.145 879s testing: mul_mat, [4, 2, 1, 3] = [4, 4, 1, 3] * [4, 2, 1, 3] 879s x0: [4, 4, 1, 3] 879s -0.847 0.720 -0.208 0.701 879s 0.974 0.829 -0.771 0.790 879s -0.132 -0.765 -0.359 -0.826 879s -0.984 -0.813 -0.003 0.942 879s 879s x1: [4, 2, 1, 3] 879s -0.781 -0.532 -0.037 0.993 879s -0.546 -0.795 0.473 0.149 879s 879s y: [4, 2, 1, 3] 879s 0.982 -0.389 -0.297 2.137 879s -0.104 -1.439 0.388 1.322 879s test-mul-mat0: iter:21/500 879s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 879s x0: [2, 3, 1, 1] 879s 0.203 -0.189 879s 0.891 0.963 879s -0.883 -0.541 879s 879s x1: [2, 4, 1, 1] 879s -0.887 -0.508 879s 0.935 -0.440 879s -0.616 -0.833 879s 0.310 0.521 879s 879s y: [3, 4, 1, 1] 879s -0.084 -1.279 1.058 879s 0.273 0.409 -0.588 879s 0.033 -1.352 0.995 879s -0.036 0.778 -0.555 879s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 879s x0: [2, 2, 4, 1] 879s 0.819 -0.006 879s -0.783 0.783 879s 879s x1: [2, 3, 4, 1] 879s -0.447 0.658 879s 0.215 0.934 879s 0.428 -0.388 879s 879s y: [2, 3, 4, 1] 879s -0.371 0.865 879s 0.170 0.563 879s 0.353 -0.639 879s testing: mul_mat, [4, 2, 4, 4] = [2, 4, 4, 4] * [2, 2, 4, 4] 879s x0: [2, 4, 4, 4] 879s -0.077 0.364 879s 0.857 0.085 879s -0.751 0.231 879s 0.344 -0.290 879s 879s x1: [2, 2, 4, 4] 879s 0.408 -0.148 879s -0.019 -0.958 879s 879s y: [4, 2, 4, 4] 879s -0.085 0.337 -0.341 0.183 879s -0.347 -0.098 -0.206 0.271 879s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 879s x0: [2, 4, 1, 1] 879s -0.813 0.748 879s 0.386 0.038 879s 0.452 -0.327 879s -0.950 0.986 879s 879s x1: [2, 4, 1, 1] 879s -0.238 -0.631 879s -0.254 -0.834 879s 0.709 -0.256 879s -0.111 0.988 879s 879s y: [4, 4, 1, 1] 879s -0.278 -0.116 0.098 -0.395 879s -0.418 -0.129 0.158 -0.581 879s -0.768 0.264 0.404 -0.926 879s 0.829 -0.005 -0.373 1.079 879s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 879s x0: [4, 1, 4, 1] 879s 0.348 -0.949 -0.048 -0.628 879s 879s x1: [4, 2, 4, 1] 879s -0.080 -0.249 0.656 -0.543 879s 0.694 0.024 0.224 0.179 879s 879s y: [1, 2, 4, 1] 879s 0.517 879s 0.096 879s testing: mul_mat, [3, 4, 4, 4] = [1, 3, 4, 4] * [1, 4, 4, 4] 879s x0: [1, 3, 4, 4] 879s 0.977 879s -0.652 879s -0.000 879s 879s x1: [1, 4, 4, 4] 879s -0.984 879s -0.035 879s 0.451 879s -0.412 879s 879s y: [3, 4, 4, 4] 880s -0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 962 0.641 0.000 880s -0.034 0.023 0.000 880s 0.441 -0.294 -0.000 880s -0.403 0.268 0.000 880s test-mul-mat0: iter:22/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.019 -0.533 0.959 0.615 880s -0.650 -0.993 -0.403 0.233 880s 880s x1: [4, 1, 1, 1] 880s 0.789 -0.229 -0.211 -0.203 880s 880s y: [2, 1, 1, 1] 880s -0.220 -0.248 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.802 -0.976 -0.098 -0.795 880s -0.378 -0.865 0.806 0.241 880s 0.104 -0.630 0.070 -0.911 880s 880s x1: [4, 2, 3, 1] 880s 0.602 -0.381 -0.031 -0.436 880s 0.829 -0.014 -0.339 0.430 880s 880s y: [3, 2, 3, 1] 880s 0.239 -0.028 0.697 880s -0.960 -0.471 -0.320 880s testing: mul_mat, [1, 3, 3, 4] = [4, 1, 3, 4] * [4, 3, 3, 4] 880s x0: [4, 1, 3, 4] 880s -0.791 0.304 -0.679 -0.247 880s 880s x1: [4, 3, 3, 4] 880s -0.394 -0.986 0.033 0.710 880s -0.616 -0.898 0.799 0.415 880s -0.398 0.431 0.398 -0.171 880s 880s y: [1, 3, 3, 4] 880s -0.186 880s -0.430 880s 0.218 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.624 0.659 0.558 -0.280 880s 0.138 -0.216 0.089 0.294 880s 880s x1: [4, 1, 1, 1] 880s -0.979 0.924 -0.711 0.276 880s 880s y: [2, 1, 1, 1] 880s -0.476 -0.317 880s testing: mul_mat, [1, 4, 3, 1] = [2, 1, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 1, 3, 1] 880s 0.959 0.901 880s 880s x1: [2, 4, 3, 1] 880s 0.954 -0.136 880s -0.668 0.139 880s 0.761 0.018 880s 0.717 -0.535 880s 880s y: [1, 4, 3, 1] 880s 0.792 880s -0.515 880s 0.746 880s 0.206 880s testing: mul_mat, [4, 2, 3, 4] = [1, 4, 3, 4] * [1, 2, 3, 4] 880s x0: [1, 4, 3, 4] 880s 0.835 880s -0.764 880s 0.253 880s -0.117 880s 880s x1: [1, 2, 3, 4] 880s 0.002 880s 0.485 880s 880s y: [4, 2, 3, 4] 880s 0.001 -0.001 0.000 -0.000 880s 0.405 -0.371 0.123 -0.057 880s test-mul-mat0: iter:23/500 880s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.234 880s 0.299 880s 880s x1: [1, 3, 1, 1] 880s -0.912 880s -0.581 880s -0.047 880s 880s y: [2, 3, 1, 1] 880s 0.213 -0.273 880s 0.136 -0.174 880s 0.011 -0.014 880s testing: mul_mat, [4, 2, 3, 1] = [1, 4, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 4, 3, 1] 880s 0.990 880s -0.327 880s 0.616 880s -0.184 880s 880s x1: [1, 2, 3, 1] 880s -0.267 880s -0.010 880s 880s y: [4, 2, 3, 1] 880s -0.264 0.087 -0.164 0.049 880s -0.010 0.003 -0.006 0.002 880s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 3, 3, 1] 880s 0.828 880s -0.529 880s -0.212 880s 880s x1: [1, 4, 3, 1] 880s 0.681 880s -0.994 880s -0.587 880s -0.129 880s 880s y: [3, 4, 3, 1] 880s 0.564 -0.360 -0.144 880s -0.824 0.526 0.211 880s -0.486 0.310 0.124 880s -0.107 0.068 0.027 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.321 880s 880s x1: [1, 3, 1, 1] 880s 0.682 880s -0.170 880s 0.052 880s 880s y: [1, 3, 1, 1] 880s 0.219 880s -0.054 880s 0.017 880s testing: mul_mat, [4, 1, 3, 1] = [1, 4, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 4, 3, 1] 880s 0.431 880s -0.180 880s 0.903 880s -0.699 880s 880s x1: [1, 1, 3, 1] 880s 0.499 880s 880s y: [4, 1, 3, 1] 880s 0.215 -0.090 0.451 -0.349 880s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 3, 3, 1] 880s 0.986 0.736 0.166 0.467 880s 0.855 -0.331 0.489 0.961 880s -0.388 -0.898 -0.995 -0.855 880s 880s x1: [4, 1, 3, 1] 880s 0.165 0.211 -0.708 0.953 880s 880s y: [3, 1, 3, 1] 880s 0.645 0.641 -0.363 880s test-mul-mat0: iter:24/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.404 -0.566 0.859 -0.299 880s -0.828 -0.041 -0.710 0.863 880s 880s x1: [4, 1, 1, 1] 880s 0.135 0.464 0.928 0.862 880s 880s y: [2, 1, 1, 1] 880s 0.331 -0.046 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.859 -0.966 0.605 0.100 880s -0.676 0.468 0.246 0.866 880s 0.220 0.535 -0.638 -0.077 880s 880s x1: [4, 2, 3, 1] 880s -0.854 0.541 0.752 -0.711 880s -0.504 0.703 0.068 0.986 880s 880s y: [3, 2, 3, 1] 880s 0.595 0.400 -0.323 880s -0.107 1.539 0.145 880s testing: mul_mat, [3, 3, 3, 3] = [4, 3, 3, 3] * [4, 3, 3, 3] 880s x0: [4, 3, 3, 3] 880s -0.440 -0.372 0.374 -0.476 880s -0.564 -0.652 -0.125 -0.939 880s 0.894 0.852 0.873 0.627 880s 880s x1: [4, 3, 3, 3] 880s 0.418 -0.048 0.836 -0.362 880s -0.513 -0.802 0.560 0.090 880s -0.455 0.249 0.527 -0.776 880s 880s y: [3, 3, 3, 3] 880s 0.319 0.031 0.835 880s 0.691 0.658 -0.597 880s 0.674 0.757 -0.220 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.835 -0.869 0.079 -0.775 880s 0.154 0.779 0.926 0.315 880s -0.862 0.357 -0.211 -0.047 880s 880s x1: [4, 3, 1, 1] 880s -0.026 0.825 -0.533 -0.107 880s 0.383 -0.790 -0.711 -0.670 880s -0.722 -0.806 -0.162 -0.605 880s 880s y: [3, 3, 1, 1] 880s -0.654 0.111 0.434 880s 0.830 -1.425 -0.431 880s 1.759 -1.080 0.397 880s testingggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s : mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 3, 3, 1] 880s 0.811 -0.280 0.918 880s -0.649 0.695 0.370 880s 0.981 0.848 0.994 880s 880s x1: [3, 4, 3, 1] 880s -0.360 0.538 0.994 880s 0.042 -0.863 0.191 880s -0.984 0.962 0.658 880s -0.091 0.345 0.868 880s 880s y: [3, 4, 3, 1] 880s 0.470 0.976 1.092 880s 0.451 -0.557 -0.501 880s -0.463 1.551 0.505 880s 0.626 0.620 1.065 880s testing: mul_mat, [1, 3, 3, 3] = [3, 1, 3, 3] * [3, 3, 3, 3] 880s x0: [3, 1, 3, 3] 880s 0.987 0.471 0.533 880s 880s x1: [3, 3, 3, 3] 880s 0.097 0.917 -0.544 880s 0.419 0.728 -0.194 880s 0.400 -0.552 -0.498 880s 880s y: [1, 3, 3, 3] 880s 0.238 880s 0.654 880s -0.131 880s test-mul-mat0: iter:25/500 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.546 0.533 880s 880s x1: [2, 3, 1, 1] 880s 0.619 -0.924 880s -0.011 -0.384 880s 0.819 -0.509 880s 880s y: [1, 3, 1, 1] 880s -0.831 880s -0.199 880s -0.718 880s testing: mul_mat, [1, 1, 4, 1] = [2, 1, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 1, 4, 1] 880s -0.570 0.968 880s 880s x1: [2, 1, 4, 1] 880s -0.015 -0.330 880s 880s y: [1, 1, 4, 1] 880s -0.310 880s testing: mul_mat, [3, 1, 4, 2] = [2, 3, 4, 2] * [2, 1, 4, 2] 880s x0: [2, 3, 4, 2] 880s -0.215 0.696 880s 0.567 -0.777 880s -0.077 0.230 880s 880s x1: [2, 1, 4, 2] 880s -0.540 0.981 880s 880s y: [3, 1, 4, 2] 880s 0.799 -1.068 0.268 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.644 -0.700 880s -0.538 -0.802 880s 880s x1: [2, 3, 1, 1] 880s 0.140 -0.646 880s 0.199 -0.586 880s 0.540 -0.504 880s 880s y: [2, 3, 1, 1] 880s 0.542 0.443 880s 0.538 0.362 880s 0.701 0.114 880s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 3, 4, 1] 880s -0.790 0.350 880s 0.199 0.553 880s -0.780 0.419 880s 880s x1: [2, 2, 4, 1] 880s -0.916 -0.964 880s 0.988 -0.509 880s 880s y: [3, 2, 4, 1] 880s 0.386 -0.715 0.310 880s -0.958 -0.085 -0.984 880s testing: mul_mat, [3, 2, 4, 2] = [3, 3, 4, 2] * [3, 2, 4, 2] 880s x0: [3, 3, 4, 2] 880s -0.757 0.676 -0.939 880s 0.070 -0.062 0.023 880s 0.120 -0.027 0.094 880s 880s x1: [3, 2, 4, 2] 880s 0.106 0.464 0.810 880s -0.535 -0.333 0.414 880s 880s y: [3, 2, 4, 2] 880s -0.526 -0.002 0.076 880s -0.209 -0.007 -0.016 880s test-mul-mat0: iter:26/500 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.898 880s 0.042 880s 880s x1: [1, 2, 1, 1] 880s -0.796 880s 0.779 880s 880s y: [2, 2, 1, 1] 880s 0.715 -0.033 880s -0.699 0.033 880s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 3, 2, 1] 880s 0.887 880s -0.312 880s 0.885 880s 880s x1: [1, 2, 2, 1] 880s -0.339 880s -0.020 880s 880s y: [3, 2, 2, 1] 880s -0.300 0.106 -0.300 880s -0.018 0.006 -0.018 880s testing: mul_mat, [3, 3, 2, 3] = [1, 3, 2, 3] * [1, 3, 2, 3] 880s x0: [1, 3, 2, 3] 880s 0.644 880s -0.225 880s 0.912 880s 880s x1: [1, 3, 2, 3] 880s 0.762 880s -0.301 880s -0.960 880s 880s y: [3, 3, 2, 3] 880s 0.491 -0.171 0.695 880s -0.194 0.068 -0.275 880s -0.619 0.216 -0.876 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.097 880s 0.179 880s -0.122 880s 880s x1: [1, 3, 1, 1] 880s 0.582 880s -0.048 880s -0.208 880s 880s y: [3, 3, 1, 1] 880s -0.056 0.104 -0.071 880s 0.005 -0.009 0.006 880s 0.020 -0.037 0.025 880s testing: mul_mat, [2, 1, 2, 1] = [3, 2, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 2, 2, 1] 880s -0.012 0.049 -0.144 880s -0.448 -0.303 -0.987 880s 880s x1: [3, 1, 2, 1] 880s 0.726 -0.901 0.708 880s 880s y: [2, 1, 2, 1] 880s -0.155 -0.751 880s testing: mul_mat, [2, 3, 2, 3] = [2, 2, 2, 3] * [2, 3, 2, 3] 880s x0: [2, 2, 2, 3] 880s -0.401 0.140 880s 0.231 -0.403 880s 880s x1: [2, 3, 2, 3] 880s -0.023 -0.713 880s -0.280 -0.857 880s -0.363 -0.699 880s 880s y: [2, 3, 2, 3] 880s -0.090 0.282 880s -0.007 0.281 880s 0.048 0.198 880s test-mul-mat0: iter:27/500 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.520 880s -0.683 880s -0.196 880s 880s x1: [1, 2, 1, 1] 880s -0.079 880s 0.086 880s 880s y: [3, 2, 1, 1] 880s -0.041 0.054 0.015 880s 0.045 -0.059 -0.017 880s testing: mul_mat, [2, 3, 2, 1] = [1, 2, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 2, 2, 1] 880s -0.837 880s -0.426 880s 880s x1: [1, 3, 2, 1] 880s -0.883 880s -0.845 880s 0.375 880s 880s y: [2, 3, 2, 1] 880s 0.740 0.377 880s 0.707 0.360 880s -0.314 -0.160 880s testing: mul_mat, [1, 2, 2, 2] = [1, 1, 2, 2] * [1, 2, 2, 2] 880s x0: [1, 1, 2, 2] 880s -0.244 880s 880s x1: [1, 2, 2, 2] 880s -0.607 880s -0.313 880s 880s y: [1, 2, 2, 2] 880s 0.148 880s 0.077 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.125 880s -0.990 880s 0.095 880s 0.442 880s 880s x1: [1, 1, 1, 1] 880s -0.654 880s 880s y: [4, 1, 1, 1] 880s -0.082 0.647 -0.062 -0.289 880s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 1, 2, 1] 880s 0.847 0.672 0.993 -0.759 880s 880s x1: [4, 1, 2, 1] 880s -0.186 0.211 0.597 -0.811 880s 880s y: [1, 1, 2, 1] 880s 1.193 880s testiggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ng: mul_mat, [1, 4, 2, 2] = [1, 1, 2, 2] * [1, 4, 2, 2] 880s x0: [1, 1, 2, 2] 880s -0.288 880s 880s x1: [1, 4, 2, 2] 880s -0.319 880s -0.722 880s -0.916 880s 0.917 880s 880s y: [1, 4, 2, 2] 880s 0.092 880s 0.208 880s 0.264 880s -0.264 880s test-mul-mat0: iter:28/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.901 0.335 -0.198 -0.703 880s 0.045 -0.958 0.938 0.568 880s -0.506 0.176 -0.721 -0.406 880s 880s x1: [4, 3, 1, 1] 880s 0.988 -0.927 -0.410 0.488 880s 0.713 -0.802 -0.831 0.992 880s -0.717 -0.914 0.026 0.636 880s 880s y: [3, 3, 1, 1] 880s -1.463 0.825 -0.566 880s -1.444 0.583 -0.306 880s -0.112 1.228 -0.074 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.875 -0.542 -0.563 -0.099 880s -0.906 0.094 -0.000 0.429 880s 880s x1: [4, 3, 1, 1] 880s -0.289 -0.735 0.155 -0.294 880s -0.903 0.387 -0.306 -0.830 880s 0.978 -0.818 0.884 -0.824 880s 880s y: [2, 3, 1, 1] 880s 0.087 0.067 880s -0.745 0.498 880s 0.883 -1.316 880s testing: mul_mat, [4, 2, 1, 3] = [4, 4, 1, 3] * [4, 2, 1, 3] 880s x0: [4, 4, 1, 3] 880s 0.493 0.513 -0.726 -0.414 880s -0.488 0.703 -0.518 0.810 880s -0.823 -0.580 0.045 -0.782 880s -0.148 0.456 -0.461 0.878 880s 880s x1: [4, 2, 1, 3] 880s 0.896 0.297 -0.526 0.938 880s 0.235 -0.959 -0.568 -0.589 880s 880s y: [4, 2, 1, 3] 880s 0.587 0.805 -1.666 1.069 880s 0.280 -0.971 0.797 -0.727 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.536 -0.214 -0.543 -0.021 880s -0.462 -0.619 0.268 0.612 880s 0.634 -0.136 -0.687 -0.699 880s 880s x1: [4, 4, 1, 1] 880s 0.113 -0.540 0.330 -0.080 880s -0.913 -0.740 0.580 0.261 880s 0.894 -0.024 -0.844 -0.416 880s -0.016 0.231 -0.233 -0.495 880s 880s y: [3, 4, 1, 1] 880s -0.002 0.322 -0.026 880s -0.651 1.195 -1.059 880s 0.951 -0.880 1.441 880s 0.079 -0.502 0.465 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.396 0.371 -0.093 880s 0.872 -0.623 -0.085 880s -0.425 0.156 -0.210 880s 880s x1: [3, 4, 1, 1] 880s 0.380 -0.465 0.413 880s 0.839 0.866 -0.667 880s 0.927 -0.874 0.912 880s 0.187 -0.980 -0.112 880s 880s y: [3, 4, 1, 1] 880s -0.361 0.585 -0.321 880s 0.052 0.250 -0.082 880s -0.776 1.275 -0.722 880s -0.428 0.783 -0.209 880s testing: mul_mat, [2, 3, 1, 3] = [3, 2, 1, 3] * [3, 3, 1, 3] 880s x0: [3, 2, 1, 3] 880s 0.605 0.325 0.273 880s 0.978 0.298 -0.021 880s 880s x1: [3, 3, 1, 3] 880s 0.693 0.295 0.654 880s -0.850 -0.437 0.967 880s 0.129 -0.824 -0.733 880s 880s y: [2, 3, 1, 3] 880s 0.694 0.752 880s -0.392 -0.982 880s -0.390 -0.104 880s test-mul-mat0: iter:29/500 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.667 0.356 -0.250 880s 0.397 0.892 -0.645 880s 0.375 0.217 0.654 880s 880s x1: [3, 1, 1, 1] 880s -0.275 0.647 0.976 880s 880s y: [3, 1, 1, 1] 880s 0.170 -0.162 0.675 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.248 -0.800 -0.480 880s -0.845 0.910 -0.478 880s 880s x1: [3, 3, 1, 1] 880s -0.352 -0.804 0.102 880s -0.515 0.141 0.680 880s -0.503 -0.646 -0.234 880s 880s y: [2, 3, 1, 1] 880s 0.507 -0.483 880s -0.567 0.239 880s 0.504 -0.050 880s testing: mul_mat, [1, 2, 1, 2] = [3, 1, 1, 2] * [3, 2, 1, 2] 880s x0: [3, 1, 1, 2] 880s 0.522 -0.943 -0.613 880s 880s x1: [3, 2, 1, 2] 880s -0.013 0.081 0.247 880s -0.366 0.057 0.298 880s 880s y: [1, 2, 1, 2] 880s -0.234 880s -0.428 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.605 -0.412 -0.539 880s 880s x1: [3, 1, 1, 1] 880s -0.141 -0.161 0.368 880s 880s y: [1, 1, 1, 1] 880s -0.218 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.630 880s 0.603 880s -0.204 880s -0.123 880s 880s x1: [1, 3, 1, 1] 880s 0.805 880s 0.108 880s -0.384 880s 880s y: [4, 3, 1, 1] 880s 0.507 0.485 -0.165 -0.099 880s 0.068 0.065 -0.022 -0.013 880s -0.242 -0.232 0.079 0.047 880s testing: mul_mat, [2, 1, 1, 2] = [4, 2, 1, 2] * [4, 1, 1, 2] 880s x0: [4, 2, 1, 2] 880s 0.306 -0.317 0.810 -0.437 880s -0.907 0.692 0.843 -0.331 880s 880s x1: [4, 1, 1, 2] 880s -0.764 0.853 -0.825 0.203 880s 880s y: [2, 1, 1, 2] 880s -1.261 0.520 880s test-mul-mat0: iter:30/500 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.293 0.158 880s -0.126 0.386 880s 0.841 -0.434 880s 0.196 0.684 880s 880s x1: [2, 4, 1, 1] 880s -0.397 -0.866 880s -0.202 -0.544 880s -0.691 -0.999 880s -0.278 -0.468 880s 880s y: [4, 4, 1, 1] 880s -0.253 -0.284 0.042 -0.670 880s -0.145 -0.184 0.066 -0.412 880s -0.360 -0.298 -0.148 -0.818 880s -0.155 -0.145 -0.031 -0.375 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.208 0.048 880s 880s x1: [2, 4, 1, 1] 880s 0.130 0.864 880s -0.634 0.061 880s 0.746 -0.347 880s 0.581 -0.911 880s 880s y: [1, 4, 1, 1] 880s 0.068 880s -0.129 880s 0.139 880s 0.077 880s testing: mul_mat, [3, 1, 1, 2] = [2, 3, 1, 2] * [2, 1, 1, 2] 880s x0: [2, 3, 1, 2] 880s -0.593 0.611 880s -0.091 -0.592 880s -0.667 0.441 880s 880s x1: [2, 1, 1, 2] 880s -0.448 -0.534 880s 880s y: [3, 1, 1, 2] 880s -0.060 0.357 0.064 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.817 0.143 880s 880s x1: [2, 3, 1, 1] 880s 0.416 -0.794 880s -0.876 -0.455 880s -0.929 -0.510 880s 880s y: [1, 3, 1, 1] 880s 0.227 880s -0.781 880s -0.832 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.604 880s -0.045 880s 880s x1: [1, 2, 1, 1] 880s 0.187 880s 0.907 880s 880s y: [2, 2, 1, 1] 880s -0.113 -0.008 880s -0.548 -0.041 880s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 880s x0: [2, 4, 1, 2] 880s -0.030 0.538 880s 0.205 0.812 880s -0.629 0.354 880s 0.378 0.163 880s 880s x1: [2, 1, 1, 2] 880s -0.561 -0.138 880s 880s y: [4, 1, 1, 2] 880s -0.058 -0.227 0.304 -0.234 880s test-mul-mat0: iter:31/500 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.354 0.130 -0.065 880s -0.784 0.694 -0.471 880s 0.678 -0.336 0.734 880s 880s x1: [3, 1, 1, 1] 880s -0.392 -0.041 -0.825 880s 880s y: [3, 1, 1, 1] 880s -0.090 0.667 -0.858 880s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.364 0.372 0.864 880s 0.413 0.414 0.136 880s 880s x1: [3, 3, 3, 1] 880s -0.951 -0.958 0.272 880s 0.861 0.396 -0.565 880s -0.728 0.600 -0.391 880s 880s y: [2, 3, 3, 1] 880s -0.467 -0.752 880s -0.028 0.443 880s -0.380 -0.106 880s testing: mul_mat, [3, 2, 3, 4] = [3, 3, 3, 4] * [3, 2, 3, 4] 880s x0: [3, 3, 3, 4] 880s -0.865 0.953 -0.520 880s -0.476 -0.135 0.724 880s 0.624 0.676 -0.541 880s 880s x1: [3, 2, 3, 4] 880s -0.668 -0.830 -0.681 880s -0.497 -0.715 -0.797 880s 880s y: [3, 2, 3, 4] 880s 0.141 -0.063 -0.610 880s 0.163 -0.244 -0.362 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.477 -0.755 0.040 880s 880s x1: [3, 3, 1, 1] 880s -0.923 0.834 -0.745 880s -0.654 -0.899 0.852 880s 0.088 0.776 -0.175 880s 880s y: [1, 3, 1, 1] 880s -1.101 880s 0.401 880s -0.551 880s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 3, 3, 1] 880s -0.435 880s 0.270 880s 0.186 880s 880s x1: [1, 3, 3, 1] 880s -0.135 880s -0.012 880s 0.077 880s 880s y: [3, 3, 3, 1] 880s 0.059 -0.036 -0.025 880s 0.005 -0.003 -0.002 880s -0.034 0.021 0.014 880s testing: mul_mat, [1, 1, 3, 4] = [3, 1, 3, 4] * [3, 1, 3, 4] 880s x0: [3, 1, 3, 4] 880s -0.108 0.010 0.077 880s 880s x1: [3, 1, 3, 4] 880s 0.117 -0.742 -0.117 880s 880s y: [1, 1, 3, 4] 880s -0.029 880s test-mul-mat0: iter:32/500 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.138 -0.727 880s 880s x1: [2, 2, 1, 1] 880s 0.448 -0.384 880s -0.117 -0.295 880s 880s y: [1, 2, 1, 1] 880s 0.217 880s 0.231 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.434 -0.029 880s -0.640 -0.293 880s 0.584 0.876 880s 0.527 -0.837 880s 880s x1: [2, 1, 1, 1] 880s 0.987 -0.935 880s 880s y: [4, 1, 1, 1] 880s -0.401 -0.357 -0.243 1.303 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.489 -0.499 880s -0.456 -0.655 880s 0.949 0.160 880s 880s x1: [2, 4, 1, 1] 880s -0.231 -0.756 880s -0.184 0.009 880s 0.854 -0.520 880s -0.674 -0.173 880s 880s y: [3, 4, 1, 1] 880s 0.264 0.600 -0.341 880s -0.094 0.078 -0.173 880s 0.677 -0.049 0.727 880s -0.243 0.421 -0.668 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.735 0.400 880s 880s x1: [2, 3, 1, 1] 880s 0.228 -0.346 880s -0.153 -0.909 880s -0.073 -0.166 880s 880s y: [1, 3, 1, 1] 880s -0.306 880s -0.251 880s -0.013 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.289 880s -0.499 880s 0.633 880s -0.127 880s 880s x1: [1, 2, 1, 1] 880s 0.126 880s -0.375 880s 880s y: [4, 2, 1, 1] 880s -0.037 -0.063 0.080 -0.016 880s 0.108 0.187 -0.237 0.047 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.831 -0.498 0.046 -0.794 880s -0.395 -0.906 -0.560 0.668 880s -0.442 0.059 0.008 0.663 880s 880s x1: [4, 1, 1, 1] 880s 0.270 0.877 0.690 -0.721 880s 880s y: [3, 1, 1, 1] 880s -0.056 -1.769 -0.540 880s test-mul-mat0: iter:33/500 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.393 880s 880s x1: [1, 4, 1, 1] 880s 0.914 880s -0.145 880s 0.286 880s 0.040 880s 880s y: [1, 4, 1, 1] 880s 0.359 880s -0.057 880s 0.112 880s 0.016 880s testing: mul_mat, [3, 1, 3, 1] = [1, 3, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 3, 3, 1] 880s 0.251 880s -0.097 880s -0.686 880s 880s x1: [1, 1, 3, 1] 880s 0.751 880s 880s y: [3, 1, 3, 1] 880s 0.189 -0.073 -0.515 880s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s , 1] 880s x0: [1, 3, 3, 1] 880s 0.512 880s 0.177 880s -0.991 880s 880s x1: [1, 3, 3, 1] 880s -0.771 880s -0.749 880s -0.968 880s 880s y: [3, 3, 3, 1] 880s -0.394 -0.137 0.764 880s -0.383 -0.133 0.743 880s -0.496 -0.172 0.960 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.785 880s 0.737 880s -0.391 880s 880s x1: [1, 3, 1, 1] 880s 0.828 880s -0.118 880s 0.423 880s 880s y: [3, 3, 1, 1] 880s 0.650 0.610 -0.323 880s -0.093 -0.087 0.046 880s 0.332 0.312 -0.165 880s testing: mul_mat, [4, 1, 3, 1] = [3, 4, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.476 -0.926 0.922 880s 0.979 0.556 -0.593 880s -0.956 0.944 0.132 880s -0.234 -0.917 0.769 880s 880s x1: [3, 1, 3, 1] 880s -0.581 -0.780 -0.256 880s 880s y: [4, 1, 3, 1] 880s 0.210 -0.851 -0.214 0.654 880s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.998 0.754 0.840 0.961 880s 0.998 0.549 0.135 0.470 880s -0.651 -0.660 -0.525 0.561 880s -0.170 -0.561 0.442 0.791 880s 880s x1: [4, 3, 3, 1] 880s 0.544 0.940 -0.216 0.467 880s -0.653 0.915 0.236 -0.389 880s 0.846 -0.167 0.756 0.200 880s 880s y: [4, 3, 3, 1] 880s 1.518 1.249 -0.600 -0.346 880s -0.138 -0.300 -0.522 -0.606 880s 1.545 0.948 -0.725 0.442 880s test-mul-mat0: iter:34/500 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.696 0.363 0.754 880s 880s x1: [3, 2, 1, 1] 880s -0.448 0.345 -0.382 880s 0.727 0.444 0.296 880s 880s y: [1, 2, 1, 1] 880s 0.148 880s -0.121 880s testing: mul_mat, [2, 1, 2, 1] = [3, 2, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.501 -0.466 -0.957 880s -0.768 0.979 0.167 880s 880s x1: [3, 1, 2, 1] 880s 0.163 0.426 -0.632 880s 880s y: [2, 1, 2, 1] 880s 0.488 0.187 880s testing: mul_mat, [1, 2, 2, 4] = [3, 1, 2, 4] * [3, 2, 2, 4] 880s x0: [3, 1, 2, 4] 880s -0.709 0.344 0.232 880s 880s x1: [3, 2, 2, 4] 880s -0.533 -0.596 0.815 880s 0.812 0.022 0.541 880s 880s y: [1, 2, 2, 4] 880s 0.362 880s -0.443 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.637 -0.696 -0.376 880s 0.280 0.928 0.536 880s 880s x1: [3, 1, 1, 1] 880s 0.960 0.801 0.788 880s 880s y: [2, 1, 1, 1] 880s -0.241 1.434 880s testing: mul_mat, [2, 3, 2, 1] = [2, 2, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 2, 2, 1] 880s 0.713 -0.022 880s 0.280 0.713 880s 880s x1: [2, 3, 2, 1] 880s -0.285 -0.947 880s -0.036 0.166 880s -1.000 0.574 880s 880s y: [2, 3, 2, 1] 880s -0.182 -0.755 880s -0.029 0.109 880s -0.725 0.129 880s testing: mul_mat, [3, 2, 2, 4] = [2, 3, 2, 4] * [2, 2, 2, 4] 880s x0: [2, 3, 2, 4] 880s -0.759 0.850 880s 0.484 0.798 880s -0.211 0.407 880s 880s x1: [2, 2, 2, 4] 880s -0.281 0.053 880s -0.671 -0.990 880s 880s y: [3, 2, 2, 4] 880s 0.258 -0.093 0.081 880s -0.333 -1.114 -0.261 880s test-mul-mat0: iter:35/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.346 0.005 -0.682 -0.491 880s 880s x1: [4, 3, 1, 1] 880s 0.001 -0.027 -0.375 -0.427 880s -0.952 0.923 0.189 0.037 880s -0.305 0.904 -0.734 -0.049 880s 880s y: [1, 3, 1, 1] 880s 0.466 880s -0.472 880s 0.423 880s testing: mul_mat, [4, 1, 4, 1] = [4, 4, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.453 0.214 0.022 0.737 880s -0.749 0.718 0.641 -0.483 880s -0.331 0.048 0.863 0.673 880s 0.366 -0.628 -0.621 0.336 880s 880s x1: [4, 1, 4, 1] 880s -0.294 0.970 -0.958 -0.131 880s 880s y: [4, 1, 4, 1] 880s 0.223 0.366 -0.771 -0.166 880s testing: mul_mat, [4, 4, 4, 3] = [4, 4, 4, 3] * [4, 4, 4, 3] 880s x0: [4, 4, 4, 3] 880s -0.176 0.174 -0.362 -0.758 880s -0.155 -0.788 -0.774 0.357 880s 0.415 0.698 0.995 0.590 880s 0.711 0.269 -0.808 0.400 880s 880s x1: [4, 4, 4, 3] 880s -0.244 0.320 -0.904 -0.473 880s -0.411 0.492 -0.033 -0.094 880s -0.249 0.080 -0.411 -0.617 880s 0.572 0.607 -0.572 0.313 880s 880s y: [4, 4, 4, 3] 880s 0.785 0.316 -1.056 0.454 880s 0.241 -0.332 0.085 -0.171 880s 0.675 0.073 -0.821 -0.070 880s -0.025 -0.013 0.277 1.158 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.279 0.936 0.918 0.773 880s -0.475 -0.201 0.952 -0.055 880s 880s x1: [4, 4, 1, 1] 880s 0.040 -0.821 0.262 0.146 880s -0.539 -0.937 -0.201 0.310 880s -0.812 -0.619 0.888 -0.804 880s 0.949 -0.253 -0.560 -0.301 880s 880s y: [2, 4, 1, 1] 880s -0.404 0.388 880s -0.973 0.236 880s -0.612 1.400 880s -0.719 -0.916 880s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.303 0.009 880s -0.398 0.345 880s 0.122 -0.932 880s -0.248 -0.437 880s 880s x1: [2, 4, 4, 1] 880s -0.675 0.024 880s 0.928 0.368 880s -0.955 -0.945 880s -0.593 -0.777 880s 880s y: [4, 4, 4, 1] 880s 0.205 0.277 -0.105 0.157 880s -0.278 -0.243 -0.230 -0.391 880s 0.281 0.054 0.765 0.651 880s 0.172 -0.032 0.651 0.487 880s testing: mul_mat, [2, 2, 4, 3] = [4, 2, 4, 3] * [ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 4, 2, 4, 3] 880s x0: [4, 2, 4, 3] 880s -0.092 0.009 -0.303 -0.002 880s -0.261 -0.002 0.730 0.028 880s 880s x1: [4, 2, 4, 3] 880s 0.257 -0.006 0.661 -0.734 880s -0.661 0.729 -0.171 -0.632 880s 880s y: [2, 2, 4, 3] 880s -0.222 0.394 880s 0.121 0.029 880s test-mul-mat0: iter:36/500 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.084 -0.671 0.317 880s 0.870 -0.249 0.021 880s 0.748 0.900 0.418 880s 880s x1: [3, 2, 1, 1] 880s 0.813 -0.358 -0.359 880s 0.784 0.647 -0.021 880s 880s y: [3, 2, 1, 1] 880s 0.194 0.789 0.137 880s -0.375 0.521 1.161 880s testing: mul_mat, [1, 3, 2, 1] = [3, 1, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 1, 2, 1] 880s -0.301 -0.492 0.944 880s 880s x1: [3, 3, 2, 1] 880s 0.161 -0.369 0.545 880s -0.255 -0.172 -0.225 880s -0.900 0.735 0.389 880s 880s y: [1, 3, 2, 1] 880s 0.647 880s -0.052 880s 0.277 880s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 4, 2, 1] 880s -0.882 0.245 0.308 880s -0.053 -0.980 0.408 880s -0.319 0.409 -0.030 880s -0.718 -0.924 0.138 880s 880s x1: [3, 1, 2, 1] 880s 0.653 0.588 -0.718 880s 880s y: [4, 1, 2, 1] 880s -0.653 -0.904 0.054 -1.110 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.342 -0.059 0.632 880s -0.255 -0.178 0.786 880s 0.526 -0.336 0.782 880s 880s x1: [3, 4, 1, 1] 880s 0.876 0.425 -0.628 880s 0.277 -0.088 -0.928 880s 0.840 0.030 0.317 880s 0.148 0.977 0.336 880s 880s y: [3, 4, 1, 1] 880s -0.722 -0.793 -0.174 880s -0.676 -0.784 -0.551 880s -0.089 0.029 0.680 880s 0.104 0.052 0.013 880s testing: mul_mat, [2, 3, 2, 1] = [3, 2, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 2, 2, 1] 880s -0.540 -0.263 0.425 880s -0.321 0.118 -0.737 880s 880s x1: [3, 3, 2, 1] 880s 0.374 -0.722 -0.334 880s 0.815 0.278 0.494 880s -0.962 0.115 0.583 880s 880s y: [2, 3, 2, 1] 880s -0.154 0.041 880s -0.304 -0.592 880s 0.737 -0.108 880s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 1, 2, 1] 880s -0.304 -0.568 880s 880s x1: [2, 3, 2, 1] 880s 0.064 -0.690 880s 0.375 -0.121 880s 0.588 -0.131 880s 880s y: [1, 3, 2, 1] 880s 0.372 880s -0.045 880s -0.104 880s test-mul-mat0: iter:37/500 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.874 0.163 -0.494 -0.843 880s 880s x1: [4, 1, 1, 1] 880s 0.974 -0.243 0.879 -0.763 880s 880s y: [1, 1, 1, 1] 880s -0.682 880s testing: mul_mat, [4, 1, 3, 1] = [4, 4, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.777 -0.300 0.405 0.280 880s 0.879 0.181 -0.483 0.518 880s 0.636 -0.509 -0.724 0.515 880s -0.273 0.092 0.641 0.891 880s 880s x1: [4, 1, 3, 1] 880s 0.872 0.570 -0.533 0.247 880s 880s y: [4, 1, 3, 1] 880s -0.996 1.256 0.777 -0.307 880s testing: mul_mat, [2, 4, 3, 3] = [4, 2, 3, 3] * [4, 4, 3, 3] 880s x0: [4, 2, 3, 3] 880s 0.048 0.527 -0.941 -0.601 880s -0.239 -0.581 0.748 -0.731 880s 880s x1: [4, 4, 3, 3] 880s 0.842 -0.924 0.314 -0.892 880s 0.086 -0.068 -0.571 -0.787 880s 0.917 0.510 0.906 0.102 880s -0.406 -0.428 -0.491 -0.623 880s 880s y: [2, 4, 3, 3] 880s -0.206 1.223 880s 0.978 0.167 880s -0.600 0.088 880s 0.590 0.433 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.677 -0.040 -0.264 0.564 880s 880s x1: [4, 2, 1, 1] 880s -0.661 0.033 0.004 0.410 880s 0.920 0.481 -0.707 0.026 880s 880s y: [1, 2, 1, 1] 880s -0.219 880s 0.805 880s testing: mul_mat, [2, 4, 3, 1] = [1, 2, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.485 880s -0.712 880s 880s x1: [1, 4, 3, 1] 880s 0.014 880s 0.395 880s -0.505 880s -0.414 880s 880s y: [2, 4, 3, 1] 880s -0.007 -0.010 880s -0.191 -0.281 880s 0.245 0.360 880s 0.201 0.295 880s testing: mul_mat, [2, 1, 3, 3] = [2, 2, 3, 3] * [2, 1, 3, 3] 880s x0: [2, 2, 3, 3] 880s -0.419 0.453 880s 0.695 -0.998 880s 880s x1: [2, 1, 3, 3] 880s 0.909 -0.749 880s 880s y: [2, 1, 3, 3] 880s -0.720 1.380 880s test-mul-mat0: iter:38/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.789 0.010 880s -0.182 -0.008 880s -0.005 -0.977 880s 880s x1: [2, 2, 1, 1] 880s -0.421 -0.338 880s -0.373 0.881 880s 880s y: [3, 2, 1, 1] 880s 0.329 0.079 0.332 880s 0.304 0.061 -0.858 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.459 -0.094 880s 880s x1: [2, 3, 1, 1] 880s -0.354 -0.207 880s -0.774 -0.262 880s 0.419 -0.180 880s 880s y: [1, 3, 1, 1] 880s -0.143 880s -0.331 880s 0.209 880s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 880s x0: [2, 4, 1, 2] 880s -0.759 0.173 880s -0.800 0.427 880s -0.126 -0.856 880s -0.994 0.536 880s 880s x1: [2, 1, 1, 2] 880s -0.374 -0.275 880s 880s y: [4, 1, 1, 2] 880s 0.236 0.181 0.282 0.224 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.798 0.747 880s 880s x1: [2, 4, 1, 1] 880s -0.749 -0.380 880s -0.315 -0.524 880s 0.358 -0.895 880s 0.296 0.120 880s 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s y: [1, 4, 1, 1] 880s 0.314 880s -0.140 880s -0.954 880s -0.147 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.733 880s -0.983 880s 880s x1: [1, 2, 1, 1] 880s -0.712 880s 0.775 880s 880s y: [2, 2, 1, 1] 880s -0.522 0.700 880s 0.568 -0.762 880s testing: mul_mat, [1, 1, 1, 2] = [2, 1, 1, 2] * [2, 1, 1, 2] 880s x0: [2, 1, 1, 2] 880s 0.450 -0.485 880s 880s x1: [2, 1, 1, 2] 880s -0.894 0.933 880s 880s y: [1, 1, 1, 2] 880s -0.854 880s test-mul-mat0: iter:39/500 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.188 -0.120 0.782 880s -0.084 -0.823 -0.098 880s -0.520 -0.621 -0.351 880s 880s x1: [3, 1, 1, 1] 880s 0.804 -0.874 -0.596 880s 880s y: [3, 1, 1, 1] 880s -0.513 0.710 0.334 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.064 0.182 0.358 880s -0.599 0.108 0.188 880s 880s x1: [3, 3, 1, 1] 880s -0.232 -0.846 0.582 880s -0.499 -0.829 0.688 880s -0.567 0.614 -0.333 880s 880s y: [2, 3, 1, 1] 880s 0.039 0.157 880s 0.063 0.339 880s -0.044 0.343 880s testing: mul_mat, [4, 2, 1, 4] = [3, 4, 1, 4] * [3, 2, 1, 4] 880s x0: [3, 4, 1, 4] 880s -0.897 0.869 -0.461 880s -0.495 -0.023 0.727 880s 0.140 -0.763 -0.281 880s 0.902 0.902 -0.864 880s 880s x1: [3, 2, 1, 4] 880s -0.365 0.261 -0.008 880s -0.238 0.664 0.416 880s 880s y: [4, 2, 1, 4] 880s 0.557 0.169 -0.248 -0.087 880s 0.599 0.405 -0.657 0.024 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.919 -0.792 0.224 880s -0.369 0.217 -0.723 880s 880s x1: [3, 4, 1, 1] 880s -0.350 -0.994 -0.911 880s -0.164 -0.324 0.611 880s 0.354 0.583 -0.443 880s 0.891 0.161 -0.458 880s 880s y: [2, 4, 1, 1] 880s 0.262 0.572 880s 0.243 -0.452 880s -0.236 0.316 880s 0.589 0.037 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.701 0.080 880s 0.445 -0.225 880s 880s x1: [2, 3, 1, 1] 880s -0.758 -0.043 880s -0.417 0.986 880s -0.408 -0.209 880s 880s y: [2, 3, 1, 1] 880s -0.535 -0.328 880s -0.213 -0.408 880s -0.303 -0.135 880s testing: mul_mat, [2, 2, 1, 4] = [2, 2, 1, 4] * [2, 2, 1, 4] 880s x0: [2, 2, 1, 4] 880s 0.336 0.412 880s 0.921 0.578 880s 880s x1: [2, 2, 1, 4] 880s 0.267 0.731 880s -0.219 0.356 880s 880s y: [2, 2, 1, 4] 880s 0.391 0.668 880s 0.073 0.004 880s test-mul-mat0: iter:40/500 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.223 880s 0.347 880s -0.459 880s 0.423 880s 880s x1: [1, 1, 1, 1] 880s 0.822 880s 880s y: [4, 1, 1, 1] 880s 0.183 0.285 -0.377 0.348 880s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 4, 3, 1] 880s -0.203 880s 0.451 880s 0.551 880s 0.029 880s 880s x1: [1, 4, 3, 1] 880s 0.414 880s 0.469 880s 0.543 880s -0.889 880s 880s y: [4, 4, 3, 1] 880s -0.084 0.186 0.228 0.012 880s -0.095 0.211 0.258 0.013 880s -0.110 0.245 0.299 0.016 880s 0.180 -0.401 -0.490 -0.026 880s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 4, 3, 1] 880s -0.632 880s 0.933 880s 0.018 880s -0.621 880s 880s x1: [1, 4, 3, 1] 880s 0.204 880s 0.738 880s 0.630 880s -0.449 880s 880s y: [4, 4, 3, 1] 880s -0.129 0.191 0.004 -0.127 880s -0.467 0.689 0.013 -0.459 880s -0.398 0.588 0.011 -0.391 880s 0.284 -0.419 -0.008 0.279 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.076 880s 880s x1: [1, 4, 1, 1] 880s -0.268 880s 0.568 880s -0.331 880s -0.886 880s 880s y: [1, 4, 1, 1] 880s -0.020 880s 0.043 880s -0.025 880s -0.067 880s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.405 880s 0.758 880s 880s x1: [1, 1, 3, 1] 880s 0.318 880s 880s y: [2, 1, 3, 1] 880s -0.129 0.242 880s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 2, 3, 1] 880s 0.718 -0.382 880s -0.567 0.124 880s 880s x1: [2, 1, 3, 1] 880s -0.305 0.830 880s 880s y: [2, 1, 3, 1] 880s -0.536 0.275 880s test-mul-mat0: iter:41/500 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.779 -0.381 880s 0.645 0.985 880s 0.655 0.087 880s 0.703 -0.913 880s 880s x1: [2, 4, 1, 1] 880s -0.723 -0.143 880s 0.544 -0.964 880s -0.308 0.887 880s -0.609 -0.398 880s 880s y: [4, 4, 1, 1] 880s 0.617 -0.607 -0.485 -0.378 880s -0.057 -0.598 0.272 1.262 880s -0.098 0.675 -0.124 -1.026 880s 0.626 -0.786 -0.434 -0.065 880s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 1, 4, 1] 880s -0.278 0.937 880s 880s x1: [2, 4, 4, 1] 880s 0.706 -0.173 880s 0.469 -0.471 880s 0.316 0.297 880s -0.916 0.374 880s 880s y: [1, 4, 4, 1] 880s -0.359 880s -0.572 880s 0.190 880s 0.605 880s testing: mul_mat, [4, 1, 4, 2] = [2, 4, 4, 2] * [2, 1, 4, 2] 880s x0: [2, 4, 4, 2] 880s 0.969 -0.364 880s 0.111 0.222 880s -0.956 0.916 880s 0.944 0.981 880s 880s x1: [2, 1, 4, 2] 880s 0.502 0.178 880s 880s y: [4, 1, 4, 2] 880s 0.422 0.095 -0.317 0.648 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1,ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.269 0.730 880s -0.248 0.476 880s 0.697 0.476 880s 880s x1: [2, 4, 1, 1] 880s -0.916 -0.053 880s -0.214 0.944 880s -0.018 -0.504 880s 0.841 -0.020 880s 880s y: [3, 4, 1, 1] 880s -0.285 0.202 -0.664 880s 0.632 0.503 0.300 880s -0.373 -0.235 -0.252 880s 0.211 -0.218 0.577 880s testing: mul_mat, [3, 2, 4, 1] = [3, 3, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.062 -0.332 -0.647 880s -0.775 0.701 0.674 880s -0.765 0.711 0.368 880s 880s x1: [3, 2, 4, 1] 880s 0.684 0.973 0.657 880s -0.910 -0.258 -0.358 880s 880s y: [3, 2, 4, 1] 880s -0.790 0.594 0.410 880s 0.373 0.283 0.381 880s testing: mul_mat, [4, 3, 4, 2] = [3, 4, 4, 2] * [3, 3, 4, 2] 880s x0: [3, 4, 4, 2] 880s -0.619 0.003 -0.940 880s -0.262 0.910 -0.860 880s 0.832 0.892 -0.065 880s -0.382 -0.602 -0.166 880s 880s x1: [3, 3, 4, 2] 880s 0.039 0.568 0.087 880s -0.594 -0.989 0.503 880s -0.583 -0.250 -0.729 880s 880s y: [4, 3, 4, 2] 880s -0.104 0.432 0.533 -0.371 880s -0.109 -1.177 -1.408 0.739 880s 1.046 0.552 -0.661 0.494 880s test-mul-mat0: iter:42/500 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.110 -0.315 -0.224 0.451 880s 0.649 -0.962 0.996 0.800 880s 880s x1: [4, 4, 1, 1] 880s 0.103 -0.917 -0.438 -0.132 880s -0.603 -0.302 0.512 -0.303 880s -0.724 -0.351 0.851 -0.461 880s 0.157 -0.656 -0.052 -0.977 880s 880s y: [2, 4, 1, 1] 880s 0.338 0.407 880s -0.223 0.167 880s -0.368 0.346 880s -0.205 -0.100 880s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.958 0.165 -0.233 0.996 880s 0.160 -0.433 0.002 0.760 880s 880s x1: [4, 2, 3, 1] 880s 0.005 -0.400 -0.355 0.120 880s 0.030 0.580 -0.777 0.113 880s 880s y: [2, 2, 3, 1] 880s 0.141 0.265 880s 0.418 -0.161 880s testing: mul_mat, [1, 2, 3, 3] = [4, 1, 3, 3] * [4, 2, 3, 3] 880s x0: [4, 1, 3, 3] 880s -0.437 0.808 0.855 0.442 880s 880s x1: [4, 2, 3, 3] 880s 0.702 -0.918 0.145 -0.865 880s 0.264 -0.142 0.253 0.221 880s 880s y: [1, 2, 3, 3] 880s -1.307 880s 0.085 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.010 -0.271 -0.008 -0.474 880s 0.732 -0.173 0.037 0.515 880s 0.141 -0.286 -0.393 0.253 880s 880s x1: [4, 1, 1, 1] 880s -0.633 0.235 0.651 -0.702 880s 880s y: [3, 1, 1, 1] 880s 0.257 -0.841 -0.590 880s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.217 0.118 0.792 880s -0.948 -0.157 -0.178 880s -0.740 0.736 0.611 880s 0.886 0.539 0.040 880s 880s x1: [3, 4, 3, 1] 880s 0.203 0.548 -0.653 880s 0.754 0.678 -0.193 880s -0.070 -0.532 -0.233 880s -0.331 0.066 -0.417 880s 880s y: [4, 4, 3, 1] 880s -0.408 -0.163 -0.146 0.450 880s 0.090 -0.787 -0.177 1.026 880s -0.262 0.191 -0.482 -0.358 880s -0.394 0.377 0.039 -0.274 880s testing: mul_mat, [4, 3, 3, 3] = [4, 4, 3, 3] * [4, 3, 3, 3] 880s x0: [4, 4, 3, 3] 880s 0.952 -0.672 -0.698 -0.558 880s -0.500 -0.564 -0.978 -0.398 880s 0.166 0.882 -0.175 -0.298 880s -0.989 0.345 0.563 0.551 880s 880s x1: [4, 3, 3, 3] 880s -0.603 -0.705 0.228 -0.811 880s 0.117 -0.161 0.229 -0.437 880s 0.531 0.248 0.775 -0.965 880s 880s y: [4, 3, 3, 3] 880s 0.193 0.799 -0.521 0.035 880s 0.304 -0.018 -0.033 -0.283 880s 0.337 -0.780 0.459 -0.536 880s test-mul-mat0: iter:43/500 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.386 -0.427 0.803 880s -0.777 0.222 -0.052 880s 880s x1: [3, 3, 1, 1] 880s 0.238 0.024 -0.141 880s -0.895 0.908 -0.113 880s -0.162 -0.536 0.676 880s 880s y: [2, 3, 1, 1] 880s -0.032 -0.172 880s -0.823 0.903 880s 0.709 -0.028 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.304 0.561 -0.031 880s 880s x1: [3, 2, 1, 1] 880s 0.410 -0.618 0.647 880s -0.295 -0.464 0.598 880s 880s y: [1, 2, 1, 1] 880s -0.491 880s -0.189 880s testing: mul_mat, [3, 1, 1, 2] = [3, 3, 1, 2] * [3, 1, 1, 2] 880s x0: [3, 3, 1, 2] 880s 0.728 -0.534 -0.884 880s -0.385 0.303 -0.419 880s -0.710 0.449 0.966 880s 880s x1: [3, 1, 1, 2] 880s 0.084 0.675 0.123 880s 880s y: [3, 1, 1, 2] 880s -0.408 0.121 0.362 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.551 0.279 0.894 880s 0.496 0.962 0.265 880s -0.823 -0.707 -0.126 880s 880s x1: [3, 3, 1, 1] 880s 0.004 0.446 0.044 880s 0.700 0.007 -0.987 880s -0.216 -0.318 0.136 880s 880s y: [3, 3, 1, 1] 880s 0.166 0.442 -0.324 880s -0.495 0.092 -0.456 880s -0.086 -0.377 0.385 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.985 -0.395 0.976 880s -0.068 0.461 -0.695 880s 880s x1: [3, 3, 1, 1] 880s -0.816 -0.286 -0.160 880s 0.048 0.967 0.029 880s 0.134 -0.833 -0.372 880s 880s y: [2, 3, 1, 1] 880s 0.760 0.035 880s -0.400 0.422 880s -0.166 -0.135 880s testing: muggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s l_mat, [4, 3, 1, 2] = [2, 4, 1, 2] * [2, 3, 1, 2] 880s x0: [2, 4, 1, 2] 880s 0.848 0.242 880s -0.598 0.104 880s -0.943 -0.471 880s -0.437 -0.729 880s 880s x1: [2, 3, 1, 2] 880s -0.533 0.989 880s 0.089 0.149 880s 0.125 0.058 880s 880s y: [4, 3, 1, 2] 880s -0.212 0.422 0.037 -0.488 880s 0.112 -0.038 -0.154 -0.148 880s 0.120 -0.069 -0.145 -0.097 880s test-mul-mat0: iter:44/500 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.544 880s -0.869 880s 880s x1: [1, 4, 1, 1] 880s 0.839 880s 0.789 880s -0.077 880s -0.104 880s 880s y: [2, 4, 1, 1] 880s 0.456 -0.729 880s 0.429 -0.686 880s -0.042 0.067 880s -0.056 0.090 880s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.939 880s 880s x1: [1, 2, 2, 1] 880s -0.532 880s -0.873 880s 880s y: [1, 2, 2, 1] 880s 0.500 880s 0.820 880s testing: mul_mat, [4, 1, 2, 4] = [1, 4, 2, 4] * [1, 1, 2, 4] 880s x0: [1, 4, 2, 4] 880s -0.524 880s -0.694 880s -0.976 880s -0.352 880s 880s x1: [1, 1, 2, 4] 880s -0.462 880s 880s y: [4, 1, 2, 4] 880s 0.242 0.320 0.451 0.162 880s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.735 880s 0.042 880s 0.001 880s 0.342 880s 880s x1: [1, 4, 1, 1] 880s 0.015 880s 0.617 880s -0.024 880s 0.130 880s 880s y: [4, 4, 1, 1] 880s -0.011 0.001 0.000 0.005 880s -0.453 0.026 0.001 0.211 880s 0.018 -0.001 -0.000 -0.008 880s -0.095 0.005 0.000 0.044 880s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 3, 2, 1] 880s 0.243 -0.737 -0.649 -0.797 880s -0.196 0.465 0.700 -0.591 880s -0.907 -0.633 -0.619 0.974 880s 880s x1: [4, 1, 2, 1] 880s 0.876 -0.491 -0.091 -0.745 880s 880s y: [3, 1, 2, 1] 880s 1.227 -0.023 -1.154 880s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 880s x0: [3, 2, 2, 4] 880s -0.331 0.784 -0.972 880s 0.337 0.565 -0.548 880s 880s x1: [3, 4, 2, 4] 880s -0.977 0.727 0.639 880s -0.033 -0.511 0.084 880s -0.713 0.732 0.888 880s -0.620 0.995 0.352 880s 880s y: [2, 4, 2, 4] 880s 0.273 -0.269 880s -0.472 -0.346 880s -0.053 -0.313 880s 0.643 0.160 880s test-mul-mat0: iter:45/500 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.465 -0.568 0.137 880s -0.043 0.188 0.837 880s 880s x1: [3, 3, 1, 1] 880s 0.437 0.248 0.326 880s -0.795 -0.720 -0.687 880s -0.576 0.538 0.704 880s 880s y: [2, 3, 1, 1] 880s 0.107 0.301 880s -0.055 -0.676 880s -0.477 0.715 880s testing: mul_mat, [1, 2, 2, 1] = [3, 1, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 1, 2, 1] 880s 0.840 -0.012 -0.543 880s 880s x1: [3, 2, 2, 1] 880s 0.359 0.773 0.534 880s -0.189 -0.851 0.775 880s 880s y: [1, 2, 2, 1] 880s 0.002 880s -0.569 880s testing: mul_mat, [3, 1, 2, 2] = [3, 3, 2, 2] * [3, 1, 2, 2] 880s x0: [3, 3, 2, 2] 880s 0.725 -0.518 0.497 880s 0.873 -0.743 -0.705 880s 0.772 -0.529 0.524 880s 880s x1: [3, 1, 2, 2] 880s 0.368 -0.372 0.086 880s 880s y: [3, 1, 2, 2] 880s 0.502 0.537 0.526 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.562 0.337 -0.529 880s 0.626 0.748 0.298 880s -0.001 -0.163 -0.814 880s -0.253 0.380 -0.694 880s 880s x1: [3, 3, 1, 1] 880s -0.733 -0.871 0.993 880s 0.791 0.017 0.773 880s -0.665 0.746 -0.458 880s 880s y: [4, 3, 1, 1] 880s -1.230 -0.815 -0.665 -0.834 880s 0.042 0.738 -0.633 -0.730 880s 0.119 0.005 0.252 0.770 880s testing: mul_mat, [3, 3, 2, 1] = [4, 3, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 3, 2, 1] 880s 0.821 0.119 -0.794 0.767 880s -0.474 -0.288 0.656 0.083 880s -0.519 -0.213 0.875 -0.771 880s 880s x1: [4, 3, 2, 1] 880s 0.087 -0.056 0.089 0.562 880s 0.427 -0.647 0.534 0.783 880s 0.875 0.801 0.912 0.867 880s 880s y: [3, 3, 2, 1] 880s 0.425 0.080 -0.388 880s 0.451 0.399 -0.221 880s 0.754 0.026 -0.495 880s testing: mul_mat, [2, 4, 2, 2] = [3, 2, 2, 2] * [3, 4, 2, 2] 880s x0: [3, 2, 2, 2] 880s 0.282 -0.425 -0.318 880s 0.760 -0.090 -0.053 880s 880s x1: [3, 4, 2, 2] 880s 0.882 -0.378 0.603 880s -0.857 0.607 -0.252 880s -0.848 0.428 0.274 880s -0.367 -0.453 0.986 880s 880s y: [2, 4, 2, 2] 880s 0.218 0.673 880s -0.419 -0.693 880s -0.508 -0.698 880s -0.225 -0.290 880s test-mul-mat0: iter:46/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.302 0.209 0.723 0.344 880s 0.790 0.051 -0.749 0.066 880s 0.733 0.568 -0.536 0.044 880s 0.931 -0.469 -0.030 -0.244 880s 880s x1: [4, 3, 1, 1] 880s -0.416 0.429 0.165 0.866 880s 0.189 0.740 -0.224 0.871 880s -0.313 0.284 -0.833 0.715 880s 880s y: [4, 3, 1, 1] 880s 0.633 -0.373 -0.111 -0.804 880s 0.236 0.413 0.718 -0.377 880s -0.202 0.439 0.410 -0.573 880s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.787 0.908 -0.581 0.837 880s -0.776 -0.828 0.969 0.285 880s 0.191 -0.225 -0.194 -0.521 880s -0.047 0.996 0.758 0.848 880s 880s x1: [4, 4, 4, 1] 880s -0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 981 0.152 0.341 0.448 880s -0.683 0.207 -0.363 -0.943 880s 0.983 -0.493 -0.256 0.267 880s -0.326 -0.540 -0.732 0.372 880s 880s y: [4, 4, 4, 1] 880s 1.087 1.092 -0.521 0.835 880s 0.147 -0.263 0.385 -0.837 880s -0.849 -0.526 0.209 -0.504 880s 0.503 0.097 0.007 -0.762 880s testing: mul_mat, [3, 4, 4, 4] = [4, 3, 4, 4] * [4, 4, 4, 4] 880s x0: [4, 3, 4, 4] 880s -0.406 -0.789 -0.729 -0.089 880s 0.370 -0.192 0.631 0.140 880s 0.397 -0.766 -0.113 0.991 880s 880s x1: [4, 4, 4, 4] 880s -0.835 -0.259 0.245 -0.539 880s 0.657 -0.892 0.947 0.348 880s 0.778 0.913 0.992 0.443 880s 0.636 -0.517 0.754 0.438 880s 880s y: [3, 4, 4, 4] 880s 0.412 -0.181 -0.696 880s -0.284 1.061 1.182 880s -1.798 0.800 -0.064 880s -0.439 0.872 0.998 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.016 -0.898 0.063 -0.288 880s -0.768 0.826 -0.929 0.015 880s 0.556 -0.893 -0.281 -0.854 880s 880s x1: [4, 3, 1, 1] 880s -0.655 -0.401 0.877 -0.888 880s -0.011 0.539 -0.132 0.845 880s -0.070 0.426 0.762 -0.767 880s 880s y: [3, 3, 1, 1] 880s 0.660 -0.655 0.505 880s -0.735 0.588 -1.171 880s -0.116 -0.313 0.021 880s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 2, 4, 1] 880s -0.877 -0.698 0.263 880s 0.256 0.864 0.730 880s 880s x1: [3, 4, 4, 1] 880s -0.632 0.556 0.811 880s -0.017 0.693 -0.208 880s 0.327 -0.708 -0.331 880s 0.439 0.281 -0.793 880s 880s y: [2, 4, 4, 1] 880s 0.379 0.911 880s -0.523 0.442 880s 0.121 -0.770 880s -0.789 -0.224 880s testing: mul_mat, [2, 3, 4, 4] = [2, 2, 4, 4] * [2, 3, 4, 4] 880s x0: [2, 2, 4, 4] 880s 0.265 0.579 880s 0.598 -0.320 880s 880s x1: [2, 3, 4, 4] 880s 0.010 0.959 880s 0.431 -0.661 880s 0.899 -0.217 880s 880s y: [2, 3, 4, 4] 880s 0.558 -0.301 880s -0.268 0.469 880s 0.112 0.607 880s test-mul-mat0: iter:47/500 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.113 -0.815 0.221 0.828 880s 880s x1: [4, 1, 1, 1] 880s 0.925 0.158 0.018 0.911 880s 880s y: [1, 1, 1, 1] 880s 0.524 880s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 2, 2, 1] 880s -0.826 0.402 -0.393 -0.119 880s -0.834 -0.355 0.208 0.904 880s 880s x1: [4, 1, 2, 1] 880s -0.290 -0.975 0.424 -0.045 880s 880s y: [2, 1, 2, 1] 880s -0.313 0.636 880s testing: mul_mat, [3, 2, 2, 2] = [4, 3, 2, 2] * [4, 2, 2, 2] 880s x0: [4, 3, 2, 2] 880s 0.273 -0.275 -0.248 0.421 880s -0.247 -0.567 -0.007 0.736 880s -0.631 -0.693 0.841 -0.347 880s 880s x1: [4, 2, 2, 2] 880s 0.132 -0.566 -0.679 0.318 880s 0.655 -0.852 -0.972 0.680 880s 880s y: [3, 2, 2, 2] 880s 0.493 0.527 -0.374 880s 0.940 0.829 -0.877 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.301 -0.117 0.500 -0.513 880s 0.532 0.448 -0.642 0.575 880s -0.522 -0.363 -0.467 -0.730 880s 880s x1: [4, 3, 1, 1] 880s -0.831 -0.119 -0.210 -0.885 880s 0.261 0.543 0.115 -0.068 880s 0.814 -0.611 0.267 0.036 880s 880s y: [3, 3, 1, 1] 880s 0.613 -0.869 1.221 880s -0.050 0.270 -0.337 880s -0.059 0.008 -0.355 880s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 3, 2, 1] 880s 0.811 -0.702 -0.576 880s 0.789 0.364 -0.824 880s -0.388 -0.118 0.011 880s 880s x1: [3, 4, 2, 1] 880s -0.874 -0.189 -0.872 880s -0.896 -0.613 0.268 880s -0.727 0.268 -0.942 880s -0.612 -0.471 0.601 880s 880s y: [3, 4, 2, 1] 880s -0.075 -0.040 0.353 880s -0.451 -1.151 0.423 880s -0.235 0.300 0.241 880s -0.512 -1.150 0.299 880s testing: mul_mat, [2, 3, 2, 2] = [3, 2, 2, 2] * [3, 3, 2, 2] 880s x0: [3, 2, 2, 2] 880s -0.834 0.066 0.941 880s 0.652 0.155 -0.716 880s 880s x1: [3, 3, 2, 2] 880s -0.307 0.822 0.352 880s -0.415 -0.449 0.803 880s 0.548 0.978 0.786 880s 880s y: [2, 3, 2, 2] 880s 0.642 -0.326 880s 1.073 -0.916 880s 0.348 -0.054 880s test-mul-mat0: iter:48/500 880s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.079 -0.991 880s 880s x1: [2, 1, 1, 1] 880s -0.245 -0.644 880s 880s y: [1, 1, 1, 1] 880s 0.657 880s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.802 -0.567 880s 0.252 -0.341 880s 0.695 0.460 880s 880s x1: [2, 1, 3, 1] 880s -0.984 -0.924 880s 880s y: [3, 1, 3, 1] 880s 1.313 0.067 -1.109 880s testing: mul_mat, [3, 3, 3, 4] = [2, 3, 3, 4] * [2, 3, 3, 4] 880s x0: [2, 3, 3, 4] 880s -0.528 -0.137 880s -0.385 -0.297 880s 0.013 0.471 880s 880s x1: [2, 3, 3, 4] 880s -0.494 -0.931 880s 0.634 0.456 880s -0.010 0.642 880s 880s y: [3, 3, 3, 4] 880s 0.388 0.467 -0.445 880s -0.397 -0.379 0.223 880s -0.083 -0.187 0.302 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.481 -0.955 880s -0.386 -0.525 880s -0.524 0.622 880s 880s x1: [2, 3, 1, 1] 880s 0.137 -0.281 880s 0.719 0.133 880s -0.019 0.216 880s 880s y: [3, 3, 1, 1] 880s 0.335 0.095 -0.247 880s 0.219 -0.347 -0.294 880s -0.216 -0.106 0.145 880s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.914 0.066 -0.453 880s -0.346 0.870 -0.515 880s -0.053 -0.208 0.269 880s 880s x1: [3, 2, 3, 1] 880s 0.427 0.998 -0.086 880s -0.217 -0.905 -0.389 880s 880s y: [3, 2, 3, 1] 880s -0.286 0.766 -0.253 880s 0.314 -0.512 0.095 880s testing: mul_mat, [1, 3, 3, 4] = [3, 1, 3, 4] * [3, 3, 3, 4] 880s x0: [3, 1, 3, 4] 880s 0.341 -0.576 0.745 880s 880s x1: [3, 3, 3, 4] 880s 0.448 -0.242 0.594 880s 0.262 -0.157 -0.752 880s -0.791 0.910 -0.881 880s 880s y: [1, 3, 3, 4] 880s 0.735 880s -0.381 880s -1.450 880s test-mul-mat0: iter:49/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.020 -0.660 0.251 -0.026 880s 0.386 0.840 0.554 -0.951 880s -0.626 0.475 0.137 -0.346 880s -0.426 -0.741 0.567 0.887 880s 880s x1: [4, 3, 1, 1] 880s -0.618 -0.523 0.061 0.514 880s -0.242 0.144 -0.239 -0.279 880s 0.511 -0.727 -0.972 0.337 880s 880s y: [4, 3, 1, 1] 880s 0.360 -1.133 -0.031 1.142 880s -0.143 0.160 0.284 -0.387 880s 0.218 -1.272 -0.914 0.068 880s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.291 0.755 -0.626 0.989 880s -0.341 0.168 -0.838 -0.301 880s 880s x1: [4, 4, 3, 1] 880s 0.205 -0.923 -0.731 0.682 880s 0.138 0.783 -0.560 -0.719 880s -0.456 0.161 0.792 -0.183 880s 0.189 0.129 0.329 -0.831 880s 880s y: [2, 4, 3, 1] 880s 0.495 0.182 880s 0.270 0.770 880s -0.688 -0.426 880s -0.875 -0.069 880s testing: mul_mat, [2, 2, 3, 2] = [4, 2, 3, 2] * [4, 2, 3, 2] 880s x0: [4, 2, 3, 2] 880s -0.193 0.556 -0.785 0.057 880s -0.875 -0.385 -0.741 0.824 880s 880s x1: [4, 2, 3, 2] 880s -0.592 -0.916 0.197 0.783 880s -0.768 -0.903 0.738 0.523 880s 880s y: [2, 2, 3, 2] 880s -0.505 1.370 880s -0.903 0.904 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.556 -0.122 0.757 -0.240 880s 0.767 -0.061 -0.674 -0.673 880s -0.607 -0.443 0.777 0.256 880s 880s x1: [4, 2, 1, 1] 880s -0.242 0.446 -0.736 -0.376 880s 0.729 -0.041 0.665 -0.900 880s 880s y: [3, 2, 1, 1] 880s -0.656 0.537 -0.719 880s 1.129 0.719 -0.138 880s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 1, 3, 1] 880s -0.040 -0.690 -0.493 880s 880s x1: [3, 4, 3, 1] 880s 0.588 -0.338 0.659 880s -0.699 0.153 0.416 880s 0.367 0.707 -0.846 880s 0.037 -0.535 0.600 880s 880s y: [1, 4, 3, 1] 880s -0.115 880s -0.282 880s -0.086 880s 0.072 880s testing: mul_mat, [2, 3, 3, 2] = [1, 2, 3, 2] * [1, 3, 3, 2] 880s x0: [1, 2, 3, 2] 880s 0.648 880s 0.498 880s 880s x1: [1, 3, 3, 2] 880s 0.555 880s -0.384 880s -0.135 880s 880s y: [2, 3, 3, 2] 880s 0.360 0.277 880s -0.248 -0.191 880s -0.088 -0.067 880s test-mul-mat0: iter:50/500 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.899 880s -0.452 880s 0.631 880s 0.158 880s 880s x1: [1, 2, 1, 1] 880s 0.988 880s -0.649 880s 880s y: [4, 2, 1, 1] 880s -0.888 -0.447 0.624 0.156 880s 0.584 0.294 -0.410 -0.102 880s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 4, 2, 1] 880s 0.045 880s -0.077 880s -0.771 880s -0.147 880s 880s x1: [1, 4, 2, 1] 880s 0.205 880s -0.561 880s 0.920 880s -0.349 880s 880s y: [4, 4, 2, 1] 880s 0.009 -0.016 -0.158 -0.030 880s -0.025 0.043 0.433 0.082 880s 0.041 -0.071 -0.709 -0.135 880s -0.016 0.027 0.269 0.051 880s testing: mul_mat, [3, 4, 2, 4] = [1, 3, 2, 4] * [1, 4, 2, 4] 880s x0: [1, 3, 2, 4] 880s -0.769 880s 0.692 880s -0.029 880s 880s x1: [1, 4, 2, 4] 880s -0.266 880s 0.882 880s -0.779 880s -0.810 880s 880s y: [3, 4, 2, 4] 880s 0.205 -0.184 0.008 880s -0.678 0.610 -0.025 880s 0.599 -0.539 0.022 880s 0.623 -0.560 0.023 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.021 880s 0.228 880s -0.554 880s 880s x1: [1, 3, 1, 1] 880s -0.728 880s -0.031 880s 0.726 880s 880s y: [3, 3, 1, 1] 880s 0.015 -0.166 0.404 880s 0.001 -0.007 0.017 880s -0.015 0.165 -0.403 880s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 4, 2, 1] 880s -0.358 0.379 0.086 880s -0.745 -0.806 -0.070 880s -0.433 0.071 0.933 880s 0.149 -0.386 -0.209 880s 880s x1: [3, 1, 2, 1] 880s 0.210 -0.080 0.417 880s 880s y: [4, 1, 2, 1] 880s -0.070 -0.121 0.292 -0.025 880s testing: mul_mat, [4, 3, 2, 4] = [4, 4, 2, 4] * [4, 3, 2, 4] 880s x0: [4, 4, 2, 4] 880s -0.582 0.638 0.731 -0.137 880s 0.736 -0.733 0.989 -0.025 880s -0.492 -0.738 -0.181 -0.276 880s -0.009 0.842 0.439 0.847 880s 880s x1: [4, 3, 2, 4] 880s 0.187 0.788 -0.096 0.139 880s -0.103 -0.776 -0.307 0.539 880s -0.521 0.260 -0.312 0.858 880s 880s y: [4, 3, 2, 4] 880s 0.305 -0.538 -0.695 0.738 880s -0.734 0.176 0.531 -0.331 880s 0.124 -0.905 -0.116 0.814 880s test-mul-mat0: iter:51/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s , 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.654 880s 0.527 880s -0.578 880s 880s x1: [1, 3, 1, 1] 880s -0.040 880s -0.557 880s -0.412 880s 880s y: [3, 3, 1, 1] 880s -0.026 -0.021 0.023 880s -0.364 -0.294 0.322 880s -0.269 -0.217 0.238 880s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 4, 2, 1] 880s 0.825 880s 0.380 880s -0.512 880s -0.302 880s 880s x1: [1, 3, 2, 1] 880s 0.132 880s -0.412 880s 0.731 880s 880s y: [4, 3, 2, 1] 880s 0.109 0.050 -0.068 -0.040 880s -0.340 -0.157 0.211 0.124 880s 0.603 0.278 -0.374 -0.220 880s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.563 880s -0.151 880s 0.512 880s 880s x1: [1, 4, 2, 1] 880s -0.942 880s 0.280 880s 0.164 880s -0.023 880s 880s y: [3, 4, 2, 1] 880s 0.530 0.143 -0.482 880s -0.158 -0.042 0.143 880s -0.092 -0.025 0.084 880s 0.013 0.003 -0.012 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.858 880s 880s x1: [1, 3, 1, 1] 880s 0.954 880s 0.635 880s 0.627 880s 880s y: [1, 3, 1, 1] 880s 0.819 880s 0.545 880s 0.538 880s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 1, 2, 1] 880s 0.554 880s 880s x1: [1, 1, 2, 1] 880s 0.729 880s 880s y: [1, 1, 2, 1] 880s 0.404 880s testing: mul_mat, [3, 1, 2, 1] = [1, 3, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.064 880s -0.044 880s -0.150 880s 880s x1: [1, 1, 2, 1] 880s 0.742 880s 880s y: [3, 1, 2, 1] 880s -0.047 -0.033 -0.111 880s test-mul-mat0: iter:52/500 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.942 -0.581 0.056 0.801 880s -0.852 0.930 0.054 0.702 880s -0.815 -0.204 0.954 -0.485 880s 880s x1: [4, 2, 1, 1] 880s -0.043 -0.758 0.886 0.805 880s 0.753 0.418 -0.692 -0.208 880s 880s y: [3, 2, 1, 1] 880s 1.094 -0.055 0.644 880s 0.262 -0.437 -1.259 880s testing: mul_mat, [1, 3, 2, 1] = [4, 1, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 1, 2, 1] 880s 0.565 0.752 0.991 -0.639 880s 880s x1: [4, 3, 2, 1] 880s 0.732 -0.090 0.364 -0.274 880s 0.146 0.378 -0.304 0.698 880s -0.304 -0.855 -0.346 -0.062 880s 880s y: [1, 3, 2, 1] 880s 0.882 880s -0.379 880s -1.117 880s testing: mul_mat, [4, 1, 2, 2] = [4, 4, 2, 2] * [4, 1, 2, 2] 880s x0: [4, 4, 2, 2] 880s -0.947 -0.041 0.833 -0.436 880s 0.009 0.639 -0.871 -0.239 880s 0.630 -0.511 -0.532 -0.865 880s -0.418 0.084 -0.995 0.400 880s 880s x1: [4, 1, 2, 2] 880s 0.870 -0.182 0.763 0.248 880s 880s y: [4, 1, 2, 2] 880s -0.288 -0.833 0.020 -1.039 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.862 -0.293 -0.726 -0.816 880s -0.331 0.798 -0.423 -0.936 880s 0.168 -0.520 0.195 0.271 880s 880s x1: [4, 4, 1, 1] 880s 0.269 0.263 0.828 -0.898 880s -0.649 0.265 -0.435 0.928 880s 0.169 0.162 -0.914 -0.843 880s 0.764 0.646 0.994 0.619 880s 880s y: [3, 4, 1, 1] 880s -0.178 0.611 -0.173 880s 0.041 -0.258 -0.081 880s 1.158 1.249 -0.462 880s -2.074 -0.737 0.154 880s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 3, 2, 1] 880s -0.251 -0.067 0.421 880s -0.204 -0.140 -0.924 880s -0.275 0.995 -0.465 880s 880s x1: [3, 4, 2, 1] 880s -0.511 0.217 -0.460 880s 0.752 0.045 -0.358 880s -0.897 -0.690 0.207 880s -0.970 0.479 -0.631 880s 880s y: [3, 4, 2, 1] 880s -0.080 0.499 0.571 880s -0.343 0.171 0.004 880s 0.359 0.089 -0.536 880s -0.055 0.714 1.037 880s testing: mul_mat, [3, 3, 2, 2] = [3, 3, 2, 2] * [3, 3, 2, 2] 880s x0: [3, 3, 2, 2] 880s -0.049 0.067 -0.288 880s 0.137 0.556 -0.276 880s 0.791 0.067 0.140 880s 880s x1: [3, 3, 2, 2] 880s 0.423 -0.454 -0.115 880s 0.185 -0.824 -0.363 880s 0.066 0.944 0.058 880s 880s y: [3, 3, 2, 2] 880s -0.018 -0.163 0.288 880s 0.041 -0.333 0.040 880s 0.043 0.518 0.124 880s test-mul-mat0: iter:53/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.956 880s -0.929 880s -0.520 880s 880s x1: [1, 3, 1, 1] 880s -0.311 880s 0.697 880s -0.009 880s 880s y: [3, 3, 1, 1] 880s 0.297 0.289 0.162 880s -0.666 -0.648 -0.362 880s 0.009 0.009 0.005 880s testing: mul_mat, [4, 3, 3, 1] = [1, 4, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 4, 3, 1] 880s 0.556 880s -0.938 880s -0.736 880s 0.011 880s 880s x1: [1, 3, 3, 1] 880s 0.087 880s 0.281 880s -0.098 880s 880s y: [4, 3, 3, 1] 880s 0.048 -0.082 -0.064 0.001 880s 0.156 -0.264 -0.207 0.003 880s -0.055 0.092 0.072 -0.001 880s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 4, 3, 1] 880s -0.211 880s -0.415 880s 0.946 880s -0.915 880s 880s x1: [1, 4, 3, 1] 880s -0.378 880s -0.606 880s 0.285 880s -0.681 880s 880s y: [4, 4, 3, 1] 880s 0.080 0.157 -0.357 0.346 880s 0.128 0.252 -0.573 0.554 880s -0.060 -0.118 0.270 -0.261 880s 0.143 0.283 -0.644 0.623 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.273 880s -0.373 880s 880s x1: [1, 4, 1, 1] 880s -0.746 880s 0.201 880s -0.323 880s -0.995 880s 880s y: [2, 4, 1, 1] 880s -0.204 0.278 880s 0.055 -0.075 880s -0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 088 0.121 880s -0.272 0.372 880s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 3, 3, 1] 880s 0.879 -0.319 880s -0.240 -0.687 880s -0.353 0.437 880s 880s x1: [2, 1, 3, 1] 880s 0.070 -0.442 880s 880s y: [3, 1, 3, 1] 880s 0.203 0.287 -0.218 880s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.187 -0.332 0.879 880s -0.712 -0.089 0.832 880s 880s x1: [3, 2, 3, 1] 880s -0.049 0.552 -0.925 880s 0.627 -0.175 -0.299 880s 880s y: [2, 2, 3, 1] 880s -1.006 -0.784 880s -0.088 -0.679 880s test-mul-mat0: iter:54/500 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.412 0.133 0.917 -0.677 880s 880s x1: [4, 2, 1, 1] 880s 0.064 0.098 -0.606 -0.695 880s -0.185 -0.897 -0.774 -0.155 880s 880s y: [1, 2, 1, 1] 880s -0.046 880s -0.801 880s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 1, 2, 1] 880s 0.514 -0.094 0.726 0.833 880s 880s x1: [4, 1, 2, 1] 880s 0.013 0.749 0.479 -0.921 880s 880s y: [1, 1, 2, 1] 880s -0.484 880s testing: mul_mat, [4, 1, 2, 2] = [4, 4, 2, 2] * [4, 1, 2, 2] 880s x0: [4, 4, 2, 2] 880s -0.477 -0.952 -0.269 -0.457 880s 0.162 -0.027 -0.943 -0.932 880s -0.300 0.890 0.467 -0.394 880s 0.427 0.007 0.845 -0.973 880s 880s x1: [4, 1, 2, 2] 880s 0.239 -0.400 -0.362 0.633 880s 880s y: [4, 1, 2, 2] 880s 0.075 -0.199 -0.846 -0.823 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.633 0.282 0.760 0.511 880s 0.949 0.514 -0.819 -0.397 880s 0.435 -0.223 0.593 -0.773 880s 880s x1: [4, 4, 1, 1] 880s 0.183 0.887 0.189 -0.294 880s -0.716 -0.778 -0.071 -0.429 880s -0.278 0.757 -0.715 -0.771 880s -0.569 -0.974 0.382 -0.417 880s 880s y: [3, 4, 1, 1] 880s 0.360 0.592 0.222 880s -0.946 -0.851 0.151 880s -0.900 1.017 -0.118 880s -0.558 -1.187 0.518 880s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.247 -0.385 -0.935 880s 0.462 -0.242 -0.158 880s 880s x1: [3, 4, 2, 1] 880s 0.059 -0.173 0.411 880s -0.054 -0.983 -0.883 880s 0.230 -0.761 0.046 880s 0.801 -0.039 -0.197 880s 880s y: [2, 4, 2, 1] 880s -0.303 0.004 880s 1.191 0.352 880s 0.307 0.283 880s 0.397 0.410 880s testing: mul_mat, [2, 3, 2, 2] = [2, 2, 2, 2] * [2, 3, 2, 2] 880s x0: [2, 2, 2, 2] 880s 0.382 0.776 880s -0.263 -0.856 880s 880s x1: [2, 3, 2, 2] 880s 0.991 0.148 880s 0.415 0.792 880s -0.892 -0.782 880s 880s y: [2, 3, 2, 2] 880s 0.493 -0.387 880s 0.773 -0.788 880s -0.947 0.904 880s test-mul-mat0: iter:55/500 880s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.859 0.701 880s 880s x1: [2, 1, 1, 1] 880s 0.541 0.636 880s 880s y: [1, 1, 1, 1] 880s 0.910 880s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.780 0.681 880s 0.602 0.866 880s 0.827 0.737 880s 880s x1: [2, 1, 3, 1] 880s -0.489 0.639 880s 880s y: [3, 1, 3, 1] 880s 0.816 0.259 0.067 880s testing: mul_mat, [2, 3, 3, 3] = [2, 2, 3, 3] * [2, 3, 3, 3] 880s x0: [2, 2, 3, 3] 880s -0.548 0.526 880s 0.044 0.908 880s 880s x1: [2, 3, 3, 3] 880s 0.972 -0.554 880s 0.490 0.676 880s -0.695 0.191 880s 880s y: [2, 3, 3, 3] 880s -0.824 -0.461 880s 0.087 0.636 880s 0.481 0.143 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.994 0.891 880s 0.564 -0.710 880s -0.284 0.980 880s -0.719 0.186 880s 880s x1: [2, 2, 1, 1] 880s 0.592 -0.314 880s -0.110 -0.221 880s 880s y: [4, 2, 1, 1] 880s -0.869 0.557 -0.476 -0.484 880s -0.088 0.095 -0.186 0.038 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s 0.226 0.515 -0.580 -0.861 880s 0.658 0.638 -0.537 -0.397 880s 0.456 -0.357 0.739 -0.127 880s 880s x1: [4, 2, 3, 1] 880s -0.095 -0.175 0.097 0.719 880s -0.860 0.134 0.717 -0.012 880s 880s y: [3, 2, 3, 1] 880s -0.787 -0.512 -0.001 880s -0.531 -0.861 0.092 880s testing: mul_mat, [3, 4, 3, 3] = [3, 3, 3, 3] * [3, 4, 3, 3] 880s x0: [3, 3, 3, 3] 880s -0.731 -0.772 -0.397 880s 0.887 -0.402 -0.994 880s -0.951 0.291 -0.651 880s 880s x1: [3, 4, 3, 3] 880s 0.577 0.934 -0.093 880s 0.316 -0.927 0.511 880s -0.811 0.930 0.102 880s -0.467 0.059 -0.211 880s 880s y: [3, 4, 3, 3] 880s -1.106 0.228 -0.216 880s 0.281 0.146 -0.903 880s -0.165 -1.195 0.975 880s 0.380 -0.228 0.599 880s test-mul-mat0: iter:56/500 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.384 -0.647 0.775 0.177 880s -0.869 0.252 -0.860 0.407 880s 880s x1: [4, 4, 1, 1] 880s 0.441 -0.627 -0.060 -0.468 880s -0.624 -0.293 0.367 0.987 880s 0.891 0.833 0.255 -0.283 880s -0.473 0.470 0.790 0.456 880s 880s y: [2, 4, 1, 1] 880s 0.106 -0.680 880s 0.889 0.554 880s -0.734 -0.898 880s 0.571 0.035 880s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.180 0.875 0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s .745 0.661 880s 880s x1: [4, 2, 4, 1] 880s -0.868 -0.012 -0.838 0.972 880s -0.702 -0.520 0.414 -0.329 880s 880s y: [1, 2, 4, 1] 880s -0.149 880s -0.491 880s testing: mul_mat, [1, 1, 4, 4] = [4, 1, 4, 4] * [4, 1, 4, 4] 880s x0: [4, 1, 4, 4] 880s 0.414 0.038 0.993 0.374 880s 880s x1: [4, 1, 4, 4] 880s 0.366 -0.451 -0.511 -0.506 880s 880s y: [1, 1, 4, 4] 880s -0.563 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.067 0.535 -0.239 0.676 880s 0.471 -0.267 0.394 0.757 880s 0.098 0.782 -0.940 0.377 880s 0.460 -0.894 0.093 -0.561 880s 880s x1: [4, 1, 1, 1] 880s 0.028 -0.081 0.355 0.055 880s 880s y: [4, 1, 1, 1] 880s -0.093 0.216 -0.374 0.088 880s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 4, 4, 1] 880s 0.090 -0.103 -0.061 0.182 880s -0.025 -0.465 0.277 -0.626 880s 0.980 0.986 -0.667 0.807 880s -0.625 -0.002 0.663 0.161 880s 880s x1: [4, 4, 4, 1] 880s 0.534 -0.778 0.618 -0.498 880s 0.164 -0.356 0.086 0.152 880s -0.071 -0.674 -0.820 0.848 880s 0.681 0.234 -0.679 -0.386 880s 880s y: [4, 4, 4, 1] 880s 0.000 0.831 -1.058 -0.002 880s 0.074 0.090 -0.125 -0.020 880s 0.268 -0.443 0.498 -0.362 880s 0.008 -0.072 1.039 -0.939 880s testing: mul_mat, [3, 4, 4, 4] = [4, 3, 4, 4] * [4, 4, 4, 4] 880s x0: [4, 3, 4, 4] 880s 0.052 0.331 0.096 -0.870 880s -0.512 -0.876 -0.506 -0.331 880s 0.130 -0.513 -0.122 -0.524 880s 880s x1: [4, 4, 4, 4] 880s 0.684 -0.568 -0.327 -0.990 880s -0.716 -0.119 0.274 0.412 880s 0.421 -0.797 0.723 -0.664 880s -0.062 0.194 0.906 0.776 880s 880s y: [3, 4, 4, 4] 880s 0.677 0.640 0.939 880s -0.409 0.196 -0.282 880s 0.405 0.337 0.724 880s -0.527 -0.854 -0.625 880s test-mul-mat0: iter:57/500 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.607 -0.385 880s 880s x1: [2, 3, 1, 1] 880s -0.162 0.510 880s 0.988 0.570 880s -0.602 0.217 880s 880s y: [1, 3, 1, 1] 880s -0.295 880s 0.380 880s -0.449 880s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 1, 2, 1] 880s 0.978 0.485 880s 880s x1: [2, 1, 2, 1] 880s -0.839 0.035 880s 880s y: [1, 1, 2, 1] 880s -0.803 880s testing: mul_mat, [2, 1, 2, 4] = [2, 2, 2, 4] * [2, 1, 2, 4] 880s x0: [2, 2, 2, 4] 880s 0.142 -0.761 880s 0.601 -0.251 880s 880s x1: [2, 1, 2, 4] 880s 0.213 -0.215 880s 880s y: [2, 1, 2, 4] 880s 0.194 0.182 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.836 -0.533 880s 0.401 -0.222 880s -0.803 0.798 880s -0.040 -0.327 880s 880s x1: [2, 2, 1, 1] 880s 0.458 0.340 880s -0.948 -0.687 880s 880s y: [4, 2, 1, 1] 880s 0.202 0.108 -0.097 -0.129 880s -0.426 -0.227 0.212 0.263 880s testing: mul_mat, [4, 2, 2, 1] = [4, 4, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 4, 2, 1] 880s -0.231 -0.239 0.551 0.415 880s -0.552 0.353 0.617 0.302 880s -0.748 -0.342 -0.371 -0.101 880s -0.179 -0.838 -0.672 -0.164 880s 880s x1: [4, 2, 2, 1] 880s -0.429 0.421 -0.343 -0.744 880s -0.889 0.706 -0.588 0.976 880s 880s y: [4, 2, 2, 1] 880s -0.499 -0.051 0.379 0.076 880s 0.118 0.672 0.543 -0.198 880s testing: mul_mat, [4, 4, 2, 4] = [4, 4, 2, 4] * [4, 4, 2, 4] 880s x0: [4, 4, 2, 4] 880s -0.401 -0.883 -0.558 0.762 880s 0.909 -0.011 -0.175 -0.370 880s -0.266 0.944 0.924 -0.809 880s -0.666 0.076 -0.637 0.859 880s 880s x1: [4, 4, 2, 4] 880s -0.648 0.311 0.916 0.113 880s -0.335 -0.426 0.275 -0.785 880s -0.809 0.904 -0.456 0.607 880s 0.206 0.443 -0.557 0.166 880s 880s y: [4, 4, 2, 4] 880s -0.440 -0.795 1.222 -0.031 880s -0.241 -0.059 0.576 -0.658 880s 0.244 -0.889 0.156 1.419 880s -0.037 0.219 -0.285 0.394 880s test-mul-mat0: iter:58/500 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.094 -0.879 880s -0.285 -0.037 880s 0.643 -0.185 880s -0.780 -0.818 880s 880s x1: [2, 3, 1, 1] 880s 0.769 -0.829 880s 0.736 0.785 880s -0.638 0.017 880s 880s y: [4, 3, 1, 1] 880s 0.657 -0.188 0.647 0.079 880s -0.759 -0.239 0.328 -1.215 880s 0.045 0.181 -0.413 0.484 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.116 0.202 880s 0.708 -0.942 880s -0.024 -0.611 880s 880s x1: [2, 4, 1, 1] 880s -0.800 0.624 880s -0.115 -0.968 880s 0.939 0.228 880s -0.423 -0.248 880s 880s y: [3, 4, 1, 1] 880s 0.034 -1.155 -0.362 880s -0.209 0.830 0.595 880s 0.154 0.451 -0.162 880s -0.099 -0.066 0.162 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.893 0.827 880s -0.104 -0.392 880s 880s x1: [2, 3, 1, 1] 880s -0.977 -0.256 880s -0.441 0.758 880s -0.471 -0.079 880s 880s y: [2, 3, 1, 1] 880s -1.084 0.202 880s 0.234 -0.252 880s -0.485 0.080 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.623 0.607 880s -0.365 0.655 880s 880s x1: [2, 2, 1, 1] 880s -0.210 -0.461 880s 0.423 0.011 880s 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s y: [2, 2, 1, 1] 880s -0.410 -0.226 880s 0.270 -0.147 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.348 -0.844 880s -0.472 0.406 880s 880s x1: [2, 2, 1, 1] 880s -0.425 -0.166 880s -0.767 0.326 880s 880s y: [2, 2, 1, 1] 880s -0.008 0.133 880s -0.543 0.494 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.187 0.962 880s -0.222 -0.329 880s -0.975 0.852 880s 880s x1: [2, 2, 1, 1] 880s 0.504 -0.456 880s 0.429 -0.751 880s 880s y: [3, 2, 1, 1] 880s -0.344 0.038 -0.880 880s -0.643 0.152 -1.059 880s test-mul-mat0: iter:59/500 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.810 0.007 0.561 880s -0.282 0.163 -0.033 880s -0.777 0.707 -0.604 880s -0.529 -0.189 0.583 880s 880s x1: [3, 3, 1, 1] 880s -0.299 0.654 -0.881 880s -0.675 -0.711 0.726 880s 0.980 -0.136 -0.440 880s 880s y: [4, 3, 1, 1] 880s -0.248 0.220 1.227 -0.479 880s 0.949 0.050 -0.417 0.914 880s -1.042 -0.284 -0.593 -0.749 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.274 0.645 -0.795 880s -0.863 -0.795 -0.583 880s -0.673 0.212 0.978 880s 880s x1: [3, 4, 1, 1] 880s 0.249 -0.164 0.545 880s 0.920 -0.312 -0.597 880s 0.198 0.331 0.801 880s 0.900 -0.015 0.919 880s 880s y: [3, 4, 1, 1] 880s -0.471 -0.402 0.331 880s 0.525 -0.198 -1.269 880s -0.369 -0.901 0.720 880s -0.495 -1.300 0.291 880s testing: mul_mat, [2, 3, 1, 2] = [3, 2, 1, 2] * [3, 3, 1, 2] 880s x0: [3, 2, 1, 2] 880s -0.266 -0.409 -0.770 880s -0.993 -0.764 -0.565 880s 880s x1: [3, 3, 1, 2] 880s 0.046 -0.625 -0.055 880s 0.268 -0.917 0.341 880s 0.739 -0.106 -0.077 880s 880s y: [2, 3, 1, 2] 880s 0.286 0.463 880s 0.041 0.242 880s -0.094 -0.609 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.005 -0.266 0.152 880s -0.961 -0.493 0.379 880s 0.514 -0.578 -0.471 880s 880s x1: [3, 2, 1, 1] 880s 0.518 -0.971 0.776 880s -0.214 -0.889 0.116 880s 880s y: [3, 2, 1, 1] 880s 0.379 0.275 0.462 880s 0.253 0.687 0.349 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.319 -0.117 -0.437 880s -0.947 0.777 0.112 880s -0.741 0.001 0.775 880s -0.028 -0.341 0.089 880s 880s x1: [3, 3, 1, 1] 880s -0.099 -0.950 -0.465 880s 0.635 -0.359 -0.235 880s 0.642 -0.122 0.200 880s 880s y: [4, 3, 1, 1] 880s 0.283 -0.696 -0.288 0.286 880s 0.347 -0.906 -0.653 0.084 880s 0.132 -0.681 -0.321 0.042 880s testing: mul_mat, [2, 3, 1, 2] = [4, 2, 1, 2] * [4, 3, 1, 2] 880s x0: [4, 2, 1, 2] 880s -0.437 -0.433 0.423 0.792 880s 0.648 0.675 0.656 0.238 880s 880s x1: [4, 3, 1, 2] 880s -0.883 0.814 -0.397 -0.148 880s -0.678 0.025 -0.996 0.700 880s 0.554 -0.095 0.751 -0.912 880s 880s y: [2, 3, 1, 2] 880s -0.252 -0.318 880s 0.419 -0.909 880s -0.606 0.570 880s test-mul-mat0: iter:60/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.759 0.262 0.938 880s -0.532 -0.177 -0.073 880s -0.528 0.386 -0.424 880s 0.039 0.061 0.999 880s 880s x1: [3, 1, 1, 1] 880s 0.778 -0.727 -0.327 880s 880s y: [4, 1, 1, 1] 880s -1.088 -0.262 -0.552 -0.341 880s testing: mul_mat, [1, 4, 2, 1] = [3, 1, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 1, 2, 1] 880s 0.242 -0.387 0.049 880s 880s x1: [3, 4, 2, 1] 880s -0.306 -0.147 0.237 880s -0.047 0.497 -0.204 880s -0.764 -0.856 -0.944 880s 0.073 0.696 0.067 880s 880s y: [1, 4, 2, 1] 880s -0.005 880s -0.214 880s 0.101 880s -0.249 880s testing: mul_mat, [2, 1, 2, 1] = [3, 2, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 2, 2, 1] 880s -0.833 0.136 0.237 880s 0.863 -0.797 0.235 880s 880s x1: [3, 1, 2, 1] 880s -0.025 -0.474 -0.716 880s 880s y: [2, 1, 2, 1] 880s -0.214 0.188 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.940 -0.693 -0.267 880s 0.686 0.734 0.282 880s 880s x1: [3, 2, 1, 1] 880s -0.986 -0.780 -0.836 880s -0.216 0.447 -0.363 880s 880s y: [2, 2, 1, 1] 880s 1.691 -1.484 880s -0.010 0.078 880s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 3, 2, 1] 880s -0.892 0.924 880s -0.144 0.475 880s 0.677 0.691 880s 880s x1: [2, 3, 2, 1] 880s -0.740 0.017 880s 0.210 -0.717 880s -0.903 0.374 880s 880s y: [3, 3, 2, 1] 880s 0.675 0.114 -0.489 880s -0.850 -0.370 -0.353 880s 1.151 0.307 -0.353 880s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.040 -0.309 0.130 880s 0.284 -0.524 -0.344 880s 880s x1: [3, 2, 2, 1] 880s 0.651 0.961 -0.193 880s 0.385 -0.306 -0.911 880s 880s y: [2, 2, 2, 1] 880s -0.296 -0.253 880s -0.008 0.583 880s test-mul-mat0: iter:61/500 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.741 0.194 880s -0.149 0.548 880s 0.867 0.229 880s -0.412 0.151 880s 880s x1: [2, 4, 1, 1] 880s 0.375 -0.001 880s 0.894 0.336 880s 0.806 0.279 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s -0.970 0.895 880s 880s y: [4, 4, 1, 1] 880s 0.277 -0.056 0.325 -0.154 880s 0.727 0.051 0.852 -0.318 880s 0.651 0.033 0.763 -0.290 880s -0.545 0.635 -0.636 0.535 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.066 -0.192 880s -0.174 0.433 880s 880s x1: [2, 4, 1, 1] 880s 0.921 0.064 880s -0.720 -0.423 880s 0.648 -0.734 880s -0.090 -0.844 880s 880s y: [2, 4, 1, 1] 880s -0.073 -0.133 880s 0.128 -0.058 880s 0.098 -0.431 880s 0.168 -0.350 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.761 -0.776 880s 0.566 -0.866 880s 880s x1: [2, 2, 1, 1] 880s 0.051 -0.800 880s -0.567 -0.055 880s 880s y: [2, 2, 1, 1] 880s 0.582 0.721 880s 0.474 -0.274 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.195 -0.654 880s 0.228 0.149 880s 0.443 -0.493 880s 880s x1: [2, 2, 1, 1] 880s 0.147 0.307 880s 0.328 0.999 880s 880s y: [3, 2, 1, 1] 880s -0.172 0.079 -0.086 880s -0.589 0.223 -0.347 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.570 0.445 -0.829 880s 0.120 0.004 0.645 880s 880s x1: [3, 2, 1, 1] 880s -0.373 0.726 -0.845 880s -0.107 -0.364 -0.689 880s 880s y: [2, 2, 1, 1] 880s 0.811 -0.587 880s 0.348 -0.459 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.513 -0.091 880s -0.946 -0.666 880s 0.139 0.714 880s -0.514 -0.648 880s 880s x1: [2, 3, 1, 1] 880s 0.437 0.116 880s 0.181 0.676 880s 0.340 -0.252 880s 880s y: [4, 3, 1, 1] 880s -0.234 -0.490 0.143 -0.299 880s -0.154 -0.622 0.508 -0.531 880s -0.151 -0.153 -0.133 -0.011 880s test-mul-mat0: iter:62/500 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.839 -0.280 0.558 0.893 880s 880s x1: [4, 4, 1, 1] 880s -0.412 0.014 -0.800 0.224 880s 0.325 -0.181 -0.206 -0.555 880s -0.736 0.798 -0.384 0.909 880s 0.234 0.731 0.091 -0.090 880s 880s y: [1, 4, 1, 1] 880s -0.596 880s -0.287 880s -0.243 880s -0.038 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.155 -0.126 -0.380 -0.986 880s -0.819 0.208 0.028 -0.619 880s 880s x1: [4, 1, 1, 1] 880s 0.859 -0.956 -0.198 -0.807 880s 880s y: [2, 1, 1, 1] 880s 1.124 -0.409 880s testing: mul_mat, [2, 2, 1, 2] = [4, 2, 1, 2] * [4, 2, 1, 2] 880s x0: [4, 2, 1, 2] 880s -0.804 0.748 0.818 -0.002 880s 0.941 0.576 -0.847 -0.185 880s 880s x1: [4, 2, 1, 2] 880s -0.568 -0.647 0.200 0.226 880s -0.202 0.464 0.024 0.414 880s 880s y: [2, 2, 1, 2] 880s 0.136 -1.118 880s 0.528 -0.019 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.877 0.513 0.122 -0.820 880s 880s x1: [4, 2, 1, 1] 880s -0.424 0.063 -0.654 -0.960 880s -0.913 0.760 0.413 -0.655 880s 880s y: [1, 2, 1, 1] 880s 0.368 880s 0.177 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.374 880s 0.090 880s 880s x1: [1, 4, 1, 1] 880s 0.402 880s -0.103 880s -0.624 880s 0.149 880s 880s y: [2, 4, 1, 1] 880s 0.150 0.036 880s -0.039 -0.009 880s -0.233 -0.056 880s 0.056 0.013 880s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 880s x0: [2, 4, 1, 2] 880s -0.100 0.730 880s 0.893 -0.564 880s -0.112 -0.148 880s -0.724 -0.206 880s 880s x1: [2, 1, 1, 2] 880s -0.708 0.527 880s 880s y: [4, 1, 1, 2] 880s 0.456 -0.930 0.001 0.404 880s test-mul-mat0: iter:63/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.637 -0.716 -0.038 880s 0.331 0.184 -0.145 880s 880s x1: [3, 2, 1, 1] 880s 0.467 -0.263 0.290 880s -0.818 -0.889 -0.620 880s 880s y: [2, 2, 1, 1] 880s 0.475 0.064 880s 0.139 -0.344 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.203 -0.740 0.978 880s 0.507 0.486 0.426 880s 0.688 -0.775 0.996 880s -0.951 -0.634 0.463 880s 880s x1: [3, 2, 1, 1] 880s -0.782 0.460 -0.414 880s -0.346 -0.688 0.379 880s 880s y: [4, 2, 1, 1] 880s -0.904 -0.349 -1.307 0.260 880s 0.810 -0.348 0.673 0.942 880s testing: mul_mat, [2, 4, 1, 3] = [3, 2, 1, 3] * [3, 4, 1, 3] 880s x0: [3, 2, 1, 3] 880s 0.937 -0.752 0.881 880s 0.801 0.751 0.818 880s 880s x1: [3, 4, 1, 3] 880s -0.214 0.655 0.645 880s -0.104 -0.965 -0.881 880s -0.466 -0.681 0.081 880s 0.865 0.503 0.937 880s 880s y: [2, 4, 1, 3] 880s -0.125 0.848 880s -0.147 -1.528 880s 0.147 -0.819 880s 1.256 1.837 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.649 0.753 0.690 880s -0.943 0.985 -0.767 880s 0.808 -0.392 -0.512 880s 880s x1: [3, 2, 1, 1] 880s -0.480 -0.876 0.925 880s -0.908 0.844 -0.837 880s 880s y: [3, 2, 1, 1] 880s 0.289 -1.120 -0.517 880s 0.648 2.330 -0.637 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.158 0.609 0.733 880s 0.292 0.344 0.269 880s 0.734 -0.746 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s -0.654 880s 880s x1: [3, 3, 1, 1] 880s 0.491 0.985 -0.694 880s 0.376 -0.300 -0.354 880s 0.997 -0.310 -0.112 880s 880s y: [3, 3, 1, 1] 880s 0.169 0.296 0.080 880s -0.383 -0.089 0.731 880s -0.114 0.154 1.036 880s testing: mul_mat, [1, 3, 1, 3] = [3, 1, 1, 3] * [3, 3, 1, 3] 880s x0: [3, 1, 1, 3] 880s 0.582 -0.877 0.359 880s 880s x1: [3, 3, 1, 3] 880s 0.577 0.432 0.474 880s 0.928 0.488 0.282 880s 0.682 0.474 0.889 880s 880s y: [1, 3, 1, 3] 880s 0.127 880s 0.212 880s 0.300 880s test-mul-mat0: iter:64/500 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.243 880s -0.694 880s -0.606 880s 880s x1: [1, 4, 1, 1] 880s -0.777 880s -0.619 880s 0.004 880s 0.915 880s 880s y: [3, 4, 1, 1] 880s -0.189 0.539 0.471 880s -0.150 0.429 0.375 880s 0.001 -0.003 -0.002 880s 0.222 -0.634 -0.554 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.627 880s -0.878 880s 0.262 880s 880s x1: [1, 3, 1, 1] 880s 0.478 880s 0.388 880s -0.388 880s 880s y: [3, 3, 1, 1] 880s 0.300 -0.420 0.125 880s 0.243 -0.341 0.102 880s -0.243 0.341 -0.102 880s testing: mul_mat, [1, 3, 1, 3] = [1, 1, 1, 3] * [1, 3, 1, 3] 880s x0: [1, 1, 1, 3] 880s 0.695 880s 880s x1: [1, 3, 1, 3] 880s 0.476 880s -0.024 880s -0.725 880s 880s y: [1, 3, 1, 3] 880s 0.331 880s -0.017 880s -0.504 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.787 880s 880s x1: [1, 1, 1, 1] 880s -0.789 880s 880s y: [1, 1, 1, 1] 880s 0.621 880s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.217 880s -0.652 880s -0.970 880s 880s x1: [1, 1, 1, 1] 880s -0.566 880s 880s y: [3, 1, 1, 1] 880s -0.123 0.369 0.549 880s testing: mul_mat, [2, 1, 1, 3] = [3, 2, 1, 3] * [3, 1, 1, 3] 880s x0: [3, 2, 1, 3] 880s 0.146 0.978 0.232 880s -0.866 -0.797 0.612 880s 880s x1: [3, 1, 1, 3] 880s -0.541 -0.345 -0.576 880s 880s y: [2, 1, 1, 3] 880s -0.550 0.391 880s test-mul-mat0: iter:65/500 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.435 -0.569 0.691 -0.904 880s -0.774 0.481 0.342 -0.766 880s 0.705 0.553 0.459 -0.722 880s 0.795 0.143 0.505 -0.539 880s 880s x1: [4, 4, 1, 1] 880s 0.227 -0.334 0.622 0.270 880s 0.702 -0.938 -0.060 -0.140 880s 0.765 -0.914 -0.006 0.743 880s -0.711 -0.774 0.355 -0.291 880s 880s y: [4, 4, 1, 1] 880s 0.474 -0.331 0.066 0.301 880s 0.924 -0.907 0.050 0.469 880s 0.176 -1.602 -0.505 0.073 880s 0.640 0.522 -0.556 -0.339 880s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 2, 3, 1] 880s -0.149 -0.112 0.739 0.414 880s -0.420 -0.030 0.531 0.731 880s 880s x1: [4, 4, 3, 1] 880s -0.235 -0.224 -0.838 -0.173 880s 0.716 0.022 -0.408 0.802 880s -0.984 -0.665 -0.909 -0.758 880s 0.690 -0.200 -0.647 0.125 880s 880s y: [2, 4, 3, 1] 880s -0.630 -0.465 880s -0.079 0.069 880s -0.763 -0.603 880s -0.506 -0.535 880s testing: mul_mat, [2, 2, 3, 3] = [4, 2, 3, 3] * [4, 2, 3, 3] 880s x0: [4, 2, 3, 3] 880s -0.904 0.667 0.260 0.861 880s 0.948 -0.707 0.976 -0.414 880s 880s x1: [4, 2, 3, 3] 880s 0.294 -0.231 0.694 0.463 880s -0.602 0.124 0.676 0.249 880s 880s y: [2, 2, 3, 3] 880s 0.159 0.928 880s 1.018 -0.102 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.020 -0.325 -0.654 0.584 880s 0.955 -0.716 -0.674 0.379 880s 0.077 -0.370 -0.571 0.884 880s -0.601 -0.917 0.126 -0.737 880s 880s x1: [4, 2, 1, 1] 880s -0.359 -0.819 0.410 -0.792 880s 0.333 0.116 -0.429 -0.294 880s 880s y: [4, 2, 1, 1] 880s -0.457 -0.333 -0.659 1.601 880s 0.065 0.412 -0.032 -0.144 880s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.853 -0.638 -0.932 -0.929 880s -0.561 0.304 -0.054 0.057 880s 0.666 -0.916 -0.555 -0.723 880s -0.195 -0.286 -0.284 0.538 880s 880s x1: [4, 4, 3, 1] 880s -0.119 -0.087 -0.659 0.522 880s -0.010 0.344 -0.837 0.171 880s -0.247 -0.629 -0.496 0.870 880s -0.058 0.210 0.931 0.922 880s 880s y: [4, 4, 3, 1] 880s 0.083 0.106 -0.012 0.516 880s 0.394 0.166 0.019 0.234 880s -0.154 0.024 0.059 0.837 880s -1.908 0.099 -1.414 0.183 880s testing: mul_mat, [2, 4, 3, 3] = [4, 2, 3, 3] * [4, 4, 3, 3] 880s x0: [4, 2, 3, 3] 880s 0.233 0.024 -0.862 0.606 880s -0.531 -0.322 -0.264 0.225 880s 880s x1: [4, 4, 3, 3] 880s 0.944 -0.058 -0.495 -0.507 880s -0.145 -0.340 0.267 -0.799 880s -0.575 0.476 -0.087 0.721 880s -0.716 0.997 -0.129 0.530 880s 880s y: [2, 4, 3, 3] 880s 0.338 -0.466 880s -0.756 -0.064 880s 0.390 0.338 880s 0.290 0.213 880s test-mul-mat0: iter:66/500 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.222 -0.767 0.203 880s 0.093 0.139 -0.973 880s 0.963 0.206 0.983 880s -0.433 -0.958 -0.701 880s 880s x1: [3, 4, 1, 1] 880s -0.241 -0.366 -0.274 880s -0.170 -0.321 -0.790 880s -0.390 -0.400 -0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s .361 880s -0.925 0.325 0.811 880s 880s y: [4, 4, 1, 1] 880s 0.171 0.193 -0.577 0.646 880s 0.048 0.709 -1.007 0.934 880s 0.147 0.259 -0.813 0.805 880s -0.290 -0.830 -0.027 -0.479 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.654 -0.804 -0.490 880s 0.671 -0.994 0.886 880s 880s x1: [3, 4, 1, 1] 880s -0.961 0.502 0.054 880s -0.765 -0.567 -0.380 880s -0.006 0.067 0.347 880s 0.825 0.746 0.557 880s 880s y: [2, 4, 1, 1] 880s 0.198 -1.096 880s 1.142 -0.285 880s -0.220 0.237 880s -1.412 0.305 880s testing: mul_mat, [2, 2, 1, 3] = [3, 2, 1, 3] * [3, 2, 1, 3] 880s x0: [3, 2, 1, 3] 880s 0.628 -0.580 0.610 880s 0.374 0.977 -0.954 880s 880s x1: [3, 2, 1, 3] 880s -0.107 -0.761 0.089 880s 0.987 0.378 0.116 880s 880s y: [2, 2, 1, 3] 880s 0.429 -0.868 880s 0.471 0.627 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.134 0.596 -0.916 880s 880s x1: [3, 2, 1, 1] 880s -0.779 -0.620 0.745 880s -0.262 0.006 0.144 880s 880s y: [1, 2, 1, 1] 880s -1.157 880s -0.163 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.089 880s 0.480 880s -0.303 880s 880s x1: [1, 3, 1, 1] 880s 0.695 880s -0.331 880s 0.870 880s 880s y: [3, 3, 1, 1] 880s 0.062 0.334 -0.211 880s -0.030 -0.159 0.100 880s 0.078 0.418 -0.264 880s testing: mul_mat, [3, 1, 1, 3] = [3, 3, 1, 3] * [3, 1, 1, 3] 880s x0: [3, 3, 1, 3] 880s -0.947 -0.301 -0.556 880s -0.637 -0.416 0.321 880s 0.631 -0.990 0.016 880s 880s x1: [3, 1, 1, 3] 880s 0.066 0.526 -0.584 880s 880s y: [3, 1, 1, 3] 880s 0.103 -0.449 -0.488 880s test-mul-mat0: iter:67/500 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.762 0.151 -1.000 880s -0.669 -0.681 -0.057 880s 880s x1: [3, 4, 1, 1] 880s -0.768 -0.457 -0.870 880s -0.069 0.126 -0.859 880s 0.375 -0.552 0.156 880s -0.037 0.062 0.306 880s 880s y: [2, 4, 1, 1] 880s 0.215 0.875 880s 0.826 0.009 880s 0.046 0.116 880s -0.325 -0.035 880s testing: mul_mat, [3, 2, 4, 1] = [3, 3, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.672 0.249 0.440 880s -0.997 0.568 -0.616 880s -0.617 0.752 0.336 880s 880s x1: [3, 2, 4, 1] 880s -0.620 -0.817 -0.048 880s 0.956 -0.536 -0.800 880s 880s y: [3, 2, 4, 1] 880s -0.641 0.183 -0.247 880s 0.156 -0.765 -1.262 880s testing: mul_mat, [4, 3, 4, 1] = [3, 4, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 4, 4, 1] 880s 0.954 -0.459 -0.842 880s -0.164 -0.438 -0.008 880s 0.482 -0.651 -0.401 880s 0.386 -0.142 0.380 880s 880s x1: [3, 3, 4, 1] 880s 0.610 0.989 -0.647 880s -0.054 -0.672 -0.432 880s -0.519 -0.722 0.335 880s 880s y: [4, 3, 4, 1] 880s 0.673 -0.529 -0.091 -0.151 880s 0.621 0.307 0.584 -0.090 880s -0.446 0.399 0.085 0.029 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.632 -0.626 -0.576 880s -0.302 0.740 -0.126 880s 880s x1: [3, 4, 1, 1] 880s 0.444 -0.615 -0.528 880s -0.300 -0.378 0.408 880s 0.901 -0.239 -0.129 880s 0.037 0.153 0.807 880s 880s y: [2, 4, 1, 1] 880s 0.408 -0.523 880s 0.191 -0.240 880s -0.346 -0.433 880s -0.584 0.000 880s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 2, 4, 1] 880s 0.744 0.829 880s 0.269 0.484 880s 880s x1: [2, 3, 4, 1] 880s -0.896 0.632 880s 0.724 0.068 880s -0.773 0.432 880s 880s y: [2, 3, 4, 1] 880s -0.142 0.064 880s 0.595 0.228 880s -0.217 0.001 880s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 2, 4, 1] 880s 0.765 -0.889 880s -0.856 -0.532 880s 880s x1: [2, 2, 4, 1] 880s 0.014 0.432 880s 0.391 0.854 880s 880s y: [2, 2, 4, 1] 880s -0.374 -0.242 880s -0.461 -0.789 880s test-mul-mat0: iter:68/500 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.274 0.910 880s 0.269 -0.809 880s 0.058 0.205 880s -0.148 0.475 880s 880s x1: [2, 1, 1, 1] 880s 0.733 0.335 880s 880s y: [4, 1, 1, 1] 880s 0.506 -0.073 0.111 0.050 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.089 0.575 880s 0.558 0.262 880s -0.960 -0.880 880s -0.132 0.896 880s 880s x1: [2, 4, 1, 1] 880s -0.030 -0.004 880s 0.585 0.438 880s -0.271 0.022 880s -0.666 0.118 880s 880s y: [4, 4, 1, 1] 880s -0.005 -0.018 0.032 0.001 880s 0.304 0.442 -0.948 0.315 880s -0.012 -0.146 0.241 0.055 880s 0.009 -0.341 0.536 0.194 880s testing: mul_mat, [1, 4, 1, 2] = [2, 1, 1, 2] * [2, 4, 1, 2] 880s x0: [2, 1, 1, 2] 880s -0.102 -0.689 880s 880s x1: [2, 4, 1, 2] 880s -0.464 -0.609 880s 0.226 -0.731 880s 0.725 -0.984 880s 0.543 0.636 880s 880s y: [1, 4, 1, 2] 880s 0.467 880s 0.480 880s 0.604 880s -0.493 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.004 0.583 880s -0.656 0.044 880s 880s x1: [2, 1, 1, 1] 880s 0.429 -0.214 880s 880s y: [2, 1, 1, 1] 880s -0.123 -0.291 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1,ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 1] 880s -0.834 -0.818 880s 0.269 -0.188 880s -0.433 -0.797 880s 880s x1: [2, 2, 1, 1] 880s -0.536 -0.550 880s -0.061 -0.001 880s 880s y: [3, 2, 1, 1] 880s 0.897 -0.040 0.671 880s 0.051 -0.016 0.027 880s testing: mul_mat, [1, 2, 1, 2] = [3, 1, 1, 2] * [3, 2, 1, 2] 880s x0: [3, 1, 1, 2] 880s -0.546 -0.620 0.866 880s 880s x1: [3, 2, 1, 2] 880s 0.467 -0.454 -0.104 880s 0.957 0.132 0.065 880s 880s y: [1, 2, 1, 2] 880s -0.063 880s -0.548 880s test-mul-mat0: iter:69/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.307 0.789 -0.733 880s 880s x1: [3, 1, 1, 1] 880s -0.813 -0.259 0.607 880s 880s y: [1, 1, 1, 1] 880s -0.899 880s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 1, 2, 1] 880s -0.372 -0.341 -0.710 880s 880s x1: [3, 1, 2, 1] 880s 0.510 0.255 -0.187 880s 880s y: [1, 1, 2, 1] 880s -0.144 880s testing: mul_mat, [4, 1, 2, 3] = [3, 4, 2, 3] * [3, 1, 2, 3] 880s x0: [3, 4, 2, 3] 880s -0.809 -0.102 0.667 880s 0.403 -0.157 0.138 880s -0.969 0.503 0.428 880s -0.027 0.187 0.311 880s 880s x1: [3, 1, 2, 3] 880s -0.512 -0.861 -0.736 880s 880s y: [4, 1, 2, 3] 880s 0.011 -0.173 -0.252 -0.377 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.260 0.957 -0.823 880s -0.735 0.011 -0.463 880s 0.954 0.578 -0.963 880s 880s x1: [3, 4, 1, 1] 880s -0.451 -0.112 -0.087 880s 0.311 -0.215 0.433 880s -0.004 0.245 0.573 880s 0.704 -0.558 0.036 880s 880s y: [3, 4, 1, 1] 880s 0.081 0.371 -0.412 880s -0.643 -0.432 -0.245 880s -0.237 -0.260 -0.414 880s -0.746 -0.541 0.315 880s testing: mul_mat, [4, 3, 2, 1] = [3, 4, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 4, 2, 1] 880s 0.986 0.130 0.345 880s -0.135 0.767 -0.141 880s 0.813 -0.317 -0.322 880s -0.274 -0.859 0.382 880s 880s x1: [3, 3, 2, 1] 880s -0.702 -0.493 -0.265 880s 0.164 -0.939 0.701 880s -0.722 -0.374 0.911 880s 880s y: [4, 3, 2, 1] 880s -0.848 -0.246 -0.329 0.514 880s 0.282 -0.840 0.205 1.029 880s -0.446 -0.317 -0.761 0.866 880s testing: mul_mat, [3, 3, 2, 3] = [4, 3, 2, 3] * [4, 3, 2, 3] 880s x0: [4, 3, 2, 3] 880s 0.903 -0.677 -0.803 -0.291 880s 0.492 0.217 -0.215 -0.687 880s -0.483 0.840 0.803 -0.271 880s 880s x1: [4, 3, 2, 3] 880s 0.566 0.686 -0.191 -0.300 880s 0.229 -0.361 0.538 0.215 880s 0.504 0.351 0.941 -0.366 880s 880s y: [3, 3, 2, 3] 880s 0.287 0.674 0.230 880s -0.043 -0.229 -0.040 880s -0.432 0.373 0.906 880s test-mul-mat0: iter:70/500 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.933 0.951 0.655 880s 880s x1: [3, 2, 1, 1] 880s -0.319 -0.850 0.881 880s -0.492 -0.237 0.392 880s 880s y: [1, 2, 1, 1] 880s 0.065 880s 0.490 880s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 1, 2, 1] 880s -0.919 -0.736 0.405 880s 880s x1: [3, 1, 2, 1] 880s -0.750 -0.194 -0.648 880s 880s y: [1, 1, 2, 1] 880s 0.570 880s testing: mul_mat, [2, 1, 2, 3] = [3, 2, 2, 3] * [3, 1, 2, 3] 880s x0: [3, 2, 2, 3] 880s 0.560 0.325 0.754 880s 0.776 -0.284 -0.925 880s 880s x1: [3, 1, 2, 3] 880s -0.654 0.946 0.509 880s 880s y: [2, 1, 2, 3] 880s 0.324 -1.246 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.835 -0.890 -0.537 880s 0.497 -0.255 -0.974 880s 880s x1: [3, 2, 1, 1] 880s -0.573 -0.344 0.219 880s 0.908 0.238 -0.614 880s 880s y: [2, 2, 1, 1] 880s 0.666 -0.411 880s -0.639 0.989 880s testing: mul_mat, [2, 3, 2, 1] = [2, 2, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.936 -0.238 880s -0.457 -0.136 880s 880s x1: [2, 3, 2, 1] 880s 0.074 -0.041 880s 0.684 0.302 880s -0.134 0.593 880s 880s y: [2, 3, 2, 1] 880s -0.059 -0.028 880s -0.712 -0.354 880s -0.016 -0.019 880s testing: mul_mat, [1, 2, 2, 3] = [2, 1, 2, 3] * [2, 2, 2, 3] 880s x0: [2, 1, 2, 3] 880s 0.218 0.716 880s 880s x1: [2, 2, 2, 3] 880s 0.097 -0.242 880s -0.612 -0.649 880s 880s y: [1, 2, 2, 3] 880s -0.153 880s -0.598 880s test-mul-mat0: iter:71/500 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.649 0.573 880s 880s x1: [2, 4, 1, 1] 880s 0.325 -0.138 880s -0.026 0.368 880s 0.035 -0.262 880s 0.603 -0.216 880s 880s y: [1, 4, 1, 1] 880s 0.132 880s 0.194 880s -0.127 880s 0.268 880s testing: mul_mat, [1, 1, 4, 1] = [2, 1, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 1, 4, 1] 880s -0.072 0.233 880s 880s x1: [2, 1, 4, 1] 880s -0.077 -0.895 880s 880s y: [1, 1, 4, 1] 880s -0.203 880s testing: mul_mat, [3, 1, 4, 3] = [2, 3, 4, 3] * [2, 1, 4, 3] 880s x0: [2, 3, 4, 3] 880s -0.420 -0.596 880s -0.703 -0.131 880s 0.857 0.699 880s 880s x1: [2, 1, 4, 3] 880s -0.126 0.008 880s 880s y: [3, 1, 4, 3] 880s 0.048 0.088 -0.103 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.747 -0.258 880s -0.457 0.100 880s -0.438 0.298 880s 880s x1: [2, 3, 1, 1] 880s 0.729 -0.676 880s 0.767 -0.747 880s -0.833 -0.593 880s 880s y: [3, 3, 1, 1] 880s -0.371 -0.401 -0.521 880s -0.381 -0.426 -0.559 880s 0.775 0.322ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 0.188 880s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 1, 4, 1] 880s 0.754 0.104 0.488 880s 880s x1: [3, 2, 4, 1] 880s -0.133 -0.674 -0.853 880s 0.854 -0.803 -0.682 880s 880s y: [1, 2, 4, 1] 880s -0.586 880s 0.227 880s testing: mul_mat, [3, 3, 4, 3] = [1, 3, 4, 3] * [1, 3, 4, 3] 880s x0: [1, 3, 4, 3] 880s -0.571 880s -0.811 880s 0.905 880s 880s x1: [1, 3, 4, 3] 880s 0.075 880s -0.352 880s 0.640 880s 880s y: [3, 3, 4, 3] 880s -0.043 -0.061 0.068 880s 0.201 0.285 -0.318 880s -0.365 -0.519 0.579 880s test-mul-mat0: iter:72/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.266 0.799 0.691 0.498 880s -0.109 -0.942 -0.655 -0.801 880s -0.121 0.622 0.695 0.606 880s -0.493 -0.635 -0.924 -0.253 880s 880s x1: [4, 3, 1, 1] 880s -0.846 0.471 -0.500 0.557 880s -0.645 -0.549 0.935 0.309 880s -0.734 -0.925 0.696 -0.527 880s 880s y: [4, 3, 1, 1] 880s 0.532 -0.470 0.385 0.438 880s 0.534 -0.273 0.574 -0.276 880s -0.324 0.917 -0.321 0.440 880s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.517 0.904 -0.397 -0.585 880s 880s x1: [4, 4, 3, 1] 880s -0.475 -0.716 -0.099 0.996 880s -0.216 -0.542 -0.650 0.235 880s -0.607 0.659 0.501 -0.532 880s 0.355 0.974 0.379 -0.911 880s 880s y: [1, 4, 3, 1] 880s -0.944 880s -0.259 880s 1.022 880s 1.079 880s testing: mul_mat, [1, 1, 3, 1] = [4, 1, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 1, 3, 1] 880s 0.191 0.591 -0.296 -0.912 880s 880s x1: [4, 1, 3, 1] 880s -0.162 0.695 0.883 -0.454 880s 880s y: [1, 1, 3, 1] 880s 0.532 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.526 0.409 -0.952 -0.321 880s 0.822 -0.632 0.080 -0.649 880s -0.231 0.907 -0.161 0.223 880s 880s x1: [4, 1, 1, 1] 880s -0.046 -0.174 0.615 -0.016 880s 880s y: [3, 1, 1, 1] 880s -0.676 0.132 -0.250 880s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.409 0.466 0.159 880s 0.151 -0.854 0.114 880s -0.913 -0.171 -0.503 880s 0.369 0.174 0.157 880s 880s x1: [3, 4, 3, 1] 880s -0.197 -0.351 -0.585 880s -0.605 0.352 -0.498 880s 0.225 0.532 0.073 880s 0.959 -0.685 -0.941 880s 880s y: [4, 4, 3, 1] 880s -0.337 0.203 0.534 -0.225 880s -0.163 -0.449 0.743 -0.240 880s 0.352 -0.412 -0.334 0.187 880s -0.077 0.623 -0.284 0.087 880s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.547 -0.749 -0.996 0.499 880s 0.588 -0.081 0.191 -0.229 880s 0.743 -0.065 -0.642 -0.347 880s 0.112 0.550 0.986 -0.874 880s 880s x1: [4, 3, 3, 1] 880s 0.365 -0.525 0.730 -0.476 880s 0.589 -0.773 0.682 -0.937 880s 0.572 -0.342 -0.337 0.176 880s 880s y: [4, 3, 3, 1] 880s -0.771 0.506 0.001 0.888 880s -0.890 0.754 0.376 1.132 880s 0.366 0.260 0.603 -0.610 880s test-mul-mat0: iter:73/500 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.038 -0.703 880s 880s x1: [2, 2, 1, 1] 880s 0.226 0.560 880s 0.639 0.790 880s 880s y: [1, 2, 1, 1] 880s -0.385 880s -0.531 880s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 2, 3, 1] 880s 0.242 -0.265 880s 0.640 0.557 880s 880s x1: [2, 1, 3, 1] 880s -0.478 0.964 880s 880s y: [2, 1, 3, 1] 880s -0.371 0.231 880s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.621 0.605 880s 0.709 -0.144 880s 880s x1: [2, 2, 3, 1] 880s -0.581 0.564 880s 0.316 -0.869 880s 880s y: [2, 2, 3, 1] 880s 0.701 -0.492 880s -0.721 0.349 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.484 -0.890 880s 0.978 0.917 880s -0.821 -0.626 880s 0.832 0.800 880s 880s x1: [2, 2, 1, 1] 880s 0.886 0.223 880s -0.923 0.559 880s 880s y: [4, 2, 1, 1] 880s -0.628 1.071 -0.867 0.915 880s -0.051 -0.391 0.408 -0.321 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s 0.508 0.425 -0.282 -0.573 880s -0.428 -0.055 0.651 -0.014 880s -0.768 -0.968 0.178 -0.809 880s 880s x1: [4, 2, 3, 1] 880s 0.294 -0.294 0.146 0.709 880s 0.335 0.013 -0.912 -0.060 880s 880s y: [3, 2, 3, 1] 880s -0.423 -0.024 -0.489 880s 0.467 -0.737 -0.383 880s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.622 -0.630 0.667 880s -0.462 -0.026 0.371 880s 880s x1: [3, 4, 3, 1] 880s -0.684 -0.274 -0.778 880s 0.493 0.154 0.208 880s 0.684 0.669 0.134 880s -0.404 0.128 0.001 880s 880s y: [2, 4, 3, 1] 880s 0.079 0.034 880s -0.265 -0.155 880s -0.757 -0.284 880s 0.171 0.183 880s test-mul-mat0: iter:74/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.970 0.797 0.213 0.268 880s 0.711 -0.447 0.397 0.201 880s -0.864 -0.855 -0.988 -0.724 880s 0.442 0.222 -0.974 0.339 880s 880s x1: [4, 3, 1, 1] 880s -0.218 0.180 -0.208 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s .982 880s -0.104 -0.180 -0.324 0.765 880s -0.944 0.054 -0.697 -0.574 880s 880s y: [4, 3, 1, 1] 880s 0.150 -0.121 -0.472 0.479 880s -0.109 0.031 0.010 0.489 880s -1.175 -1.087 1.874 0.078 880s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 4, 4, 1] 880s 0.030 -0.675 0.692 0.575 880s 0.830 0.814 0.375 -0.984 880s 0.768 -0.269 0.309 0.582 880s 0.616 0.507 -0.531 0.295 880s 880s x1: [4, 4, 4, 1] 880s 0.413 0.012 -0.879 -0.407 880s 0.805 -0.897 0.489 -0.375 880s -0.221 0.254 -0.319 0.832 880s 0.557 0.108 0.860 0.528 880s 880s y: [4, 4, 4, 1] 880s -0.838 0.424 -0.195 0.607 880s 0.752 0.490 0.792 -0.330 880s 0.080 -0.916 0.148 0.408 880s 0.842 0.353 0.972 0.097 880s testing: mul_mat, [4, 4, 4, 4] = [4, 4, 4, 4] * [4, 4, 4, 4] 880s x0: [4, 4, 4, 4] 880s 0.271 0.256 -0.850 0.715 880s -0.806 0.583 0.184 -0.461 880s -0.363 0.938 0.351 -0.963 880s 0.373 0.658 -0.665 0.091 880s 880s x1: [4, 4, 4, 4] 880s -0.758 -0.710 -0.508 -0.430 880s 0.165 -0.958 -0.333 -0.229 880s -0.100 -0.654 0.993 0.266 880s -0.507 -0.490 0.952 0.621 880s 880s y: [4, 4, 4, 4] 880s -0.263 0.302 -0.155 -0.452 880s -0.082 -0.648 -0.856 -0.369 880s -0.849 -0.241 -0.484 -1.104 880s -0.628 0.011 -0.540 -1.088 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.298 -0.397 0.156 -0.057 880s -0.807 -0.764 -0.308 -0.646 880s -0.228 -0.206 -0.388 0.142 880s -0.217 -0.859 0.169 -0.592 880s 880s x1: [4, 4, 1, 1] 880s 0.973 0.465 0.623 -0.787 880s 0.151 -0.225 0.438 0.937 880s 0.083 -0.831 0.918 -0.472 880s 0.540 -0.968 0.234 -0.214 880s 880s y: [4, 4, 1, 1] 880s -0.332 -0.824 -0.671 -0.040 880s 0.059 -0.691 -0.025 -0.320 880s 0.475 0.590 -0.272 1.130 880s 0.271 0.369 -0.046 0.880 880s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 2, 4, 1] 880s -0.699 0.414 0.714 -0.828 880s -0.883 -0.945 0.800 0.816 880s 880s x1: [4, 4, 4, 1] 880s 0.976 0.356 0.559 -0.249 880s -0.206 0.496 0.835 -0.037 880s 0.413 -0.638 -0.496 0.446 880s 0.596 0.289 -0.991 -0.702 880s 880s y: [2, 4, 4, 1] 880s 0.070 -0.954 880s 0.976 0.351 880s -1.276 0.204 880s -0.423 -2.165 880s testing: mul_mat, [3, 4, 4, 4] = [2, 3, 4, 4] * [2, 4, 4, 4] 880s x0: [2, 3, 4, 4] 880s -0.231 -0.207 880s 0.828 0.398 880s 0.740 -0.950 880s 880s x1: [2, 4, 4, 4] 880s 0.944 0.352 880s -0.804 0.658 880s 0.152 -0.633 880s 0.474 -0.216 880s 880s y: [3, 4, 4, 4] 880s -0.291 0.922 0.364 880s 0.050 -0.404 -1.221 880s 0.096 -0.126 0.714 880s -0.065 0.306 0.556 880s test-mul-mat0: iter:75/500 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.256 -0.673 -0.023 880s -0.059 -0.159 0.299 880s 0.050 0.195 0.333 880s -0.854 0.372 -0.972 880s 880s x1: [3, 3, 1, 1] 880s 0.823 0.537 -0.410 880s -0.867 0.633 -0.574 880s 0.901 -0.761 -0.513 880s 880s y: [4, 3, 1, 1] 880s -0.142 -0.257 0.009 -0.104 880s -0.635 -0.221 -0.111 1.534 880s 0.755 -0.085 -0.274 -0.554 880s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 3, 2, 1] 880s -0.420 -0.521 0.909 880s -0.274 0.851 0.938 880s 0.905 0.229 0.040 880s 880s x1: [3, 4, 2, 1] 880s 0.179 0.378 0.102 880s -0.075 -0.036 0.064 880s 0.560 0.926 -0.386 880s 0.383 0.463 0.204 880s 880s y: [3, 4, 2, 1] 880s -0.179 0.369 0.253 880s 0.109 0.050 -0.074 880s -1.068 0.273 0.704 880s -0.216 0.480 0.461 880s testing: mul_mat, [3, 3, 2, 4] = [3, 3, 2, 4] * [3, 3, 2, 4] 880s x0: [3, 3, 2, 4] 880s 0.400 -0.776 0.471 880s -0.506 0.472 0.885 880s 0.314 0.426 0.643 880s 880s x1: [3, 3, 2, 4] 880s 0.289 -0.324 0.552 880s -0.295 -0.989 0.669 880s 0.006 -0.399 -0.888 880s 880s y: [3, 3, 2, 4] 880s 0.626 0.189 0.307 880s 0.965 0.274 -0.084 880s -0.106 -0.977 -0.739 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.477 0.807 -0.809 880s 880s x1: [3, 3, 1, 1] 880s 0.237 0.622 -0.772 880s -0.832 -0.126 0.329 880s 0.893 0.023 0.669 880s 880s y: [1, 3, 1, 1] 880s 1.014 880s 0.029 880s -0.948 880s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 4, 2, 1] 880s -0.243 880s -0.892 880s -0.873 880s 0.609 880s 880s x1: [1, 3, 2, 1] 880s -0.836 880s 0.838 880s 0.774 880s 880s y: [4, 3, 2, 1] 880s 0.203 0.745 0.730 -0.509 880s -0.204 -0.748 -0.732 0.511 880s -0.188 -0.690 -0.676 0.471 880s testing: mul_mat, [1, 1, 2, 4] = [4, 1, 2, 4] * [4, 1, 2, 4] 880s x0: [4, 1, 2, 4] 880s 0.766 -0.619 0.544 -0.481 880s 880s x1: [4, 1, 2, 4] 880s -0.042 -0.132 -0.475 -0.806 880s 880s y: [1, 1, 2, 4] 880s 0.178 880s test-mul-mat0: iter:76/500 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.884 -0.654 880s 0.065 0.191 880s 0.317 0.489 880s 880s x1: [2, 3, 1, 1] 880s -0.719 0.622 880s -0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s .659 0.577 880s 0.974 -0.388 880s 880s y: [3, 3, 1, 1] 880s -1.042 0.072 0.076 880s -0.959 0.067 0.073 880s 1.115 -0.011 0.119 880s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.709 -0.625 880s 0.149 -0.735 880s 880s x1: [2, 3, 3, 1] 880s 0.885 0.472 880s 0.039 -0.277 880s -0.080 0.841 880s 880s y: [2, 3, 3, 1] 880s -0.922 -0.215 880s 0.145 0.210 880s -0.469 -0.631 880s testing: mul_mat, [1, 2, 3, 2] = [2, 1, 3, 2] * [2, 2, 3, 2] 880s x0: [2, 1, 3, 2] 880s 0.626 0.752 880s 880s x1: [2, 2, 3, 2] 880s 0.224 -0.878 880s -0.475 0.947 880s 880s y: [1, 2, 3, 2] 880s -0.519 880s 0.415 880s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.250 0.623 880s 880s x1: [2, 1, 1, 1] 880s 0.399 0.020 880s 880s y: [1, 1, 1, 1] 880s -0.087 880s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.284 880s -0.990 880s 880s x1: [1, 2, 3, 1] 880s 0.161 880s -0.980 880s 880s y: [2, 2, 3, 1] 880s -0.046 -0.159 880s 0.278 0.971 880s testing: mul_mat, [4, 1, 3, 2] = [2, 4, 3, 2] * [2, 1, 3, 2] 880s x0: [2, 4, 3, 2] 880s 0.805 0.976 880s -0.648 -0.072 880s 0.767 -0.425 880s -0.445 0.851 880s 880s x1: [2, 1, 3, 2] 880s -0.786 0.113 880s 880s y: [4, 1, 3, 2] 880s -0.522 0.501 -0.651 0.446 880s test-mul-mat0: iter:77/500 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.685 -0.939 0.132 0.609 880s 0.979 -0.051 0.254 0.665 880s 0.321 -0.125 -0.722 -0.732 880s 880s x1: [4, 1, 1, 1] 880s -0.556 0.515 -0.646 0.079 880s 880s y: [3, 1, 1, 1] 880s -0.141 -0.682 0.166 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.465 -0.427 0.857 -0.112 880s 880s x1: [4, 3, 1, 1] 880s -0.653 -0.999 -0.629 -0.534 880s -0.925 -0.652 0.243 0.170 880s -0.804 -0.836 -0.624 -0.871 880s 880s y: [1, 3, 1, 1] 880s 0.251 880s 0.898 880s 0.294 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.188 0.692 0.059 -0.934 880s 0.356 -0.620 -0.059 0.634 880s 880s x1: [4, 1, 1, 1] 880s 0.652 -0.023 0.204 0.713 880s 880s y: [2, 1, 1, 1] 880s -0.792 0.686 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.562 0.257 0.504 -0.736 880s 0.162 0.691 0.792 0.361 880s 880s x1: [4, 2, 1, 1] 880s -0.353 0.288 0.635 0.019 880s 0.755 0.711 0.367 -0.002 880s 880s y: [2, 2, 1, 1] 880s 0.182 0.652 880s 0.793 0.902 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.238 0.886 880s -0.324 0.704 880s 880s x1: [2, 4, 1, 1] 880s -0.320 0.916 880s -0.662 0.884 880s 0.628 -0.552 880s -0.304 0.320 880s 880s y: [2, 4, 1, 1] 880s 0.888 0.748 880s 0.941 0.836 880s -0.639 -0.592 880s 0.356 0.324 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.253 -0.081 880s 0.657 0.745 880s 880s x1: [2, 2, 1, 1] 880s -0.690 -0.466 880s -0.008 0.946 880s 880s y: [2, 2, 1, 1] 880s 0.212 -0.800 880s -0.074 0.699 880s test-mul-mat0: iter:78/500 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.564 0.770 880s -0.732 -0.648 880s 880s x1: [2, 1, 1, 1] 880s -0.828 -0.590 880s 880s y: [2, 1, 1, 1] 880s 0.012 0.989 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.301 -0.937 880s 0.610 0.977 880s 880s x1: [2, 2, 1, 1] 880s -0.891 -0.848 880s 0.980 -0.444 880s 880s y: [2, 2, 1, 1] 880s 0.527 -1.373 880s 0.710 0.165 880s testing: mul_mat, [2, 2, 1, 2] = [2, 2, 1, 2] * [2, 2, 1, 2] 880s x0: [2, 2, 1, 2] 880s -0.045 -0.202 880s -0.573 0.437 880s 880s x1: [2, 2, 1, 2] 880s 0.949 0.314 880s -0.713 -0.517 880s 880s y: [2, 2, 1, 2] 880s -0.106 -0.406 880s 0.136 0.182 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.993 -0.622 880s -0.771 -0.451 880s 880s x1: [2, 2, 1, 1] 880s 0.809 -0.955 880s 0.884 -0.923 880s 880s y: [2, 2, 1, 1] 880s 1.398 -0.194 880s 1.453 -0.266 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.438 0.942 880s -0.999 0.045 880s 880s x1: [2, 2, 1, 1] 880s 0.077 0.678 880s -0.388 -0.313 880s 880s y: [2, 2, 1, 1] 880s 0.605 -0.046 880s -0.125 0.373 880s testing: mul_mat, [2, 2, 1, 2] = [2, 2, 1, 2] * [2, 2, 1, 2] 880s x0: [2, 2, 1, 2] 880s 0.862 0.082 880s -0.285 0.976 880s 880s x1: [2, 2, 1, 2] 880s -0.693 -0.825 880s 0.081 0.360 880s 880s y: [2, 2, 1, 2] 880s -0.665 -0.607 880s 0.099 0.328 880s test-mul-mat0: iter:79/500 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.099 0.353 0.701 880s 880s x1: [3, 3, 1, 1] 880s -0.598 0.906 0.580 880s 0.014 -0.407 0.235 880s 0.576 -0.406 0.177 880s 880s y: [1, 3, 1, 1] 880s 0.786 880s 0.020 880s -0.076 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.910 0.211 -0.71ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 9 880s -0.375 -0.707 -0.743 880s 0.149 0.421 -0.883 880s 880s x1: [3, 1, 1, 1] 880s -0.739 0.418 0.738 880s 880s y: [3, 1, 1, 1] 880s -1.115 -0.567 -0.586 880s testing: mul_mat, [3, 3, 1, 2] = [3, 3, 1, 2] * [3, 3, 1, 2] 880s x0: [3, 3, 1, 2] 880s 0.611 0.021 0.255 880s 0.521 -0.768 0.536 880s -0.853 -0.475 0.793 880s 880s x1: [3, 3, 1, 2] 880s 0.750 -0.055 0.970 880s -0.420 -0.152 -0.628 880s -0.514 -0.572 0.386 880s 880s y: [3, 3, 1, 2] 880s 0.705 0.953 0.154 880s -0.420 -0.439 -0.068 880s -0.227 0.378 1.016 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.137 -0.525 0.996 880s 0.463 -0.516 -0.284 880s -0.723 0.532 0.068 880s 880s x1: [3, 3, 1, 1] 880s -0.887 -0.833 0.637 880s 0.193 0.830 -0.402 880s -0.134 -0.330 -0.820 880s 880s y: [3, 3, 1, 1] 880s 1.193 -0.162 0.242 880s -0.863 -0.225 0.275 880s -0.625 0.341 -0.134 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.819 0.986 0.816 880s -0.099 -0.462 0.136 880s -0.776 0.417 -0.717 880s 880s x1: [3, 3, 1, 1] 880s -0.857 0.242 -0.139 880s 0.438 0.188 -0.230 880s -0.515 -0.920 -0.350 880s 880s y: [3, 3, 1, 1] 880s -0.578 -0.046 0.866 880s 0.357 -0.162 -0.097 880s -1.616 0.429 0.267 880s testing: mul_mat, [2, 3, 1, 2] = [3, 2, 1, 2] * [3, 3, 1, 2] 880s x0: [3, 2, 1, 2] 880s -0.510 0.998 0.782 880s -0.119 -0.691 -0.778 880s 880s x1: [3, 3, 1, 2] 880s -0.513 0.316 0.724 880s 0.350 -0.221 -1.000 880s 0.825 0.263 0.532 880s 880s y: [2, 3, 1, 2] 880s 1.142 -0.720 880s -1.181 0.889 880s 0.257 -0.694 880s test-mul-mat0: iter:80/500 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.986 0.237 880s -0.041 -0.576 880s 880s x1: [2, 4, 1, 1] 880s 0.550 0.645 880s 0.774 -0.187 880s 0.177 0.594 880s -0.207 0.776 880s 880s y: [2, 4, 1, 1] 880s -0.390 -0.394 880s -0.807 0.076 880s -0.034 -0.350 880s 0.388 -0.439 880s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.375 -0.772 880s -0.325 -0.730 880s -0.823 -0.775 880s 880s x1: [2, 2, 3, 1] 880s 0.467 -0.154 880s -0.087 -0.652 880s 880s y: [3, 2, 3, 1] 880s -0.056 -0.039 -0.265 880s 0.536 0.504 0.577 880s testing: mul_mat, [2, 3, 3, 2] = [2, 2, 3, 2] * [2, 3, 3, 2] 880s x0: [2, 2, 3, 2] 880s 0.930 -0.909 880s 0.792 -0.572 880s 880s x1: [2, 3, 3, 2] 880s 0.571 -0.092 880s 0.182 0.919 880s 0.751 -0.596 880s 880s y: [2, 3, 3, 2] 880s 0.615 0.505 880s -0.666 -0.382 880s 1.240 0.936 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.598 -0.507 880s 0.357 0.149 880s -0.630 -0.202 880s 880s x1: [2, 2, 1, 1] 880s -0.980 -0.442 880s 0.127 0.486 880s 880s y: [3, 2, 1, 1] 880s 0.810 -0.416 0.707 880s -0.322 0.118 -0.178 880s testing: mul_mat, [1, 2, 3, 1] = [3, 1, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 1, 3, 1] 880s -0.299 0.766 0.807 880s 880s x1: [3, 2, 3, 1] 880s -0.896 -0.508 0.628 880s 0.860 0.614 0.535 880s 880s y: [1, 2, 3, 1] 880s 0.385 880s 0.645 880s testing: mul_mat, [2, 3, 3, 2] = [1, 2, 3, 2] * [1, 3, 3, 2] 880s x0: [1, 2, 3, 2] 880s 0.439 880s 0.372 880s 880s x1: [1, 3, 3, 2] 880s -0.276 880s 0.759 880s 0.414 880s 880s y: [2, 3, 3, 2] 880s -0.121 -0.103 880s 0.333 0.282 880s 0.182 0.154 880s test-mul-mat0: iter:81/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.559 880s -0.125 880s 880s x1: [1, 1, 1, 1] 880s 0.010 880s 880s y: [2, 1, 1, 1] 880s -0.006 -0.001 880s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.856 880s -0.557 880s 880s x1: [1, 2, 4, 1] 880s 0.296 880s 0.159 880s 880s y: [2, 2, 4, 1] 880s 0.254 -0.165 880s 0.136 -0.089 880s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 3, 4, 1] 880s -0.571 880s 0.527 880s 0.738 880s 880s x1: [1, 2, 4, 1] 880s 0.553 880s -0.243 880s 880s y: [3, 2, 4, 1] 880s -0.315 0.291 0.408 880s 0.138 -0.128 -0.179 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.053 880s -0.034 880s 0.011 880s 880s x1: [1, 3, 1, 1] 880s -0.467 880s 0.504 880s 0.738 880s 880s y: [3, 3, 1, 1] 880s 0.025 0.016 -0.005 880s -0.026 -0.017 0.006 880s -0.039 -0.025 0.008 880s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 1, 4, 1] 880s 0.892 0.626 0.813 880s 880s x1: [3, 1, 4, 1] 880s -0.469 0.332 -0.554 880s 880s y: [1, 1, 4, 1] 880s -0.661 880s testing: mul_mat, [2, 3, 4, 1] = [1, 2, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 2, 4, 1] 880s -0.833 880s 0.640 880s 880s x1: [1, 3, 4, 1] 880s 0.131 880s 0.932 880s 0.788 880s 880s y: [2, 3, 4, 1] 880s -0.109 0.084 880s -0.776 0.596 880s -0.656 0.504 880s test-mul-mat0: iter:82/500 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.749 -0.479 880s 880s x1: [2, 3, 1, 1] 880s -0.154 -0.875 880s 0.922 -0.527 880s -0.681 -0.267 880s 880s y: [1, 3, 1, 1] 880s 0.535 880s -0.438 880s 0.638 880s testing: mul_mat, [3, 1,ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 4, 1] = [2, 3, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.516 0.864 880s -0.026 -0.440 880s -0.604 -0.498 880s 880s x1: [2, 1, 4, 1] 880s 0.682 0.148 880s 880s y: [3, 1, 4, 1] 880s 0.480 -0.083 -0.486 880s testing: mul_mat, [1, 3, 4, 2] = [2, 1, 4, 2] * [2, 3, 4, 2] 880s x0: [2, 1, 4, 2] 880s 0.737 0.914 880s 880s x1: [2, 3, 4, 2] 880s -0.515 -0.568 880s -0.309 0.049 880s 0.417 -0.611 880s 880s y: [1, 3, 4, 2] 880s -0.898 880s -0.183 880s -0.251 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.461 -0.551 880s 0.152 -0.165 880s 880s x1: [2, 1, 1, 1] 880s 0.445 0.532 880s 880s y: [2, 1, 1, 1] 880s -0.088 -0.020 880s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 2, 4, 1] 880s 0.152 -0.602 880s 0.435 -0.411 880s 880s x1: [2, 2, 4, 1] 880s -0.734 -0.600 880s -0.930 0.418 880s 880s y: [2, 2, 4, 1] 880s 0.250 -0.072 880s -0.393 -0.576 880s testing: mul_mat, [1, 2, 4, 2] = [2, 1, 4, 2] * [2, 2, 4, 2] 880s x0: [2, 1, 4, 2] 880s -0.275 -0.834 880s 880s x1: [2, 2, 4, 2] 880s 0.057 0.612 880s -0.499 0.879 880s 880s y: [1, 2, 4, 2] 880s -0.527 880s -0.596 880s test-mul-mat0: iter:83/500 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.529 -0.607 -0.037 -0.363 880s -0.200 -0.574 0.240 -0.353 880s -0.332 0.154 0.005 0.772 880s -0.936 -0.921 -0.439 0.122 880s 880s x1: [4, 2, 1, 1] 880s -0.288 -0.701 -0.526 -0.046 880s -0.670 -0.986 -0.816 -0.445 880s 880s y: [4, 2, 1, 1] 880s 0.614 0.350 -0.050 1.141 880s 1.144 0.661 -0.277 1.839 880s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 2, 4, 1] 880s -0.430 0.506 -0.782 -0.826 880s 0.741 -0.662 0.372 -0.768 880s 880s x1: [4, 4, 4, 1] 880s 0.304 0.431 -0.084 0.926 880s 0.628 -0.867 -0.362 0.927 880s -0.394 0.593 -0.743 -0.380 880s 0.777 -0.188 -0.142 -0.752 880s 880s y: [2, 4, 4, 1] 880s -0.611 -0.803 880s -1.191 0.193 880s 1.364 -0.669 880s 0.302 1.224 880s testing: mul_mat, [1, 2, 4, 3] = [4, 1, 4, 3] * [4, 2, 4, 3] 880s x0: [4, 1, 4, 3] 880s -0.755 0.194 0.565 -0.961 880s 880s x1: [4, 2, 4, 3] 880s -0.158 -0.476 0.462 -0.417 880s -0.138 -0.166 -0.185 0.942 880s 880s y: [1, 2, 4, 3] 880s 0.688 880s -0.938 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.867 -0.938 -0.823 0.690 880s -0.320 0.672 -0.348 0.135 880s 0.522 0.652 -0.661 0.610 880s -0.102 -0.397 -0.753 -0.546 880s 880s x1: [4, 1, 1, 1] 880s 0.682 0.479 0.368 0.075 880s 880s y: [4, 1, 1, 1] 880s -0.109 -0.014 0.471 -0.577 880s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 2, 4, 1] 880s 0.331 -0.689 -0.678 -0.420 880s -0.081 -0.250 -0.533 0.372 880s 880s x1: [4, 4, 4, 1] 880s 0.439 -0.086 0.246 -0.294 880s -0.324 0.467 0.731 0.966 880s -0.548 -0.589 0.648 0.931 880s 0.779 -0.277 -0.383 0.645 880s 880s y: [2, 4, 4, 1] 880s 0.160 -0.255 880s -1.330 -0.120 880s -0.605 0.192 880s 0.438 0.450 880s testing: mul_mat, [2, 4, 4, 3] = [2, 2, 4, 3] * [2, 4, 4, 3] 880s x0: [2, 2, 4, 3] 880s -0.164 0.258 880s 0.947 0.936 880s 880s x1: [2, 4, 4, 3] 880s 0.400 -0.263 880s 0.173 0.722 880s 0.204 0.754 880s 0.094 0.759 880s 880s y: [2, 4, 4, 3] 880s -0.134 0.133 880s 0.158 0.840 880s 0.161 0.899 880s 0.180 0.799 880s test-mul-mat0: iter:84/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.852 0.157 880s 0.691 0.340 880s 0.756 -0.172 880s 880s x1: [2, 2, 1, 1] 880s -0.047 -0.847 880s 0.268 -0.688 880s 880s y: [3, 2, 1, 1] 880s -0.093 -0.321 0.110 880s -0.336 -0.049 0.321 880s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 1, 2, 1] 880s -0.453 -0.027 880s 880s x1: [2, 3, 2, 1] 880s -0.521 0.613 880s 0.901 0.019 880s 0.709 -0.646 880s 880s y: [1, 3, 2, 1] 880s 0.220 880s -0.408 880s -0.304 880s testing: mul_mat, [2, 1, 2, 4] = [2, 2, 2, 4] * [2, 1, 2, 4] 880s x0: [2, 2, 2, 4] 880s 0.255 0.428 880s 0.177 -0.125 880s 880s x1: [2, 1, 2, 4] 880s 0.929 -0.140 880s 880s y: [2, 1, 2, 4] 880s 0.177 0.182 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.214 -0.593 880s 880s x1: [2, 2, 1, 1] 880s 0.207 -0.320 880s 0.139 0.692 880s 880s y: [1, 2, 1, 1] 880s 0.145 880s -0.440 880s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.005 880s 880s x1: [1, 2, 2, 1] 880s 0.573 880s 0.091 880s 880s y: [1, 2, 2, 1] 880s -0.003 880s -0.000 880s testing: mul_mat, [3, 1, 2, 4] = [1, 3, 2, 4] * [1, 1, 2, 4] 880s x0: [1, 3, 2, 4] 880s 0.414 880s 0.706 880s -0.284 880s 880s x1: [1, 1, 2, 4] 880s -0.272 880s 880s y: [3, 1, 2, 4] 880s -0.113 -0.192 0.077 880s test-mul-mat0: iter:85/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.652 880s 0.850 880s -0.641 880s 880s x1: [1, 3, 1, 1] 880s -0.151 880s -0.359 880s 0.134 880s 880s y: [3, 3, 1, 1] 880s 0.098 -0.128 0.097 880s 0.234 -0.305 0.230 880s -0.088 0.114 -0.086 880s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 2, 3, 1] 880s 0.240 880s -0.733 880s 880s x1: [1, 3, 3, 1] 880s 0.063 880s -0.656 880s 0.958 880s 880s y: [2, 3, 3, 1] 880s 0.015 -0.046 880s -0.157 0.480 880s 0.229 -0.702 880s testing: mul_mat, [1, 2, 3, 2] = [1, 1, 3, 2] * [1, 2, 3, 2] 880s x0: [1, 1, 3, 2] 880s -0.858 880s 880s x1: [1, 2, 3, 2] 880s 0.173 880s -0.092 880s 880s y: [1, 2, 3, 2] 880s -0.149 880s 0.079 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.078 880s 0.971 880s 880s x1: [1, 1, 1, 1] 880s -0.468 880s 880s y: [2, 1, 1, 1] 880s 0.036 -0.455 880s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.417 -0.946 880s 0.093 0.158 880s -0.522 0.255 880s 0.322 -0.279 880s 880s x1: [2, 1, 3, 1] 880s 0.064 0.987 880s 880s y: [4, 1, 3, 1] 880s -0.961 0.161 0.219 -0.254 880s testing: mul_mat, [1, 2, 3, 2] = [4, 1, 3, 2] * [4, 2, 3, 2] 880s x0: [4, 1, 3, 2] 880s 0.258 -0.918 0.189 0.977 880s 880s x1: [4, 2, 3, 2] 880s -0.486 -0.921 -0.053 0.846 880s 0.126 -0.134 0.416 0.709 880s 880s y: [1, 2, 3, 2] 880s 1.537 880s 0.927 880s test-mul-mat0: iter:86/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.435 0.467 0.166 0.204 880s 0.868 -0.327 0.401 0.594 880s 0.447 -0.427 -0.645 0.448 880s -0.387 0.182 -0.312 -0.428 880s 880s x1: [4, 3, 1, 1] 880s 0.245 0.852 0.950 -0.497 880s 0.934 0.139 -0.520 0.913 880s 0.188 -0.589 0.640 0.267 880s 880s y: [4, 3, 1, 1] 880s 0.348 0.019 -1.090 -0.024 880s -0.241 1.099 1.103 -0.565 880s -0.196 0.771 0.043 -0.494 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.561 0.468 -0.221 0.112 880s 880s x1: [4, 4, 1, 1] 880s -0.641 0.900 0.817 -0.789 880s 0.850 -0.680 -0.854 -0.011 880s -0.200 -0.941 -0.823 0.212 880s 0.699 0.444 0.273 -0.736 880s 880s y: [1, 4, 1, 1] 880s 0.511 880s -0.607 880s -0.123 880s -0.327 880s testing: mul_mat, [3, 1, 1, 3] = [4, 3, 1, 3] * [4, 1, 1, 3] 880s x0: [4, 3, 1, 3] 880s -0.520 -0.555 -0.236 -0.374 880s 0.434 0.564 -0.316 0.610 880s -0.224 -0.617 0.054 -0.951 880s 880s x1: [4, 1, 1, 3] 880s -0.131 -0.627 -0.441 0.442 880s 880s y: [3, 1, 1, 3] 880s 0.355 -0.002 -0.028 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.944 0.670 0.980 0.513 880s 0.825 0.118 0.961 0.947 880s -0.049 -0.148 0.445 0.340 880s -0.629 0.423 0.437 0.829 880s 880s x1: [4, 3, 1, 1] 880s 0.880 0.812 -0.054 -0.344 880s -0.805 -1.000 -0.295 -0.157 880s 0.966 0.192 -0.041 -0.291 880s 880s y: [4, 3, 1, 1] 880s 1.145 0.445 -0.304 -0.519 880s -1.799 -1.215 0.002 -0.176 880s 0.850 0.504 -0.193 -0.786 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.994 0.197 0.792 0.124 880s 880s x1: [4, 4, 1, 1] 880s 0.075 -0.324 0.710 -0.113 880s 0.622 -0.634 0.082 0.622 880s 0.070 0.925 0.588 -0.737 880s -0.117 -0.703 0.055 -0.173 880s 880s y: [1, 4, 1, 1] 880s 0.410 880s -0.602 880s 0.487 880s -0.001 880s testing: mul_mat, [4, 4, 1, 3] = [1, 4, 1, 3] * [1, 4, 1, 3] 880s x0: [1, 4, 1, 3] 880s -0.431 880s -0.822 880s 0.428 880s 0.651 880s 880s x1: [1, 4, 1, 3] 880s -0.950 880s 0.215 880s 0.104 880s -0.990 880s 880s y: [4, 4, 1, 3] 880s 0.409 0.781 -0.407 -0.619 880s -0.093 -0.177 0.092 0.140 880s -0.045 -0.085 0.044 0.067 880s 0.426 0.814 -0.424 -0.645 880s test-mul-mat0: iter:87/500 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.457 880s 880s x1: [1, 4, 1, 1] 880s 0.599 880s -0.055 880s -0.467 880s -0.186 880s 880s y: [1, 4, 1, 1] 880s -0.273 880s 0.025 880s 0.213 880s 0.085 880s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 1, 4, 1] 880s 0.419 880s 880s x1: [1, 1, 4, 1] 880s 0.475 880s 880s y: [1, 1, 4, 1] 880s 0.199 880s testing: mul_mat, [4, 1, 4, 2] = [1, 4, 4, 2] * [1, 1, 4, 2] 880s x0: [1, 4, 4, 2] 880s -0.978 880s -0.924 880s -0.015 880s 0.308 880s 880s x1: [1, 1, 4, 2] 880s -0.665 880s 880s y: [4, 1, 4, 2] 880s 0.650 0.615 0.010 -0.205 880s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.377 880s 0.922 880s 0.865 880s -0.053 880s 880s x1: [1, 4, 1, 1] 880s -0.442 880s 0.307 880s -0.168 880s -0.559 880s 880s y: [4, 4, 1, 1] 880s 0.166 -0.407 -0.382 0.023 880s -0.116 0.283 0.266 -0.016 880s 0.063 -0.155 -0.145 0.009 880s 0.211 -0.516 -0.483 0.030 880s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.836 -0.142 -0.541 -0.842 880s 880s x1: [4, 1, 4, 1] 880s -0.782 0.664 0.815 0.988 880s 880s y: [1, 1, 4, 1] 880s -2.021 880s testing: mul_mat, [3, 4, 4, 2] = [1, 3, 4, 2] * [1, 4, 4, 2] 880s x0: [1, 3, 4, 2] 880s 0.823 880s -0.319 880s 0.661 880s 880s x1: [1, 4, 4, 2] 880s -0.401 880s 0.587 880s 0.161 880s 0.348 880s 880s y: [3, 4, 4, 2] 880s -0.330 0.128 -0.265 880s 0.483 -0.187 0.388 880s 0.133 -0.051 0.106 880s 0.286 -0.111 0.230 880s test-mul-mat0: iter:88/500 880s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.243 880s 0.246 880s -0.267 880s 880s x1: [1, 1, 1, 1] 880s 0.423 880s 880s y: [3, 1, 1, 1] 880s 0.103 0.104 -0.113 880s testing: mul_mat, [1, 3, 3, 1] = [1, 1, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 1, 3, 1] 880s 0.682 880s 880s x1: [1, 3, 3, 1] 880s -0.096 880s 0.142 880s 0.865 880s 880s y: [1, 3, 3, 1] 880s -0.066 880s 0.097 880s 0.590 880s testing: mul_mat, [1, 1, 3, 2] = [1, 1, 3, 2] * [1, 1, 3, 2] 880s x0: [1, 1, 3, 2] 880s -0.649 880s 880s x1: [1, 1, 3, 2] 880s 0.126 880s 880s y: [1, 1, 3, 2] 880s -0.082 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.286 880s 880s x1: [1, 1, 1, 1] 880s -0.464 880s 880s y: [1, 1, 1, 1] 880s 0.133 880s testing: mul_mat, [1, 1, 3, 1] = [1, 1, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 1, 3, 1] 880s 0.148 880s 880s x1: [1, 1, 3, 1] 880s 0.782 880s 880s y: [1, 1, 3, 1] 880s 0.116 880s testing: mul_mat, [2, 1, 3, 2] = [1, 2, 3, 2] * [1, 1, 3, 2] 880s x0: [1, 2, 3, 2] 880s -0.970 880s 0.072 880s 880s x1: [1, 1, 3, 2] 880s -0.466 880s 880s y: [2, 1, 3, 2] 880s 0.452 -0.033 880s test-mul-mat0: iter:89/500 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.280 0.386 880s -0.347 -0.017 880s 0.899 -0.157 880s -0.071 0.891 880s 880s x1: [2, 3, 1, 1] 880s -0.411 -0.039 880s 0.703 -0.793 880s 0.886 -0.149 880s 880s y: [4, 3, 1, 1] 880s 0.100 0.143 -0.363 -0.005 880s -0.503 -0.231 0.756 -0.756 880s -0.306 -0.305 0.820 -0.195 880s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 4, 4, 1] 880s 0.570 0.181 880s 0.790 -0.323 880s 0.435 0.583 880s 0.315 0.059 880s 880s x1: [2, 4, 4, 1] 880s -0.127 -0.999 880s 0.928 -0.335 880s 0.849 0.107 880s -0.746 0.793 880s 880s y: [4, 4, 4, 1] 880s -0.253 0.222 -0.638 -0.099 880s 0.468 0.841 0.208 0.272 880s 0.503 0.636 0.432 0.274 880s -0.282 -0.845 0.138 -0.188 880s testing: mul_mat, [2, 4, 4, 2] = [2, 2, 4, 2] * [2, 4, 4, 2] 880s x0: [2, 2, 4, 2] 880s 0.551 0.384 880s -0.481 -0.595 880s 880s x1: [2, 4, 4, 2] 880s 0.668 -0.509 880s -0.959 0.103 880s -0.926 0.356 880s -0.838 0.855 880s 880s y: [2, 4, 4, 2] 880s 0.172 -0.019 880s -0.488 0.400 880s -0.373 0.233 880s -0.133 -0.105 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.436 0.025 880s 880s x1: [2, 2, 1, 1] 880s 0.209 0.631 880s 0.463 -0.626 880s 880s y: [1, 2, 1, 1] 880s 0.107 880s 0.186 880s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 3, 4, 1] 880s 0.025 880s 0.973 880s 0.220 880s 880s x1: [1, 2, 4, 1] 880s -0.729 880s -0.844 880s 880s y: [3, 2, 4, 1] 880s -0.018 -0.709 -0.161 880s -0.021 -0.821 -0.186 880s testing: mul_mat, [2, 1, 4, 2] = [3, 2, 4, 2] * [3, 1, 4, 2] 880s x0: [3, 2, 4, 2] 880s 0.740 0.798 -0.714 880s -0.586 -0.602 -0.443 880s 880s x1: [3, 1, 4, 2] 880s -0.517 -0.016 0.817 880s 880s y: [2, 1, 4, 2] 880s -0.978 -0.050 880s test-mul-mat0: iter:90/500 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.099 -0.437 -0.019 880s 0.689 -0.377 -0.702 880s 880s x1: [3, 3, 1, 1] 880s -0.631 0.449 -0.883 880s -0.159 -0.547 0.428 880s 0.257 -0.094 -0.211 880s 880s y: [2, 3, 1, 1] 880s -0.241 0.016 880s 0.215 -0.204 880s 0.071 0.361 880s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.683 0.494 0.395 880s 0.223 -0.717 0.614 880s 880s x1: [3, 2, 3, 1] 880s 0.161 0.953 0.989 880s 0.207 0.170 0.333 880s 880s y: [2, 2, 3, 1] 880s 0.751 -0.040 880s 0.074 0.129 880s testing: mul_mat, [1, 2, 3, 3] = [3, 1, 3, 3] * [3, 2, 3, 3] 880s x0: [3, 1, 3, 3] 880s -0.383 0.019 0.420 880s 880s x1: [3, 2, 3, 3] 880s -0.729 -0.506 0.260 880s -0.423 -0.448 0.260 880s 880s y: [1, 2, 3, 3] 880s 0.380 880s 0.263 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.609 -0.567 0.119 880s 880s x1: [3, 1, 1, 1] 880s 0.067 0.775 0.414 880s 880s y: [1, 1, 1, 1] 880s -0.431 880s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 2, 3, 1] 880s 0.929 880s -0.812 880s 880s x1: [1, 3, 3, 1] 880s 0.328 880s 0.708 880s 0.919 880s 880s y: [2, 3, 3, 1] 880s 0.305 -0.266 880s 0.658 -0.574 880s 0.854 -0.746 880s testing: mul_mat, [1, 1, 3, 3] = [2, 1, 3, 3] * [2, 1, 3, 3] 880s x0: [2, 1, 3, 3] 880s -0.829 0.737 880s 880s x1: [2, 1, 3, 3] 880s -0.871 -0.001 880s 880s y: [1, 1, 3, 3] 880s 0.720 880s test-mul-mat0: iter:91/500 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.059 880s -0.885 880s 880s x1: [1, 2, 1, 1] 880s 0.987 880s -0.109 880s 880s y: [2, 2, 1, 1] 880s -0.058 -0.873 880s 0.006 0.097 880s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.252 880s 880s x1: [1, 2, 1, 1] 880s 0.768 880s 0.603 880s 880s y: [1, 2, 1, 1] 880s -0.194 880s -0.152 880s testing: mul_mat, [2, 1, 1, 2] = [1, 2, 1, 2] * [1, 1, 1, 2] 880s x0: [1, 2, 1, 2] 880s -0.640 880s -0.572 880s 880s x1: [1, 1, 1, 2] 880s 0.774 880s 880s y: [2, 1, 1, 2] 880s -0.495 -0.443 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.425 880s 0.126 880s 0.744 880s 880s x1: [1, 2, 1, 1] 880s -0.952 880s 0.625 880s 880s y: [3, 2, 1, 1] 880s 0.404 -0.120 -0.708 880s -0.265 0.079 0.465 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.459 -0.655 -0.145 880s 0.609 0.914 -0.668 880s 0.927 -0.783 0.702 880s 0.326 0.933 -0.543 880s 880s x1: [3, 1, 1, 1] 880s -0.899 -0.836 0.333 880s 880s y: [4, 1, 1, 1] 880s 0.912 -1.534 0.056 -1.255 880s testing: mul_mat, [1, 3, 1, 2] = [4, 1, 1, 2] * [4, 3, 1, 2] 880s x0: [4, 1, 1, 2] 880s -0.538 -0.324 -0.105 -0.206 880s 880s x1: [4, 3, 1, 2] 880s 0.591 -0.551 -0.769 -0.412 880s -0.759 -0.409 0.016 0.464 880s 0.467 0.064 0.088 0.712 880s 880s y: [1, 3, 1, 2] 880s 0.026 880s 0.443 880s -0.427 880s test-mul-mat0: iter:92/500 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.620 0.353 -0.308 880s 0.835 -0.191 -0.568 880s 880s x1: [3, 1, 1, 1] 880s 0.989 -0.468 0.641 880s 880s y: [2, 1, 1, 1] 880s 0.251 0.551 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.261 -0.201 0.839 880s 880s x1: [3, 2, 1, 1] 880s -0.788 0.599 -0.368 880s 0.131 0.997 -0.242 880s 880s y: [1, 2, 1, 1] 880s -0.224 880s -0.438 880s testing: mul_mat, [1, 1, 1, 4] = [3, 1, 1, 4] * [3, 1, 1, 4] 880s x0: [3, 1, 1, 4] 880s -0.554 0.036 0.841 880s 880s x1: [3, 1, 1, 4] 880s 0.415 0.694 -0.366 880s 880s y: [1, 1, 1, 4] 880s -0.514 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.651 -0.131 -0.228 880s -0.765 -0.849 0.718 880s -0.642 0.089 0.881 880s -0.957 0.888 -0.540 880s 880s x1: [3, 1, 1, 1] 880s -0.797 0.550 -0.604 880s 880s y: [4, 1, 1, 1] 880s 0.584 -0.291 0.028 1.577 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.578 0.502 0.086 -0.391 880s 0.106 -0.273 0.520 -0.506 880s 0.785 0.477 -0.394 -0.991 880s 0.128 0.737 0.237 -0.673 880s 880s x1: [4, 3, 1, 1] 880s -0.478 0.863 -0.319 -0.032 880s -0.101 -0.478 -0.233 -0.374 880s -0.802 0.375 -0.214 -0.914 880s 880s y: [4, 3, 1, 1] 880s 0.142 -0.435 0.195 0.521 880s -0.172 0.187 0.155 -0.169 880s 0.064 0.165 0.540 0.738 880s testing: mul_mat, [4, 4, 1, 4] = [4, 4, 1, 4] * [4, 4, 1, 4] 880s x0: [4, 4, 1, 4] 880s 0.528 0.742 -0.398 -0.362 880s -0.934 -0.795 0.050 0.359 880s 0.767 0.589 -0.208 0.800 880s -0.322 0.288 -0.462 0.481 880s 880s x1: [4, 4, 1, 4] 880s -0.625 -0.531 -0.151 -0.762 880s 0.150 0.817 0.136 0.673 880s -0.416 0.763 0.871 0.959 880s -0.451 0.957 0.540 -0.873 880s 880s y: [4, 4, 1, 4] 880s -0.388 0.725 -1.371 -0.248 880s 0.388 -0.542 1.107 0.448 880s -0.349 0.170 0.716 0.413 880s 0.572 -0.626 -0.592 -0.248 880s test-mul-mat0: iter:93/500 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.034 -0.547 880s 0.353 -0.427 880s 880s x1: [2, 4, 1, 1] 880s 0.189 -0.417 880s 0.213 0.552 880s -0.863 0.689 880s -0.447 0.050 880s 880s y: [2, 4, 1, 1] 880s 0.235 0.245 880s -0.294 -0.161 880s -0.406 -0.599 880s -0.043 -0.179 880s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 3, 4, 1] 880s -0.538 0.280 880s 0.972 0.676 880s -0.168 -0.891 880s 880s x1: [2, 2, 4, 1] 880s 0.779 -0.692 880s -0.005 0.366 880s 880s y: [3, 2, 4, 1] 880s -0.612 0.290 0.486 880s 0.105 0.243 -0.325 880s testing: mul_mat, [4, 3, 4, 2] = [2, 4, 4, 2] * [2, 3, 4, 2] 880s x0: [2, 4, 4, 2] 880s -0.949 0.553 880s -0.902 0.724 880s 0.902 0.071 880s -0.846 -0.104 880s 880s x1: [2, 3, 4, 2] 880s -0.529 -0.339 880s 0.990 -0.960 880s 0.847 -0.919 880s 880s y: [4, 3, 4, 2] 880s 0.314 0.231 -0.501 0.482 880s -1.470 -1.588 0.825 -0.737 880s -1.312 -1.430 0.699 -0.621 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.309 -0.261 880s -0.080 -0.705 880s 0.993 0.408 880s -0.092 -0.330 880s 880s x1: [2, 4, 1, 1] 880s 0.067 -0.440 880s 0.778 0.466 880s 0.743 -0.530 880s -0.114 0.224 880s 880s y: [4, 4, 1, 1] 880s 0.094 0.305 -0.113 0.139 880s -0.362 -0.391 0.963 -0.226 880s -0.092 0.314 0.522 0.106 880s -0.023 -0.149 -0.022 -0.063 880s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 3, 4, 1] 880s 0.527 0.978 -0.060 0.640 880s -0.096 -0.243 -0.164 -0.554 880s 0.473 0.191 -0.003 -0.216 880s 880s x1: [4, 2, 4, 1] 880s 0.395 -0.626 -0.940 -0.118 880s 0.972 -0.548 0.853 0.718 880s 880s y: [3, 2, 4, 1] 880s -0.423 0.333 0.095 880s 0.384 -0.498 0.197 880s testing: mul_mat, [4, 4, 4, 2] = [3, 4, 4, 2] * [3ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s , 4, 4, 2] 880s x0: [3, 4, 4, 2] 880s -0.002 0.301 0.278 880s 0.816 0.071 0.830 880s -0.814 -0.864 -0.927 880s -0.449 -0.014 0.686 880s 880s x1: [3, 4, 4, 2] 880s 0.097 -0.118 0.461 880s 0.821 -0.546 0.183 880s -0.123 0.744 0.703 880s -0.439 -0.840 0.689 880s 880s y: [4, 4, 4, 2] 880s 0.092 0.452 -0.403 0.274 880s -0.115 0.783 -0.366 -0.235 880s 0.419 0.536 -1.194 0.527 880s -0.060 0.154 0.445 0.681 880s test-mul-mat0: iter:94/500 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.655 -0.917 880s -0.411 0.628 880s 880s x1: [2, 3, 1, 1] 880s -0.104 0.125 880s -0.413 0.089 880s -0.722 0.317 880s 880s y: [2, 3, 1, 1] 880s -0.046 0.121 880s 0.188 0.226 880s 0.182 0.495 880s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 1, 2, 1] 880s 0.728 0.800 880s 880s x1: [2, 2, 2, 1] 880s 0.004 -0.989 880s -0.808 0.564 880s 880s y: [1, 2, 2, 1] 880s -0.788 880s -0.137 880s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 1, 2, 1] 880s 0.856 0.467 880s 880s x1: [2, 1, 2, 1] 880s 0.378 0.297 880s 880s y: [1, 1, 2, 1] 880s 0.462 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.372 0.603 880s -0.679 0.454 880s 880s x1: [2, 1, 1, 1] 880s -0.945 0.885 880s 880s y: [2, 1, 1, 1] 880s 0.183 1.043 880s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 4, 2, 1] 880s 0.812 0.484 880s -0.679 0.564 880s -0.710 -0.010 880s 0.612 -0.219 880s 880s x1: [2, 2, 2, 1] 880s -0.089 0.231 880s -0.542 -0.078 880s 880s y: [4, 2, 2, 1] 880s 0.039 0.191 0.061 -0.105 880s -0.478 0.324 0.386 -0.315 880s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.772 0.792 -0.129 0.806 880s -0.084 0.906 -0.544 -0.528 880s -0.099 0.682 -0.637 0.839 880s 880s x1: [4, 2, 2, 1] 880s 0.362 0.174 0.443 0.683 880s -0.223 -0.103 -0.406 -0.992 880s 880s y: [3, 2, 2, 1] 880s 0.352 -0.475 0.374 880s -0.657 0.669 -0.622 880s test-mul-mat0: iter:95/500 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.069 -0.430 -0.750 880s 0.542 -0.591 0.971 880s -0.915 0.611 -0.134 880s -0.928 -0.504 -0.463 880s 880s x1: [3, 3, 1, 1] 880s 0.553 0.791 0.624 880s 0.781 -0.293 -0.475 880s 0.573 -0.386 -0.793 880s 880s y: [4, 3, 1, 1] 880s -0.770 0.438 -0.106 -1.200 880s 0.536 0.134 -0.829 -0.357 880s 0.801 -0.231 -0.653 0.030 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.957 0.684 0.715 880s -0.429 0.891 -0.842 880s 0.640 -0.539 -0.592 880s 880s x1: [3, 4, 1, 1] 880s -0.044 0.975 -0.783 880s 0.194 -0.029 0.328 880s -0.191 0.458 0.535 880s -0.639 0.250 0.159 880s 880s y: [3, 4, 1, 1] 880s 0.066 1.547 -0.090 880s 0.401 -0.386 -0.054 880s 0.513 0.040 -0.686 880s -0.327 0.363 -0.637 880s testing: mul_mat, [3, 3, 1, 4] = [3, 3, 1, 4] * [3, 3, 1, 4] 880s x0: [3, 3, 1, 4] 880s -0.414 -0.613 -0.512 880s -0.119 0.339 -0.513 880s -0.630 0.263 0.814 880s 880s x1: [3, 3, 1, 4] 880s 0.181 -0.130 -0.621 880s 0.267 -0.519 0.245 880s 0.339 -0.023 0.782 880s 880s y: [3, 3, 1, 4] 880s 0.322 0.253 -0.654 880s 0.082 -0.333 -0.105 880s -0.526 -0.449 0.417 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.009 0.141 -0.808 880s 0.170 0.654 0.143 880s 880s x1: [3, 3, 1, 1] 880s 0.545 -0.117 0.388 880s 0.360 0.552 -0.968 880s -0.963 -0.121 0.307 880s 880s y: [2, 3, 1, 1] 880s -0.326 0.072 880s 0.863 0.283 880s -0.273 -0.199 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.141 0.809 880s -0.407 -0.509 880s 880s x1: [2, 3, 1, 1] 880s -0.027 -0.988 880s 0.204 0.375 880s 0.812 -0.806 880s 880s y: [2, 3, 1, 1] 880s -0.795 0.514 880s 0.274 -0.274 880s -0.767 0.080 880s testing: mul_mat, [4, 2, 1, 4] = [2, 4, 1, 4] * [2, 2, 1, 4] 880s x0: [2, 4, 1, 4] 880s -0.019 0.607 880s 0.595 -0.966 880s -0.170 0.329 880s 0.954 0.419 880s 880s x1: [2, 2, 1, 4] 880s 0.380 -0.133 880s 0.004 0.506 880s 880s y: [4, 2, 1, 4] 880s -0.088 0.354 -0.108 0.307 880s 0.307 -0.486 0.166 0.216 880s test-mul-mat0: iter:96/500 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.583 0.199 880s 880s x1: [2, 4, 1, 1] 880s -0.667 0.011 880s 0.784 -0.439 880s 0.579 -0.939 880s 0.382 0.956 880s 880s y: [1, 4, 1, 1] 880s -0.387 880s 0.370 880s 0.151 880s 0.413 880s testing: mul_mat, [2, 1, 2, 1] = [2, 2, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.716 0.364 880s 0.417 0.522 880s 880s x1: [2, 1, 2, 1] 880s -0.397 0.290 880s 880s y: [2, 1, 2, 1] 880s 0.390 -0.014 880s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 2, 2, 1] 880s 0.247 0.968 880s 0.532 0.308 880s 880s x1: [2, 2, 2, 1] 880s -0.661 -0.225 880s 0.259 0.098 880s 880s y: [2, 2, 2, 1] 880s -0.381 -0.421 880s 0.158 0.168 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.887 -0.631 880s 0.825 0.530 880s 880s x1: [2, 2, 1, 1] 880s -0.314 -0.259 880s -0.777 0.542 880s 880s y: [2, 2, 1, 1] 880s 0.442 -0.396 880s 0.347 -0.354 880s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 1, 2, 1] 880s -0.499 -0.117 880s 880s x1: [2, 2, 2, 1] 880s 0.347 0.153 880s -0.293 -0.348 880s 880s y: [1, 2, 2, 1] 880s -0.191 880s 0.187 880s testing: mul_mat, [2, 2, 2, 1] = [1, 2, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.225 880s 0.855 880s 880s x1: [1, 2, 2, 1] 880s -0.164 880s -0.147 880s 880s y: [2, 2, 2, 1] 880s -0.037 -0.140 880s -0.033 -0.126 880s test-mul-mat0: iter:97/500 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.255 880s 880s x1: [1, 4, 1, 1] 880s -0.955 880s -0.505 880s 0.908 880s -0.586 880s 880s y: [1, 4, 1, 1] 880s -0.244 880s -0.129 880s 0.232 880s -0.150 880s testing: mul_mat, [3, 1, 3, 1] = [1, 3, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 3, 3, 1] 880s 0.778 880s 0.335 880s -0.808 880s 880s x1: [1, 1, 3, 1] 880s 0.566 880s 880s y: [3, 1, 3, 1] 880s 0.440 0.190 -0.458 880s testing: mul_mat, [3, 3, 3, 2] = [1, 3, 3, 2] * [1, 3, 3, 2] 880s x0: [1, 3, 3, 2] 880s -0.364 880s 0.829 880s -0.845 880s 880s x1: [1, 3, 3, 2] 880s -0.619 880s 0.620 880s -0.721 880s 880s y: [3, 3, 3, 2] 880s 0.226 -0.513 0.523 880s -0.226 0.514 -0.524 880s 0.263 -0.598 0.609 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.598 880s 0.178 880s 0.432 880s 880s x1: [1, 3, 1, 1] 880s 0.625 880s -0.787 880s -0.580 880s 880s y: [3, 3, 1, 1] 880s 0.374 0.111 0.270 880s -0.471 -0.140 -0.340 880s -0.346 -0.103 -0.250 880s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.243 -0.234 0.601 880s -0.821 0.757 -0.372 880s 880s x1: [3, 1, 3, 1] 880s 0.477 0.843 0.772 880s 880s y: [2, 1, 3, 1] 880s 0.150 -0.041 880s testing: mul_mat, [2, 3, 3, 2] = [2, 2, 3, 2] * [2, 3, 3, 2] 880s x0: [2, 2, 3, 2] 880s 0.317 0.057 880s -0.115 -0.922 880s 880s x1: [2, 3, 3, 2] 880s 0.884 0.890 880s -0.781 0.361 880s 0.733 0.991 880s 880s y: [2, 3, 3, 2] 880s 0.331 -0.923 880s -0.227 -0.243 880s 0.289 -0.998 880s test-mul-mat0: iter:98/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.263 880s 0.976 880s 880s x1: [1, 1, 1, 1] 880s -0.341 880s 880s y: [2, 1, 1, 1] 880s 0.090 -0.333 880s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 4, 2, 1] 880s 0.323 880s 0.034 880s -0.411 880s -0.451 880s 880s x1: [1, 2, 2, 1] 880s 0.236 880s 0.794 880s 880s y: [4, 2, 2, 1] 880s 0.076 0.008 -0.097 -0.106 880s 0.256 0.027 -0.326 -0.358 880s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 3, 2, 1] 880s 0.189 880s -0.276 880s -0.003 880s 880s x1: [1, 4, 2, 1] 880s 0.840 880s 0.627 880s -0.505 880s -0.666 880s 880s y: [3, 4, 2, 1] 880s 0.159 -0.232 -0.003 880s 0.119 -0.173 -0.002 880s -0.096 0.139 0.002 880s -0.126 0.184 0.002 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.398 880s 0.685 880s -0.048 880s -0.046 880s 880s x1: [1, 3, 1, 1] 880s -0.496 880s -0.192 880s 0.631 880s 880s y: [4, 3, 1, 1] 880s 0.197 -0.339 0.024 0.023 880s 0.076 -0.131 0.009 0.009 880s -0.251 0.432 -0.031 -0.029 880s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 2, 2, 1] 880s 0.543 0.824 -0.025 -0.677 880s -0.064 0.549 0.703 0.887 880s 880s x1: [4, 1, 2, 1] 880s 0.007 0.986 0.543 0.556 880s 880s y: [2, 1, 2, 1] 880s 0.426 1.415 880s testing: mul_mat, [4, 4, 2, 1] = [2, 4, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 4, 2, 1] 880s 0.991 0.750 880s 0.426 -0.086 880s 0.028 -0.413 880s 0.694 -0.203 880s 880s x1: [2, 4, 2, 1] 880s -0.830 0.291 880s -0.471 0.854 880s -0.758 0.483 880s -0.138 -0.772 880s 880s y: [4, 4, 2, 1] 880s -0.604 -0.379 -0.143 -0.635 880s 0.174 -0.274 -0.366 -0.501 880s -0.389 -0.364 -0.220 -0.624 880s -0.716 0.007 0.315 0.061 880s test-mul-mat0: iter:99/500 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.121 880s 0.640 880s 0.621 880s 0.555 880s 880s x1: [1, 1, 1, 1] 880s 0.222 880s 880s y: [4, 1, 1, 1] 880s -0.027 0.142 0.138 0.123 880s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 1, 2, 1] 880s 0.027 880s 880s x1: [1, 4, 2, 1] 880s 0.974 880s -0.263 880s 0.721 880s 0.852 880s 880s y: [1, 4, 2, 1] 880s 0.027 880s -0.007 880s 0.020 880s 0.023 880s testing: mul_mat, [3, 1, 2, 4] = [1, 3, 2, 4] * [1, 1, 2, 4] 880s x0: [1, 3, 2, 4] 880s 0.284 880s 0.952 880s 0.730 880s 880s x1: [1, 1, 2, 4] 880s 0.993 880s 880s y: [3, 1, 2, 4] 880s 0.282 0.945 0.725 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.505 880s 0.869 880s 0.405 880s -0.212 880s 880s x1: [1, 3, 1, 1] 880s 0.170 880s -0.305 880s -0.713 880s 880s y: [4, 3, 1, 1] 880s 0.086 0.148 0.069 -0.036 880s -0.154 -0.265 -0.124 0.064 880s -0.360 -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s .619 -0.289 0.151 880s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.454 -0.494 -0.823 0.253 880s 0.246 0.664 0.359 -0.918 880s 0.605 0.698 0.608 -0.750 880s 880s x1: [4, 1, 2, 1] 880s 0.820 0.136 -0.211 0.625 880s 880s y: [3, 1, 2, 1] 880s -0.107 -0.357 -0.006 880s testing: mul_mat, [1, 4, 2, 4] = [3, 1, 2, 4] * [3, 4, 2, 4] 880s x0: [3, 1, 2, 4] 880s 0.003 -0.138 0.991 880s 880s x1: [3, 4, 2, 4] 880s -0.900 -0.769 -0.915 880s 0.835 -0.460 -0.235 880s -0.863 0.086 -0.989 880s 0.742 0.591 0.675 880s 880s y: [1, 4, 2, 4] 880s -0.804 880s -0.167 880s -0.995 880s 0.589 880s test-mul-mat0: iter:100/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.241 -0.852 0.192 880s 880s x1: [3, 1, 1, 1] 880s -0.470 0.897 -0.843 880s 880s y: [1, 1, 1, 1] 880s -0.814 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.048 -0.746 -0.447 880s 0.714 0.220 0.641 880s 880s x1: [3, 1, 1, 1] 880s 0.272 0.929 0.008 880s 880s y: [2, 1, 1, 1] 880s -0.710 0.405 880s testing: mul_mat, [4, 2, 1, 3] = [3, 4, 1, 3] * [3, 2, 1, 3] 880s x0: [3, 4, 1, 3] 880s -0.356 -0.589 0.746 880s -0.427 0.419 -0.252 880s 0.525 0.673 0.301 880s 0.240 -0.107 -0.058 880s 880s x1: [3, 2, 1, 3] 880s -0.986 0.545 0.732 880s -0.635 0.226 -0.046 880s 880s y: [4, 2, 1, 3] 880s 0.576 0.465 0.069 -0.337 880s 0.058 0.377 -0.195 -0.174 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.973 0.088 -0.327 880s 0.882 0.321 0.628 880s 0.655 0.995 -0.324 880s -0.924 -0.366 0.244 880s 880s x1: [3, 4, 1, 1] 880s -0.399 -0.216 -0.926 880s 0.902 -0.976 -0.033 880s -0.156 0.277 0.405 880s 0.517 -0.105 0.069 880s 880s y: [4, 4, 1, 1] 880s -0.104 -1.003 -0.176 0.222 880s 0.803 0.462 -0.369 -0.485 880s -0.260 0.206 0.042 0.142 880s 0.471 0.466 0.212 -0.423 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.294 -0.547 -0.857 0.575 880s 0.073 0.602 0.597 -0.975 880s 880s x1: [4, 3, 1, 1] 880s 0.735 0.362 -0.774 0.588 880s -0.379 -0.373 -0.810 0.405 880s -0.299 -0.908 0.429 0.668 880s 880s y: [2, 3, 1, 1] 880s 0.589 -0.764 880s 1.243 -1.131 880s 0.601 -0.963 880s testing: mul_mat, [2, 4, 1, 3] = [2, 2, 1, 3] * [2, 4, 1, 3] 880s x0: [2, 2, 1, 3] 880s -0.732 -0.074 880s 0.549 0.865 880s 880s x1: [2, 4, 1, 3] 880s 0.252 0.651 880s 0.113 -0.427 880s 0.646 0.747 880s 0.246 0.274 880s 880s y: [2, 4, 1, 3] 880s -0.232 0.701 880s -0.051 -0.308 880s -0.528 1.001 880s -0.200 0.372 880s test-mul-mat0: iter:101/500 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.763 0.525 880s -0.033 0.478 880s 0.235 -0.217 880s 880s x1: [2, 3, 1, 1] 880s -0.296 -0.980 880s -0.860 0.301 880s -0.574 0.482 880s 880s y: [3, 3, 1, 1] 880s -0.288 -0.459 0.143 880s 0.814 0.172 -0.268 880s 0.691 0.249 -0.240 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.896 -0.967 880s -0.127 0.960 880s 0.723 0.294 880s 0.771 -0.341 880s 880s x1: [2, 3, 1, 1] 880s 0.425 -0.981 880s 0.051 -0.026 880s -0.055 -0.084 880s 880s y: [4, 3, 1, 1] 880s 1.329 -0.996 0.019 0.663 880s 0.071 -0.032 0.029 0.048 880s 0.032 -0.073 -0.064 -0.014 880s testing: mul_mat, [2, 4, 1, 2] = [2, 2, 1, 2] * [2, 4, 1, 2] 880s x0: [2, 2, 1, 2] 880s -0.514 0.882 880s -0.434 0.431 880s 880s x1: [2, 4, 1, 2] 880s -0.012 -0.994 880s -0.916 0.692 880s -0.974 -0.776 880s -0.007 -0.548 880s 880s y: [2, 4, 1, 2] 880s -0.871 -0.423 880s 1.082 0.695 880s -0.183 0.088 880s -0.480 -0.233 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.214 0.064 880s 0.820 0.298 880s 880s x1: [2, 2, 1, 1] 880s 0.538 0.287 880s -0.445 -0.169 880s 880s y: [2, 2, 1, 1] 880s 0.133 0.526 880s -0.106 -0.415 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.228 -0.769 880s 0.586 -0.540 880s 880s x1: [2, 2, 1, 1] 880s 0.512 0.090 880s 0.522 -0.495 880s 880s y: [2, 2, 1, 1] 880s 0.048 0.251 880s 0.500 0.573 880s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 880s x0: [2, 1, 1, 2] 880s 0.146 0.221 880s 880s x1: [2, 2, 1, 2] 880s 0.634 -0.602 880s -0.074 -0.535 880s 880s y: [1, 2, 1, 2] 880s -0.040 880s -0.129 880s test-mul-mat0: iter:102/500 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.112 -0.171 -0.408 0.459 880s 880s x1: [4, 4, 1, 1] 880s -0.485 0.747 0.410 0.579 880s 0.044 -0.079 -0.331 -0.434 880s 0.426 -0.790 0.794 0.012 880s -0.559 -0.746 -0.354 -0.161 880s 880s y: [1, 4, 1, 1] 880s -0.083 880s -0.046 880s -0.136 880s 0.135 880s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 1, 4, 1] 880s -0.527 -0.287 -0.965 -0.733 880s 880s x1: [4, 1, 4, 1] 880s 0.386 0.435 -0.395 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s -0.393 880s 880s y: [1, 1, 4, 1] 880s 0.340 880s testing: mul_mat, [1, 1, 4, 3] = [4, 1, 4, 3] * [4, 1, 4, 3] 880s x0: [4, 1, 4, 3] 880s 0.639 -0.130 0.211 -0.085 880s 880s x1: [4, 1, 4, 3] 880s 0.547 -0.862 0.332 -0.870 880s 880s y: [1, 1, 4, 3] 880s 0.605 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.931 0.731 0.128 -0.208 880s -0.741 -0.094 -0.149 -0.363 880s -0.566 0.824 -0.817 0.981 880s -0.564 0.522 -0.439 -0.375 880s 880s x1: [4, 1, 1, 1] 880s 0.681 -0.167 0.821 -0.720 880s 880s y: [4, 1, 1, 1] 880s 0.767 -0.351 -1.900 -0.562 880s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.481 0.055 0.944 -0.716 880s 880s x1: [4, 4, 4, 1] 880s 0.287 -0.691 0.618 0.480 880s 0.996 -0.234 -0.890 0.633 880s -0.603 -0.799 0.315 0.230 880s -0.978 0.595 0.956 0.953 880s 880s y: [1, 4, 4, 1] 880s 0.339 880s -0.827 880s -0.202 880s -0.217 880s testing: mul_mat, [4, 4, 4, 3] = [1, 4, 4, 3] * [1, 4, 4, 3] 880s x0: [1, 4, 4, 3] 880s 0.027 880s 0.814 880s -0.135 880s 0.357 880s 880s x1: [1, 4, 4, 3] 880s -0.090 880s 0.122 880s -0.761 880s 0.268 880s 880s y: [4, 4, 4, 3] 880s -0.002 -0.073 0.012 -0.032 880s 0.003 0.099 -0.017 0.044 880s -0.020 -0.620 0.103 -0.272 880s 0.007 0.218 -0.036 0.096 880s test-mul-mat0: iter:103/500 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.561 880s 0.479 880s 0.673 880s 0.586 880s 880s x1: [1, 3, 1, 1] 880s 0.505 880s 0.520 880s -0.633 880s 880s y: [4, 3, 1, 1] 880s 0.283 0.242 0.340 0.296 880s 0.292 0.249 0.350 0.305 880s -0.355 -0.303 -0.426 -0.371 880s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 4, 2, 1] 880s -0.223 880s 0.389 880s 0.803 880s -0.664 880s 880s x1: [1, 4, 2, 1] 880s 0.871 880s -0.518 880s -0.362 880s -0.922 880s 880s y: [4, 4, 2, 1] 880s -0.195 0.339 0.700 -0.578 880s 0.116 -0.201 -0.416 0.344 880s 0.081 -0.141 -0.291 0.240 880s 0.206 -0.359 -0.741 0.612 880s testing: mul_mat, [1, 4, 2, 2] = [1, 1, 2, 2] * [1, 4, 2, 2] 880s x0: [1, 1, 2, 2] 880s -0.741 880s 880s x1: [1, 4, 2, 2] 880s -0.076 880s -0.398 880s -0.495 880s -0.750 880s 880s y: [1, 4, 2, 2] 880s 0.056 880s 0.295 880s 0.367 880s 0.556 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.847 880s 880s x1: [1, 1, 1, 1] 880s -0.813 880s 880s y: [1, 1, 1, 1] 880s -0.689 880s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.620 880s 880s x1: [1, 1, 2, 1] 880s 0.576 880s 880s y: [1, 1, 2, 1] 880s -0.357 880s testing: mul_mat, [4, 1, 2, 2] = [1, 4, 2, 2] * [1, 1, 2, 2] 880s x0: [1, 4, 2, 2] 880s -0.226 880s -0.143 880s 0.117 880s -0.471 880s 880s x1: [1, 1, 2, 2] 880s -0.052 880s 880s y: [4, 1, 2, 2] 880s 0.012 0.007 -0.006 0.025 880s test-mul-mat0: iter:104/500 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.209 -0.085 -0.478 0.204 880s 880s x1: [4, 4, 1, 1] 880s 0.618 -0.982 -0.643 0.802 880s -0.602 0.958 -0.250 -0.454 880s -0.518 -0.700 0.198 0.256 880s 0.157 -0.685 0.784 -0.159 880s 880s y: [1, 4, 1, 1] 880s 0.684 880s -0.181 880s -0.091 880s -0.316 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.494 0.945 0.434 -0.052 880s 0.382 0.053 -0.034 0.740 880s -0.145 0.364 0.698 0.605 880s 880s x1: [4, 1, 1, 1] 880s -0.037 -0.037 -0.749 0.624 880s 880s y: [3, 1, 1, 1] 880s -0.374 0.471 -0.153 880s testing: mul_mat, [3, 3, 1, 4] = [4, 3, 1, 4] * [4, 3, 1, 4] 880s x0: [4, 3, 1, 4] 880s -0.989 0.342 -0.999 -0.721 880s -0.320 -0.268 0.600 -0.015 880s 0.203 0.776 -0.347 0.371 880s 880s x1: [4, 3, 1, 4] 880s 0.910 -0.820 0.905 0.108 880s 0.436 0.062 0.423 0.220 880s 0.903 0.686 -0.571 -0.182 880s 880s y: [3, 3, 1, 4] 880s -2.163 0.470 -0.726 880s -0.991 0.095 0.071 880s 0.043 -0.813 0.846 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.439 -0.433 0.304 0.244 880s 0.480 0.777 0.673 0.346 880s 0.248 0.228 -0.807 0.811 880s 880s x1: [4, 3, 1, 1] 880s -0.466 -0.541 0.799 0.042 880s 0.098 -0.375 -0.018 0.203 880s 0.456 0.989 -0.444 0.463 880s 880s y: [3, 3, 1, 1] 880s 0.692 -0.091 -0.850 880s 0.164 -0.186 0.118 880s -0.650 0.849 1.072 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.807 0.775 0.329 880s -0.688 -0.918 0.492 880s -0.866 -0.075 0.031 880s -0.637 0.255 -0.443 880s 880s x1: [3, 4, 1, 1] 880s -0.444 -0.424 0.661 880s 0.587 0.047 -0.569 880s -0.879 0.506 -0.769 880s 0.163 -0.396 -0.144 880s 880s y: [4, 4, 1, 1] 880s -0.469 1.020 0.437 -0.118 880s 0.322 -0.727 -0.529 -0.110 880s -0.570 -0.239 0.698 1.029 880s -0.223 0.181 -0.116 -0.141 880s testing: mul_mat, [1, 3, 1, 4] = [4, 1, 1, 4] * [4, 3, 1, 4] 880s x0: [4, 1, 1, 4] 880s 0.372 0.726 -0.424 0.118 880s 880s x1: [4, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 3, 1, 4] 880s -0.204 -0.295 -0.250 -0.959 880s -0.949 -0.439 -0.404 -0.373 880s -0.778 -0.817 0.674 -0.347 880s 880s y: [1, 3, 1, 4] 880s -0.298 880s -0.545 880s -1.209 880s test-mul-mat0: iter:105/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.392 -0.703 880s 0.246 0.151 880s -0.569 0.771 880s 880s x1: [2, 2, 1, 1] 880s -0.223 -0.441 880s 0.099 0.947 880s 880s y: [3, 2, 1, 1] 880s 0.223 -0.121 -0.213 880s -0.627 0.167 0.673 880s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 2, 3, 1] 880s 0.847 -0.699 880s -0.329 -0.054 880s 880s x1: [2, 3, 3, 1] 880s -0.477 -0.843 880s -0.653 0.641 880s 0.838 -0.250 880s 880s y: [2, 3, 3, 1] 880s 0.185 0.203 880s -1.001 0.181 880s 0.884 -0.263 880s testing: mul_mat, [3, 2, 3, 2] = [2, 3, 3, 2] * [2, 2, 3, 2] 880s x0: [2, 3, 3, 2] 880s 0.992 -0.833 880s -0.463 0.480 880s 0.142 -0.315 880s 880s x1: [2, 2, 3, 2] 880s -0.989 0.133 880s -0.505 0.652 880s 880s y: [3, 2, 3, 2] 880s -1.091 0.521 -0.183 880s -1.045 0.547 -0.278 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.850 -0.712 880s -0.730 -0.532 880s 880s x1: [2, 3, 1, 1] 880s -0.216 0.622 880s -0.588 0.963 880s -0.234 0.657 880s 880s y: [2, 3, 1, 1] 880s -0.259 -0.173 880s -0.186 -0.083 880s -0.269 -0.178 880s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.885 0.584 880s 880s x1: [2, 2, 3, 1] 880s 0.581 0.496 880s -0.766 0.018 880s 880s y: [1, 2, 3, 1] 880s -0.225 880s 0.688 880s testing: mul_mat, [2, 2, 3, 2] = [1, 2, 3, 2] * [1, 2, 3, 2] 880s x0: [1, 2, 3, 2] 880s -0.399 880s -0.001 880s 880s x1: [1, 2, 3, 2] 880s 0.246 880s -0.532 880s 880s y: [2, 2, 3, 2] 880s -0.098 -0.000 880s 0.213 0.001 880s test-mul-mat0: iter:106/500 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.126 -0.990 0.250 0.979 880s 880s x1: [4, 2, 1, 1] 880s 0.158 -0.814 -0.306 0.626 880s 0.567 -0.649 -0.944 0.714 880s 880s y: [1, 2, 1, 1] 880s 1.322 880s 1.034 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.433 0.021 -0.902 -0.630 880s 880s x1: [4, 1, 1, 1] 880s -0.225 -0.585 -0.420 0.774 880s 880s y: [1, 1, 1, 1] 880s -0.219 880s testing: mul_mat, [4, 1, 1, 3] = [4, 4, 1, 3] * [4, 1, 1, 3] 880s x0: [4, 4, 1, 3] 880s -0.104 0.426 0.094 -0.390 880s 0.091 0.968 -0.380 -0.659 880s 0.947 0.395 -0.243 -0.474 880s 0.169 0.356 0.960 -0.809 880s 880s x1: [4, 1, 1, 3] 880s -0.111 0.918 -0.406 -0.221 880s 880s y: [4, 1, 1, 3] 880s 0.451 1.178 0.461 0.097 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.497 0.853 0.451 0.265 880s 0.280 0.186 0.422 -0.622 880s 880s x1: [4, 4, 1, 1] 880s -0.615 -0.482 0.109 0.308 880s -0.623 0.196 0.164 -0.511 880s -0.574 0.401 -0.987 -0.015 880s -0.287 -0.425 -0.561 0.451 880s 880s y: [2, 4, 1, 1] 880s 0.026 -0.408 880s 0.416 0.248 880s 0.178 -0.493 880s -0.353 -0.677 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.265 -0.154 880s 880s x1: [2, 4, 1, 1] 880s 0.517 -0.487 880s -0.339 0.573 880s 0.951 -0.602 880s 0.957 -0.531 880s 880s y: [1, 4, 1, 1] 880s 0.212 880s -0.178 880s 0.344 880s 0.335 880s testing: mul_mat, [3, 2, 1, 3] = [1, 3, 1, 3] * [1, 2, 1, 3] 880s x0: [1, 3, 1, 3] 880s -0.459 880s 0.923 880s -0.447 880s 880s x1: [1, 2, 1, 3] 880s -0.297 880s -0.572 880s 880s y: [3, 2, 1, 3] 880s 0.136 -0.274 0.133 880s 0.262 -0.528 0.256 880s test-mul-mat0: iter:107/500 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.483 0.972 880s -0.839 -0.526 880s -0.559 0.668 880s 880s x1: [2, 1, 1, 1] 880s -0.055 -0.980 880s 880s y: [3, 1, 1, 1] 880s -0.925 0.562 -0.624 880s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.058 -0.035 880s 0.551 0.416 880s 0.406 0.220 880s 0.155 -0.307 880s 880s x1: [2, 3, 4, 1] 880s 0.739 0.287 880s -0.628 -0.833 880s -0.377 0.501 880s 880s y: [4, 3, 4, 1] 880s -0.053 0.527 0.363 0.026 880s 0.066 -0.693 -0.438 0.159 880s 0.004 0.000 -0.043 -0.212 880s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 3, 4, 1] 880s -0.495 -0.089 880s -0.683 0.202 880s 0.508 -0.592 880s 880s x1: [2, 4, 4, 1] 880s -0.502 0.280 880s 0.542 0.904 880s -0.501 -0.303 880s -0.403 -0.909 880s 880s y: [3, 4, 4, 1] 880s 0.224 0.399 -0.421 880s -0.349 -0.188 -0.259 880s 0.275 0.281 -0.075 880s 0.280 0.092 0.333 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.452 -0.037 880s 880s x1: [2, 3, 1, 1] 880s -0.637 0.985 880s 0.044 -0.099 880s -0.053 -0.948 880s 880s y: [1, 3, 1, 1] 880s -0.324 880s 0.024 880s 0.011 880s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 4, 4, 1] 880s -0.481 880s -0.823 880s 0.513 880s -0.229 880s 880s x1: [1, 2, 4, 1] 880s 0.035 880s -0.347 880s 880s y:ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s [4, 2, 4, 1] 880s -0.017 -0.029 0.018 -0.008 880s 0.167 0.285 -0.178 0.079 880s testing: mul_mat, [3, 1, 4, 1] = [4, 3, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 3, 4, 1] 880s -0.566 -0.444 -0.823 -0.053 880s -0.081 -0.188 -0.736 -0.750 880s 0.785 -0.635 -0.502 -0.006 880s 880s x1: [4, 1, 4, 1] 880s 0.397 0.701 0.082 -0.885 880s 880s y: [3, 1, 4, 1] 880s -0.557 0.439 -0.170 880s test-mul-mat0: iter:108/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.420 -0.234 -0.415 880s 880s x1: [3, 1, 1, 1] 880s -0.610 0.247 -0.113 880s 880s y: [1, 1, 1, 1] 880s 0.246 880s testing: mul_mat, [2, 1, 2, 1] = [3, 2, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.498 0.837 -0.813 880s 0.520 -0.076 -0.445 880s 880s x1: [3, 1, 2, 1] 880s -0.166 -0.207 0.024 880s 880s y: [2, 1, 2, 1] 880s -0.275 -0.082 880s testing: mul_mat, [3, 2, 2, 1] = [3, 3, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 3, 2, 1] 880s 0.540 -0.444 0.763 880s 0.280 0.804 -0.471 880s -0.223 0.641 -0.284 880s 880s x1: [3, 2, 2, 1] 880s 0.259 -0.542 -0.876 880s 0.582 0.094 -0.486 880s 880s y: [3, 2, 2, 1] 880s -0.288 0.049 -0.156 880s -0.098 0.467 0.069 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.330 -0.248 0.663 880s 0.107 -0.130 -0.485 880s 880s x1: [3, 3, 1, 1] 880s -0.308 0.508 -0.886 880s -0.726 -0.398 -0.371 880s 0.102 -0.417 0.575 880s 880s y: [2, 3, 1, 1] 880s -0.815 0.330 880s -0.387 0.154 880s 0.518 -0.214 880s testing: mul_mat, [2, 3, 2, 1] = [2, 2, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.706 0.438 880s -0.084 0.802 880s 880s x1: [2, 3, 2, 1] 880s -0.850 0.467 880s 0.045 -0.072 880s 0.108 0.761 880s 880s y: [2, 3, 2, 1] 880s 0.804 0.446 880s -0.063 -0.062 880s 0.257 0.601 880s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.409 0.471 880s 0.338 0.636 880s 880s x1: [2, 2, 2, 1] 880s -0.186 -0.013 880s 0.235 0.324 880s 880s y: [2, 2, 2, 1] 880s 0.070 -0.071 880s 0.057 0.285 880s test-mul-mat0: iter:109/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.012 880s 0.593 880s 0.300 880s 880s x1: [1, 3, 1, 1] 880s 0.353 880s 0.550 880s 0.791 880s 880s y: [3, 3, 1, 1] 880s -0.004 0.210 0.106 880s -0.006 0.326 0.165 880s -0.009 0.469 0.238 880s testing: mul_mat, [2, 3, 2, 1] = [1, 2, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.301 880s -0.464 880s 880s x1: [1, 3, 2, 1] 880s -0.300 880s 0.997 880s -0.040 880s 880s y: [2, 3, 2, 1] 880s -0.090 0.139 880s 0.300 -0.463 880s -0.012 0.019 880s testing: mul_mat, [3, 2, 2, 4] = [1, 3, 2, 4] * [1, 2, 2, 4] 880s x0: [1, 3, 2, 4] 880s -0.077 880s 0.723 880s 0.888 880s 880s x1: [1, 2, 2, 4] 880s -0.460 880s 0.505 880s 880s y: [3, 2, 2, 4] 880s 0.035 -0.333 -0.408 880s -0.039 0.365 0.448 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.685 880s -0.968 880s 0.942 880s 0.238 880s 880s x1: [1, 3, 1, 1] 880s 0.036 880s -0.620 880s 0.902 880s 880s y: [4, 3, 1, 1] 880s -0.025 -0.035 0.034 0.009 880s 0.425 0.600 -0.584 -0.148 880s -0.618 -0.873 0.849 0.215 880s testing: mul_mat, [4, 1, 2, 1] = [4, 4, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 4, 2, 1] 880s -0.814 0.416 -0.206 0.408 880s 0.298 0.259 0.414 0.829 880s -0.135 0.002 -0.734 0.902 880s -0.319 0.546 -0.134 0.788 880s 880s x1: [4, 1, 2, 1] 880s 0.756 0.830 -0.138 0.476 880s 880s y: [4, 1, 2, 1] 880s -0.047 0.777 0.430 0.605 880s testing: mul_mat, [2, 4, 2, 4] = [4, 2, 2, 4] * [4, 4, 2, 4] 880s x0: [4, 2, 2, 4] 880s -0.556 -0.396 -0.664 0.449 880s 0.754 -0.437 -0.287 -0.574 880s 880s x1: [4, 4, 2, 4] 880s -0.278 0.042 -0.957 -0.862 880s -0.699 0.044 0.684 0.095 880s -0.542 0.950 0.961 0.867 880s 0.778 0.863 0.655 0.509 880s 880s y: [2, 4, 2, 4] 880s 0.387 0.542 880s -0.041 -0.797 880s -0.325 -1.596 880s -0.982 -0.270 880s test-mul-mat0: iter:110/500 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.792 0.563 880s -0.563 -0.376 880s -0.371 0.355 880s 880s x1: [2, 3, 1, 1] 880s 0.092 0.179 880s -0.886 -0.255 880s 0.979 0.153 880s 880s y: [3, 3, 1, 1] 880s 0.028 -0.119 0.029 880s 0.558 0.595 0.238 880s -0.690 -0.609 -0.309 880s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.856 -0.153 880s 0.765 -0.233 880s 0.476 0.120 880s -0.530 -0.322 880s 880s x1: [2, 3, 4, 1] 880s 0.703 0.202 880s 0.118 -0.296 880s 0.937 0.744 880s 880s y: [4, 3, 4, 1] 880s -0.633 0.491 0.359 -0.437 880s -0.056 0.159 0.021 0.032 880s -0.916 0.544 0.535 -0.736 880s testing: mul_mat, [4, 4, 4, 3] = [2, 4, 4, 3] * [2, 4, 4, 3] 880s x0: [2, 4, 4, 3] 880s 0.847 0.948 880s -0.692 0.272 880s -0.006 -0.222 880s 0.419 -0.605 880s 880s x1: [2, 4, 4, 3] 880s -0.258 0.496 880s -0.389 -0.782 880s -0.384 0.081 880s -0.104 -0.146 880s 880s y: [4, 4, 4, 3] 880s 0.252 0.314 -0.109 -0.408 880s -1.070 0.0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 56 0.176 0.310 880s -0.248 0.288 -0.016 -0.210 880s -0.226 0.032 0.033 0.045 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.434 -0.633 880s 880s x1: [2, 4, 1, 1] 880s 0.185 0.121 880s -0.935 -0.439 880s -0.244 0.800 880s -0.654 0.065 880s 880s y: [1, 4, 1, 1] 880s 0.004 880s -0.129 880s -0.612 880s -0.326 880s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 3, 4, 1] 880s 0.975 880s 0.824 880s -0.654 880s 880s x1: [1, 2, 4, 1] 880s -0.342 880s -0.877 880s 880s y: [3, 2, 4, 1] 880s -0.334 -0.282 0.224 880s -0.855 -0.722 0.573 880s testing: mul_mat, [4, 1, 4, 3] = [3, 4, 4, 3] * [3, 1, 4, 3] 880s x0: [3, 4, 4, 3] 880s 0.420 -0.280 0.936 880s 0.106 -0.787 0.556 880s -0.464 0.090 -0.707 880s -0.501 -0.433 0.124 880s 880s x1: [3, 1, 4, 3] 880s -0.110 -0.761 0.774 880s 880s y: [4, 1, 4, 3] 880s 0.892 1.018 -0.565 0.480 880s test-mul-mat0: iter:111/500 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.093 -0.529 -0.654 -0.424 880s -0.914 -0.711 -0.504 -0.587 880s 880s x1: [4, 3, 1, 1] 880s -0.855 0.586 0.746 0.154 880s 0.296 -0.482 -0.519 0.243 880s -0.906 -0.513 -0.095 -0.734 880s 880s y: [2, 3, 1, 1] 880s -0.782 -0.101 880s 0.463 0.191 880s 0.729 1.672 880s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.520 -0.547 -0.533 -0.231 880s -0.051 -0.119 0.116 0.343 880s 0.364 0.236 -0.779 -0.390 880s 0.462 0.269 -0.449 -0.252 880s 880s x1: [4, 2, 3, 1] 880s -0.653 -0.606 -0.517 -0.880 880s -0.122 0.246 -0.774 0.049 880s 880s y: [4, 2, 3, 1] 880s 1.149 -0.256 0.365 -0.012 880s 0.330 -0.096 0.598 0.346 880s testing: mul_mat, [1, 4, 3, 3] = [4, 1, 3, 3] * [4, 4, 3, 3] 880s x0: [4, 1, 3, 3] 880s -0.207 0.234 0.732 -0.926 880s 880s x1: [4, 4, 3, 3] 880s 0.218 -0.319 0.322 -0.884 880s -0.987 -0.038 0.080 -0.012 880s 0.917 0.015 0.874 0.006 880s -0.172 0.142 -0.425 0.093 880s 880s y: [1, 4, 3, 3] 880s 0.934 880s 0.265 880s 0.448 880s -0.328 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.711 -0.365 0.712 0.168 880s 880s x1: [4, 1, 1, 1] 880s 0.526 0.518 -0.056 -0.502 880s 880s y: [1, 1, 1, 1] 880s 0.061 880s testing: mul_mat, [2, 4, 3, 1] = [1, 2, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.376 880s -0.215 880s 880s x1: [1, 4, 3, 1] 880s -0.648 880s -0.860 880s -0.248 880s 0.099 880s 880s y: [2, 4, 3, 1] 880s 0.244 0.139 880s 0.324 0.185 880s 0.093 0.053 880s -0.037 -0.021 880s testing: mul_mat, [4, 1, 3, 3] = [2, 4, 3, 3] * [2, 1, 3, 3] 880s x0: [2, 4, 3, 3] 880s 0.672 -0.140 880s 0.172 0.196 880s -0.083 0.684 880s 0.060 0.645 880s 880s x1: [2, 1, 3, 3] 880s 0.978 -0.361 880s 880s y: [4, 1, 3, 3] 880s 0.708 0.097 -0.329 -0.174 880s test-mul-mat0: iter:112/500 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.587 -0.562 -0.181 880s -0.623 -0.932 -0.604 880s 880s x1: [3, 4, 1, 1] 880s -0.792 -0.179 -0.749 880s -0.607 -0.613 -0.606 880s -0.598 0.194 0.705 880s -0.624 -0.769 -0.793 880s 880s y: [2, 4, 1, 1] 880s -0.228 1.112 880s 0.098 1.316 880s -0.587 -0.234 880s 0.210 1.584 880s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.849 0.052 0.612 880s -0.618 0.259 -0.186 880s 0.968 0.697 0.634 880s 880s x1: [3, 2, 3, 1] 880s 0.520 -0.842 -0.205 880s -0.790 -0.608 0.659 880s 880s y: [3, 2, 3, 1] 880s -0.611 -0.502 -0.214 880s 1.043 0.208 -0.771 880s testing: mul_mat, [4, 3, 3, 2] = [3, 4, 3, 2] * [3, 3, 3, 2] 880s x0: [3, 4, 3, 2] 880s 0.470 -0.598 0.706 880s 0.331 0.861 0.189 880s -0.360 -0.743 -0.850 880s -0.253 -0.175 -0.261 880s 880s x1: [3, 3, 3, 2] 880s -0.063 0.478 -0.203 880s -0.827 0.628 0.848 880s 0.785 -0.990 0.108 880s 880s y: [4, 3, 3, 2] 880s -0.459 0.352 -0.160 -0.014 880s -0.166 0.428 -0.891 -0.122 880s 1.037 -0.573 0.362 -0.054 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.980 0.324 -0.659 880s 0.832 0.721 -0.234 880s 0.977 0.266 -0.560 880s -0.529 -0.709 0.168 880s 880s x1: [3, 4, 1, 1] 880s -0.463 -0.698 0.561 880s 0.364 -0.099 0.621 880s -0.678 0.415 0.312 880s -0.402 -0.255 0.073 880s 880s y: [4, 4, 1, 1] 880s -1.049 -1.020 -0.952 0.834 880s -0.084 0.087 -0.018 -0.018 880s -0.735 -0.337 -0.726 0.116 880s -0.525 -0.536 -0.502 0.406 880s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.336 -0.963 0.889 -0.045 880s -0.170 0.289 -0.226 0.373 880s -0.999 -0.768 -0.974 0.068 880s -0.713 -0.341 0.562 0.112 880s 880s x1: [4, 3, 3, 1] 880s -0.167 -0.039 -0.160 0.722 880s -0.997 -0.894 -0.741 -0.695 880s 0.667 0.623 0.207 0.288 880s 880s y: [4, 3, 3, 1] 880s -0.081 0.323 0.402 0.123 880s 0.569 -0.181 2.357 0.522 880s -0.653 0.127 -1.326 -0.540 880s testing: mul_mat, [2, 4, 3, 2] = [4, 2, 3, 2] * [4, 4, 3, 2] 880s x0: [4, 2, 3, 2] 880s 0.283 0.653 0.980 -0.385 880s -0.706 -0.906 0.784 0.715 880s 880s x1: [4, 4, 3, 2] 880s -0.134 -0.772 -0.982 0.884 880s 0.438 -0.855 -0.537 -0.965 880s -0.045 0.563 -0.140 -0.830 880s 0.188 -0.981 -0.950 0.644 880s 880s y: [2, 4, 3, 2] 880s -1.845 0.656 880s -0.590 -0.646 880s 0.537 -1.181 880s -1.767 0.470 880s test-mul-mat0: iter:113/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.862 0.819 0.567 -0.428 880s 0.359 0.714 0.871 -0.555 880s 0.941 0.192 0.223 -0.807 880s 880s x1: [4, 3, 1, 1] 880s 0.401 0.031 0.532 0.618 880s -0.848 -0.402 -0.514 -0.406 880s -0.851 -0.517 -0.681 0.994 880s 880s y: [3, 3, 1, 1] 880s -0.283 0.287 0.003 880s 0.285 -0.814 -0.662 880s -0.502 -1.820 -1.854 880s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.960 0.621 0.365 -0.187 880s 880s x1: [4, 3, 4, 1] 880s -0.775 -0.237 -0.719 0.164 880s -0.154 -0.655 -0.434 0.877 880s 0.878 -0.816 -0.971 -0.524 880s 880s y: [1, 3, 4, 1] 880s -1.184 880s -0.877 880s 0.079 880s testing: mul_mat, [1, 1, 4, 3] = [4, 1, 4, 3] * [4, 1, 4, 3] 880s x0: [4, 1, 4, 3] 880s -0.165 0.299 -0.764 0.251 880s 880s x1: [4, 1, 4, 3] 880s -0.453 -0.825 -0.106 0.216 880s 880s y: [1, 1, 4, 3] 880s -0.037 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.515 -0.224 0.835 0.730 880s -0.109 -0.264 -0.264 -0.965 880s 0.182 0.754 0.024 -0.469 880s 880s x1: [4, 1, 1, 1] 880s 0.794 -0.999 0.827 0.327 880s 880s y: [3, 1, 1, 1] 880s 1.563 -0.357 -0.741 880s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.535 -0.289 0.641 880s 0.517 0.454 0.226 880s -0.732 -0.918 -0.369 880s 880s x1: [3, 4, 4, 1] 880s -0.125 -0.428 0.217 880s 0.574 0.789 -0.104 880s -0.284 0.576 -0.518 880s -0.240 -0.631 0.065 880s 880s y: [3, 4, 4, 1] 880s 0.196 -0.210 0.405 880s 0.012 0.631 -1.106 880s -0.650 -0.002 -0.130 880s 0.096 -0.396 0.731 880s testing: mul_mat, [4, 3, 4, 3] = [3, 4, 4, 3] * [3, 3, 4, 3] 880s x0: [3, 4, 4, 3] 880s -0.496 -0.966 -0.399 880s -0.531 -0.822 0.109 880s -0.814 0.187 -0.615 880s 0.112 -0.083 -0.732 880s 880s x1: [3, 3, 4, 3] 880s -0.849 0.060 0.994 880s -0.217 -0.344 -0.277 880s 0.999 -0.263 0.828 880s 880s y: [4, 3, 4, 3] 880s -0.034 0.509 0.090 -0.827 880s 0.551 0.368 0.283 0.207 880s -0.571 -0.223 -1.371 -0.472 880s test-mul-mat0: iter:114/500 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.938 880s 0.822 880s 880s x1: [1, 4, 1, 1] 880s 0.886 880s 0.706 880s -0.432 880s 0.615 880s 880s y: [2, 4, 1, 1] 880s 0.831 0.728 880s 0.662 0.581 880s -0.405 -0.355 880s 0.577 0.506 880s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.560 880s -0.906 880s 0.587 880s 880s x1: [1, 2, 2, 1] 880s 0.842 880s 0.036 880s 880s y: [3, 2, 2, 1] 880s -0.472 -0.763 0.494 880s -0.020 -0.032 0.021 880s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.395 880s 880s x1: [1, 3, 2, 1] 880s 0.784 880s 0.809 880s -0.160 880s 880s y: [1, 3, 2, 1] 880s -0.310 880s -0.320 880s 0.063 880s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.316 880s 0.406 880s -0.593 880s 880s x1: [1, 1, 1, 1] 880s -0.966 880s 880s y: [3, 1, 1, 1] 880s 0.305 -0.392 0.572 880s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 4, 2, 1] 880s -0.053 0.453 -0.824 880s 0.013 0.179 0.274 880s 0.085 0.957 -0.619 880s -0.629 -0.403 0.985 880s 880s x1: [3, 1, 2, 1] 880s -0.609 0.974 -0.977 880s 880s y: [4, 1, 2, 1] 880s 1.279 -0.101 1.486 -0.972 880s testing: mul_mat, [3, 3, 2, 1] = [4, 3, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.176 -0.749 -0.931 0.042 880s -0.762 0.941 0.617 0.205 880s 0.653 0.029 -0.572 0.671 880s 880s x1: [4, 3, 2, 1] 880s 0.818 0.676 0.398 -0.027 880s 0.588 0.243 0.970 -0.465 880s -0.743 0.055 -0.094 0.710 880s 880s y: [3, 3, 2, 1] 880s -1.023 0.254 0.308 880s -1.208 0.284 -0.476 880s 0.207 0.706 0.046 880s test-mul-mat0: iter:115/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.252 -0.188 0.178 -0.007 880s -0.260 0.246 0.639 -0.623 880s 0.548 0.148 -0.679 -0.739 880s 880s x1: [4, 3, 1, 1] 880s 0.979 -0.749 -0.969 -0.197 880s -0.511 0.684 0.054 -0.570 880s -0.287 0.123 -0.953 0.141 880s 880s y: [3, 3, 1, 1] 880s 0.217 -0.935 1.230 880s -0.244 0.691 0.206 880s -0.266 -0.592 0.404 880s testing: mul_mat, [1, 3, 2, 1] = [4, 1, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 1, 2, 1] 880s -0.639 -0.787 0.789 0.509 880s 880s x1: [4, 3, 2, 1] 880s -0.205 -0.048 0.369 -0.307 880s 0.262 -0.285 -0.328 0.512 880s -0.254 0.475 -0.999 -0.570 880s 880s y: [1, 3, 2, 1] 880s 0.304 880s 0.059 880s -1.290 880s testing: mul_mat, [3, 1, 2, 4] = [4, 3, 2, 4] * [4, 1, 2, 4] 880s x0: [4, 3, 2, 4] 880s -0.346 0.772 -0.535 -0.085 880s -0.347 0.610 0.176 0.963 880s 0.284 0.869 -0.582 -0.390 880s 880s x1: [4, 1, 2, 4] 880s 0.419 -0.002 -0.252 -0.865 880s 880s y: [3, 1, 2, 4] 880s 0.062 -1.025 0.602 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.801 0.356 0.146 0.077 880s 880s x1: [4, 3, 1, 1] 880s 0.180 0.682 0.148 0.075 880s -0.953 0.013 -0.541 0.512 880s -0.522 -0.632 0.231 0.295 880s 880s y: [1, 3, 1, 1] 880s 0.125 880s 0.729 880s 0.250 880s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 1, 2, 1] 880s 0.616 880s 880s x1: [1, 4, 2, 1] 880s -0.071 880s 0.120 880s -0.583 880s -0.547 880s 880s y: [1, 4, 2, 1] 880s -0.044 880s 0.074 880s -0.360 880s -0.337 880s testing: mul_mat, [4, 1, 2, 4] = [1, 4, 2, 4] * [1, 1, 2, 4] 880s x0: [1, 4, 2, 4] 880s 0.877 880s -0.145 880s 0.745 880s -0.755 880s 880s x1: [1, 1, 2, 4] 880s 0.163 880s 880s y: [4, 1, 2, 4] 880s 0.143 -0.024 0.121 -0.123 880s test-mul-mat0: iter:116/500 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.702 -0.215 -0.222 0.822 880s 0.169 -0.628 -0.548 0.897 880s 880s x1: [4, 3, 1, 1] 880s -0.064 0.258 0.457 0.332 880s 0.393 -0.180 -0.343 -0.351 880s -0.402 -0.565 0.608 0.563 880s 880s y: [2, 3, 1, 1] 880s 0.161 -0.126 880s -0.450 0.053 880s 0.731 0.458 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.753 -0.681 0.592 0.210 880s 0.651 -0.015 -0.970 -0.692 880s 880s x1: [4, 2, 1, 1] 880s 0.261 0.336 -0.957 0.493 880s 0.487 -0.934 -0.420 0.382 880s 880s y: [2, 2, 1, 1] 880s -0.495 0.752 880s 0.834 0.473 880s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 880s x0: [4, 4, 1, 2] 880s 0.446 -0.365 -0.849 -0.974 880s -0.982 -0.515 0.778 -0.663 880s -0.923 -0.011 0.987 0.063 880s 0.018 -0.705 -0.303 0.646 880s 880s x1: [4, 2, 1, 2] 880s 0.634 -0.372 -0.257 0.242 880s -0.809 0.542 0.540 -0.024 880s 880s y: [4, 2, 1, 2] 880s 0.401 -0.792 -0.819 0.508 880s -0.993 0.952 1.272 -0.576 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.717 -0.026 0.590 -0.945 880s -0.309 0.097 0.814 0.181 880s 880s x1: [4, 4, 1, 1] 880s -0.563 0.236 0.014 -0.545 880s 0.721 -0.207 -0.209 0.799 880s 0.781 -0.221 -0.139 -0.201 880s 0.074 0.558 -0.554 -0.888 880s 880s y: [2, 4, 1, 1] 880s 0.921 0.110 880s -1.389 -0.268 880s -0.446 -0.412 880s 0.446 -0.580 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.160 0.952 880s 0.097 -0.104 880s 0.519 0.300 880s 880s x1: [2, 4, 1, 1] 880s -0.533 0.055 880s -0.276 -0.732 880s -0.860 0.712 880s -0.294 0.376 880s 880s y: [3, 4, 1, 1] 880s -0.033 -0.058 -0.260 880s -0.741 0.049 -0.363 880s 0.540 -0.158 -0.233 880s 0.311 -0.068 -0.040 880s testing: mul_mat, [2, 2, 1, 2] = [3, 2, 1, 2] * [3, 2, 1, 2] 880s x0: [3, 2, 1, 2] 880s 0.600 0.378 0.035 880s 0.980 -0.934 -0.899 880s 880s x1: [3, 2, 1, 2] 880s -0.270 0.757 -0.901 880s 0.804 0.315 -0.455 880s 880s y: [2, 2, 1, 2] 880s 0.093 -0.161 880s 0.585 0.903 880s test-mul-mat0: iter:117/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.456 -0.313 0.617 880s 880s x1: [3, 1, 1, 1] 880s -0.456 0.357 -0.117 880s 880s y: [1, 1, 1, 1] 880s -0.392 880s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.217 0.879 0.769 880s -0.116 -0.786 -0.056 880s 880s x1: [3, 1, 3, 1] 880s -0.999 -0.398 -0.572 880s 880s y: [2, 1, 3, 1] 880s -0.573 0.460 880s testing: mul_mat, [3, 2, 3, 2] = [3, 3, 3, 2] * [3, 2, 3, 2] 880s x0: [3, 3, 3, 2] 880s 0.097 -0.093 0.342 880s -0.210 0.302 0.402 880s 0.218 -0.520 0.341 880s 880s x1: [3, 2, 3, 2] 880s -0.892 0.177 0.606 880s -0.891 0.780 -0.966 880s 880s y: [3, 2, 3, 2] 880s 0.104 0.484 -0.080 880s -0.489 0.034 -0.929 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.337 0.816 -0.671 880s 0.974 0.849 0.197 880s 0.911 0.576 -0.118 880s 0.449 0.072 -0.888 880s 880s x1: [3, 3, 1, 1] 880s 0.713 -0.714 -0.222 880s 0.045 0.235 0.515 880s -0.705 0.278 0.349 880s 880s y: [4, 3, 1, 1] 880s -0.674 0.045 0.264 0.466 880s -0.170 0.345 0.115 -0.421 880s 0.230 -0.381 -0.523 -0.607 880s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.823 -0.052 -0.461 0.307 880s -0.582 0.184 0.276 0.462 880s 0.345 0.698 0.756 0.919 880s -0.586 -0.287 0.242 0.248 880s 880s x1: [4, 3, 3, 1] 880s 0.719 0.881 0.192 -0.995 880s 0.846 -0.138 -0.476 -0.755 880s 0.061 -0.762 -0.469 0.839 880s 880s y: [4, 3, 3, 1] 880s -1.032 -0.663 0.094 -0.874 880s -0.702 -0.998 -0.858 -0.758 880s 0.463 0.082 -0.095 0.277 880s testing: mul_mat, [1, 4, 3, 2] = [4, 1, 3, 2] * [4, 4, 3, 2] 880s x0: [4, 1, 3, 2] 880s 0.581 0.967 0.284 0.576 880s 880s x1: [4, 4, 3, 2] 880s 0.738 0.732 -0.819 0.714 880s -0.110 0.456 -0.864 -0.722 880s -0.622 -0.115 0.090 -0.048 880s 0.806 -0.814 0.703 0.145 880s 880s y: [1, 4, 3, 2] 880s 1.315 880s -0.285 880s -0.475 880s -0.036 880s test-mul-mat0: iter:118/500 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.319 -0.367 880s 0.044 -0.829 880s 0.957 -0.206 880s -0.018 0.589 880s 880s x1: [2, 3, 1, 1] 880s -0.683 0.548 880s 0.758 0.898 880s 0.515 0.042 880s 880s y: [4, 3, 1, 1] 880s -0.419 -0.484 -0.767 0.335 880s -0.088 -0.711 0.541 0.515 880s 0.149 -0.012 0.484 0.015 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.881 0.552 880s 0.764 -0.614 880s 0.372 -0.074 880s 0.543 0.689 880s 880s x1: [2, 4, 1, 1] 880s 0.538 0.533 880s 0.531 -0.660 880s 0.337 -0.777 880s -0.315 0.777 880s 880s y: [4, 4, 1, 1] 880s 0.768 0.083 0.160 0.659 880s 0.104 0.812 0.247 -0.166 880s -0.132 0.735 0.183 -0.352 880s 0.152 -0.718 -0.175 0.364 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.478 -0.067 880s -0.102 -0.541 880s 880s x1: [2, 4, 1, 1] 880s -0.527 0.301 880s 0.586 0.988 880s -0.657 0.060 880s 0.307 -0.024 880s 880s y: [2, 4, 1, 1] 880s -0.272 -0.109 880s 0.214 -0.594 880s -0.318 0.035 880s 0.148 -0.018 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.330 0.461 880s -0.724 0.107 880s 880s x1: [2, 2, 1, 1] 880s -0.478 -0.564 880s 0.992 -0.947 880s 880s y: [2, 2, 1, 1] 880s -0.418 0.286 880s -0.109 -0.820 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.968 -0.272 880s -0.745 -0.428 880s 0.271 0.307 880s 0.720 -0.284 880s 880s x1: [2, 2, 1, 1] 880s 0.565 0.342 880s -0.340 0.329 880s 880s y: [4, 2, 1, 1] 880s -0.640 -0.567 0.258 0.309 880s 0.240 0.113 0.009 -0.339 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.346 0.804 0.952 0.399 880s 880s x1: [4, 2, 1, 1] 880s -0.084 -0.634 -0.977 0.892 880s -0.530 0.500 -0.175 0.368 880s 880s y: [1, 2, 1, 1] 880s -1.113 880s 0.199 880s test-mul-mat0: iter:119/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.177 -0.617 -0.200 880s 0.551 -0.045 -0.893 880s 880s x1: [3, 2, 1, 1] 880s -0.611 0.240 0.383 880s 0.049 -0.431 0.111 880s 880s y: [2, 2, 1, 1] 880s -0.117 -0.690 880s 0.235 -0.053 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.412 -0.688 0.749 880s 0.934 -0.884 0.701 880s 0.333 -0.304 -0.018 880s 0.008 -0.263 0.371 880s 880s x1: [3, 2, 1, 1] 880s -0.733 0.871 -0.527 880s -0.711 0.763 -0.056 880s 880s y: [4, 2, 1, 1] 880s -0.691 -1.823 -0.500 -0.431 880s -0.274 -1.377 -0.468 -0.227 880s testing: mul_mat, [4, 4, 1, 2] = [3, 4, 1, 2] * [3, 4, 1, 2] 880s x0: [3, 4, 1, 2] 880s -0.942 -0.050 0.969 880s 0.067 0.687 0.340 880s 0.315 0.807 0.760 880s -0.868 0.039 0.260 880s 880s x1: [3, 4, 1, 2] 880s -0.752 -0.879 -0.580 880s -0.183 0.232 0.500 880s 0.640 0.615 -0.699 880s 0.191 -0.430 -0.592 880s 880s y: [4, 4, 1, 2] 880s 0.190 -0.851 -1.388 0.467 880s 0.645 0.317 0.510 0.298 880s -1.311 0.227 0.166 -0.714 880s -0.732 -0.484 -0.737 -0.337 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.826 0.509 0.047 880s 880s x1: [3, 4, 1, 1] 880s 0.706 -0.340 -0.188 880s -0.008 0.386 0.065 880s 0.978 0.444 -0.985 880s 0.947 -0.489 0.702 880s 880s y: [1, 4, 1, 1] 880s 0.401 880s 0.193 880s 0.987 880s 0.566 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.202 880s 0.869 880s -0.308 880s -0.573 880s 880s x1: [1, 3, 1, 1] 880s 0.957 880s -0.452 880s -0.692 880s 880s y: [4, 3, 1, 1] 880s 0.194 0.832 -0.295 -0.549 880s -0.091 -0.393 0.139 0.259 880s -0.140 -0.601 0.213 0.397 880s testing: mul_mat, [4, 1, 1, 2] = [4, 4, 1, 2] * [4, 1, 1, 2] 880s x0: [4, 4, 1, 2] 880s -0.825 0.958 0.962 -0.534 880s 0.980 0.451 -0.043 -0.760 880s -0.993 0.255 0.192 0.746 880s 0.240 0.905 0.788 0.014 880s 880s x1: [4, 1, 1, 2] 880s 0.838 0.114 -0.908 -0.312 880s 880s y: [4, 1, 1, 2] 880s -1.290 1.149 -1.210 -0.416 880s test-mul-mat0: iter:120/500 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.934 880s 880s x1: [1, 3, 1, 1] 880s -0.223 880s -0.023 880s 0.395 880s 880s y: [1, 3, 1, 1] 880s -0.208 880s -0.022 880s 0.369 880s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 1, 4, 1] 880s 0.414 880s 880s x1: [1, 1, 4, 1] 880s -0.413 880s 880s y: [1, 1, 4, 1] 880s -0.171 880s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.010 880s -0.011 880s 880s x1: [1, 1, 4, 1] 880s 0.792 880s 880s y: [2, 1, 4, 1] 880s 0.008 -0.009 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.668 880s 0.127 880s -0.527 880s 880s x1: [1, 2, 1, 1] 880s -0.157 880s -0.650 880s 880s y: [3, 2, 1, 1] 880s 0.105 -0.020 0.083 880s 0.434 -0.082 0.342 880s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 4, 4, 1] 880s -0.671 0.135 -0.361 880s -0.875 0.344 -0.405 880s -0.645 0.284 0.866 880s 0.525 -0.159 -0.586 880s 880s x1: [3, 1, 4, 1] 880s 0.728 0.866 -0.593 880s 880s y: [4, 1, 4, 1] 880s -0.157 -0.098 -0.737 0.592 880s testing: mul_mat, [2, 3, 4, 1] = [4, 2, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 2, 4, 1] 880s 0.165 -0.678 0.570 0.197 880s -0.301 -0.637 0.067 -0.028 880s 880s x1: [4, 3, 4, 1] 880s -0.173 0.122 -0.597 0.618 880s -0.065 -0.995 0.216 0.889 880s -0.727 0.034 0.360 -0.043 880s 880s y: [2, 3, 4, 1] 880s -0.330 -0.083 880s 0.962 0.643 880s 0.054 0.222 880s test-mul-mat0: iter:121/500 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.924 0.210 0.155 0.596 880s 880s x1: [4, 2, 1, 1] 880s 0.279 0.519 0.079 -0.054 880s 0.402 0.895 -0.162 0.165 880s 880s y: [1, 2, 1, 1] 880s -0.168 880s -0.110 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.494 -0.152 0.306 -0.601 880s -0.255 -0.900 -0.763 -0.364 880s -0.940 -0.226 -0.152 -0.618 880s -0.118 -0.873 0.902 0.961 880s 880s x1: [4, 1, 1, 1] 880s 0.543 -0.567 0.286 -0.563 880s 880s y: [4, 1, 1, 1] 880s 0.244 0.359 -0.079 0.147 880s testing: mul_mat, [2, 4, 1, 2] = [4, 2, 1, 2] * [4, 4, 1, 2] 880s x0: [4, 2, 1, 2] 880s 0.482 0.846 -0.403 -0.282 880s 0.832 -0.724 0.110 0.414 880s 880s x1: [4, 4, 1, 2] 880s 0.073 0.304 0.856 0.910 880s -0.531 0.236 0.986 0.679 880s -0.609 0.582 0.223 -0.175 880s -0.133 0.659 0.074 0.374 880s 880s y: [2, 4, 1, 2] 880s -0.310 0.311 880s -0.646 -0.224 880s 0.158 -0.976 880s 0.359 -0.425 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.885 -0.595 -0.847 0.512 880s 880s x1: [4, 2, 1, 1] 880s 0.427 -0.082 0.868 0.436 880s -0.194 -0.592 -0.780 0.542 880s 880s y: [1, 2, 1, 1] 880s -0.085 880s 1.120 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.712 880s -0.982 880s 0.153 880s 880s x1: [1, 4, 1, 1] 880s -0.659 880s -0.720 880s -0.006 880s -0.814 880s 880s y: [3, 4, 1, 1] 880s -0.470 0.648 -0.101 880s -0.513 0.707 -0.110 880s -0.004 0.006 -0.001 880s -0.580 0.799 -0.124 880s testing: mul_mat, [4, 1, 1, 2] = [3, 4, 1, 2] * [3, 1, 1, 2] 880s x0: [3, 4, 1, 2] 880s 0.367 -0.607 0.428 880s 0.475 0.021 -0.550 880s 0.153 -0.770 -0.258 880s -0.206 0.200 -0.242 880s 880s x1: [3, 1, 1, 2] 880s -0.178 0.431 -0.348 880s 880s y: [4, 1, 1, 2] 880s -0.476 0.116 -0.270 0.207 880s test-mul-mat0: iter:122/500 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.550 -0.930 0.226 880s -0.250 0.498 0.676 880s 0.492 -0.744 -0.989 880s -0.361 0.167 0.858 880s 880s x1: [3, 2, 1, 1] 880s -0.164 0.682 0.896 880s -0.798 0.157 0.049 880s 880s y: [4, 2, 1, 1] 880s -0.522 0.987 -1.475 0.942 880s -0.574 0.311 -0.558 0.356 880s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 3, 2, 1] 880s -0.850 0.528 0.563 880s -0.211 -0.305 0.421 880s -0.084 0.053 -0.880 880s 880s x1: [3, 4, 2, 1] 880s -0.873 -0.642 -0.301 880s 0.801 -0.574 0.252 880s 0.492 0.647 -0.373 880s -0.384 0.519 -0.242 880s 880s y: [3, 4, 2, 1] 880s 0.233 0.253 0.304 880s -0.842 0.112 -0.319 880s -0.287 -0.458 0.321 880s 0.465 -0.179 0.272 880s testing: mul_mat, [2, 3, 2, 2] = [3, 2, 2, 2] * [3, 3, 2, 2] 880s x0: [3, 2, 2, 2] 880s 0.166 -0.146 -0.340 880s 0.623 0.627 -0.178 880s 880s x1: [3, 3, 2, 2] 880s -0.724 0.847 0.412 880s 0.930 -0.795 0.116 880s -0.420 -0.886 -0.110 880s 880s y: [2, 3, 2, 2] 880s -0.384 0.006 880s 0.231 0.061 880s 0.097 -0.797 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.424 -0.801 0.831 880s -0.884 0.743 -0.708 880s -0.391 0.431 0.944 880s 880s x1: [3, 2, 1, 1] 880s 0.524 -0.176 -0.478 880s -0.822 0.409 0.262 880s 880s y: [3, 2, 1, 1] 880s -0.478 -0.255 -0.732 880s 0.238 0.845 0.746 880s testing: mul_mat, [4, 3, 2, 1] = [3, 4, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 4, 2, 1] 880s 0.867 0.619 -0.687 880s 0.503 -0.118 -0.528 880s -0.727 -0.359 0.654 880s -0.558 0.362 -0.744 880s 880s x1: [3, 3, 2, 1] 880s 0.035 -0.447 -0.158 880s 0.233 0.443 0.179 880s -0.455 0.677 -0.566 880s 880s y: [4, 3, 2, 1] 880s -0.137 0.154 0.031 -0.064 880s 0.353 -0.030 -0.211 -0.103 880s 0.414 -0.010 -0.282 0.919 880s testing: mul_mat, [2, 3, 2, 2] = [4, 2, 2, 2] * [4, 3, 2, 2] 880s x0: [4, 2, 2, 2] 880s -0.646 0.158 -0.711 0.724 880s 0.734 0.900 0.557 -0.870 880s 880s x1: [4, 3, 2, 2] 880s -0.749 -0.573 -0.837 -0.602 880s -0.115 -0.596 -0.277 -0.249 880s 0.907 -0.004 0.194 0.526 880s 880s y: [2, 3, 2, 2] 880s 0.552 -1.008 880s -0.003 -0.559 880s -0.344 0.313 880s test-mul-mat0: iter:123/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.975 880s -0.775 880s -0.548 880s 880s x1: [1, 3, 1, 1] 880s 0.365 880s 0.714 880s -0.374 880s 880s y: [3, 3, 1, 1] 880s 0.355 -0.283 -0.200 880s 0.696 -0.554 -0.392 880s -0.365 0.290 0.205 880s testing: mul_mat, [2, 3, 2, 1] = [1, 2, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.258 880s -0.051 880s 880s x1: [1, 3, 2, 1] 880s 0.217 880s 0.544 880s 0.202 880s 880s y: [2, 3, 2, 1] 880s 0.056 -0.011 880s 0.140 -0.028 880s 0.052 -0.010 880s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 4, 2, 1] 880s -0.060 880s -0.082 880s -0.813 880s 0.031 880s 880s x1: [1, 2, 2, 1] 880s 0.891 880s 0.299 880s 880s y: [4, 2, 2, 1] 880s -0.053 -0.073 -0.724 0.027 880s -0.018 -0.025 -0.243 0.009 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.529 880s -0.541 880s 880s x1: [1, 4, 1, 1] 880s 0.359 880s -0.265 880s -0.754 880s -0.993 880s 880s y: [2, 4, 1, 1] 880s -0.190 -0.194 880s 0.140 0.143 880s 0.399 0.408 880s 0.526 0.537 880s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 4, 2, 1] 880s 0.445 0.995 880s 0.067 0.626 880s -0.747 -0.714 880s -0.606 0.293 880s 880s x1: [2, 1, 2, 1] 880s 0.926 -0.986 880s 880s y: [4, 1, 2, 1] 880s -0.568 -0.555 0.012 -0.850 880s testing: mul_mat, [4, 2, 2, 1] = [4, 4, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 4, 2, 1] 880s -0.630 -0.375 0.908 0.396 880s -0.661 0.077 -0.404 0.775 880s 0.783 -0.174 0.003 0.940 880s 0.996 0.355 -0.073 0.217 880s 880s x1: [4, 2, 2, 1] 880s 0.013 0.821 -0.608 -0.627 880s -0.444 -0.363 -0.620 0.266 880s 880s y: [4, 2, 2, 1] 880s -1.117 -0.186 -0.725 0.213 880s -0.042 0.723 -0.036 -0.468 880s test-mul-mat0: iter:124/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.061 0.298 -0.564 880s -0.665 0.074 -0.623 880s 0.552 0.241 -0.310 880s 0.325 -0.955 0.382 880s 880s x1: [3, 1, 1, 1] 880s 0.601 -0.656 -0.705 880s 880s y: [4, 1, 1, 1] 880s 0.165 -0.009 0.392 0.552 880s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 4, 3, 1] 880s -0.819 0.221 0.587 880s -0.531 -0.115 -0.177 880s 0.282 -0.933 -0.001 880s 0.695 0.593 0.469 880s 880s x1: [3, 4, 3, 1] 880s -0.637 0.957 -0.086 880s -0.406 -0.008 -0.055 880s 0.547 -0.857 -0.079 880s -0.712 0.664 0.236 880s 880s y: [4, 4, 3, 1] 880s 0.683 0.243 -1.072 0.084 880s 0.298 0.226 -0.106 -0.313 880s -0.684 -0.178 0.954 -0.165 880s 0.868 0.260 -0.820 0.010 880s testing: mul_mat, [3, 4, 3, 2] = [3, 3, 3, 2] * [3, 4, 3, 2] 880s x0: [3, 3, 3, 2] 880s -0.115 -0.832 0.964 880s -0.402 0.035 -0.980 880s -0.451 0.893 -0.347 880s 880s x1: [3, 4, 3, 2] 880s 0.743 0.490 0.633 880s -0.259 0.186 0.226 880s -0.789 -0.196 0.284 880s 0.449 -0.219 -0.858 880s 880s y: [3, 4, 3, 2] 880s 0.117 -0.901 -0.117 880s 0.093 -0.111 0.204 880s 0.528 0.032 0.083 880s -0.698 0.653 -0.100 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.337 0.913 0.878 880s -0.754 0.795 -0.348 880s 880s x1: [3, 3, 1, 1] 880s -0.035 0.494 0.079 880s 0.611 -0.573 -0.321 880s 0.255 -0.100 0.163 880s 880s y: [2, 3, 1, 1] 880s 0.508 0.392 880s -0.599 -0.805 880s 0.138 -0.328 880s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.073 -0.482 880s 0.355 0.300 880s -0.767 0.164 880s 0.826 -0.236 880s 880s x1: [2, 3, 3, 1] 880s -0.094 0.170 880s -0.485 -0.453 880s 0.955 -0.436 880s 880s y: [4, 3, 3, 1] 880s -0.075 0.018 0.100 -0.118 880s 0.254 -0.308 0.298 -0.294 880s 0.140 0.208 -0.804 0.892 880s testing: mul_mat, [2, 2, 3, 2] = [4, 2, 3, 2] * [4, 2, 3, 2] 880s x0: [4, 2, 3, 2] 880s 0.873 0.088 0.376 -0.086 880s -0.408 0.922 0.133 0.741 880s 880s x1: [4, 2, 3, 2] 880s -0.289 0.131 -0.489 -0.921 880s 0.631 0.187 0.057 -0.442 880s 880s y: [2, 2, 3, 2] 880s -0.345 -0.509 880s 0.627 -0.406 880s test-mul-mat0: iter:125/500 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.484 0.581 -0.497 880s 0.485 -0.829 -0.036 880s 880s x1: [3, 3, 1, 1] 880s -0.574 0.199 0.340 880s 0.509 0.199 0.743 880s -0.063 0.405 -0.634 880s 880s y: [2, 3, 1, 1] 880s -0.331 -0.455 880s -0.007 0.055 880s 0.520 -0.343 880s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 4, 4, 1] 880s -0.500 0.935 -0.698 880s 0.985 -0.894 0.266 880s -0.616 -0.391 -0.841 880s -0.078 0.860 -0.090 880s 880s x1: [3, 2, 4, 1] 880s -0.600 -0.497 -0.107 880s -0.462 0.251 -0.248 880s 880s y: [4, 2, 4, 1] 880s -0.089 -0.175 0.654 -0.371 880s 0.639 -0.745 0.396 0.274 880s testing: mul_mat, [1, 4, 4, 3] = [3, 1, 4, 3] * [3, 4, 4, 3] 880s x0: [3, 1, 4, 3] 880s -0.802 -0.071 0.527 880s 880s x1: [3, 4, 4, 3] 880s -0.060 -0.035 -0.282 880s 0.144 -0.179 0.614 880s 0.595 -0.638 -0.880 880s -0.966 -0.474 0.642 880s 880s y: [1, 4, 4, 3] 880s -0.098 880s 0.221 880s -0.896 880s 1.146 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.167 -0.846 0.559 880s 0.003 0.919 -0.384 880s 880s x1: [3, 1, 1, 1] 880s -0.322 0.181 -0.458 880s 880s y: [2, 1, 1, 1] 880s -0.463 0.341 880s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.676 0.595 880s 0.172 -0.269 880s -0.283 0.333 880s 880s x1: [2, 3, 4, 1] 880s -0.345 0.899 880s 0.012 0.239 880s 0.970 0.202 880s 880s y: [3, 3, 4, 1] 880s 0.302 -0.301 0.397 880s 0.150 -0.062 0.076 880s 0.777 0.113 -0.207 880s testing: mul_mat, [4, 2, 4, 3] = [3, 4, 4, 3] * [3, 2, 4, 3] 880s x0: [3, 4, 4, 3] 880s 0.359 0.206 -0.696 880s -0.763 0.697 0.702 880s -0.932 -0.614 -0.195 880s 0.382 0.621 -0.136 880s 880s x1: [3, 2, 4, 3] 880s 0.697 0.360 0.677 880s -0.059 0.869 0.846 880s 880s y: [4, 2, 4, 3] 880s -0.146 0.194 -1.002 0.398 880s -0.431 1.245 -0.643 0.402 880s test-mul-mat0: iter:126/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.906 0.888 880s 0.061 -0.208 880s 0.238 0.671 880s 880s x1: [2, 2, 1, 1] 880s 0.483 0.611 880s 0.662 -0.582 880s 880s y: [3, 2, 1, 1] 880s 0.980 -0.098 0.525 880s 0.083 0.161 -0.233 880s testing: mul_mat, [2, 3, 2, 1] = [2, 2, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 2, 2, 1] 880s 0.101 -0.329 880s -0.054 0.025 880s 880s x1: [2, 3, 2, 1] 880s -0.728 0.925 880s 0.572 -0.710 880s 0.823 -0.339 880s 880s y: [2, 3, 2, 1] 880s -0.378 0.063 880s 0.291 -0.049 880s 0.195 -0.053 880s testing: mul_mat, [3, 2, 2, 2] = [2, 3, 2, 2] * [2, 2, 2, 2] 880s x0: [2, 3, 2, 2] 880s 0.685 0.698 880s 0.747 -0.727 880s -0.481 0.585 880s 880s x1: [2, 2, 2, 2] 880s 0.242 0.162 880s -0.274 -0.078 880s 880s y: [3, 2, 2, 2] 880s 0.279 0.063 -0.021 880s -0.242 -0.148 0.086 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.420 0.293 880s -0.226 0.882 880s -0.703 -0.773 880s -0.147 -0.517 880s 880s x1: [2, 3, 1, 1] 880s -0.878 0.294 880s 0.189 -0.456 880s -0.105 0.076 880s 880s y: [4, 3, 1, 1] 880s 0.456 0.458 0.390 -0.023 880s -0.213 -0.445 0.219 0.208 880s 0.067 0.091 0.015 -0.024 880s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.650 0.244 0.824 -0.323 880s 0.766 -0.158 0.616 -0.090 880s -0.627 -0.077 0.219 0.101 880s 880s x1: [4, 2, 2, 1] 880s -0.587 -0.373 -0.501 -0.425 880s -0.474 -0.177 0.978 0.588 880s 880s y: [3, 2, 2, 1] 880s 0.016 -0.661 0.244 880s 0.881 0.215 0.584 880s testing: mul_mat, [2, 4, 2, 2] = [3, 2, 2, 2] * [3, 4, 2, 2] 880s x0: [3, 2, 2, 2] 880s -0.855 0.159 -0.340 880s -0.050 -0.826 0.138 880s 880s x1: [3, 4, 2, 2] 880s 0.363 -0.874 -0.016 880s 0.070 0.181 0.561 880s -0.100 0.531 0.326 880s 0.273 -0.225 -0.832 880s 880s y: [2, 4, 2, 2] 880s -0.444 0.702 880s -0.222 -0.076 880s 0.059 -0.389 880s 0.013 0.058 880s test-mul-mat0: iter:127/500 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.999 -0.571 880s 0.678 -0.573 880s -0.740 -0.848 880s 880s x1: [2, 3, 1, 1] 880s 0.070 -0.611 880s -0.868 0.215 880s 0.339 0.291 880s 880s y: [3, 3, 1, 1] 880s 0.419 0.397 0.466 880s -0.990 -0.712 0.460 880s 0.172 0.063 -0.498 880s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.632 0.374 880s 0.507 0.707 880s 880s x1: [2, 3, 3, 1] 880s -0.732 0.373 880s 0.179 -0.115 880s 0.893 0.745 880s 880s y: [2, 3, 3, 1] 880s 0.602 -0.108 880s -0.156 0.010 880s -0.286 0.980 880s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.785 -0.262 880s 880s x1: [2, 2, 3, 1] 880s -0.598 -0.274 880s 0.905 0.287 880s 880s y: [1, 2, 3, 1] 880s 0.542 880s -0.786 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.732 0.974 880s -0.982 0.112 880s -0.110 0.464 880s 880s x1: [2, 1, 1, 1] 880s -0.439 0.353 880s 880s y: [3, 1, 1, 1] 880s 0.665 0.471 0.212 880s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.697 0.432 -0.845 880s -0.922 0.265 -0.500 880s 0.271 -0.474 -0.439 880s -0.742 0.450 0.618 880s 880s x1: [3, 2, 3, 1] 880s -0.892 -0.535 -0.810 880s -0.490 0.192 -0.905 880s 880s y: [4, 2, 3, 1] 880s -0.168 1.086 0.367 -0.080 880s 0.506 0.955 0.173 -0.110 880s testing: mul_mat, [2, 3, 3, 1] = [4, 2, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 2, 3, 1] 880s -0.777 -0.648 -0.854 0.032 880s 0.180 -0.324 0.030 -0.975 880s 880s x1: [4, 3, 3, 1] 880s -0.128 -0.260 -0.129 0.026 880s 0.240 0.432 -0.355 0.848 880s -0.541 -0.547 -0.034 -0.322 880s 880s y: [2, 3, 3, 1] 880s 0.379 0.032 880s -0.136 -0.934 880s 0.793 0.392 880s test-mul-mat0: iter:128/500 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.013 0.289 0.171 -0.977 880s -0.463 0.082 0.514 0.971 880s 0.919 0.272 0.142 0.940 880s 0.209 -0.614 0.100 0.315 880s 880s x1: [4, 2, 1, 1] 880s 0.056 0.937 -0.950 0.279 880s 0.117 -0.598 0.955 0.263 880s 880s y: [4, 2, 1, 1] 880s -0.166 -0.166 0.434 -0.570 880s -0.268 0.643 0.328 0.570 880s testing: mul_mat, [3, 4, 2, 1] = [4, 3, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.590 0.960 -0.906 0.577 880s 0.804 -0.752 -0.206 -0.072 880s 0.520 -0.587 -0.994 -0.945 880s 880s x1: [4, 4, 2, 1] 880s 0.311 0.613 0.011 -0.211 880s -0.783 0.167 0.844 -0.846 880s 0.217 0.123 0.272 0.619 880s 0.079 -0.465 0.052 -0.934 880s 880s y: [3, 4, 2, 1] 880s 0.273 -0.197 -0.009 880s -0.630 -0.868 -0.546 880s 0.101 -0.019 -0.815 880s -1.078 0.470 1.145 880s testing: mul_mat, [3, 3, 2, 3] = [4, 3, 2, 3] * [4, 3, 2, 3] 880s x0: [4, 3, 2, 3] 880s -0.844 0.991 0.467 -0.045 880s 0.072 0.576 0.528 0.592 880s -0.537 0.947 0.430 0.816 880s 880s x1: [4, 3, 2, 3] 880s 0.380 -0.277 0.905 0.644 880s -0.878 0.076 0.521 -0.469 880s 0.036 0.615 -0.892 -0.160 880s 880s y: [3, 3, 2, 3] 880s -0.202 0.727 0.448 880s 1.081 -0.022 0.385 880s 0.169 -0.209 0.049 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.036 0.279 -0.974 -0.658 880s 880s x1: [4, 3, 1, 1] 880s 0.297 -0.727 -0.674 0.847 880s 0.584 0.743 -0.251 -0.995 880s -0.661 0.700 0.077 0.853 880s 880s y: [1, 3, 1, 1] 880s -0.093 880s 1.128 880s -0.464 880s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.905 880s -0.580 880s 880s x1: [1, 4, 2, 1] 880s 0.396 880s 0.460 880s 0.287 880s -0.124 880s 880s y: [2, 4, 2, 1] 880s 0.358 -0.230 880s 0.416 -0.267 880s 0.260 -0.167 880s -0.112 0.072 880s testing: mul_mat, [1, 1, 2, 3] = [2, 1, 2, 3] * [2, 1, 2, 3] 880s x0: [2, 1, 2, 3] 880s -0.211 0.467 880s 880s x1: [2, 1, 2, 3] 880s 0.898 -0.989 880s 880s y: [1, 1, 2, 3] 880s -0.652 880s test-mul-mat0: iter:129/500 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.234 0.444 0.597 880s 0.197 0.604 -0.614 880s -0.335 0.042 0.201 880s -0.526 0.876 0.476 880s 880s x1: [3, 3, 1, 1] 880s -0.186 -0.704 -0.054 880s -0.176 -0.938 0.170 880s -0.318 0.412 0.138 880s 880s y: [4, 3, 1, 1] 880s -0.301 -0.428 0.022 -0.544 880s -0.274 -0.705 0.054 -0.648 880s 0.339 0.101 0.151 0.593 880s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 3, 2, 1] 880s 0.686 0.709 0.423 880s -0.840 0.585 -0.101 880s 0.730 0.172 0.589 880s 880s x1: [3, 4, 2, 1] 880s 0.570 0.587 -0.311 880s -0.470 -0.515 0.318 880s -0.370 0.288 -0.635 880s 0.443 0.584 0.312 880s 880s y: [3, 4, 2, 1] 880s 0.676 -0.104 0.334 880s -0.554 0.061 -0.245 880s -0.318 0.543 -0.594 880s 0.851 -0.062 0.608 880s testing: mul_mat, [2, 3, 2, 1] = [3, 2, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.656 -0.118 0.759 880s 0.916 -0.462 -0.335 880s 880s x1: [3, 3, 2, 1] 880s -0.399 -0.825 0.065 880s 0.551 0.233 -0.315 880s 0.791 0.057 0.749 880s 880s y: [2, 3, 2, 1] 880s -0.115 -0.006 880s 0.095 0.503 880s 1.080 0.447 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.524 -0.566 0.828 880s -0.981 -0.443 -0.753 880s -0.291 -0.679 -0.731 880s -0.444 -0.441 0.998 880s 880s x1: [3, 2, 1, 1] 880s 0.740 -0.814 0.585 880s 0.291 0.419 -0.730 880s 880s y: [4, 2, 1, 1] 880s 0.557 -0.806 -0.090 0.614 880s -0.994 0.078 0.164 -1.043 880s testing: mul_mat, [3, 3, 2, 1] = [4, 3, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.859 -0.613 -0.710 -0.891 880s 0.745 -0.985 0.166 -0.144 880s -0.029 -0.031 -0.274 0.148 880s 880s x1: [4, 3, 2, 1] 880s 0.903 0.151 0.757 0.819 880s 0.689 -0.578 -0.375 0.172 880s -0.642 0.764 0.183 -0.122 880s 880s y: [3, 3, 2, 1] 880s -2.136 0.531 -0.117 880s -0.125 0.996 0.126 880s 0.063 -1.183 -0.073 880s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 4, 2, 1] 880s -0.393 -0.520 -0.135 880s 0.114 -0.466 0.337 880s -0.046 -0.922 0.559 880s 0.404 0.110 -0.618 880s 880s x1: [3, 4, 2, 1] 880s -0.326 -0.370 -0.669 880s 0.260 -0.551 0.517 880s -0.598 -0.410 0.262 880s 0.373 -0.023 0.277 880s 880s y: [4, 4, 2, 1] 880s 0.411 -0.090 -0.018 0.241 880s 0.115 0.461 0.785 -0.275 880s 0.412 0.212 0.552 -0.449 880s -0.172 0.147 0.159 -0.023 880s test-mul-mat0: iter:130/500 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.229 -0.074 0.910 880s -0.212 0.308 0.650 880s 880s x1: [3, 3, 1, 1] 880s -0.516 -0.475 -0.427 880s 0.091 0.639 0.527 880s -0.505 -0.881 -0.939 880s 880s y: [2, 3, 1, 1] 880s -0.471 -0.314 880s 0.454 0.520 880s -0.905 -0.774 880s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 4, 4, 1] 880s 0.402 0.040 -0.347 880s -0.810 -0.652 -0.697 880s 0.126 0.301 0.066 880s 0.189 0.174 0.284 880s 880s x1: [3, 2, 4, 1] 880s -0.063 -0.048 -0.237 880s -0.937 0.874 -0.781 880s 880s y: [4, 2, 4, 1] 880s 0.055 0.248 -0.038 -0.088 880s -0.071 0.735 0.093 -0.247 880s testing: mul_mat, [4, 4, 4, 2] = [3, 4, 4, 2] * [3, 4, 4, 2] 880s x0: [3, 4, 4, 2] 880s -0.205 0.704 -0.579 880s -0.496 0.865 0.213 880s -0.569 -0.653 -0.391 880s 0.139 -0.507 0.461 880s 880s x1: [3, 4, 4, 2] 880s 0.992 -0.765 -0.046 880s 0.459 -0.212 0.573 880s 0.451 0.018 -0.753 880s 0.496 0.418 0.093 880s 880s y: [4, 4, 4, 2] 880s -0.715 -1.164 -0.047 0.504 880s -0.574 -0.288 -0.347 0.435 880s 0.356 -0.369 0.027 -0.294 880s 0.139 0.135 -0.591 -0.100 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.980 0.723 0.856 880s 880s x1: [3, 4, 1, 1] 880s 0.847 0.154 0.091 880s 0.702 0.996 0.141 880s 0.403 0.306 -0.487 880s 0.933 0.163 0.377 880s 880s y: [1, 4, 1, 1] 880s -0.641 880s 0.153 880s -0.591 880s -0.473 880s testing: mul_mat, [1, 3, 4, 1] = [1, 1, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 1, 4, 1] 880s -0.219 880s 880s x1: [1, 3, 4, 1] 880s 0.582 880s 0.736 880s 0.388 880s 880s y: [1, 3, 4, 1] 880s -0.128 880s -0.161 880s -0.085 880s testing: mul_mat, [1, 1, 4, 2] = [1, 1, 4, 2] * [1, 1, 4, 2] 880s x0: [1, 1, 4, 2] 880s 0.213 880s 880s x1: [1, 1, 4, 2] 880s -0.818 880s 880s y: [1, 1, 4, 2] 880s -0.174 880s test-mul-mat0: iter:131/500 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.139 880s 0.240 880s 0.279 880s 0.952 880s 880s x1: [1, 3, 1, 1] 880s -0.996 880s -0.330 880s 0.813 880s 880s y: [4, 3, 1, 1] 880s 0.139 -0.239 -0.278 -0.948 880s 0.046 -0.079 -0.092 -0.314 880s -0.113 0.195 0.227 0.774 880s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.819 880s 880s x1: [1, 4, 2, 1] 880s -0.486 880s 0.214 880s -0.866 880s 0.490 880s 880s y: [1, 4, 2, 1] 880s 0.399 880s -0.175 880s 0.710 880s -0.401 880s testing: mul_mat, [4, 1, 2, 3] = [1, 4, 2, 3] * [1, 1, 2, 3] 880s x0: [1, 4, 2, 3] 880s 0.377 880s 0.387 880s -0.331 880s 0.820 880s 880s x1: [1, 1, 2, 3] 880s -0.494 880s 880s y: [4, 1, 2, 3] 880s -0.186 -0.191 0.164 -0.405 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.620 880s -0.753 880s 880s x1: [1, 4, 1, 1] 880s -0.309 880s 0.791 880s 0.175 880s -0.867 880s 880s y: [2, 4, 1, 1] 880s 0.192 0.233 880s -0.491 -0.596 880s -0.109 -0.132 880s 0.538 0.653 880s testing: mul_mat, [2, 1, 2, 1] = [2, 2, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 2, 2, 1] 880s 0.588 -0.476 880s 0.578 -0.241 880s 880s x1: [2, 1, 2, 1] 880s 0.268 0.766 880s 880s y: [2, 1, 2, 1] 880s -0.207 -0.030 880s testing: mul_mat, [1, 2, 2, 3] = [2, 1, 2, 3] * [2, 2, 2, 3] 880s x0: [2, 1, 2, 3] 880s -0.072 0.077 880s 880s x1: [2, 2, 2, 3] 880s -0.913 -0.128 880s 0.667 0.616 880s 880s y: [1, 2, 2, 3] 880s 0.056 880s -0.000 880s test-mul-mat0: iter:132/500 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.060 880s -0.024 880s 0.158 880s 880s x1: [1, 4, 1, 1] 880s -0.443 880s 0.774 880s -0.333 880s -0.423 880s 880s y: [3, 4, 1, 1] 880s 0.027 0.010 -0.070 880s -0.046 -0.018 0.123 880s 0.020 0.008 -0.053 880s 0.025 0.010 -0.067 880s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 2, 3, 1] 880s 0.452 880s 0.571 880s 880s x1: [1, 3, 3, 1] 880s -0.518 880s 0.142 880s 0.889 880s 880s y: [2, 3, 3, 1] 880s -0.234 -0.296 880s 0.064 0.081 880s 0.402 0.508 880s testing: mul_mat, [1, 2, 3, 4] = [1, 1, 3, 4] * [1, 2, 3, 4] 880s x0: [1, 1, 3, 4] 880s -0.992 880s 880s x1: [1, 2, 3, 4] 880s 0.197 880s 0.569 880s 880s y: [1, 2, 3, 4] 880s -0.195 880s -0.565 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.371 880s 0.629 880s 880s x1: [1, 1, 1, 1] 880s 0.787 880s 880s y: [2, 1, 1, 1] 880s -0.292 0.495 880s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.449 0.856 880s -0.951 0.711 880s 880s x1: [2, 1, 3, 1] 880s 0.850 -0.466 880s 880s y: [2, 1, 3, 1] 880s -0.781 -1.140 880s testing: mul_mat, [4, 2, 3, 4] = [2, 4, 3, 4] * [2, 2, 3, 4] 880s x0: [2, 4, 3, 4] 880s 0.250 -0.915 880s 0.027 0.748 880s 0.178 0.640 880s -0.761 0.300 880s 880s x1: [2, 2, 3, 4] 880s -0.080 -0.309 880s -0.334 0.236 880s 880s y: [4, 2, 3, 4] 880s 0.262 -0.233 -0.212 -0.032 880s -0.299 0.167 0.091 0.325 880s test-mul-mat0: iter:133/500 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.439 -0.610 -0.220 880s 880s x1: [3, 4, 1, 1] 880s -0.390 0.736 0.010 880s -0.487 0.525 -0.823 880s -0.676 0.065 -0.745 880s -0.415 -0.108 0.612 880s 880s y: [1, 4, 1, 1] 880s -0.280 880s 0.074 880s 0.420 880s 0.113 880s testing: mul_mat, [3, 1, 3, 1] = [3, 3, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.776 0.230 -0.273 880s -0.627 -0.671 0.337 880s -0.890 0.339 0.851 880s 880s x1: [3, 1, 3, 1] 880s 0.316 0.199 -0.401 880s 880s y: [3, 1, 3, 1] 880s -0.090 -0.467 -0.554 880s testing: mul_mat, [2, 3, 3, 4] = [3, 2, 3, 4] * [3, 3, 3, 4] 880s x0: [3, 2, 3, 4] 880s -0.722 -0.961 -0.617 880s 0.828 0.335 -0.483 880s 880s x1: [3, 3, 3, 4] 880s 0.022 0.189 0.485 880s -0.866 0.413 -0.284 880s -0.139 0.786 0.045 880s 880s y: [2, 3, 3, 4] 880s -0.497 -0.153 880s 0.405 -0.442 880s -0.682 0.127 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.259 -0.616 0.223 880s 880s x1: [3, 2, 1, 1] 880s 0.396 0.936 0.558 880s 0.494 0.306 -0.448 880s 880s y: [1, 2, 1, 1] 880s -0.350 880s -0.160 880s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 3, 3, 1] 880s -0.592 880s -0.373 880s 0.514 880s 880s x1: [1, 3, 3, 1] 880s 0.365 880s -0.748 880s 0.473 880s 880s y: [3, 3, 3, 1] 880s -0.216 -0.136 0.188 880s 0.442 0.279 -0.384 880s -0.280 -0.177 0.243 880s testing: mul_mat, [2, 1, 3, 4] = [3, 2, 3, 4] * [3, 1, 3, 4] 880s x0: [3, 2, 3, 4] 880s -0.564 -0.962 -0.848 880s 0.802 -0.946 0.002 880s 880s x1: [3, 1, 3, 4] 880s 0.117 0.856 0.985 880s 880s y: [2, 1, 3, 4] 880s -1.724 -0.713 880s test-mul-mat0: iter:134/500 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.743 880s 880s x1: [1, 1, 1, 1] 880s -0.467 880s 880s y: [1, 1, 1, 1] 880s -0.347 880s testing: mul_mat, [4, 1, 4, 1] = [1, 4, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 4, 4, 1] 880s -0.097 880s -0.539 880s 0.561 880s -0.409 880s 880s x1: [1, 1, 4, 1] 880s -0.242 880s 880s y: [4, 1, 4, 1] 880s 0.024 0.131 -0.136 0.099 880s testing: mul_mat, [4, 4, 4, 3] = [1, 4, 4, 3] * [1, 4, 4, 3] 880s x0: [1, 4, 4, 3] 880s -0.479 880s -0.317 880s 0.554 880s -0.142 880s 880s x1: [1, 4, 4, 3] 880s -0.956 880s 0.598 880s 0.932 880s -0.303 880s 880s y: [4, 4, 4, 3] 880s 0.458 0.303 -0.529 0.136 880s -0.286 -0.189 0.331 -0.085 880s -0.446 -0.295 0.516 -0.132 880s 0.145 0.096 -0.168 0.043 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.041 880s 0.437 880s -0.249 880s 880s x1: [1, 4, 1, 1] 880s -0.685 880s 0.644 880s -0.910 880s -0.866 880s 880s y: [3, 4, 1, 1] 880s -0.028 -0.299 0.171 880s 0.026 0.281 -0.161 880s -0.037 -0.397 0.227 880s -0.035 -0.378 0.216 880s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.572 0.711 0.829 880s 0.431 0.698 0.773 880s -0.917 -0.514 0.228 880s 880s x1: [3, 1, 4, 1] 880s -0.366 -0.993 0.574 880s 880s y: [3, 1, 4, 1] 880s -0.020 -0.407 0.977 880s testing: mul_mat, [3, 3, 4, 3] = [3, 3, 4, 3] * [3, 3, 4, 3] 880s x0: [3, 3, 4, 3] 880s -0.934 -0.293 0.499 880s -0.652 -0.985 0.987 880s -0.029 -0.985 0.226 880s 880s x1: [3, 3, 4, 3] 880s 0.318 0.166 -0.610 880s -0.454 -0.961 -0.523 880s 0.419 0.001 -0.301 880s 880s y: [3, 3, 4, 3] 880s -0.651 -0.973 -0.311 880s 0.445 0.727 0.842 880s -0.542 -0.572 -0.081 880s test-mul-mat0: iter:135/500 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.267 0.658 880s 0.564 0.346 880s 880s x1: [2, 1, 1, 1] 880s 0.819 -0.818 880s 880s y: [2, 1, 1, 1] 880s -0.319 0.179 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.737 -0.136 880s 0.022 0.341 880s -0.516 -0.764 880s -0.039 -0.966 880s 880s x1: [2, 2, 1, 1] 880s 0.846 0.475 880s 0.756 0.786 880s 880s y: [4, 2, 1, 1] 880s -0.688 0.181 -0.800 -0.492 880s -0.664 0.285 -0.991 -0.789 880s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 880s x0: [2, 4, 1, 2] 880s 0.561 -0.865 880s -0.487 0.317 880s 0.921 -0.786 880s 0.580 -0.215 880s 880s x1: [2, 4, 1, 2] 880s 0.363 0.393 880s -0.629 0.033 880s -0.313 0.352 880s -0.335 -0.140 880s 880s y: [4, 4, 1, 2] 880s -0.136 -0.052 0.025 0.126 880s -0.382 0.317 -0.606 -0.372 880s -0.480 0.264 -0.565 -0.257 880s -0.067 0.119 -0.198 -0.164 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.102 0.610 880s 0.611 0.316 880s 880s x1: [2, 4, 1, 1] 880s 0.674 0.868 880s 0.962 0.026 880s -0.467 -0.178 880s -0.846 -0.019 880s 880s y: [2, 4, 1, 1] 880s 0.598 0.687 880s 0.114 0.596 880s -0.156 -0.342 880s -0.097 -0.523 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.806 -0.259 880s 0.956 -0.389 880s 0.375 -0.306 880s 0.773 -0.989 880s 880s x1: [2, 2, 1, 1] 880s 0.900 -0.352 880s -0.123 -0.965 880s 880s y: [4, 2, 1, 1] 880s -0.635 0.997 0.445 1.044 880s 0.349 0.258 0.249 0.860 880s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 880s x0: [4, 4, 1, 2] 880s -0.414 0.844 0.566 -0.876 880s 0.809 -0.809 0.430 -0.448 880s 0.030 0.657 -0.459 -0.741 880s 0.463 0.800 -0.823 -0.299 880s 880s x1: [4, 2, 1, 2] 880s -0.917 -0.424 0.314 -0.083 880s 0.851 -0.012 -0.216 0.812 880s 880s y: [4, 2, 1, 2] 880s 0.272 -0.227 -0.388 -0.997 880s -1.196 0.241 -0.485 0.319 880s test-mul-mat0: iter:136/500 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.479 880s 880s x1: [1, 4, 1, 1] 880s 0.372 880s -0.988 880s -0.062 880s -0.062 880s 880s y: [1, 4, 1, 1] 880s 0.178 880s -0.474 880s -0.030 880s -0.030 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.969 880s 0.374 880s -0.733 880s 0.897 880s 880s x1: [1, 1, 1, 1] 880s 0.115 880s 880s y: [4, 1, 1, 1] 880s -0.111 0.043 -0.084 0.103 880s testing: mul_mat, [4, 4, 1, 4] = [1, 4, 1, 4] * [1, 4, 1, 4] 880s x0: [1, 4, 1, 4] 880s -0.793 880s 0.279 880s -0.345 880s -0.781 880s 880s x1: [1, 4, 1, 4] 880s -0.450 880s -0.181 880s -0.250 880s -0.490 880s 880s y: [4, 4, 1, 4] 880s 0.357 -0.126 0.155 0.352 880s 0.144 -0.051 0.063 0.142 880s 0.198 -0.070 0.086 0.195 880s 0.388 -0.137 0.169 0.383 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.676 880s -0.354 880s -0.709 880s 880s x1: [1, 4, 1, 1] 880s -0.626 880s 0.141 880s 0.496 880s 0.992 880s 880s y: [3, 4, 1, 1] 880s -0.423 0.221 0.444 880s 0.096 -0.050 -0.100 880s 0.335 -0.175 -0.351 880s 0.671 -0.351 -0.703 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.458 0.279 0.010 880s 880s x1: [3, 1, 1, 1] 880s 0.627 0.606 0.138 880s 880s y: [1, 1, 1, 1] 880s 0.458 880s testing: mul_mat, [2, 3, 1, 4] = [1, 2, 1, 4] * [1, 3, 1, 4] 880s x0: [1, 2, 1, 4] 880s 0.823 880s -0.322 880s 880s x1: [1, 3, 1, 4] 880s 0.665 880s -0.441 880s 0.665 880s 880s y: [2, 3, 1, 4] 880s 0.548 -0.214 880s -0.364 0.142 880s 0.547 -0.214 880s test-mul-mat0: iter:137/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.985 0.783 0.451 880s -0.209 0.040 -0.004 880s 880s x1: [3, 2, 1, 1] 880s -0.557 -0.823 0.108 880s 0.901 0.457 -0.882 880s 880s y: [2, 2, 1, 1] 880s -0.046 0.083 880s -0.927 -0.167 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.730 -0.381 0.483 880s 0.549 -0.304 0.673 880s 880s x1: [3, 2, 1, 1] 880s 0.263 0.005 0.770 880s 0.462 0.052 -0.962 880s 880s y: [2, 2, 1, 1] 880s 0.562 0.661 880s -0.148 -0.410 880s testing: mul_mat, [4, 2, 1, 3] = [3, 4, 1, 3] * [3, 2, 1, 3] 880s x0: [3, 4, 1, 3] 880s -0.117 -0.364 -0.499 880s 0.935 -0.327 0.787 880s 0.665 0.292 0.270 880s 0.213 0.988 -0.057 880s 880s x1: [3, 2, 1, 3] 880s -0.703 -0.551 -0.491 880s 0.336 0.829 -0.048 880s 880s y: [4, 2, 1, 3] 880s 0.527 -0.864 -0.761 -0.667 880s -0.317 0.005 0.452 0.894 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.385 -0.287 0.027 880s -0.090 0.168 -0.736 880s -0.255 -0.579 0.968 880s 880s x1: [3, 4, 1, 1] 880s 0.417 0.457 0.014 880s -0.313 -0.329 0.002 880s 0.630 -0.819 0.439 880s 0.083 0.026 0.706 880s 880s y: [3, 4, 1, 1] 880s 0.029 0.029 -0.357 880s -0.026 -0.028 0.272 880s 0.489 -0.518 0.739 880s 0.043 -0.523 0.647 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.654 0.166 -0.309 880s -0.836 0.608 -0.777 880s 0.978 0.197 -0.689 880s 0.016 -0.395 -0.822 880s 880s x1: [3, 3, 1, 1] 880s 0.116 0.238 -0.620 880s -0.755 0.096 0.428 880s 0.875 -0.112 0.150 880s 880s y: [4, 3, 1, 1] 880s 0.155 0.529 0.588 0.417 880s 0.378 0.357 -1.015 -0.402 880s -0.637 -0.916 0.730 -0.065 880s testing: mul_mat, [1, 3, 1, 3] = [4, 1, 1, 3] * [4, 3, 1, 3] 880s x0: [4, 1, 1, 3] 880s 0.933 -0.222 0.908 -0.603 880s 880s x1: [4, 3, 1, 3] 880s 0.608 0.221 -0.077 -0.679 880s -0.611 0.344 0.347 -0.347 880s 0.312 -0.536 0.891 0.693 880s 880s y: [1, 3, 1, 3] 880s 0.858 880s -0.122 880s 0.802 880s test-mul-mat0: iter:138/500 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.654 0.485 880s -0.397 -0.068 880s 0.610 0.199 880s -0.135 -0.612 880s 880s x1: [2, 4, 1, 1] 880s -0.010 0.027 880s -0.127 0.424 880s 0.232 0.024 880s -0.919 0.748 880s 880s y: [4, 4, 1, 1] 880s 0.020 0.002 -0.001 -0.015 880s 0.288 0.021 0.007 -0.242 880s -0.140 -0.094 0.146 -0.046 880s 0.964 0.314 -0.412 -0.334 880s testing: mul_mat, [1, 4, 3, 1] = [2, 1, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.117 0.770 880s 880s x1: [2, 4, 3, 1] 880s 0.107 0.262 880s 0.057 -0.292 880s -0.633 -0.287 880s 0.429 -0.945 880s 880s y: [1, 4, 3, 1] 880s 0.189 880s -0.232 880s -0.147 880s -0.777 880s testing: mul_mat, [1, 1, 3, 4] = [2, 1, 3, 4] * [2, 1, 3, 4] 880s x0: [2, 1, 3, 4] 880s -0.036 0.850 880s 880s x1: [2, 1, 3, 4] 880s 0.189 0.897 880s 880s y: [1, 1, 3, 4] 880s 0.755 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.710 0.589 880s -0.375 -0.460 880s -0.310 -0.205 880s -0.933 -0.365 880s 880s x1: [2, 1, 1, 1] 880s 0.854 -0.456 880s 880s y: [4, 1, 1, 1] 880s -0.875 -0.111 -0.171 -0.631 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s 0.390 0.930 -0.045 -0.638 880s 0.954 -0.252 0.610 -0.066 880s 0.294 0.929 -0.200 0.803 880s 880s x1: [4, 2, 3, 1] 880s 0.025 -0.139 -0.129 0.407 880s 0.324 0.003 -0.947 -0.103 880s 880s y: [3, 2, 3, 1] 880s -0.373 -0.047 0.231 880s 0.238 -0.262 0.205 880s testing: mul_mat, [2, 4, 3, 4] = [3, 2, 3, 4] * [3, 4, 3, 4] 880s x0: [3, 2, 3, 4] 880s 0.455 0.053 -0.939 880s 0.191 -0.327 -0.410 880s 880s x1: [3, 4, 3, 4] 880s 0.690 0.540 0.939 880s -0.510 0.902 -0.127 880s -0.707 -0.684 -0.296 880s -0.443 0.749 -0.642 880s 880s y: [2, 4, 3, 4] 880s -0.538 -0.430 880s -0.065 -0.340 880s -0.080 0.210 880s 0.440 -0.066 880s test-mul-mat0: iter:139/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.928 -0.217 -0.205 -0.499 880s -0.431 -0.544 0.294 0.582 880s 0.977 -0.104 -0.717 -0.645 880s 880s x1: [4, 3, 1, 1] 880s -0.974 0.790 0.851 0.838 880s 0.419 0.073 0.868 -0.707 880s 0.412 0.676 -0.480 0.549 880s 880s y: [3, 3, 1, 1] 880s -1.668 0.728 -2.184 880s 0.547 -0.376 0.235 880s 0.060 -0.367 0.323 880s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 1, 4, 1] 880s -0.921 0.304 -0.546 -0.621 880s 880s x1: [4, 3, 4, 1] 880s 0.129 0.314 0.429 0.165 880s 0.950 -0.288 0.191 0.740 880s -0.437 0.029 0.160 0.636 880s 880s y: [1, 3, 4, 1] 880s -0.360 880s -1.525 880s -0.072 880s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 2, 4, 1] 880s 0.927 -0.179 -0.410 -0.670 880s 0.955 0.992 -0.736 0.822 880s 880s x1: [4, 1, 4, 1] 880s -0.397 0.953 -0.300 -0.320 880s 880s y: [2, 1, 4, 1] 880s -0.201 0.524 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.260 -0.013 0.934 0.373 880s 0.272 0.578 -0.435 0.337 880s -0.349 0.492 0.103 0.462 880s 880s x1: [4, 2, 1, 1] 880s -0.813 0.590 0.487 -0.858 880s 0.582 0.751 0.964 0.569 880s 880s y: [3, 2, 1, 1] 880s 0.339 -0.381 0.228 880s 0.952 0.365 0.529 880s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 2, 4, 1] 880s 0.705 0.794 -0.697 880s -0.903 -0.762 -0.500 880s 880s x1: [3, 4, 4, 1] 880s -0.267 0.897 0.998 880s -0.509 -0.180 -0.885 880s 0.722 -0.814 0.115 880s -0.121 -0.627 -0.295 880s 880s y: [2, 4, 4, 1] 880s -0.171 -0.941 880s 0.116 1.039 880s -0.218 -0.089 880s -0.378 0.734 880s testing: mul_mat, [1, 3, 4, 1] = [2, 1, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 1, 4, 1] 880s 0.686 -0.773 880s 880s x1: [2, 3, 4, 1] 880s 0.612 -0.556 880s -0.993 -0.578 880s -0.732 -0.115 880s 880s y: [1, 3, 4, 1] 880s 0.850 880s -0.234 880s -0.413 880s test-mul-mat0: iter:140/500 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.954 -0.582 0.275 880s -0.540 0.245 0.907 880s -0.854 0.472 -0.737 880s -0.079 -0.563 -0.773 880s 880s x1: [3, 4, 1, 1] 880s 0.898 0.650 0.836 880s -0.219 0.861 0.121 880s -0.836 -0.665 -0.152 880s 0.319 -0.800 -0.468 880s 880s y: [4, 4, 1, 1] 880s -1.005 0.434 -1.076 -1.084 880s -0.259 0.439 0.504 -0.561 880s 1.143 0.151 0.512 0.558 880s 0.032 -0.793 -0.305 0.787 880s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.415 0.506 -0.056 880s 0.381 -0.458 -0.047 880s -0.658 0.913 0.810 880s 880s x1: [3, 4, 3, 1] 880s 0.308 0.847 -0.803 880s -0.883 0.524 0.052 880s -0.985 0.174 -0.112 880s -0.204 0.036 -0.991 880s 880s y: [3, 4, 3, 1] 880s 0.346 -0.232 -0.080 880s 0.629 -0.579 1.102 880s 0.503 -0.450 0.717 880s 0.159 -0.047 -0.636 880s testing: mul_mat, [2, 3, 3, 2] = [3, 2, 3, 2] * [3, 3, 3, 2] 880s x0: [3, 2, 3, 2] 880s 0.964 -0.510 0.577 880s -0.009 -0.694 -0.069 880s 880s x1: [3, 3, 3, 2] 880s -0.133 0.568 -0.315 880s -0.078 -0.847 -0.808 880s 0.866 0.534 -0.266 880s 880s y: [2, 3, 3, 2] 880s -0.599 -0.371 880s -0.109 0.645 880s 0.409 -0.360 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.450 0.548 0.219 880s -0.137 -0.763 0.568 880s 880s x1: [3, 2, 1, 1] 880s 0.232 0.417 -0.797 880s 0.084 0.855 -0.202 880s 880s y: [2, 2, 1, 1] 880s -0.050 -0.802 880s 0.386 -0.778 880s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.268 0.425 880s 880s x1: [2, 3, 3, 1] 880s 0.563 -0.506 880s -0.430 -0.343 880s -0.483 -0.198 880s 880s y: [1, 3, 3, 1] 880s -0.366 880s -0.030 880s 0.046 880s testing: mul_mat, [4, 2, 3, 2] = [1, 4, 3, 2] * [1, 2, 3, 2] 880s x0: [1, 4, 3, 2] 880s 0.440 880s -0.222 880s -0.399 880s 0.656 880s 880s x1: [1, 2, 3, 2] 880s 0.137 880s 0.085 880s 880s y: [4, 2, 3, 2] 880s 0.060 -0.030 -0.055 0.090 880s 0.038 -0.019 -0.034 0.056 880s test-mul-mat0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s : iter:141/500 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.500 -0.637 -0.512 880s -0.970 0.334 0.047 880s -0.217 0.914 -0.257 880s 880s x1: [3, 4, 1, 1] 880s -0.745 0.855 0.947 880s 0.695 -0.367 -0.452 880s 0.350 -0.490 -0.872 880s 0.275 -0.606 -0.011 880s 880s y: [3, 4, 1, 1] 880s -1.402 1.053 0.700 880s 0.812 -0.817 -0.370 880s 0.933 -0.544 -0.300 880s 0.529 -0.470 -0.611 880s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.577 -0.351 0.072 880s -0.802 -0.700 -0.114 880s 880s x1: [3, 3, 3, 1] 880s -0.379 0.651 0.814 880s -0.237 -0.886 0.822 880s 0.704 0.871 0.424 880s 880s y: [2, 3, 3, 1] 880s -0.389 -0.245 880s 0.234 0.716 880s 0.130 -1.222 880s testing: mul_mat, [1, 2, 3, 4] = [3, 1, 3, 4] * [3, 2, 3, 4] 880s x0: [3, 1, 3, 4] 880s -0.994 -0.409 0.414 880s 880s x1: [3, 2, 3, 4] 880s 0.279 0.238 0.470 880s 0.557 -0.499 -0.494 880s 880s y: [1, 2, 3, 4] 880s -0.180 880s -0.554 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.413 -0.696 -0.891 880s 0.256 -0.380 0.442 880s 0.027 -0.629 0.296 880s 0.642 0.090 0.131 880s 880s x1: [3, 1, 1, 1] 880s -0.634 -0.090 0.684 880s 880s y: [4, 1, 1, 1] 880s -0.285 0.174 0.242 -0.325 880s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.506 0.972 0.505 0.852 880s 880s x1: [4, 3, 3, 1] 880s -0.241 -0.925 -0.119 -0.273 880s 0.088 -0.095 0.017 0.900 880s -0.619 0.358 -0.282 -0.552 880s 880s y: [1, 3, 3, 1] 880s -1.069 880s 0.639 880s 0.048 880s testing: mul_mat, [2, 4, 3, 4] = [1, 2, 3, 4] * [1, 4, 3, 4] 880s x0: [1, 2, 3, 4] 880s -0.086 880s 0.544 880s 880s x1: [1, 4, 3, 4] 880s -0.107 880s -0.786 880s -0.590 880s -0.630 880s 880s y: [2, 4, 3, 4] 880s 0.009 -0.058 880s 0.068 -0.428 880s 0.051 -0.321 880s 0.054 -0.343 880s test-mul-mat0: iter:142/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.670 0.946 880s -0.992 -0.500 880s -0.468 0.525 880s 880s x1: [2, 2, 1, 1] 880s 0.259 0.415 880s 0.328 -0.827 880s 880s y: [3, 2, 1, 1] 880s 0.219 -0.464 0.097 880s -1.002 0.088 -0.588 880s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 2, 3, 1] 880s 0.439 -0.016 880s -0.274 -0.233 880s 880s x1: [2, 3, 3, 1] 880s 0.569 0.161 880s 0.955 0.214 880s -0.975 -0.810 880s 880s y: [2, 3, 3, 1] 880s 0.247 -0.193 880s 0.416 -0.311 880s -0.415 0.456 880s testing: mul_mat, [2, 2, 3, 2] = [2, 2, 3, 2] * [2, 2, 3, 2] 880s x0: [2, 2, 3, 2] 880s -0.242 -0.387 880s 0.496 -0.645 880s 880s x1: [2, 2, 3, 2] 880s 0.166 -0.205 880s 0.174 -0.620 880s 880s y: [2, 2, 3, 2] 880s 0.039 0.215 880s 0.198 0.486 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.713 -0.674 880s -0.101 -0.606 880s -0.202 0.153 880s 880s x1: [2, 2, 1, 1] 880s -0.021 -0.622 880s -0.798 0.096 880s 880s y: [3, 2, 1, 1] 880s 0.405 0.379 -0.091 880s -0.633 0.023 0.176 880s testing: mul_mat, [1, 2, 3, 1] = [3, 1, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 1, 3, 1] 880s -0.369 0.801 -0.216 880s 880s x1: [3, 2, 3, 1] 880s -0.427 0.109 -0.327 880s 0.615 -0.002 -0.426 880s 880s y: [1, 2, 3, 1] 880s 0.316 880s -0.136 880s testing: mul_mat, [4, 3, 3, 2] = [1, 4, 3, 2] * [1, 3, 3, 2] 880s x0: [1, 4, 3, 2] 880s 0.100 880s -0.701 880s -0.540 880s 0.731 880s 880s x1: [1, 3, 3, 2] 880s -0.128 880s -0.810 880s -0.226 880s 880s y: [4, 3, 3, 2] 880s -0.013 0.090 0.069 -0.094 880s -0.081 0.568 0.438 -0.593 880s -0.023 0.159 0.122 -0.166 880s test-mul-mat0: iter:143/500 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.641 0.896 0.945 880s 0.340 -0.643 0.372 880s 0.619 -0.735 0.859 880s 880s x1: [3, 4, 1, 1] 880s 0.455 -0.825 -0.561 880s -0.445 -0.526 -0.101 880s -0.713 -0.427 -0.858 880s -0.149 0.568 -0.791 880s 880s y: [3, 4, 1, 1] 880s -1.561 0.476 0.407 880s -0.282 0.150 0.025 880s -0.736 -0.287 -0.864 880s -0.143 -0.710 -1.189 880s testing: mul_mat, [3, 3, 3, 1] = [3, 3, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 3, 3, 1] 880s 0.452 -0.269 -0.257 880s 0.647 -0.075 0.624 880s -0.033 0.572 0.761 880s 880s x1: [3, 3, 3, 1] 880s -0.804 -0.806 -0.119 880s 0.319 -0.354 -0.863 880s -0.543 0.344 -0.478 880s 880s y: [3, 3, 3, 1] 880s -0.116 -0.535 -0.524 880s 0.461 -0.306 -0.870 880s -0.215 -0.676 -0.149 880s testing: mul_mat, [4, 3, 3, 2] = [3, 4, 3, 2] * [3, 3, 3, 2] 880s x0: [3, 4, 3, 2] 880s -0.257 0.781 -0.765 880s -0.940 -0.684 -0.135 880s -0.197 -0.906 -0.055 880s 0.440 -0.998 0.926 880s 880s x1: [3, 3, 3, 2] 880s 0.244 0.450 0.893 880s -0.871 -0.099 -0.376 880s -0.128 -0.451 0.549 880s 880s y: [4, 3, 3, 2] 880s -0.395 -0.658 -0.504 0.486 880s 0.434 0.937 0.281 -0.633 880s -0.740 0.355 0.404 0.902 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.269 0.837 0.507 880s -0.553 -0.622 0.190 880s -0.744 -0.370 -0.949 880s 0.917 0.627 0.703 880s 880s x1: [3, 4, 1, 1] 880s 0.236 -0.260 0.385 880s -0.724 -0.433 -0.094 880s 0.371 -0.105 0.867 880s -0.947 0.709 -0.765 880s 880s y: [4, 4, 1, 1] 880s 0.042 0.104 -0.445 0.325 880s -0.606 0.652 0.789 -1.002 880s 0.451 0.025 -1.059 0.883 880s -0.050 -0.062 1.169 -0.962 880s testing: mul_mat, [3, 3, 3, 1] = [4, 3, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.812 -0.410 -0.638 -0.813 880s -0.096 -0.444 0.102 0.356 880s 0.506 -0.060 -0.759 0.776 880s 880s x1: [4, 3, 3, 1] 880s 0.911 0.976 0.768 -0.110 880s -0.655 0.143 -0.874 0.085 880s -0.472 -0.598 0.652 0.434 880s 880s y: [3, 3, 3, 1] 880s -1.541 -0.483 -0.267 880s 0.961 -0.059 0.390 880s -0.140 0.532 -0.360 880s testing: mul_mat, [1, 4, 3, 2] = [3, 1, 3, 2] * [3, 4, 3, 2] 880s x0: [3, 1, 3, 2] 880s -0.485 0.236 -0.782 880s 880s x1: [3, 4, 3, 2] 880s 0.425 -0.971 -0.623 880s 0.666 -0.784 0.733 880s 0.442 -0.375 -0.386 880s 0.201 0.511 -0.064 880s 880s y: [1, 4, 3, 2] 880s 0.052 880s -1.081 880s -0.001 880s 0.073 880s test-mul-mat0: iter:144/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.555 -0.835 -0.585 -0.370 880s -0.232 -0.915 -0.222 -0.147 880s -0.626 0.031 -0.074 0.059 880s -0.752 0.978 0.584 -0.434 880s 880s x1: [4, 3, 1, 1] 880s 0.931 0.928 -0.159 0.790 880s -0.067 -0.843 -0.491 -0.337 880s 0.298 0.025 0.899 0.516 880s 880s y: [4, 3, 1, 1] 880s -1.491 -1.146 -0.496 -0.228 880s 1.153 0.945 0.033 -0.914 880s -0.903 -0.367 -0.223 0.100 880s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 2, 4, 1] 880s -0.421 -0.778 -0.650 -0.426 880s -0.670 0.477 -0.480 -0.844 880s 880s x1: [4, 4, 4, 1] 880s -0.573 0.422 -0.503 -0.645 880s -0.737 -0.713 0.288 -0.580 880s -0.204 0.951 0.718 0.821 880s 0.849 0.234 -0.513 -0.706 880s 880s y: [2, 4, 4, 1] 880s 0.514 1.371 880s 0.925 0.505 880s -1.471 -0.448 880s 0.094 0.385 880s testing: mul_mat, [4, 2, 4, 4] = [4, 4, 4, 4] * [4, 2, 4, 4] 880s x0: [4, 4, 4, 4] 880s -0.200 0.394 0.239 -0.144 880s 0.304 -0.730 -0.901 0.207 880s -0.671 -0.871 -0.520 -0.406 880s 0.425 0.315 -0.715 -0.078 880s 880s x1: [4, 2, 4, 4] 880s 0.177 -0.645 -0.907 0.506 880s 0.832 -0.387 0.662 -0.742 880s 880s y: [4, 2, 4, 4] 880s -0.579 1.447 0.709 0.481 880s -0.053 -0.215 -0.264 -0.184 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.408 -0.425 -0.870 -0.159 880s -0.308 0.425 0.383 0.833 880s -0.521 0.941 0.641 -0.434 880s -0.195 0.842 0.430 0.206 880s 880s x1: [4, 4, 1, 1] 880s 0.135 -0.635 -0.031 0.230 880s 0.844 0.909 -0.646 -0.807 880s 0.885 0.005 -0.632 -0.042 880s 0.066 -0.129 0.419 0.575 880s 880s y: [4, 4, 1, 1] 880s 0.316 -0.132 -0.788 -0.527 880s 0.648 -0.793 0.352 0.156 880s 0.915 -0.548 -0.844 -0.450 880s -0.373 0.564 -0.137 0.177 880s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 2, 4, 1] 880s 0.367 -0.425 0.724 0.893 880s -0.177 0.853 0.036 -0.153 880s 880s x1: [4, 4, 4, 1] 880s -0.199 0.796 0.050 -0.289 880s -0.850 0.243 -0.404 0.155 880s 0.611 0.554 -0.779 -0.518 880s -0.028 0.796 0.771 -0.620 880s 880s y: [2, 4, 4, 1] 880s -0.633 0.760 880s -0.570 0.320 880s -1.039 0.415 880s -0.344 0.806 880s testing: mul_mat, [3, 4, 4, 4] = [2, 3, 4, 4] * [2, 4, 4, 4] 880s x0: [2, 3, 4, 4] 880s -0.260 0.620 880s -0.918 0.848 880s -0.102 -0.214 880s 880s x1: [2, 4, 4, 4] 880s 0.788 -0.227 880s 0.923 0.512 880s 0.809 0.816 880s -0.641 0.787 880s 880s y: [3, 4, 4, 4] 880s -0.346 -0.916 -0.032 880s 0.077 -0.414 -0.204 880s 0.296 -0.051 -0.257 880s 0.655 1.256 -0.103 880s test-mul-mat0: iter:145/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.420 -0.973 -0.403 880s -0.725 -0.230 0.558 880s -0.112 0.576 -0.102 880s 0.169 -0.156 0.924 880s 880s x1: [3, 1, 1, 1] 880s -0.225 -0.896 -0.931 880s 880s y: [4, 1, 1, 1] 880s 1.342 -0.151 -0.396 -0.759 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.710 -0.488 -0.281 880s 0.814 -0.419 0.797 880s -0.606 -0.393 -0.605 880s 880s x1: [3, 4, 1, 1] 880s 0.103 -0.391 0.038 880s -0.411 -0.873 -0.681 880s -0.631 0.411 -0.414 880s 0.263 0.373 -0.723 880s 880s y: [3, 4, 1, 1] 880s 0.253 0.279 0.068 880s 0.326 -0.511 1.004 880s -0.532 -1.017 0.472 880s 0.208 -0.518 0.131 880s testing: mul_mat, [3, 3, 1, 2] = [3, 3, 1, 2] * [3, 3, 1, 2] 880s x0: [3, 3, 1, 2] 880s 0.317 -0.004 0.994 880s 0.027 0.508 -0.287 880s -0.158 -0.912 -0.490 880s 880s x1: [3, 3, 1, 2] 880s -0.331 0.377 0.953 880s 0.557 -0.047 -0.149 880s -0.273 0.797 -0.224 880s 880s y: [3, 3, 1, 2] 880s 0.841 -0.091 -0.758 880s 0.029 0.034 0.027 880s -0.313 0.462 -0.574 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.706 -0.853 0.983 880s 0.151 0.271 -0.108 880s 0.276 -0.950 0.743 880s 0.023 0.654 -0.506 880s 880s x1: [3, 3, 1, 1] 880s -0.811 -0.177 -0.515 880s -0.982 0.230 -0.701 880s -0.563 0.762 -0.568 880s 880s y: [4, 3, 1, 1] 880s -0.928 -0.114 -0.439 0.127 880s -1.578 -0.010 -1.010 0.483 880s -1.606 0.183 -1.301 0.773 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.590 -0.862 0.184 0.058 880s -0.249 0.352 0.364 -0.666 880s 880s x1: [4, 3, 1, 1] 880s -0.873 -0.562 0.399 -0.968 880s -0.489 -0.743 0.494 0.537 880s 0.967 0.683 -0.640 -0.548 880s 880s y: [2, 3, 1, 1] 880s -0.014 0.809 880s 0.473 -0.318 880s -0.167 0.132 880s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 880s x0: [2, 4, 1, 2] 880s -0.636 -0.538 880s -0.990 -0.941 880s -0.642 -0.686 880s -0.276 0.213 880s 880s x1: [2, 4, 1, 2] 880s -0.360 -0.919 880s -0.519 0.910 880s -0.869 -0.865 880s 0.894 0.023 880s 880s y: [4, 4, 1, 2] 880s 0.724 1.221 0.861 -0.097 880s -0.160 -0.342 -0.291 0.337 880s 1.018 1.674 1.151 0.055 880s -0.580 -0.906 -0.589 -0.241 880s test-mul-mat0: iter:146/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.710 -0.631 -0.777 880s -0.034 -0.267 -0.767 880s 880s x1: [3, 2, 1, 1] 880s -0.995 0.959 0.225 880s -0.607 0.108 0.542 880s 880s y: [2, 2, 1, 1] 880s -0.074 -0.395 880s -0.058 -0.423 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.438 -0.695 0.143 880s -0.837 0.036 0.344 880s 880s x1: [3, 2, 1, 1] 880s 0.324 0.457 -0.305 880s 0.384 0.772 0.908 880s 880s y: [2, 2, 1, 1] 880s -0.219 -0.360 880s -0.238 0.020 880s testing: mul_mat, [1, 2, 1, 3] = [3, 1, 1, 3] * [3, 2, 1, 3] 880s x0: [3, 1, 1, 3] 880s -0.079 0.186 -0.065 880s 880s x1: [3, 2, 1, 3] 880s -0.032 -0.752 0.456 880s 0.904 0.596 0.461 880s 880s y: [1, 2, 1, 3] 880s -0.167 880s 0.009 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.316 -0.720 0.101 880s -0.540 0.922 0.776 880s 0.166 0.029 -1.000 880s 880s x1: [3, 1, 1, 1] 880s -0.706 -0.564 0.004 880s 880s y: [3, 1, 1, 1] 880s 0.629 -0.136 -0.138 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.744 0.273 -0.509 880s 0.975 -0.162 0.404 880s 880s x1: [3, 3, 1, 1] 880s 0.464 0.810 0.261 880s -0.805 -0.932 0.488 880s 0.659 -0.442 0.040 880s 880s y: [2, 3, 1, 1] 880s 0.433 0.426 880s -1.102 -0.436 880s 0.349 0.730 880s testing: mul_mat, [4, 3, 1, 3] = [2, 4, 1, 3] * [2, 3, 1, 3] 880s x0: [2, 4, 1, 3] 880s 0.191 -0.054 880s 0.906 -0.365 880s -0.945 0.492 880s 0.748 -0.080 880s 880s x1: [2, 3, 1, 3] 880s -0.763 -0.849 880s 0.699 -0.327 880s 0.155 -0.039 880s 880s y: [4, 3, 1, 3] 880s -0.099 -0.382 0.303 -0.503 880s 0.151 0.752 -0.821 0.549 880s 0.032 0.155 -0.166 0.119 880s test-mul-mat0: iter:147/500 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.682 -0.740 -0.265 880s -0.398 -0.832 0.064 880s -0.986 0.481 0.631 880s 880s x1: [3, 2, 1, 1] 880s -0.696 -0.170 0.570 880s 0.494 -0.264 0.625 880s 880s y: [3, 2, 1, 1] 880s -0.500 0.455 0.965 880s 0.367 0.063 -0.220 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.818 -0.327 0.673 880s -0.770 0.846 0.977 880s 880s x1: [3, 3, 1, 1] 880s 0.576 -0.508 -0.129 880s -0.707 -0.536 0.467 880s -0.376 -0.437 0.701 880s 880s y: [2, 3, 1, 1] 880s 0.550 -1.000 880s -0.089 0.547 880s 0.308 0.604 880s testing: mul_mat, [4, 2, 1, 3] = [3, 4, 1, 3] * [3, 2, 1, 3] 880s x0: [3, 4, 1, 3] 880s 0.906 -0.518 -0.414 880s -0.531 -0.817 0.078 880s -0.712 -0.144 -0.248 880s -0.482 -0.297 -0.271 880s 880s x1: [3, 2, 1, 3] 880s 0.060 0.416 0.471 880s 0.796 0.041 -0.286 880s 880s y: [4, 2, 1, 3] 880s -0.357 -0.335 -0.219 -0.280 880s 0.818 -0.478 -0.502 -0.319 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.305 -0.765 0.257 880s -0.998 0.933 -0.186 880s 0.275 -0.848 0.173 880s 880s x1: [3, 4, 1, 1] 880s -0.683 -0.013 0.558 880s 0.069 0.505 -0.739 880s 0.797 0.082 -0.620 880s 0.998 -0.545 0.540 880s 880s y: [3, 4, 1, 1] 880s -0.055 0.566 -0.080 880s -0.556 0.539 -0.537 880s 0.021 -0.604 0.042 880s 0.861 -1.605 0.831 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.056 -0.440 0.642 880s 0.989 -0.750 -0.370 880s 0.226 0.023 0.021 880s 880s x1: [3, 3, 1, 1] 880s 0.949 0.138 0.210 880s -0.398 0.369 0.729 880s 0.840 -0.932 -0.569 880s 880s y: [3, 3, 1, 1] 880s 0.127 0.757 0.222 880s 0.284 -0.940 -0.066 880s 0.091 1.740 0.156 880s testing: mul_mat, [4, 3, 1, 3] = [3, 4, 1, 3] * [3, 3, 1, 3] 880s x0: [3, 4, 1, 3] 880s -0.247 -0.056 0.854 880s -0.227 -0.354 0.890 880s -0.287 -0.747 0.221 880s -0.149 0.116 0.215 880s 880s x1: [3, 3, 1, 3] 880s -0.903 -0.830 0.933 880s 0.402 -0.828 0.209 880s 0.637 -0.895 0.361 880s 880s y: [4, 3, 1, 3] 880s 1.066 1.329 1.085 0.239 880s 0.125 0.388 0.549 -0.111 880s 0.201 0.494 0.565 -0.121 880s test-mul-mat0: iter:148/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.101 -0.663 -0.058 880s 0.299 0.831 -0.985 880s -0.075 -0.203 -0.628 880s -0.955 0.666 -0.827 880s 880s x1: [3, 1, 1, 1] 880s -0.471 -0.040 -0.412 880s 880s y: [4, 1, 1, 1] 880s 0.003 0.232 0.302 0.763 880s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 4, 2, 1] 880s 0.658 -0.881 -0.768 880s 0.704 0.785 -0.596 880s 0.420 -0.786 0.046 880s 0.918 0.547 0.847 880s 880s x1: [3, 4, 2, 1] 880s 0.716 -0.572 -0.359 880s -0.501 0.334 -0.198 880s -0.966 0.013 0.675 880s -0.580 0.635 0.193 880s 880s y: [4, 4, 2, 1] 880s 1.251 0.269 0.734 0.041 880s -0.472 0.028 -0.482 -0.446 880s -1.165 -1.072 -0.385 -0.309 880s -1.090 -0.024 -0.734 -0.021 880s testing: mul_mat, [4, 4, 2, 2] = [3, 4, 2, 2] * [3, 4, 2, 2] 880s x0: [3, 4, 2, 2] 880s 0.459 -0.933 -0.412 880s -0.863 0.164 -0.738 880s 0.530 0.693 0.504 880s -0.795 0.242 -0.349 880s 880s x1: [3, 4, 2, 2] 880s 0.924 0.338 0.929 880s 0.768 -0.929 0.252 880s 0.628 0.730 0.371 880s 0.859 0.433 0.157 880s 880s y: [4, 4, 2, 2] 880s -0.274 -1.428 1.192 -0.977 880s 1.114 -1.002 -0.109 -0.924 880s -0.546 -0.696 1.026 -0.452 880s -0.075 -0.787 0.835 -0.633 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.714 0.484 -0.027 880s -0.471 -0.262 -0.675 880s 880s x1: [3, 4, 1, 1] 880s -0.735 -0.497 0.272 880s -0.760 -0.183 -0.173 880s 0.934 0.952 0.373 880s -0.801 0.278 -0.778 880s 880s y: [2, 4, 1, 1] 880s -0.773 0.292 880s -0.626 0.523 880s 1.118 -0.941 880s -0.416 0.829 880s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.021 0.669 880s -0.158 0.267 880s 0.333 0.748 880s 0.796 0.042 880s 880s x1: [2, 3, 2, 1] 880s -0.977 -0.053 880s -0.142 -0.187 880s 0.127 -0.367 880s 880s y: [4, 3, 2, 1] 880s -0.015 0.140 -0.365 -0.780 880s -0.122 -0.028 -0.187 -0.121 880s -0.248 -0.118 -0.232 0.085 880s testing: mul_mat, [2, 2, 2, 2] = [4, 2, 2, 2] * [4, 2, 2, 2] 880s x0: [4, 2, 2, 2] 880s 0.207 0.614 0.199 -0.912 880s -0.642 0.521 -0.812 -0.338 880s 880s x1: [4, 2, 2, 2] 880s -0.233 0.084 0.501 0.714 880s 0.942 -0.686 -0.159 -0.425 880s 880s y: [2, 2, 2, 2] 880s -0.549 -0.454 880s 0.130 -0.689 880s test-mul-mat0: iter:149/500 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.070 0.738 -0.465 880s -0.493 0.924 -0.520 880s 880s x1: [3, 4, 1, 1] 880s 0.329 -0.061 -0.315 880s 0.216 -0.578 0.286 880s -0.630 -0.606 -0.194 880s 0.633 -0.813 0.040 880s 880s y: [2, 4, 1, 1] 880s 0.124 -0.055 880s -0.544 -0.789 880s -0.401 -0.148 880s -0.574 -1.084 880s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 4, 3, 1] 880s -0.605 -0.610 -0.155 880s -0.417 0.430 0.519 880s 0.652 0.168 -0.946 880s -0.841 0.092 -0.466 880s 880s x1: [3, 2, 3, 1] 880s 0.774 -0.661 -0.698 880s 0.629 -0.539 -0.232 880s 880s y: [4, 2, 3, 1] 880s 0.043 -0.970 1.054 -0.386 880s -0.016 -0.615 0.539 -0.470 880s testing: mul_mat, [2, 4, 3, 2] = [3, 2, 3, 2] * [3, 4, 3, 2] 880s x0: [3, 2, 3, 2] 880s 0.122 -0.171 0.402 880s 0.142 -0.201 0.370 880s 880s x1: [3, 4, 3, 2] 880s 0.061 -0.465 0.681 880s 0.115 -0.306 -0.227 880s 0.649 -0.373 -0.796 880s -0.515 -0.811 -0.904 880s 880s y: [2, 4, 3, 2] 880s 0.361 0.354 880s -0.025 -0.006 880s -0.178 -0.128 880s -0.288 -0.244 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.779 0.779 -0.162 880s 0.421 0.439 0.213 880s 880s x1: [3, 2, 1, 1] 880s 0.577 -0.028 -0.989 880s 0.479 -0.641 -0.863 880s 880s y: [2, 2, 1, 1] 880s 0.588 0.020 880s 0.013 -0.264 880s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.027 0.115 880s 0.858 0.332 880s 880s x1: [2, 3, 3, 1] 880s -0.569 0.878 880s 0.614 -0.621 880s 0.709 0.448 880s 880s y: [2, 3, 3, 1] 880s 0.116 -0.197 880s -0.088 0.321 880s 0.032 0.757 880s testing: mul_mat, [1, 2, 3, 2] = [2, 1, 3, 2] * [2, 2, 3, 2] 880s x0: [2, 1, 3, 2] 880s 0.545 -0.881 880s 880s x1: [2, 2, 3, 2] 880s -0.019 -0.868 880s 0.686 0.360 880s 880s y: [1, 2, 3, 2] 880s 0.755 880s 0.057 880s test-mul-mat0: iter:150/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.193 -0.441 880s 0.620 -0.250 880s 0.661 -0.671 880s 880s x1: [2, 2, 1, 1] 880s -0.699 0.765 880s 0.271 -0.570 880s 880s y: [3, 2, 1, 1] 880s -0.472 -0.625 -0.976 880s 0.304 0.311 0.562 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.737 0.524 880s -0.537 -0.186 880s 880s x1: [2, 3, 1, 1] 880s 0.422 -0.905 880s 0.032 0.173 880s 0.641 0.151 880s 880s y: [2, 3, 1, 1] 880s -0.163 -0.058 880s 0.114 -0.049 880s 0.551 -0.372 880s testing: mul_mat, [2, 2, 1, 2] = [2, 2, 1, 2] * [2, 2, 1, 2] 880s x0: [2, 2, 1, 2] 880s 0.136 0.553 880s 0.330 0.565 880s 880s x1: [2, 2, 1, 2] 880s -0.972 -0.171 880s 0.251 -0.627 880s 880s y: [2, 2, 1, 2] 880s -0.226 -0.418 880s -0.312 -0.271 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.303 0.651 880s 0.868 0.258 880s 0.522 0.977 880s -0.546 -0.886 880s 880s x1: [2, 2, 1, 1] 880s 0.273 0.785 880s -0.729 -0.305 880s 880s y: [4, 2, 1, 1] 880s 0.594 0.440 0.910 -0.845 880s -0.419 -0.711 -0.678 0.668 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.968 -0.388 0.914 0.885 880s -0.651 0.362 -0.066 -0.962 880s 880s x1: [4, 2, 1, 1] 880s -0.928 -0.995 -0.057 0.323 880s -0.622 0.272 0.973 0.213 880s 880s y: [2, 2, 1, 1] 880s 1.518 -0.063 880s 1.575 0.234 880s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 880s x0: [2, 4, 1, 2] 880s 0.879 -0.521 880s 0.266 -0.240 880s 0.701 0.884 880s 0.852 -0.171 880s 880s x1: [2, 4, 1, 2] 880s -0.123 0.158 880s 0.824 0.148 880s 0.853 0.704 880s -0.549 0.721 880s 880s y: [4, 4, 1, 2] 880s -0.191 -0.071 0.053 -0.132 880s 0.647 0.184 0.708 0.676 880s 0.383 0.058 1.221 0.606 880s -0.858 -0.319 0.253 -0.591 880s test-mul-mat0: iter:151/500 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.236 880s 880s x1: [1, 3, 1, 1] 880s -0.218 880s 0.864 880s 0.535 880s 880s y: [1, 3, 1, 1] 880s -0.051 880s 0.204 880s 0.126 880s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 1, 4, 1] 880s 0.151 880s 880s x1: [1, 1, 4, 1] 880s -0.981 880s 880s y: [1, 1, 4, 1] 880s -0.148 880s testing: mul_mat, [3, 1, 4, 1] = [1, 3, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 3, 4, 1] 880s -0.016 880s 0.437 880s -0.226 880s 880s x1: [1, 1, 4, 1] 880s -0.694 880s 880s y: [3, 1, 4, 1] 880s 0.011 -0.303 0.157 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.900 880s -0.978 880s 0.843 880s -0.178 880s 880s x1: [1, 3, 1, 1] 880s 0.586 880s -0.136 880s -0.997 880s 880s y: [4, 3, 1, 1] 880s -0.527 -0.573 0.494 -0.104 880s 0.123 0.133 -0.115 0.024 880s 0.897 0.974 -0.840 0.178 880s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.182 -0.589 0.748 0.986 880s 880s x1: [4, 1, 4, 1] 880s -0.403 0.301 0.973 0.052 880s 880s y: [1, 1, 4, 1] 880s 0.528 880s testing: mul_mat, [1, 4, 4, 1] = [1, 1, 4, 1] * [1, 4, 4, 1] 880s x0: [1, 1, 4, 1] 880s 0.870 880s 880s x1: [1, 4, 4, 1] 880s 0.169 880s 0.892 880s -0.987 880s 0.994 880s 880s y: [1, 4, 4, 1] 880s 0.147 880s 0.776 880s -0.858 880s 0.864 880s test-mul-mat0: iter:152/500 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.469 -0.924 -0.401 880s 0.330 -0.424 -0.552 880s 0.114 0.109 0.007 880s 880s x1: [3, 3, 1, 1] 880s 0.810 -0.450 -0.168 880s 0.007 0.389 -0.179 880s -0.824 0.281 -0.165 880s 880s y: [3, 3, 1, 1] 880s 0.104 0.551 0.042 880s -0.291 -0.065 0.042 880s 0.193 -0.300 -0.065 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.495 0.918 -0.393 880s -0.956 -0.250 0.613 880s 0.434 0.572 0.789 880s 880s x1: [3, 3, 1, 1] 880s -0.548 0.231 -0.252 880s -0.799 0.101 -0.525 880s 0.361 0.188 0.774 880s 880s y: [3, 3, 1, 1] 880s 0.040 0.312 -0.304 880s -0.096 0.417 -0.703 880s 0.046 0.082 0.874 880s testing: mul_mat, [3, 3, 1, 3] = [3, 3, 1, 3] * [3, 3, 1, 3] 880s x0: [3, 3, 1, 3] 880s -0.342 0.250 -0.491 880s 0.373 0.656 -0.532 880s -0.381 -0.861 0.026 880s 880s x1: [3, 3, 1, 3] 880s -0.285 -0.594 0.959 880s 0.246 -0.518 -0.442 880s -0.424 0.058 0.006 880s 880s y: [3, 3, 1, 3] 880s -0.521 -1.006 0.645 880s 0.004 -0.013 0.341 880s 0.157 -0.124 0.112 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.999 -0.294 0.401 880s 880s x1: [3, 3, 1, 1] 880s -0.321 0.349 0.661 880s 0.399 -0.993 -0.089 880s 0.908 0.380 -0.432 880s 880s y: [1, 3, 1, 1] 880s -0.158 880s 0.654 880s 0.622 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.266 880s 0.479 880s 0.102 880s 880s x1: [1, 3, 1, 1] 880s 0.193 880s 0.903 880s 0.433 880s 880s y: [3, 3, 1, 1] 880s 0.051 0.092 0.020 880s 0.240 0.432 0.092 880s 0.115 0.207 0.044 880s testing: mul_mat, [1, 1, 1, 3] = [3, 1, 1, 3] * [3, 1, 1, 3] 880s x0: [3, 1, 1, 3] 880s 0.367 0.793 0.704 880s 880s x1: [3, 1, 1, 3] 880s -0.973 -0.714 -0.530 880s 880s y: [1, 1, 1, 3] 880s -1.297 880s test-mul-mat0: iter:153/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.475 0.925 0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 788 -0.411 880s -0.720 0.156 -0.618 0.984 880s -0.816 0.574 0.695 -0.876 880s 880s x1: [4, 3, 1, 1] 880s -0.123 -0.717 0.117 -0.690 880s -0.640 -0.617 0.789 0.462 880s -0.591 -0.925 0.933 0.695 880s 880s y: [3, 3, 1, 1] 880s -0.346 -0.774 0.374 880s -0.443 0.332 0.311 880s -0.687 0.389 -0.009 880s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.054 0.969 -0.780 -0.553 880s 880s x1: [4, 3, 4, 1] 880s -0.326 -0.213 -0.368 -0.850 880s 0.878 0.846 0.027 -0.839 880s -0.038 0.337 -0.479 0.345 880s 880s y: [1, 3, 4, 1] 880s 0.533 880s 1.310 880s 0.508 880s testing: mul_mat, [2, 1, 4, 2] = [4, 2, 4, 2] * [4, 1, 4, 2] 880s x0: [4, 2, 4, 2] 880s -0.473 0.394 0.102 -0.764 880s 0.919 -0.095 0.589 -0.306 880s 880s x1: [4, 1, 4, 2] 880s 0.616 -0.036 0.412 0.402 880s 880s y: [2, 1, 4, 2] 880s -0.570 0.689 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.200 0.800 -0.458 0.009 880s -0.081 0.708 -0.602 0.604 880s 880s x1: [4, 2, 1, 1] 880s -0.431 0.507 0.085 -0.740 880s 0.107 -0.354 0.322 0.046 880s 880s y: [2, 2, 1, 1] 880s 0.446 -0.104 880s -0.452 -0.426 880s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.700 0.405 880s -0.994 -0.544 880s 0.366 0.213 880s 880s x1: [2, 4, 4, 1] 880s 0.640 0.977 880s 0.309 0.978 880s -0.869 0.705 880s -0.550 0.847 880s 880s y: [3, 4, 4, 1] 880s 0.844 -1.168 0.442 880s 0.613 -0.840 0.321 880s -0.323 0.480 -0.168 880s -0.042 0.086 -0.021 880s testing: mul_mat, [2, 2, 4, 2] = [3, 2, 4, 2] * [3, 2, 4, 2] 880s x0: [3, 2, 4, 2] 880s 0.196 -0.182 0.989 880s -0.948 -0.749 0.143 880s 880s x1: [3, 2, 4, 2] 880s 0.905 -0.901 0.715 880s 0.163 -0.510 0.743 880s 880s y: [2, 2, 4, 2] 880s 1.049 -0.081 880s 0.860 0.334 880s test-mul-mat0: iter:154/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.774 -0.590 -0.865 880s -0.628 0.742 -0.560 880s 880s x1: [3, 2, 1, 1] 880s 0.255 0.196 0.341 880s 0.395 -0.628 0.131 880s 880s y: [2, 2, 1, 1] 880s -0.213 -0.206 880s 0.562 -0.787 880s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 4, 3, 1] 880s -0.233 0.763 0.661 880s 0.140 -0.106 -0.691 880s -0.086 0.304 -0.557 880s 0.285 0.046 -0.117 880s 880s x1: [3, 2, 3, 1] 880s 0.221 0.062 0.888 880s -0.475 0.424 0.873 880s 880s y: [4, 2, 3, 1] 880s 0.583 -0.590 -0.495 -0.038 880s 1.011 -0.715 -0.316 -0.218 880s testing: mul_mat, [1, 4, 3, 4] = [3, 1, 3, 4] * [3, 4, 3, 4] 880s x0: [3, 1, 3, 4] 880s 0.953 -0.168 0.108 880s 880s x1: [3, 4, 3, 4] 880s 0.713 -0.893 0.485 880s -0.844 0.392 -0.469 880s 0.039 0.899 -0.361 880s 0.811 -0.069 -0.829 880s 880s y: [1, 4, 3, 4] 880s 0.882 880s -0.921 880s -0.153 880s 0.695 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.315 -0.363 0.623 880s -0.004 0.235 -0.341 880s 880s x1: [3, 1, 1, 1] 880s 0.193 0.916 0.471 880s 880s y: [2, 1, 1, 1] 880s -0.100 0.054 880s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.830 -0.145 880s 0.384 0.577 880s 0.247 -0.229 880s -0.914 0.851 880s 880s x1: [2, 3, 3, 1] 880s -0.721 -0.644 880s 0.762 -0.678 880s 0.405 0.024 880s 880s y: [4, 3, 3, 1] 880s 0.692 -0.649 -0.031 0.112 880s -0.534 -0.099 0.344 -1.273 880s -0.340 0.170 0.095 -0.350 880s testing: mul_mat, [4, 2, 3, 4] = [4, 4, 3, 4] * [4, 2, 3, 4] 880s x0: [4, 4, 3, 4] 880s -0.842 -0.832 0.976 -0.243 880s -0.320 0.972 0.469 0.596 880s 0.165 -0.327 -0.805 0.566 880s 0.768 0.442 0.456 0.422 880s 880s x1: [4, 2, 3, 4] 880s -0.339 -0.649 0.020 0.187 880s 0.569 0.603 0.241 0.739 880s 880s y: [4, 2, 3, 4] 880s 0.799 -0.402 0.245 -0.459 880s -0.925 0.958 0.121 1.126 880s test-mul-mat0: iter:155/500 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.733 -0.551 0.879 -0.974 880s -0.778 -0.782 -0.168 0.193 880s -0.185 -0.989 0.999 0.716 880s 0.589 -0.464 0.784 -0.242 880s 880s x1: [4, 4, 1, 1] 880s -0.933 0.169 0.566 -0.884 880s -0.160 -0.755 -0.531 -0.460 880s 0.630 0.713 0.115 0.601 880s -0.710 0.947 -0.856 0.732 880s 880s y: [4, 4, 1, 1] 880s 1.949 0.329 -0.063 0.030 880s 0.515 0.716 -0.083 -0.048 880s -1.339 -0.951 -0.275 -0.015 880s -1.467 0.098 -1.135 -1.705 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 1.000 -0.985 -0.226 -0.782 880s 880s x1: [4, 4, 1, 1] 880s -0.898 0.900 0.598 -0.653 880s -0.631 -0.862 0.977 -0.918 880s 0.253 0.578 -0.629 0.200 880s 0.722 -0.896 -0.879 0.989 880s 880s y: [1, 4, 1, 1] 880s -1.408 880s 0.716 880s -0.330 880s 1.030 880s testing: mul_mat, [2, 1, 1, 2] = [4, 2, 1, 2] * [4, 1, 1, 2] 880s x0: [4, 2, 1, 2] 880s 0.205 -0.397 -0.567 -0.197 880s -0.050 -0.198 -0.058 -0.073ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 880s 880s x1: [4, 1, 1, 2] 880s -0.153 0.967 0.033 -0.142 880s 880s y: [2, 1, 1, 2] 880s -0.406 -0.175 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.936 -0.383 0.517 -0.618 880s 880s x1: [4, 2, 1, 1] 880s -0.784 0.911 0.516 -0.769 880s -0.315 0.733 0.077 -0.348 880s 880s y: [1, 2, 1, 1] 880s -0.341 880s -0.320 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.609 880s 880s x1: [1, 4, 1, 1] 880s 0.118 880s -0.950 880s 0.587 880s 0.722 880s 880s y: [1, 4, 1, 1] 880s -0.072 880s 0.579 880s -0.358 880s -0.440 880s testing: mul_mat, [1, 1, 1, 2] = [1, 1, 1, 2] * [1, 1, 1, 2] 880s x0: [1, 1, 1, 2] 880s 0.599 880s 880s x1: [1, 1, 1, 2] 880s -0.328 880s 880s y: [1, 1, 1, 2] 880s -0.196 880s test-mul-mat0: iter:156/500 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.954 -0.223 0.380 0.687 880s 0.854 -0.969 -0.546 0.790 880s -0.352 0.970 -0.828 0.767 880s -0.980 0.760 0.488 -0.497 880s 880s x1: [4, 1, 1, 1] 880s -0.670 -0.217 -0.562 -0.454 880s 880s y: [4, 1, 1, 1] 880s -1.116 -0.414 0.142 0.443 880s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.629 0.770 0.623 -0.164 880s 880s x1: [4, 4, 3, 1] 880s -0.850 -0.840 0.789 0.483 880s 0.759 0.958 0.010 0.862 880s 0.382 0.932 -0.808 -0.835 880s -0.630 -0.262 -0.140 -0.676 880s 880s y: [1, 4, 3, 1] 880s 0.300 880s 0.126 880s 0.111 880s 0.218 880s testing: mul_mat, [4, 1, 3, 2] = [4, 4, 3, 2] * [4, 1, 3, 2] 880s x0: [4, 4, 3, 2] 880s 0.590 -0.821 -0.659 0.318 880s 0.530 0.744 -0.995 -0.879 880s 0.480 0.440 0.437 0.310 880s 0.582 -0.514 0.095 0.811 880s 880s x1: [4, 1, 3, 2] 880s -0.313 -0.408 0.737 0.435 880s 880s y: [4, 1, 3, 2] 880s -0.196 -1.585 0.127 0.451 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.552 -0.213 0.621 -0.074 880s 0.822 0.113 0.444 0.336 880s -0.385 0.952 -0.363 0.123 880s 880s x1: [4, 4, 1, 1] 880s -0.942 -0.753 0.354 0.404 880s -0.632 -0.524 0.966 0.735 880s -0.003 0.325 -0.169 -0.990 880s 0.292 -0.143 -0.455 -0.839 880s 880s y: [3, 4, 1, 1] 880s 0.870 -0.567 -0.433 880s 1.006 0.097 -0.515 880s -0.099 -0.374 0.250 880s -0.352 -0.260 -0.187 880s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.743 0.465 0.504 880s 0.198 0.681 -0.826 880s 0.306 0.183 -0.647 880s 880s x1: [3, 4, 3, 1] 880s -0.141 0.656 0.181 880s 0.105 0.010 -0.414 880s 0.473 0.486 -0.448 880s 0.208 -0.517 0.877 880s 880s y: [3, 4, 3, 1] 880s 0.502 0.268 -0.041 880s -0.283 0.370 0.302 880s -0.351 0.795 0.524 880s 0.047 -1.036 -0.599 880s testing: mul_mat, [1, 3, 3, 2] = [3, 1, 3, 2] * [3, 3, 3, 2] 880s x0: [3, 1, 3, 2] 880s -0.872 -0.403 0.066 880s 880s x1: [3, 3, 3, 2] 880s -0.417 -0.852 -0.726 880s 0.337 -0.595 0.472 880s -0.357 0.871 0.153 880s 880s y: [1, 3, 3, 2] 880s 0.659 880s -0.023 880s -0.030 880s test-mul-mat0: iter:157/500 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.028 -0.684 880s 880s x1: [2, 2, 1, 1] 880s 0.096 -0.202 880s 0.786 -0.196 880s 880s y: [1, 2, 1, 1] 880s 0.141 880s 0.156 880s testing: mul_mat, [1, 1, 3, 1] = [2, 1, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 1, 3, 1] 880s 0.590 -0.308 880s 880s x1: [2, 1, 3, 1] 880s -0.068 0.339 880s 880s y: [1, 1, 3, 1] 880s -0.145 880s testing: mul_mat, [2, 1, 3, 3] = [2, 2, 3, 3] * [2, 1, 3, 3] 880s x0: [2, 2, 3, 3] 880s -0.077 0.530 880s -0.565 0.329 880s 880s x1: [2, 1, 3, 3] 880s 0.163 0.394 880s 880s y: [2, 1, 3, 3] 880s 0.196 0.037 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.663 0.976 880s -0.815 -0.927 880s -0.422 0.528 880s 880s x1: [2, 2, 1, 1] 880s 0.964 0.494 880s -0.257 -0.860 880s 880s y: [3, 2, 1, 1] 880s 1.121 -1.243 -0.147 880s -1.010 1.007 -0.345 880s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.809 0.331 0.235 880s 0.095 -0.448 -0.214 880s 880s x1: [3, 2, 3, 1] 880s 0.365 -0.255 -0.116 880s -0.224 -0.714 -0.081 880s 880s y: [2, 2, 3, 1] 880s 0.184 0.174 880s -0.437 0.316 880s testing: mul_mat, [3, 3, 3, 3] = [2, 3, 3, 3] * [2, 3, 3, 3] 880s x0: [2, 3, 3, 3] 880s -0.807 -0.556 880s 0.875 -0.301 880s 0.207 0.688 880s 880s x1: [2, 3, 3, 3] 880s 0.290 -0.681 880s 0.124 -0.014 880s -0.607 0.039 880s 880s y: [3, 3, 3, 3] 880s 0.144 0.459 -0.408 880s -0.092 0.113 0.016 880s 0.468 -0.543 -0.099 880s test-mul-mat0: iter:158/500 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.707 0.479 -0.662 880s -0.032 0.403 -0.359 880s -0.337 0.712 -0.152 880s -0.692 -0.118 -0.869 880s 880s x1: [3, 4, 1, 1] 880s 0.127 0.465 0.902 880s 0.660 -0.721 -0.612 880s 0.399 0.537 0.573 880s -0.173 -0.815 0.570 880s 880s y: [4, 4, 1, 1] 880s -0.464 -0.140 0.151 -0.92ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 7 880s -0.406 -0.092 -0.642 0.160 880s -0.404 -0.002 0.161 -0.837 880s -0.645 -0.527 -0.609 -0.279 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.050 0.346 0.255 880s 0.136 -0.084 0.997 880s 0.428 -0.606 -0.666 880s -0.604 0.797 -0.025 880s 880s x1: [3, 4, 1, 1] 880s -0.601 -0.177 -0.806 880s -0.757 0.668 0.484 880s 0.370 0.133 0.386 880s 0.029 0.413 0.774 880s 880s y: [4, 4, 1, 1] 880s -0.237 -0.870 0.386 0.242 880s 0.393 0.324 -1.051 0.978 880s 0.126 0.424 -0.179 -0.126 880s 0.339 0.741 -0.753 0.292 880s testing: mul_mat, [2, 4, 1, 4] = [3, 2, 1, 4] * [3, 4, 1, 4] 880s x0: [3, 2, 1, 4] 880s -0.837 0.071 -0.102 880s -0.745 -0.414 0.952 880s 880s x1: [3, 4, 1, 4] 880s 0.059 0.510 0.823 880s 0.367 -0.608 0.954 880s 0.766 0.215 -0.853 880s -0.991 -0.117 0.631 880s 880s y: [2, 4, 1, 4] 880s -0.097 0.528 880s -0.447 0.886 880s -0.539 -1.472 880s 0.757 1.388 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.482 0.761 0.812 880s 0.175 -0.587 0.880 880s -0.539 0.115 -0.840 880s -0.227 0.291 -0.543 880s 880s x1: [3, 2, 1, 1] 880s -0.168 0.027 0.413 880s -0.198 -0.645 -0.896 880s 880s y: [4, 2, 1, 1] 880s 0.437 0.318 -0.253 -0.178 880s -1.123 -0.444 0.785 0.344 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.836 -0.127 0.497 -0.482 880s 880s x1: [4, 3, 1, 1] 880s -0.957 0.428 0.856 0.427 880s -0.538 -0.599 0.984 0.029 880s 0.308 0.864 0.136 0.459 880s 880s y: [1, 3, 1, 1] 880s 0.966 880s 1.001 880s -0.521 880s testing: mul_mat, [4, 4, 1, 4] = [1, 4, 1, 4] * [1, 4, 1, 4] 880s x0: [1, 4, 1, 4] 880s -0.726 880s 0.256 880s -0.541 880s -0.741 880s 880s x1: [1, 4, 1, 4] 880s -0.023 880s 0.060 880s 0.036 880s -0.847 880s 880s y: [4, 4, 1, 4] 880s 0.017 -0.006 0.012 0.017 880s -0.043 0.015 -0.032 -0.044 880s -0.026 0.009 -0.020 -0.027 880s 0.615 -0.217 0.459 0.628 880s test-mul-mat0: iter:159/500 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.494 -0.165 0.800 0.761 880s 0.293 -0.941 -0.954 0.060 880s 880s x1: [4, 2, 1, 1] 880s 0.142 -0.171 0.873 -0.653 880s -0.425 -0.322 0.712 -0.223 880s 880s y: [2, 2, 1, 1] 880s 0.160 -0.669 880s 0.664 -0.514 880s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.121 -0.845 0.893 0.180 880s -0.799 -0.047 0.362 0.669 880s 880s x1: [4, 2, 3, 1] 880s -0.819 0.467 -0.715 -0.001 880s 0.052 -0.616 0.314 0.155 880s 880s y: [2, 2, 3, 1] 880s -1.132 0.372 880s 0.835 0.205 880s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.657 -0.736 0.863 -0.982 880s -0.205 0.637 0.366 0.122 880s 880s x1: [4, 2, 3, 1] 880s 0.348 -0.673 0.662 0.113 880s 0.648 -0.944 0.712 -0.231 880s 880s y: [2, 2, 3, 1] 880s 1.185 -0.244 880s 1.963 -0.501 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.806 0.831 0.129 -0.517 880s 0.203 -0.431 0.047 0.238 880s 0.447 -0.431 0.339 -0.123 880s 880s x1: [4, 2, 1, 1] 880s 0.567 0.818 0.538 -0.935 880s 0.528 -0.612 -0.669 0.185 880s 880s y: [3, 2, 1, 1] 880s 0.775 -0.435 0.198 880s -1.116 0.384 0.250 880s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.299 -0.082 0.290 880s -0.816 -0.757 -0.629 880s 880s x1: [3, 4, 3, 1] 880s -0.952 0.034 -0.635 880s -0.974 0.389 0.164 880s -0.916 0.539 0.635 880s -0.056 0.106 0.452 880s 880s y: [2, 4, 3, 1] 880s -0.471 1.150 880s -0.275 0.398 880s -0.134 -0.060 880s 0.106 -0.319 880s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 1, 3, 1] 880s 0.657 0.867 880s 880s x1: [2, 3, 3, 1] 880s 0.627 0.845 880s 0.806 -0.496 880s -0.015 -0.972 880s 880s y: [1, 3, 3, 1] 880s 1.144 880s 0.100 880s -0.852 880s test-mul-mat0: iter:160/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.330 -0.407 0.507 880s 0.358 -0.148 0.681 880s -0.356 0.230 0.070 880s -0.425 -0.108 0.758 880s 880s x1: [3, 1, 1, 1] 880s -0.323 0.524 -0.911 880s 880s y: [4, 1, 1, 1] 880s -0.782 -0.814 0.172 -0.610 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.246 -0.565 0.959 880s -0.230 -0.477 -0.519 880s 880s x1: [3, 4, 1, 1] 880s -0.310 0.473 -0.311 880s -0.904 -0.576 -0.654 880s 0.962 -0.274 0.600 880s 0.792 -0.456 0.973 880s 880s y: [2, 4, 1, 1] 880s -0.642 0.008 880s -0.524 0.822 880s 0.967 -0.402 880s 1.386 -0.469 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.198 -0.260 -0.324 880s -0.910 -0.503 0.366 880s 0.563 0.186 0.462 880s 880s x1: [3, 2, 1, 1] 880s -0.900 0.116 0.988 880s 0.458 0.968 0.670 880s 880s y: [3, 2, 1, 1] 880s -0.529 1.122 -0.029 880s -0.378 -ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 0.658 0.747 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.539 -0.937 0.179 880s -0.544 -0.062 -0.074 880s 0.272 0.372 -0.170 880s 880s x1: [3, 3, 1, 1] 880s 0.987 0.532 0.424 880s -0.288 0.131 0.216 880s 0.256 0.105 -0.503 880s 880s y: [3, 3, 1, 1] 880s 0.110 -0.601 0.394 880s -0.240 0.132 -0.066 880s -0.050 -0.109 0.194 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.019 0.859 0.954 880s -0.565 -0.302 -0.213 880s 0.986 0.682 0.571 880s -0.449 0.074 -0.544 880s 880s x1: [3, 3, 1, 1] 880s 0.147 -0.404 -0.069 880s -0.655 0.336 0.607 880s -0.564 0.833 -0.027 880s 880s y: [4, 3, 1, 1] 880s -0.410 0.054 -0.170 -0.059 880s 0.855 0.139 -0.069 -0.011 880s 0.679 0.073 -0.003 0.330 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.823 0.496 0.442 -0.325 880s -0.531 0.656 0.495 0.877 880s 880s x1: [4, 3, 1, 1] 880s -0.674 -0.972 0.728 -0.611 880s -0.035 0.100 0.568 0.891 880s 0.930 -0.285 -0.513 -0.139 880s 880s y: [2, 3, 1, 1] 880s -0.516 -0.454 880s -0.018 1.147 880s 0.442 -1.057 880s test-mul-mat0: iter:161/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.265 0.348 -0.308 0.752 880s 880s x1: [4, 3, 1, 1] 880s -0.092 -0.191 -0.867 0.694 880s -0.619 -0.411 -0.980 -0.592 880s -0.682 -0.591 0.373 0.418 880s 880s y: [1, 3, 1, 1] 880s 0.698 880s -0.450 880s -0.187 880s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.595 -0.963 -0.458 -0.462 880s 0.731 -0.078 0.127 0.752 880s 0.331 0.445 -0.839 -0.296 880s 880s x1: [4, 1, 3, 1] 880s -0.790 0.753 0.574 -0.322 880s 880s y: [3, 1, 3, 1] 880s -0.369 -0.806 -0.313 880s testing: mul_mat, [3, 3, 3, 4] = [4, 3, 3, 4] * [4, 3, 3, 4] 880s x0: [4, 3, 3, 4] 880s 0.430 -0.032 -0.746 -0.241 880s -0.069 0.852 0.598 0.186 880s 0.675 -0.002 -0.072 0.134 880s 880s x1: [4, 3, 3, 4] 880s -0.588 -0.128 0.078 0.743 880s -0.683 0.240 -0.553 0.180 880s 0.378 -0.584 0.391 -0.791 880s 880s y: [3, 3, 3, 4] 880s -0.486 0.117 -0.302 880s 0.068 -0.046 -0.398 880s 0.080 -0.437 0.122 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.254 -0.853 -0.606 0.930 880s 0.590 0.965 0.190 -0.589 880s -0.096 -0.450 0.484 0.551 880s 880s x1: [4, 3, 1, 1] 880s -0.908 0.708 -0.333 -0.574 880s -0.814 0.428 -0.861 -0.283 880s 0.546 0.564 0.393 -0.344 880s 880s y: [3, 3, 1, 1] 880s -1.167 0.422 -0.709 880s -0.313 -0.064 -0.687 880s -0.900 1.143 -0.305 880s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.221 -0.696 0.480 880s 0.377 -0.167 0.031 880s 0.731 -0.328 -0.014 880s 0.877 0.949 -0.215 880s 880s x1: [3, 4, 3, 1] 880s -0.910 0.142 -0.051 880s -0.861 0.033 0.007 880s -0.770 -0.259 0.674 880s -0.343 -0.073 0.102 880s 880s y: [4, 4, 3, 1] 880s -0.324 -0.368 -0.711 -0.652 880s -0.210 -0.330 -0.640 -0.726 880s 0.334 -0.226 -0.487 -1.066 880s 0.024 -0.114 -0.229 -0.392 880s testing: mul_mat, [2, 3, 3, 4] = [4, 2, 3, 4] * [4, 3, 3, 4] 880s x0: [4, 2, 3, 4] 880s -0.445 -0.299 -0.502 -0.821 880s 0.585 -0.677 0.689 -0.491 880s 880s x1: [4, 3, 3, 4] 880s -0.453 -0.080 -0.085 -0.972 880s 0.952 0.900 -0.188 -0.406 880s 0.178 -0.509 0.753 -0.107 880s 880s y: [2, 3, 3, 4] 880s 1.066 0.207 880s -0.266 0.017 880s -0.217 1.020 880s test-mul-mat0: iter:162/500 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.232 0.880 880s 0.533 0.677 880s 880s x1: [2, 2, 1, 1] 880s -0.259 -0.373 880s 0.096 0.831 880s 880s y: [2, 2, 1, 1] 880s -0.388 -0.390 880s 0.754 0.614 880s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 1, 3, 1] 880s 0.082 -0.876 880s 880s x1: [2, 2, 3, 1] 880s -0.922 0.036 880s 0.340 -0.842 880s 880s y: [1, 2, 3, 1] 880s -0.107 880s 0.765 880s testing: mul_mat, [4, 1, 3, 2] = [2, 4, 3, 2] * [2, 1, 3, 2] 880s x0: [2, 4, 3, 2] 880s -0.617 -0.273 880s -0.403 -0.940 880s 0.041 0.800 880s 0.514 0.288 880s 880s x1: [2, 1, 3, 2] 880s 0.908 -0.898 880s 880s y: [4, 1, 3, 2] 880s -0.316 0.478 -0.681 0.208 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.764 0.664 880s -0.609 0.634 880s 0.495 0.567 880s 0.302 0.534 880s 880s x1: [2, 4, 1, 1] 880s 0.214 0.090 880s -0.266 0.175 880s 0.083 0.329 880s 0.614 0.557 880s 880s y: [4, 4, 1, 1] 880s 0.223 -0.073 0.157 0.113 880s -0.087 0.273 -0.033 0.013 880s 0.283 0.158 0.228 0.201 880s 0.840 -0.021 0.620 0.483 880s testing: mul_mat, [1, 2, 3, 1] = [4, 1, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.476 0.164 -0.142 -0.842 880s 880s x1: [4, 2, 3, 1] 880s 0.833 -0.127 0.204 0.546 880s 0.468 0.090 0.016 0.255 880s 880s y: [1, 2, 3, 1] 880s -0.906 880s -0.425 880s testing: mul_mat, [1, 4, 3, 2] = [1, 1, 3, 2] * [1, 4, 3, 2] 880s x0: [1, 1, 3, 2] 880s -0.134 880s 880s x1: [1, 4, 3, 2] 880s -0.448 880s -0.673 880s -0.854 880s 0.878 880s 880s y: [1, 4, 3, 2] 880s 0.060 880s 0.090 880s 0.115 880s -0.118 880s test-mul-mat0: iter:163/500 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.822 -0.107 0.367 880s 0.856 0.248 -0.470 880s 880s x1: [3, 1, 1, 1] 880s 0.580 -0.728 0.782 880s 880s y: [2, 1, 1, 1] 880s 0.841 -0.052 880s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 2, 2, 1] 880s -0.238 0.583 0.090 880s -0.354 -0.462 0.984 880s 880s x1: [3, 2, 2, 1] 880s 0.987 -0.594 -0.277 880s 0.981 -0.244 0.440 880s 880s y: [2, 2, 2, 1] 880s -0.606 -0.347 880s -0.336 0.198 880s testing: mul_mat, [4, 2, 2, 3] = [3, 4, 2, 3] * [3, 2, 2, 3] 880s x0: [3, 4, 2, 3] 880s 0.025 -0.701 -0.968 880s 0.618 -0.245 0.149 880s 0.350 0.176 -0.419 880s -0.889 0.161 0.194 880s 880s x1: [3, 2, 2, 3] 880s -0.230 -0.954 -0.577 880s -0.375 0.294 -0.046 880s 880s y: [4, 2, 2, 3] 880s 1.221 0.005 -0.007 -0.061 880s -0.171 -0.311 -0.060 0.372 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.867 0.898 0.055 880s 0.655 0.129 0.382 880s 0.454 0.982 -0.876 880s 880s x1: [3, 4, 1, 1] 880s 0.701 0.301 -0.259 880s -0.607 -0.573 -0.403 880s -0.611 0.679 -0.421 880s 0.758 0.161 -0.395 880s 880s y: [3, 4, 1, 1] 880s -0.352 0.398 0.841 880s -0.010 -0.625 -0.485 880s 1.116 -0.474 0.758 880s -0.535 0.366 0.848 880s testing: mul_mat, [1, 3, 2, 1] = [3, 1, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 1, 2, 1] 880s 0.604 -0.369 0.874 880s 880s x1: [3, 3, 2, 1] 880s 0.904 -0.554 0.499 880s 0.199 0.471 0.457 880s -0.884 0.036 -0.891 880s 880s y: [1, 3, 2, 1] 880s 1.187 880s 0.346 880s -1.327 880s testing: mul_mat, [3, 3, 2, 3] = [1, 3, 2, 3] * [1, 3, 2, 3] 880s x0: [1, 3, 2, 3] 880s -0.148 880s -0.204 880s -0.466 880s 880s x1: [1, 3, 2, 3] 880s 0.635 880s 0.414 880s 0.310 880s 880s y: [3, 3, 2, 3] 880s -0.094 -0.129 -0.296 880s -0.061 -0.084 -0.193 880s -0.046 -0.063 -0.144 880s test-mul-mat0: iter:164/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.339 -0.019 0.741 880s -0.990 -0.084 -0.408 880s -0.194 0.450 0.887 880s 0.628 -0.658 -0.213 880s 880s x1: [3, 1, 1, 1] 880s 0.904 -0.838 0.256 880s 880s y: [4, 1, 1, 1] 880s -0.101 -0.929 -0.326 1.065 880s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.067 0.488 0.823 880s 0.475 0.535 -0.466 880s -0.524 -0.612 -0.305 880s 880s x1: [3, 4, 4, 1] 880s 0.081 0.107 0.423 880s -0.181 0.201 0.694 880s 0.972 -0.395 -0.345 880s 0.920 0.443 -0.790 880s 880s y: [3, 4, 4, 1] 880s 0.406 -0.101 -0.237 880s 0.658 -0.302 -0.240 880s -0.412 0.411 -0.162 880s -0.373 1.043 -0.512 880s testing: mul_mat, [2, 3, 4, 2] = [3, 2, 4, 2] * [3, 3, 4, 2] 880s x0: [3, 2, 4, 2] 880s 0.548 -0.843 0.354 880s -0.103 -0.289 -0.948 880s 880s x1: [3, 3, 4, 2] 880s 0.638 -0.762 -0.169 880s -0.667 0.603 0.701 880s 0.531 -0.358 -0.868 880s 880s y: [2, 3, 4, 2] 880s 0.932 0.314 880s -0.625 -0.769 880s 0.286 0.871 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.511 0.874 0.417 880s 0.500 0.380 -0.162 880s 0.501 0.912 -0.386 880s -0.660 0.796 -0.835 880s 880s x1: [3, 2, 1, 1] 880s 0.389 0.261 0.386 880s 0.935 -0.475 0.436 880s 880s y: [4, 2, 1, 1] 880s 0.588 0.231 0.284 -0.372 880s 0.244 0.217 -0.133 -1.359 880s testing: mul_mat, [3, 3, 4, 1] = [4, 3, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 3, 4, 1] 880s 0.237 0.727 0.597 -0.116 880s -0.498 -0.130 0.151 -0.730 880s -0.677 -0.472 -0.471 0.144 880s 880s x1: [4, 3, 4, 1] 880s 0.816 0.226 0.067 0.669 880s -0.100 -0.576 -0.406 -0.509 880s -0.184 0.446 -0.268 -0.992 880s 880s y: [3, 3, 4, 1] 880s 0.320 -0.914 -0.595 880s -0.625 0.435 0.458 880s 0.235 0.718 -0.102 880s testing: mul_mat, [4, 4, 4, 2] = [3, 4, 4, 2] * [3, 4, 4, 2] 880s x0: [3, 4, 4, 2] 880s -0.877 -0.829 -0.579 880s 0.660 -0.430 0.702 880s 0.748 0.111 -0.357 880s -0.520 -0.142 -0.006 880s 880s x1: [3, 4, 4, 2] 880s 0.722 0.813 -0.836 880s -0.319 0.375 0.812 880s 0.604 -0.294 -0.062 880s -0.516 0.199 -0.687 880s 880s y: [4, 4, 4, 2] 880s -0.823 -0.460 0.929 -0.486 880s -0.501 0.198 -0.487 0.108 880s -0.250 0.482 0.441 -0.272 880s 0.686 -0.909 -0.119 0.244 880s test-mul-mat0: iter:165/500 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.446 0.410 880s -0.186 -0.554 880s 880s x1: [2, 4, 1, 1] 880s 0.980 -0.613 880s -0.678 -0.662 880s 0.590 0.255 880s -0.336 0.315 880s 880s y: [2, 4, 1, 1] 880s 0.185 0.158 880s -0.574 0.492 880s 0.368 -0.251 880s -0.021 -0.112 880s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.499 -0.400 880s -0.271 0.500 880s 880s x1: [2, 2, 3, 1] 880s -0.257 -0.801 880s 0.435 -0.143 880s 880s y: [2, 2, 3, 1] 880s 0.449 -0.331 880s -0.159 -0.190 880s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 3, 3, 1] 880s 0.915 -0.846 880s -0.840 0.984 880s -0.056 -0.912 880s 880s x1: [2, 2, 3, 1] 880s -0.208 0.224 880s 0.156 -0.798 880s 880s y: [3, 2, 3, 1] 880s -0.380 0.396 -0.193 880s 0.818 -0.917 0.719 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.009 -0.921 880s 880s x1: [2, 3, 1, 1] 880s -0.858 -0.267 880s 0.439 -0.656 880s -0.228 0.042 880s 880s y: [1, 3, 1, 1] 880s 0.238 880s 0.608 880s -0.040 880s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 3, 3, 1] 880s 0.019 880s -0.676 880s 0.725 880s 880s x1: [1, 2, 3, 1] 880s 0.091 880s 0.120 880s 880s y: [3, 2, 3, 1] 880s 0.002 -0.061 0.066 880s 0.002 -0.081 0.087 880s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.642 -0.511 0.481 880s 0.402 0.368 0.579 880s 880s x1: [3, 1, 3, 1] 880s 0.364 0.344 -0.361 880s 880s y: [2, 1, 3, 1] 880s -0.583 0.063 880s test-mul-mat0: iter:166/500 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.479 -0.654 -0.840 0.888 880s 0.827 0.739 -0.624 -0.467 880s -0.205 0.900 -0.381 0.487 880s -0.557 0.029 0.904 -0.510 880s 880s x1: [4, 2, 1, 1] 880s 0.722 0.664 0.616 -0.424 880s -0.415 -0.529 -0.838 -0.055 880s 880s y: [4, 2, 1, 1] 880s -1.674 0.902 0.008 0.389 880s 1.200 -0.186 -0.098 -0.513 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.152 -0.447 0.253 -0.413 880s -0.071 0.786 0.382 -0.171 880s 880s x1: [4, 4, 1, 1] 880s 0.306 -0.138 -0.971 0.684 880s 0.887 -0.069 0.406 0.550 880s -0.453 0.982 -0.865 0.018 880s -0.856 0.080 -0.311 -0.335 880s 880s y: [2, 4, 1, 1] 880s -0.513 -0.618 880s -0.228 -0.056 880s -0.596 0.471 880s 0.154 0.062 880s testing: mul_mat, [4, 2, 1, 4] = [4, 4, 1, 4] * [4, 2, 1, 4] 880s x0: [4, 4, 1, 4] 880s -0.795 -0.409 0.434 0.967 880s 0.331 0.069 0.758 0.755 880s -0.410 -0.520 0.585 0.661 880s 0.749 0.398 -0.132 -0.965 880s 880s x1: [4, 2, 1, 4] 880s -0.595 -0.131 0.272 0.434 880s -0.227 0.762 -0.260 0.635 880s 880s y: [4, 2, 1, 4] 880s 1.064 0.328 0.758 -0.952 880s 0.369 0.259 -0.035 -0.445 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.773 0.097 -0.915 0.795 880s 0.926 0.577 0.091 -0.833 880s -0.892 -0.223 -0.656 0.632 880s -0.351 -0.148 -0.708 -0.145 880s 880s x1: [4, 4, 1, 1] 880s 0.341 0.060 0.482 0.543 880s 0.175 -0.103 -0.665 0.770 880s 0.897 -0.212 -0.886 -0.706 880s -0.890 -0.507 -0.954 -0.415 880s 880s y: [4, 4, 1, 1] 880s 0.259 -0.058 -0.290 -0.549 880s 1.346 -0.600 0.790 0.313 880s 0.922 1.215 -0.619 0.446 880s -0.194 -0.857 1.270 1.124 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.602 0.981 0.212 0.984 880s 0.204 -0.133 -0.648 0.723 880s -0.984 -0.940 -0.133 0.027 880s 0.975 0.073 0.379 0.159 880s 880s x1: [4, 4, 1, 1] 880s 0.649 0.175 -0.970 -0.454 880s 0.510 0.800 -0.558 -0.701 880s 0.914 -0.263 -0.592 -0.593 880s -0.217 -0.007 -0.710 -0.445 880s 880s y: [4, 4, 1, 1] 880s -0.871 0.410 -0.687 0.206 880s -0.330 -0.147 -1.199 0.233 880s -1.517 0.177 -0.590 0.554 880s -0.464 0.095 0.302 -0.552 880s testing: mul_mat, [4, 4, 1, 4] = [4, 4, 1, 4] * [4, 4, 1, 4] 880s x0: [4, 4, 1, 4] 880s 0.759 0.043 -0.505 0.110 880s -0.087 0.018 -0.200 0.511 880s 0.911 0.053 -0.538 0.329 880s 0.469 -0.293 0.260 0.956 880s 880s x1: [4, 4, 1, 4] 880s 0.753 0.573 -0.330 0.553 880s -0.985 -0.032 0.467 -0.248 880s 0.377 0.874 0.535 -0.630 880s -0.836 -0.910 0.288 -0.438 880s 880s y: [4, 4, 1, 4] 880s 0.824 0.293 1.075 0.628 880s -1.012 -0.135 -1.231 -0.568 880s -0.016 -0.446 -0.106 -0.543 880s -0.867 -0.225 -1.108 -0.470 880s test-mul-mat0: iter:167/500 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.827 0.005 880s 880s x1: [2, 3, 1, 1] 880s -0.908 0.494 880s -0.129 0.909 880s 0.905 -0.726 880s 880s y: [1, 3, 1, 1] 880s -0.748 880s -0.102 880s 0.745 880s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.497 0.863 880s 0.277 -0.619 880s -0.320 0.011 880s 880s x1: [2, 1, 3, 1] 880s 0.720 0.583 880s 880s y: [3, 1, 3, 1] 880s 0.145 -0.162 -0.224 880s testing: mul_mat, [3, 3, 3, 2] = [2, 3, 3, 2] * [2, 3, 3, 2] 880s x0: [2, 3, 3, 2] 880s -0.351 -0.983 880s -0.901 -0.611 880s 0.302 0.621 880s 880s x1: [2, 3, 3, 2] 880s 0.180 -0.970 880s -0.582 -0.984 880s 0.857 0.423 880s 880s y: [3, 3, 3, 2] 880s 0.891 0.431 -0.548 880s 1.171 1.125 -0.787 880s -0.716 -1.030 0.521 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.640 -0.373 880s -0.496 -0.802 880s 0.055 -0.093 880s 880s x1: [2, 3, 1, 1] 880s 0.063 -0.084 880s -0.181 -0.932 880s -0.776 -0.921 880s 880s y: [3, 3, 1, 1] 880s 0.072 0.036 0.011 880s 0.232 0.838 0.076 880s -0.153 1.123 0.043 880s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 4, 3, 1] 880s -0.444 0.725 -0.682 880s 0.646 0.264 0.046 880s -0.874 -0.581 -0.573 880s -0.221 -0.771 0.226 880s 880s x1: [3, 2, 3, 1] 880s 0.729 -0.662 0.326 880s -0.782 -0.804 -0.564 880s 880s y: [4, 2, 3, 1] 880s -1.026 0.311 -0.439 0.423 880s 0.149 -0.743 1.473 0.665 880s testing: mul_mat, [1, 3, 3, 2] = [4, 1, 3, 2] * [4, 3, 3, 2] 880s x0: [4, 1, 3, 2] 880s 0.633 0.223 0.371 -0.937 880s 880s x1: [4, 3, 3, 2] 880s 0.440 -0.142 0.455 0.758 880s 0.904 0.882 -0.016 -0.322 880s -0.352 0.536 0.670 0.610 880s 880s y: [1, 3, 3, 2] 880s -0.295 880s 1.064 880s -0.426 880s test-mul-mat0: iter:168/500 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.672 -0.999 -0.930 -0.230 880s -0.454 -0.855 0.621 0.086 880s -0.768 -0.788 -0.759 -0.080 880s 880s x1: [4, 1, 1, 1] 880s -0.696 -0.948 -0.043 0.937 880s 880s y: [3, 1, 1, 1] 880s 1.240 1.181 1.239 880s testing: mul_mat, [4, 3, 2, 1] = [4, 4, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 4, 2, 1] 880s -0.731 0.289 0.215 -0.519 880s 0.530 -0.866 -0.599 0.818 880s -0.568 -0.297 0.761 -0.889 880s 0.052 -0.458 0.945 0.870 880s 880s x1: [4, 3, 2, 1] 880s 0.920 -0.712 -0.703 -0.698 880s 0.943 0.679 -0.650 -0.219 880s 0.834 -0.182 -0.721 0.415 880s 880s y: [4, 3, 2, 1] 880s -0.667 0.954 -0.225 -0.897 880s -0.519 0.122 -1.038 -1.067 880s -1.033 1.371 -1.338 -0.194 880s testing: mul_mat, [3, 4, 2, 3] = [4, 3, 2, 3] * [4, 4, 2, 3] 880s x0: [4, 3, 2, 3] 880s 0.037 -0.748 0.672 -0.492 880s -0.248 -0.907 0.546 -0.326 880s -0.047 0.383 -0.719 -0.395 880s 880s x1: [4, 4, 2, 3] 880s 0.883 -0.391 -0.763 0.413 880s -0.257 -0.362 0.230 0.174 880s 0.341 -0.009 0.285 -0.607 880s 0.533 0.230 -0.737 -0.647 880s 880s y: [3, 4, 2, 3] 880s -0.391 -0.415 0.194 880s 0.330 0.461 -0.361 880s 0.510 0.277 0.015 880s -0.329 -0.532 0.848 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.927 0.332 -0.503 0.810 880s 0.561 0.931 0.369 0.677 880s 0.839 -0.340 0.507 -0.882 880s -0.070 -0.910 -0.956 -0.344 880s 880s x1: [4, 3, 1, 1] 880s 0.532 -0.813 -0.498 0.368 880s 0.314 0.909 -0.138 -0.244 880s 0.950 -0.886 -0.130 -0.381 880s 880s y: [4, 3, 1, 1] 880s 0.773 -0.393 0.146 1.052 880s 0.466 0.807 0.099 -0.634 880s 0.343 -0.597 1.368 0.994 880s testing: mul_mat, [3, 4, 2, 1] = [4, 3, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.780 0.777 -0.804 -0.271 880s -0.458 0.595 0.405 -0.915 880s -0.150 0.363 0.826 -0.091 880s 880s x1: [4, 4, 2, 1] 880s 0.553 -0.898 -0.811 0.739 880s -0.396 0.557 0.054 -0.487 880s -0.581 0.810 -0.537 -0.467 880s -0.320 0.082 0.716 -0.393 880s 880s y: [3, 4, 2, 1] 880s -0.677 -1.793 -1.147 880s 0.830 0.980 0.350 880s 1.641 0.958 -0.020 880s -0.156 0.845 0.705 880s testing: mul_mat, [3, 4, 2, 3] = [3, 3, 2, 3] * [3, 4, 2, 3] 880s x0: [3, 3, 2, 3] 880s 0.649 0.831 -0.141 880s 0.162 -0.366 -0.064 880s 0.321 0.202 0.849 880s 880s x1: [3, 4, 2, 3] 880s 0.492 0.367 -0.584 880s 0.636 0.399 0.219 880s 0.882 0.619 0.760 880s -0.269 0.396 0.355 880s 880s y: [3, 4, 2, 3] 880s 0.707 -0.017 -0.263 880s 0.713 -0.057 0.471 880s 0.980 -0.132 1.054 880s 0.105 -0.211 0.296 880s test-mul-mat0: iter:169/500 880s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.748 -0.605 880s 880s x1: [2, 1, 1, 1] 880s -0.796 0.416 880s 880s y: [1, 1, 1, 1] 880s 0.343 880s testing: mul_mat, [4, 1, 4, 1] = [2, 4, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 4, 4, 1] 880s 0.010 -0.703 880s -0.177 0.657 880s -0.098 0.968 880s -0.383 0.807 880s 880s x1: [2, 1, 4, 1] 880s 0.893 0.621 880s 880s y: [4, 1, 4, 1] 880s -0.428 0.251 0.514 0.160 880s testing: mul_mat, [3, 4, 4, 2] = [2, 3, 4, 2] * [2, 4, 4, 2] 880s x0: [2, 3, 4, 2] 880s 0.910 -0.919 880s -0.161 -0.895 880s -0.509 -0.162 880s 880s x1: [2, 4, 4, 2] 880s -0.362 -0.589 880s 0.834 0.540 880s -0.621 -0.549 880s 0.347 -0.504 880s 880s y: [3, 4, 4, 2] 880s 0.212 0.586 0.279 880s 0.262 -0.618 -0.512 880s -0.061 0.591 0.405 880s 0.780 0.395 -0.095 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.634 0.054 880s 880s x1: [2, 3, 1, 1] 880s -1.000 -0.340 880s 0.377 0.732 880s -0.310 0.338 880s 880s y: [1, 3, 1, 1] 880s -0.652 880s 0.278 880s -0.179 880s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 4, 4, 1] 880s 0.584 880s 0.847 880s 0.345 880s 0.125 880s 880s x1: [1, 2, 4, 1] 880s -0.165 880s -0.905 880s 880s y: [4, 2, 4, 1] 880s -0.097 -0.140 -0.057 -0.021 880s -0.529 -0.767 -0.313 -0.113 880s testing: mul_mat, [2, 1, 4, 2] = [4, 2, 4, 2] * [4, 1, 4, 2] 880s x0: [4, 2, 4, 2] 880s 0.191 0.231 -0.469 0.399 880s -0.292 0.380 -0.340 0.222 880s 880s x1: [4, 1, 4, 2] 880s -0.980 0.700 0.135 -0.011 880s 880s y: [2, 1, 4, 2] 880s -0.093 0.503 880s test-mul-mat0: iter:170/500 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.631 -0.975 -0.982 -0.788 880s 0.351 -0.762 0.471 -0.036 880s 880s x1: [4, 2, 1, 1] 880s -0.695 -0.408 0.049 -0.187 880s -0.204 -0.480 0.104 0.147 880s 880s y: [2, 2, 1, 1] 880s 0.936 0.097 880s 0.379 0.338 880s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 1, 4, 1] 880s -0.230 -0.304 0.775 -0.342 880s 880s x1: [4, 2, 4, 1] 880s 0.889 0.332 0.124 -0.329 880s 0.991 0.888 -0.697 0.344 880s 880s y: [1, 2, 4, 1] 880s -0.096 880s -1.155 880s testing: mul_mat, [4, 1, 4, 2] = [4, 4, 4, 2] * [4, 1, 4, 2] 880s x0: [4, 4, 4, 2] 880s 0.845 -0.257 0.655 -0.929 880s -0.830 0.275 0.277 -0.722 880s 0.031 -0.064 -0.967 -0.187 880s -0.621 -0.443 0.291 0.596 880s 880s x1: [4, 1, 4, 2] 880s -0.212 0.839 0.457 -0.905 880s 880s y: [4, 1, 4, 2] 880s 0.746 1.187 -0.334 -0.647 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.064 0.503 -0.474 -0.497 880s -0.024 0.746 -1.000 -0.447 880s 880s x1: [4, 4, 1, 1] 880s -0.988 -0.177 0.712 0.347 880s 0.643 -0.633 -0.139 0.707 880s 0.026 -0.187 0.439 0.105 880s 0.200 -0.965 0.954 -0.952 880s 880s y: [2, 4, 1, 1] 880s -0.536 -0.975 880s -0.645 -0.665 880s -0.356 -0.626 880s -0.479 -1.254 880s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 3, 4, 1] 880s -0.323 0.987 880s 0.765 -0.949 880s -0.849 -0.225 880s 880s x1: [2, 4, 4, 1] 880s 0.116 0.690 880s -0.932 -0.726 880s 0.801 0.310 880s -0.714 -0.376 880s 880s y: [3, 4, 4, 1] 880s 0.643 -0.565 -0.253 880s -0.416 -0.024 0.954 880s 0.048 0.319 -0.750 880s -0.141 -0.190 0.690 880s testing: mul_mat, [1, 2, 4, 2] = [3, 1, 4, 2] * [3, 2, 4, 2] 880s x0: [3, 1, 4, 2] 880s -0.612 0.341 0.945 880s 880s x1: [3, 2, 4, 2] 880s -0.622 -0.866 0.502 880s 0.038 -0.257 -0.007 880s 880s y: [1, 2, 4, 2] 880s 0.560 880s -0.118 880s test-mul-mat0: iter:171/500 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.699 -0.744 880s -0.103 -0.607 880s -0.863 0.980 880s 0.178 0.455 880s 880s x1: [2, 2, 1, 1] 880s -0.327 -0.751 880s -0.644 0.061 880s 880s y: [4, 2, 1, 1] 880s 0.787 0.489 -0.454 -0.400 880s 0.404 0.029 0.616 -0.086 880s testing: mul_mat, [2, 4, 2, 1] = [2, 2, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 2, 2, 1] 880s 0.124 -0.612 880s -0.103 0.480 880s 880s x1: [2, 4, 2, 1] 880s 0.059 -0.268 880s 0.337 -0.487 880s 0.189 0.284 880s 0.380 0.293 880s 880s y: [2, 4, 2, 1] 880s 0.172 -0.135 880s 0.340 -0.268 880s -0.150 0.117 880s -0.132 0.102 880s testing: mul_mat, [2, 2, 2, 2] = [2, 2, 2, 2] * [2, 2, 2, 2] 880s x0: [2, 2, 2, 2] 880s 0.319 -0.584 880s 0.139 0.878 880s 880s x1: [2, 2, 2, 2] 880s 0.384 -0.825 880s 0.842 0.364 880s 880s y: [2, 2, 2, 2] 880s 0.604 -0.671 880s 0.057 0.436 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.875 0.280 880s 0.448 0.148 880s 880s x1: [2, 2, 1, 1] 880s 0.516 0.920 880s 0.791 -0.186 880s 880s y: [2, 2, 1, 1] 880s 0.709 0.367 880s 0.640 0.327 880s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.228 -0.974 880s 0.507 -0.647 880s 0.668 -0.744 880s 0.774 0.994 880s 880s x1: [2, 2, 2, 1] 880s 0.367 0.597 880s 0.713 0.575 880s 880s y: [4, 2, 2, 1] 880s -0.665 -0.200 -0.199 0.877 880s -0.723 -0.011 0.049 1.124 880s testing: mul_mat, [1, 2, 2, 2] = [4, 1, 2, 2] * [4, 2, 2, 2] 880s x0: [4, 1, 2, 2] 880s -0.064 0.379 0.702 -0.292 880s 880s x1: [4, 2, 2, 2] 880s 0.998 -0.656 -0.376 0.446 880s 0.624 0.772 -0.187 0.221 880s 880s y: [1, 2, 2, 2] 880s -0.707 880s 0.057 880s test-mul-mat0: iter:172/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.821 0.997 -0.416 880s 0.556 -0.691 -0.482 880s 0.633 0.245 0.897 880s 0.335 0.952 -0.911 880s 880s x1: [3, 1, 1, 1] 880s 0.896 -0.912 -0.565 880s 880s y: [4, 1, 1, 1] 880s 0.062 1.401 -0.163 -0.054 880s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 4, 2, 1] 880s -0.071 -0.992 -0.902 880s -0.736 0.960 -0.813 880s -0.478 -0.895 -0.145 880s 0.882 -0.330 0.385 880s 880s x1: [3, 4, 2, 1] 880s -0.743 -0.855 -0.332 880s 0.360 -0.435 -0.470 880s -0.980 0.093 -0.553 880s 0.742 0.398 -0.808 880s 880s y: [4, 4, 2, 1] 880s 1.200 -0.004 1.169 -0.500 880s 0.830 -0.300 0.285 0.280 880s 0.476 1.261 0.466 -1.108 880s 0.282 0.493 -0.594 0.212 880s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 3, 2, 1] 880s 0.289 -0.622 -0.940 880s 0.938 0.290 -0.437 880s 0.687 0.942 0.521 880s 880s x1: [3, 4, 2, 1] 880s 0.749 -0.348 -0.043 880s 0.432 0.949 0.720 880s 0.633 -0.122 0.728 880s 0.731 0.143 0.688 880s 880s y: [3, 4, 2, 1] 880s 0.472 0.621 0.165 880s -1.143 0.366 1.566 880s -0.426 0.241 0.699 880s -0.524 0.427 0.995 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.658 0.334 -0.099 880s -0.477 -0.053 0.394 880s 880s x1: [3, 3, 1, 1] 880s -0.599 0.576 -0.550 880s 0.174 0.121 -0.088 880s 0.332 -0.432 0.137 880s 880s y: [2, 3, 1, 1] 880s -0.147 0.038 880s 0.164 -0.124 880s 0.060 -0.082 880s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 3, 2, 1] 880s -0.295 -0.719 880s -0.655 -0.206 880s 0.330 -0.496 880s 880s x1: [2, 3, 2, 1] 880s -0.114 -0.809 880s 0.957 -0.427 880s -0.867 0.478 880s 880s y: [3, 3, 2, 1] 880s 0.616 0.242 0.364 880s 0.025 -0.539 0.528 880s -0.088 0.470 -0.523 880s testing: mul_mat, [3, 2, 2, 1] = [3, 3, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 3, 2, 1] 880s -0.387 0.406 -0.889 880s -0.962 0.726 -0.929 880s -0.829 0.099 -0.571 880s 880s x1: [3, 2, 2, 1] 880s 0.629 0.366 -0.509 880s -0.424 -0.734 0.885 880s 880s y: [3, 2, 2, 1] 880s 0.358 0.134 -0.194 880s -0.921 -0.947 -0.226 880s test-mul-mat0: iter:173/500 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.502 -0.857 880s 0.464 -0.908 880s 0.463 0.190 880s 880s x1: [2, 1, 1, 1] 880s 0.045 -0.699 880s 880s y: [3, 1, 1, 1] 880s 0.622 0.656 -0.111 880s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 3, 2, 1] 880s -0.266 -0.410 880s -0.828 -0.853 880s -0.485 0.564 880s 880s x1: [2, 3, 2, 1] 880s -0.295 0.501 880s 0.360 -0.888 880s 0.228 -0.540 880s 880s y: [3, 3, 2, 1] 880s -0.127 -0.184 0.426 880s 0.268 0.459 -0.676 880s 0.161 0.272 -0.415 880s testing: mul_mat, [1, 3, 2, 4] = [2, 1, 2, 4] * [2, 3, 2, 4] 880s x0: [2, 1, 2, 4] 880s 0.962 0.197 880s 880s x1: [2, 3, 2, 4] 880s 0.474 0.376 880s -0.575 0.566 880s -0.161 0.615 880s 880s y: [1, 3, 2, 4] 880s 0.531 880s -0.442 880s -0.034 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.006 -0.054 880s -0.245 0.510 880s 880s x1: [2, 1, 1, 1] 880s -0.116 -0.215 880s 880s y: [2, 1, 1, 1] 880s 0.011 -0.082 880s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 4, 2, 1] 880s 0.331 -0.053 880s -0.818 0.810 880s -0.319 -0.618 880s -0.562 0.895 880s 880s x1: [2, 2, 2, 1] 880s -0.706 0.255 880s -0.378 -0.301 880s 880s y: [4, 2, 2, 1] 880s -0.247 0.784 0.067 0.625 880s -0.109 0.066 0.307 -0.057 880s testing: mul_mat, [4, 2, 2, 4] = [4, 4, 2, 4] * [4, 2, 2, 4] 880s x0: [4, 4, 2, 4] 880s 0.945 0.364 0.624 -0.391 880s -0.694 -0.928 0.359 0.055 880s 0.667 0.520 -0.686 0.600 880s 0.759 -0.523 0.240 0.706 880s 880s x1: [4, 2, 2, 4] 880s -0.803 0.966 -0.941 -0.047 880s -0.088 0.569 0.247 -0.833 880s 880s y: [4, 2, 2, 4] 880s -0.976 -0.680 0.584 -1.374 880s 0.604 -0.424 -0.432 -0.894 880s test-mul-mat0: iter:174/500 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.185 0.171 880s -0.014 -0.749 880s 0.621 0.891 880s -0.130 0.051 880s 880s x1: [2, 1, 1, 1] 880s -0.252 -0.710 880s 880s y: [4, 1, 1, 1] 880s -0.168 0.535 -0.789 -0.004 880s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 1, 4, 1] 880s 0.832 0.908 880s 880s x1: [2, 4, 4, 1] 880s -0.300 0.788 880s -0.410 0.002 880s -0.003 0.315 880s -0.517 0.177 880s 880s y: [1, 4, 4, 1] 880s 0.465 880s -0.340 880s 0.283 880s -0.269 880s testing: mul_mat, [2, 1, 4, 2] = [2, 2, 4, 2] * [2, 1, 4, 2] 880s x0: [2, 2, 4, 2] 880s 0.534 0.387 880s -0.922 -0.224 880s 880s x1: [2, 1, 4, 2] 880s -0.769 -0.961 880s 880s y: [2, 1, 4, 2] 880s -0.782 0.924 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.720 -0.058 880s 880s x1: [2, 2, 1, 1] 880s -0.553 0.524 880s 0.112 -0.260 880s 880s y: [1, 2, 1, 1] 880s -0.429 880s 0.096 880s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 2, 4, 1] 880s -0.498 880s 0.034 880s 880s x1: [1, 2, 4, 1] 880s 0.530 880s 0.476 880s 880s y: [2, 2, 4, 1] 880s -0.264 0.018 880s -0.237 0.016 880s testing: mul_mat, [3, 1, 4, 2] = [2, 3, 4, 2] * [2, 1, 4, 2] 880s x0: [2, 3, 4, 2] 880s 0.494 0.458 880s -0.785 0.641 880s -0.582 -0.707 880s 880s x1: [2, 1, 4, 2] 880s 0.337 -0.129 880s 880s y: [3, 1, 4, 2] 880s 0.107 -0.348 -0.105 880s test-mul-mat0: iter:175/500 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.384 0.373 880s 0.167 0.231 880s 880s x1: [2, 3, 1, 1] 880s 0.776 -0.459 880s -0.824 0.003 880s 0.032 -0.617 880s 880s y: [2, 3, 1, 1] 880s 0.127 0.024 880s -0.316 -0.137 880s -0.218 -0.137 880s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 880s x0:ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s [2, 4, 2, 1] 880s 0.508 0.560 880s 0.136 0.387 880s -0.310 0.071 880s 0.391 -0.345 880s 880s x1: [2, 2, 2, 1] 880s -0.423 -0.536 880s -0.465 0.423 880s 880s y: [4, 2, 2, 1] 880s -0.515 -0.265 0.093 0.019 880s 0.001 0.101 0.174 -0.328 880s testing: mul_mat, [1, 4, 2, 2] = [2, 1, 2, 2] * [2, 4, 2, 2] 880s x0: [2, 1, 2, 2] 880s 0.261 0.920 880s 880s x1: [2, 4, 2, 2] 880s -0.509 0.805 880s -0.269 0.864 880s -0.028 0.962 880s -0.559 0.437 880s 880s y: [1, 4, 2, 2] 880s 0.608 880s 0.725 880s 0.878 880s 0.256 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.402 -0.223 880s 0.311 0.745 880s 0.352 0.056 880s 880s x1: [2, 1, 1, 1] 880s 0.979 -0.986 880s 880s y: [3, 1, 1, 1] 880s -0.174 -0.429 0.289 880s testing: mul_mat, [3, 2, 2, 1] = [3, 3, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 3, 2, 1] 880s -0.399 0.652 -0.860 880s 0.992 0.215 -0.067 880s -0.828 -0.467 -0.242 880s 880s x1: [3, 2, 2, 1] 880s 0.647 0.249 -0.850 880s 0.171 0.492 0.018 880s 880s y: [3, 2, 2, 1] 880s 0.636 0.752 -0.447 880s 0.237 0.274 -0.376 880s testing: mul_mat, [1, 3, 2, 2] = [3, 1, 2, 2] * [3, 3, 2, 2] 880s x0: [3, 1, 2, 2] 880s -0.370 -0.357 0.275 880s 880s x1: [3, 3, 2, 2] 880s 0.140 0.253 -0.650 880s -0.688 -0.255 0.369 880s -0.059 -0.006 0.570 880s 880s y: [1, 3, 2, 2] 880s -0.321 880s 0.447 880s 0.181 880s test-mul-mat0: iter:176/500 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.659 0.412 880s 880s x1: [2, 4, 1, 1] 880s 0.982 0.209 880s -0.710 -0.274 880s -0.143 -0.924 880s -0.037 -0.492 880s 880s y: [1, 4, 1, 1] 880s -0.561 880s 0.355 880s -0.286 880s -0.179 880s testing: mul_mat, [1, 1, 3, 1] = [2, 1, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.575 0.864 880s 880s x1: [2, 1, 3, 1] 880s -0.601 -0.970 880s 880s y: [1, 1, 3, 1] 880s -0.493 880s testing: mul_mat, [3, 1, 3, 4] = [2, 3, 3, 4] * [2, 1, 3, 4] 880s x0: [2, 3, 3, 4] 880s 0.271 -0.361 880s 0.999 -0.153 880s -0.220 -0.176 880s 880s x1: [2, 1, 3, 4] 880s -0.715 -0.813 880s 880s y: [3, 1, 3, 4] 880s 0.100 -0.590 0.300 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.868 -0.116 880s -0.125 0.658 880s -0.241 -0.657 880s 880s x1: [2, 3, 1, 1] 880s -0.948 -0.868 880s -0.148 0.598 880s 0.816 -0.673 880s 880s y: [3, 3, 1, 1] 880s -0.722 -0.452 0.798 880s -0.198 0.412 -0.357 880s 0.787 -0.544 0.245 880s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.102 0.971 -0.868 880s 0.805 -0.082 0.200 880s 880s x1: [3, 2, 3, 1] 880s -0.129 -0.764 -0.834 880s -0.100 -0.725 0.073 880s 880s y: [2, 2, 3, 1] 880s -0.031 -0.208 880s -0.777 -0.007 880s testing: mul_mat, [1, 3, 3, 4] = [2, 1, 3, 4] * [2, 3, 3, 4] 880s x0: [2, 1, 3, 4] 880s 0.601 -0.051 880s 880s x1: [2, 3, 3, 4] 880s 0.185 0.995 880s 0.878 0.230 880s 0.861 -0.321 880s 880s y: [1, 3, 3, 4] 880s 0.060 880s 0.515 880s 0.534 880s test-mul-mat0: iter:177/500 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.395 -0.244 0.073 0.843 880s 880s x1: [4, 4, 1, 1] 880s -0.669 0.549 -0.353 0.932 880s -0.502 0.172 -0.413 -0.684 880s -0.091 0.465 -0.319 0.737 880s -0.258 0.225 0.203 0.259 880s 880s y: [1, 4, 1, 1] 880s 0.362 880s -0.847 880s 0.448 880s 0.076 880s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 1, 4, 1] 880s -0.058 0.112 0.740 0.623 880s 880s x1: [4, 1, 4, 1] 880s -0.849 -0.272 0.259 -0.645 880s 880s y: [1, 1, 4, 1] 880s -0.192 880s testing: mul_mat, [3, 1, 4, 3] = [4, 3, 4, 3] * [4, 1, 4, 3] 880s x0: [4, 3, 4, 3] 880s 0.660 0.068 0.327 -0.459 880s 0.844 0.646 -0.842 -0.933 880s -0.537 0.583 0.580 0.927 880s 880s x1: [4, 1, 4, 3] 880s -0.451 0.004 0.214 -0.866 880s 880s y: [3, 1, 4, 3] 880s 0.169 0.248 -0.434 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.676 -0.229 0.120 0.516 880s 0.014 0.722 -0.949 0.342 880s -0.233 -0.317 0.782 0.894 880s -0.614 -0.098 -0.802 -0.690 880s 880s x1: [4, 3, 1, 1] 880s -0.296 0.807 -0.956 0.403 880s -0.922 0.384 -0.583 -0.368 880s 0.727 -0.063 0.461 -0.914 880s 880s y: [4, 3, 1, 1] 880s 0.108 1.624 -0.574 0.591 880s 0.275 0.691 -0.692 1.250 880s -0.894 -0.786 -0.606 -0.179 880s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.936 0.368 -0.455 -0.964 880s -0.251 0.663 0.298 0.007 880s -0.449 -0.538 -0.053 0.047 880s 0.204 -0.261 -0.916 0.063 880s 880s x1: [4, 4, 4, 1] 880s 0.600 0.597 0.014 0.407 880s 0.640 -0.584 0.485 0.024 880s -0.167 -0.883 -0.249 0.770 880s 0.578 -0.163 -0.343 0.902 880s 880s y: [4, 4, 4, 1] 880s -0.741 0.251 -0.572 -0.020 880s -1.059 -0.403 0.002 -0.160 880s -0.798 -0.612 0.599 0.473 880s -1.315 -0.350 -0.112 0.532 880s testing: mul_mat, [4, 4, 4, 3] = [4, 4, 4, 3] * [4, 4, 4, 3] 880s x0: [4, 4, 4, 3] 880s -0.139 -0.944 0.920 0.210 880s 0.581 0.069 -0.056 0.876 880s 0.618 -0.045 -0.482 0.093 880s 0.233 -0.310 -0.180 -0.497 880s 880s x1: [4, 4, 4, 3] 880s -0.848 0.335 0.863 -0.308 880s 0.039 0.438 0.777 -0.960 880s 0.961 -0.054 0.077 0.491 880s 0.162 0.178 -0.291 -0.467 880s 880s y: [4, 4, 4, 3] 880s 0.531 -0.787 -0.983 -0.304 880s 0.095 -0.832 -0.460 0.210 880s 0.091 0.980 0.604 -0.018 880s -0.557 -0.286 0.189 0.267 880s test-mul-mat0: iter:178/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.597 -0.982 0.508 0.585 880s -0.756 0.107 0.028 0.764 880s 880s x1: [4, 1, 1, 1] 880s -0.827 0.929 -0.990 -0.336 880s 880s y: [2, 1, 1, 1] 880s -1.118 0.440 880s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 3, 4, 1] 880s -0.646 0.712 0.052 0.782 880s 0.108 -0.905 0.607 -0.847 880s -0.017 0.100 0.842 -0.144 880s 880s x1: [4, 2, 4, 1] 880s 0.894 -0.683 0.530 0.800 880s -0.923 0.428 0.396 0.043 880s 880s y: [3, 2, 4, 1] 880s -0.411 0.358 0.248 880s 0.955 -0.284 0.385 880s testing: mul_mat, [1, 3, 4, 2] = [4, 1, 4, 2] * [4, 3, 4, 2] 880s x0: [4, 1, 4, 2] 880s -0.357 0.278 -0.645 -0.446 880s 880s x1: [4, 3, 4, 2] 880s -0.102 0.118 -0.200 0.562 880s -0.588 -0.647 0.138 0.336 880s -0.033 -0.268 -0.083 -0.595 880s 880s y: [1, 3, 4, 2] 880s -0.053 880s -0.209 880s 0.256 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.770 0.512 -0.832 -0.318 880s -0.617 0.252 0.347 -0.112 880s 0.682 0.639 0.440 0.803 880s 880s x1: [4, 1, 1, 1] 880s 0.027 0.260 -0.378 0.473 880s 880s y: [3, 1, 1, 1] 880s 0.277 -0.135 0.398 880s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 2, 4, 1] 880s -0.302 -0.555 0.781 880s 0.825 -0.645 0.198 880s 880s x1: [3, 4, 4, 1] 880s 0.009 0.333 -0.972 880s -0.882 0.632 -0.876 880s 0.627 -0.902 -0.130 880s -0.303 0.524 -0.381 880s 880s y: [2, 4, 4, 1] 880s -0.947 -0.400 880s -0.769 -1.309 880s 0.210 1.073 880s -0.497 -0.663 880s testing: mul_mat, [3, 3, 4, 2] = [2, 3, 4, 2] * [2, 3, 4, 2] 880s x0: [2, 3, 4, 2] 880s 0.707 -0.761 880s 0.069 0.436 880s 0.627 0.575 880s 880s x1: [2, 3, 4, 2] 880s 0.926 -0.866 880s 0.361 0.752 880s -0.144 -0.258 880s 880s y: [3, 3, 4, 2] 880s 1.314 -0.314 0.083 880s -0.317 0.353 0.658 880s 0.094 -0.123 -0.239 880s test-mul-mat0: iter:179/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.270 -0.303 0.926 -0.082 880s 880s x1: [4, 3, 1, 1] 880s 0.413 0.229 -0.892 0.128 880s -0.647 0.215 -0.983 0.261 880s -0.483 0.044 0.876 -0.111 880s 880s y: [1, 3, 1, 1] 880s -1.017 880s -0.823 880s 0.938 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.664 0.180 -0.054 -0.266 880s -0.730 -0.968 0.353 -0.476 880s 880s x1: [4, 1, 1, 1] 880s -0.505 0.695 -0.848 -0.420 880s 880s y: [2, 1, 1, 1] 880s 0.618 -0.403 880s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 880s x0: [4, 4, 1, 2] 880s -0.107 0.205 -0.727 0.045 880s 0.785 0.516 0.381 -0.035 880s -0.538 -0.885 0.235 -0.506 880s 0.467 0.760 0.778 -0.767 880s 880s x1: [4, 2, 1, 2] 880s 0.284 -0.235 0.754 0.392 880s 0.893 -0.894 -0.393 0.910 880s 880s y: [4, 2, 1, 2] 880s -0.609 0.375 0.034 0.240 880s 0.048 0.058 -0.242 -1.266 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.464 0.707 0.687 0.499 880s 880s x1: [4, 4, 1, 1] 880s -0.171 -0.509 -0.857 -0.386 880s -0.993 0.524 0.579 -0.531 880s 0.638 -0.186 -0.037 0.106 880s -0.426 -0.259 0.338 -0.913 880s 880s y: [1, 4, 1, 1] 880s -1.221 880s 0.043 880s 0.192 880s -0.604 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.872 880s 0.131 880s 880s x1: [1, 4, 1, 1] 880s -0.305 880s -0.413 880s -0.512 880s -0.074 880s 880s y: [2, 4, 1, 1] 880s 0.266 -0.040 880s 0.360 -0.054 880s 0.447 -0.067 880s 0.065 -0.010 880s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 880s x0: [2, 4, 1, 2] 880s -0.628 -0.854 880s -0.433 0.312 880s -0.157 0.534 880s 0.896 -0.040 880s 880s x1: [2, 1, 1, 2] 880s 0.124 -0.654 880s 880s y: [4, 1, 1, 2] 880s 0.481 -0.258 -0.369 0.137 880s test-mul-mat0: iter:180/500 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.902 -0.690 880s 0.779 -0.261 880s 880s x1: [2, 3, 1, 1] 880s 0.201 0.429 880s -0.050 -0.298 880s 0.557 -0.919 880s 880s y: [2, 3, 1, 1] 880s -0.114 0.045 880s 0.160 0.039 880s 1.136 0.673 880s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.247 -0.014 880s -0.814 -0.269 880s 880s x1: [2, 2, 2, 1] 880s -0.318 -0.654 880s 0.582 -0.421 880s 880s y: [2, 2, 2, 1] 880s 0.088 0.434 880s -0.138 -0.360 880s testing: mul_mat, [4, 2, 2, 2] = [2, 4, 2, 2] * [2, 2, 2, 2] 880s x0: [2, 4, 2, 2] 880s -0.438 -0.928 880s 0.216 -0.325 880s 0.524 -0.614 880s 0.428 -0.490 880s 880s x1: [2, 2, 2, 2] 880s 0.728 0.114 880s 0.960 -0.071 880s 880s y: [4, 2, 2, 2] 880s -0.425 0.120 0.311 0.255 880s -0.355 0.231 0.547 0.446 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.468 -0.574 880s 0.013 -0.199 880s 0.355 0.536 880s -0.478 -0.231 880s 880s x1: [2, 4, 1, 1] 880s -0.683 -0.632 880s 0.435 0.841 880s -0.246 -0.137 880s -0.649 0.327 880s 880s y: [4, 4, 1, 1] 880s 0.043 0.118 -0.581 0.473 880s -0.279 -0.162 0.605 -0.403 880s -0.037 0.024 -0.160 0.149 880s -0.491 -0.073 -0.055 0.235 880s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 3, 2, 1] 880s 0.240 0.590 0.058 0.580 880s -0.932 0.394 -0.593 -0.167 880s 0.849 -0.128 -0.774 0.027 880s 880s x1: [4, 2, 2, 1] 880s -0.211 0.460 -0.258 -0.123 880s -0.625 -0.829 -0.035 -0.409 880s 880s y: [3, 2, 2, 1] 880s 0.135 0.552 -0.042 880s -0.878 0.344 -0.408 880s testing: mul_mat, [3, 4, 2, 2] = [3, 3, 2, 2] * [3, 4, 2, 2] 880s x0: [3, 3, 2, 2] 880s 0.536 -0.806 0.544 880s 0.650 -0.359 -0.416 880s 0.536 0.896 -0.230 880s 880s x1: [3, 4, 2, 2] 880s -0.668 -0.568 -0.602 880s 0.883 0.831 0.712 880s -0.131 0.070 0.780 880s -0.282 -0.339 0.174 880s 880s y: [3, 4, 2, 2] 880s -0.227 0.020 -0.729 880s 0.190 -0.020 1.055 880s 0.297 -0.435 -0.186 880s 0.218 -0.134 -0.495 880s test-mul-mat0: iter:181/500 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.512 0.934 -0.838 880s 880s x1: [3, 2, 1, 1] 880s -0.160 -0.962 -0.861 880s 0.852 0.925 0.793 880s 880s y: [1, 2, 1, 1] 880s -0.259 880s 0.636 880s testing: mul_mat, [4, 1, 3, 1] = [3, 4, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 4, 3, 1] 880s -0.492 0.627 0.000 880s 0.252 -0.896 0.745 880s -0.362 -0.523 -0.967 880s -0.393 0.620 -0.126 880s 880s x1: [3, 1, 3, 1] 880s 0.668 0.635 -0.048 880s 880s y: [4, 1, 3, 1] 880s 0.069 -0.436 -0.528 0.138 880s testing: mul_mat, [4, 4, 3, 3] = [3, 4, 3, 3] * [3, 4, 3, 3] 880s x0: [3, 4, 3, 3] 880s -0.691 0.150 0.395 880s 0.864 0.751 -0.827 880s -0.611 0.013 0.450 880s 0.937 0.565 -0.550 880s 880s x1: [3, 4, 3, 3] 880s 0.493 -0.511 0.722 880s 0.526 0.096 0.342 880s -0.600 0.742 0.102 880s 0.126 -0.688 -0.346 880s 880s y: [4, 4, 3, 3] 880s -0.132 -0.555 0.017 -0.224 880s -0.214 0.244 -0.166 0.359 880s 0.566 -0.046 0.422 -0.199 880s -0.327 -0.122 -0.241 -0.080 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.745 0.964 0.921 880s -0.698 -0.621 -0.945 880s -0.699 -0.367 0.020 880s 880s x1: [3, 4, 1, 1] 880s -0.493 0.845 0.958 880s 0.865 -0.068 -0.839 880s 0.722 -0.754 0.633 880s 0.974 -0.770 0.059 880s 880s y: [3, 4, 1, 1] 880s 1.329 -1.086 0.054 880s -0.194 0.231 -0.597 880s 0.395 -0.634 -0.216 880s 0.038 -0.257 -0.398 880s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 1, 3, 1] 880s 0.493 1.000 0.577 880s 880s x1: [3, 3, 3, 1] 880s 0.042 0.110 0.912 880s -0.312 -0.115 -0.791 880s -0.470 -0.390 -0.547 880s 880s y: [1, 3, 3, 1] 880s 0.656 880s -0.725 880s -0.937 880s testing: mul_mat, [3, 3, 3, 3] = [1, 3, 3, 3] * [1, 3, 3, 3] 880s x0: [1, 3, 3, 3] 880s -0.626 880s 0.214 880s 0.782 880s 880s x1: [1, 3, 3, 3] 880s 0.753 880s -0.395 880s -0.850 880s 880s y: [3, 3, 3, 3] 880s -0.471 0.161 0.589 880s 0.247 -0.084 -0.309 880s 0.532 -0.181 -0.665 880s test-mul-mat0: iter:182/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.738 0.214 0.079 -0.244 880s 0.823 -0.595 -0.618 -0.249 880s 880s x1: [4, 1, 1, 1] 880s -0.830 -0.618 -0.671 -0.703 880s 880s y: [2, 1, 1, 1] 880s 0.600 0.274 880s testing: mul_mat, [2, 2, 4, 1] = [4, 2, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 2, 4, 1] 880s 0.380 -0.359 -0.316 -0.641 880s -0.294 0.324 -0.815 -0.203 880s 880s x1: [4, 2, 4, 1] 880s 0.812 0.346 -0.405 0.163 880s -0.090 -0.021 -0.935 -0.360 880s 880s y: [2, 2, 4, 1] 880s 0.208 0.171 880s 0.500 0.855 880s testing: mul_mat, [1, 2, 4, 2] = [4, 1, 4, 2] * [4, 2, 4, 2] 880s x0: [4, 1, 4, 2] 880s -0.264 -0.697 -0.970 0.587 880s 880s x1: [4, 2, 4, 2] 880s 0.036 0.983 -0.464 0.741 880s 0.307 -0.280 -0.461 0.110 880s 880s y: [1, 2, 4, 2] 880s 0.191 880s 0.626 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.136 -0.748 -0.304 0.816 880s -0.355 0.058 -0.608 -0.914 880s 0.992 -0.764 -0.626 0.442 880s -0.295 0.837 0.070 0.158 880s 880s x1: [4, 1, 1, 1] 880s 0.250 -0.122 -0.374 -0.965 880s 880s y: [4, 1, 1, 1] 880s -0.548 1.014 0.149 -0.354 880s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.444 -0.865 -0.140 0.885 880s 880s x1: [4, 4, 4, 1] 880s -0.887 -0.406 0.866 0.993 880s -0.389 0.565 -0.704 0.559 880s -0.586 0.335 -0.192 0.292 880s 0.961 -0.156 0.677 0.097 880s 880s y: [1, 4, 4, 1] 880s 0.714 880s -0.068 880s -0.265 880s 0.553 880s testing: mul_mat, [4, 4, 4, 2] = [1, 4, 4, 2] * [1, 4, 4, 2] 880s x0: [1, 4, 4, 2] 880s 0.859 880s 0.328 880s -0.210 880s -0.496 880s 880s x1: [1, 4, 4, 2] 880s -0.884 880s 0.369 880s 0.540 880s 0.276 880s 880s y: [4, 4, 4, 2] 880s -0.760 -0.290 0.185 0.439 880s 0.318 0.121 -0.078 -0.183 880s 0.464 0.177 -0.113 -0.268 880s 0.237 0.091 -0.058 -0.137 880s test-mul-mat0: iter:183/500 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.082 0.468 880s -0.095 0.604 880s 880s x1: [2, 1, 1, 1] 880s 0.955 -0.778 880s 880s y: [2, 1, 1, 1] 880s -0.286 -0.560 880s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.694 0.772 880s 0.729 0.346 880s 880s x1: [2, 2, 2, 1] 880s 0.486 0.278 880s 0.657 0.136 880s 880s y: [2, 2, 2, 1] 880s -0.123 0.450 880s -0.351 0.526 880s testing: mul_mat, [2, 2, 2, 3] = [2, 2, 2, 3] * [2, 2, 2, 3] 880s x0: [2, 2, 2, 3] 880s 0.369 0.610 880s 0.698 0.561 880s 880s x1: [2, 2, 2, 3] 880s -0.292 -0.162 880s 0.396 -0.447 880s 880s y: [2, 2, 2, 3] 880s -0.207 -0.295 880s -0.127 0.025 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.505 -0.895 880s 0.734 0.432 880s -0.445 -0.883 880s 880s x1: [2, 2, 1, 1] 880s 0.738 0.545 880s -0.770 -0.150 880s 880s y: [3, 2, 1, 1] 880s -0.861 0.777 -0.810 880s 0.524 -0.630 0.475 880s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.232 0.585 -0.484 880s 0.707 -0.929 -0.309 880s 880s x1: [3, 2, 2, 1] 880s -0.886 0.304 -0.614 880s -0.038 -0.428 0.209 880s 880s y: [2, 2, 2, 1] 880s 0.269 -0.719 880s -0.360 0.306 880s testing: mul_mat, [2, 3, 2, 3] = [2, 2, 2, 3] * [2, 3, 2, 3] 880s x0: [2, 2, 2, 3] 880s 0.537 -0.737 880s -0.892 0.784 880s 880s x1: [2, 3, 2, 3] 880s 0.066 -0.162 880s 0.978 0.171 880s -0.730 -0.905 880s 880s y: [2, 3, 2, 3] 880s 0.155 -0.186 880s 0.400 -0.739 880s 0.275 -0.058 880s test-mul-mat0: iter:184/500 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.306 0.516 0.480 -0.859 880s -0.326 0.112 -0.252 -0.137 880s -0.706 -0.287 -0.848 0.544 880s 880s x1: [4, 2, 1, 1] 880s -0.921 0.793 0.476 0.617 880s 0.901 0.739 0.401 -0.845 880s 880s y: [3, 2, 1, 1] 880s -0.174 0.184 0.353 880s 1.575 -0.196 -1.648 880s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.244 0.501 -0.160 0.871 880s 0.502 -0.751 -0.902 -0.749 880s -0.479 -0.381 0.713 -0.056 880s -0.316 0.901 -0.749 0.863 880s 880s x1: [4, 3, 4, 1] 880s -0.274 -0.116 0.114 0.025 880s 0.630 -0.890 0.033 0.458 880s 0.370 -0.273 0.537 0.163 880s 880s y: [4, 3, 4, 1] 880s 0.013 -0.172 0.255 -0.082 880s -0.206 0.612 0.034 -0.631 880s -0.170 -0.216 0.300 -0.623 880s testing: mul_mat, [3, 4, 4, 3] = [4, 3, 4, 3] * [4, 4, 4, 3] 880s x0: [4, 3, 4, 3] 880s 0.956 -0.864 0.744 0.831 880s 0.311 -0.373 0.859 0.893 880s -0.424 -0.951 0.105 -0.428 880s 880s x1: [4, 4, 4, 3] 880s 0.727 -0.283 0.556 0.441 880s -0.924 0.053 0.676 -0.946 880s -0.738 -0.146 -0.284 0.372 880s -0.716 0.241 0.597 -0.097 880s 880s y: [3, 4, 4, 3] 880s 1.721 1.204 -0.169 880s -1.213 -0.572 0.818 880s -0.481 -0.087 0.263 880s -0.530 0.113 0.179 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.231 0.225 0.086 -0.387 880s 0.041 0.479 -0.092 -0.202 880s -0.722 -0.278 0.913 0.625 880s 0.206 0.850 0.213 -0.561 880s 880s x1: [4, 3, 1, 1] 880s 0.072 -0.770 0.507 -0.669 880s 0.603 0.408 -0.797 0.406 880s 0.274 0.608 -0.015 -0.695 880s 880s y: [4, 3, 1, 1] 880s 0.145 -0.277 0.207 -0.157 880s 0.006 0.211 -1.022 0.073 880s 0.468 0.445 -0.815 0.961 880s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.729 0.541 -0.062 -0.762 880s 880s x1: [4, 4, 4, 1] 880s -0.462 0.511 0.511 -0.232 880s 0.018 0.842 -0.629 -0.574 880s -0.954 0.777 0.700 0.654 880s -0.238 -0.995 0.574 0.993 880s 880s y: [1, 4, 4, 1] 880s 0.085 880s 0.945 880s -0.817 880s -1.504 880s testing: mul_mat, [2, 4, 4, 3] = [1, 2, 4, 3] * [1, 4, 4, 3] 880s x0: [1, 2, 4, 3] 880s -0.851 880s 0.828 880s 880s x1: [1, 4, 4, 3] 880s 0.767 880s 0.712 880s 0.014 880s 0.121 880s 880s y: [2, 4, 4, 3] 880s -0.653 0.635 880s -0.606 0.590 880s -0.012 0.011 880s -0.103 0.100 880s test-mul-mat0: iter:185/500 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.050 -0.703 880s 880s x1: [2, 4, 1, 1] 880s -0.403 0.467 880s 0.660 -0.254 880s 0.296 0.158 880s 0.707 -0.619 880s 880s y: [1, 4, 1, 1] 880s -0.349 880s 0.211 880s -0.096 880s 0.470 880s testing: mul_mat, [3, 1, 2, 1] = [2, 3, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 3, 2, 1] 880s 0.993 -0.512 880s -0.367 -0.340 880s 0.733 -0.446 880s 880s x1: [2, 1, 2, 1] 880s -0.905 0.390 880s 880s y: [3, 1, 2, 1] 880s -1.099 0.199 -0.838 880s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.962 0.143 880s 0.239 0.617 880s -0.642 -0.181 880s 0.610 -0.154 880s 880s x1: [2, 3, 2, 1] 880s 0.748 -0.404 880s -0.185 -0.654 880s -0.936 -0.525 880s 880s y: [4, 3, 2, 1] 880s -0.778 -0.070 -0.408 0.519 880s 0.085 -0.448 0.237 -0.012 880s 0.826 -0.548 0.696 -0.490 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.534 0.358 880s 0.748 -0.793 880s 880s x1: [2, 4, 1, 1] 880s -0.553 -0.174 880s 0.122 0.043 880s 0.641 0.468 880s 0.107 -0.885 880s 880s y: [2, 4, 1, 1] 880s 0.233 -0.276 880s -0.050 0.057 880s -0.174 0.108 880s -0.374 0.781 880s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 2, 2, 1] 880s 0.142 0.742 880s -0.283 0.721 880s 880s x1: [2, 2, 2, 1] 880s -0.204 0.207 880s -0.755 0.939 880s 880s y: [2, 2, 2, 1] 880s 0.124 0.207 880s 0.589 0.890 880s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 1, 2, 1] 880s -0.375 0.870 880s 880s x1: [2, 2, 2, 1] 880s 0.476 0.254 880s 0.330 -0.402 880s 880s y: [1, 2, 2, 1] 880s 0.043 880s -0.474 880s test-mul-mat0: iter:186/500 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.574 0.087 0.145 0.639 880s -0.509 -0.769 -0.736 -0.639 880s 0.065 -0.753 -0.562 -0.305 880s 0.436 -0.239 -0.737 0.071 880s 880s x1: [4, 4, 1, 1] 880s 0.568 0.635 -0.815 0.865 880s -0.100 0.657 0.007 0.617 880s 0.399 -0.272 -0.660 0.501 880s 0.535 -0.155 -0.023 -0.210 880s 880s y: [4, 4, 1, 1] 880s 0.816 -0.731 -0.248 0.758 880s 0.395 -0.853 -0.694 -0.162 880s 0.429 0.171 0.449 0.761 880s 0.156 -0.002 0.229 0.273 880s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.402 -0.902 -0.662 -0.447 880s -0.776 0.728 -0.245 -0.213 880s 0.813 -0.607 -0.888 -0.839 880s 0.102 -0.618 -0.364 0.782 880s 880s x1: [4, 4, 3, 1] 880s -0.054 -0.872 0.971 -0.398 880s 0.135 0.588 -0.999 0.863 880s 0.929 0.502 0.398 -0.226 880s -0.522 -0.812 -0.052 -0.948 880s 880s y: [4, 4, 3, 1] 880s 0.343 -0.747 -0.043 -0.131 880s -0.309 0.385 -0.085 0.689 880s -0.988 -0.405 0.287 -0.537 880s 1.400 0.027 0.910 -0.274 880s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.145 0.545 -0.595 -0.647 880s -0.449 -0.480 0.055 -0.249 880s -0.698 0.344 -0.219 0.532 880s -0.525 -0.113 -0.738 -0.424 880s 880s x1: [4, 4, 3, 1] 880s -0.923 0.500 -0.029 0.797 880s 0.749 0.476 -0.613 -0.031 880s -0.165 -0.963 -0.891 0.100 880s -0.980 0.262 0.963 -0.899 880s 880s y: [4, 4, 3, 1] 880s -0.092 -0.025 1.247 0.113 880s 0.535 -0.591 -0.241 0.018 880s -0.035 0.462 0.033 0.811 880s 0.293 0.592 0.085 0.155 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.622 -0.150 -0.334 -0.043 880s -0.288 0.693 0.454 -0.897 880s 0.770 0.404 -0.430 0.998 880s 880s x1: [4, 4, 1, 1] 880s -0.831 0.021 -0.478 -0.643 880s 0.587 0.932 -0.897 0.665 880s 0.690 -0.334 0.218 0.898 880s -0.199 0.625 -0.038 -0.519 880s 880s y: [3, 4, 1, 1] 880s 0.701 0.613 -1.068 880s -0.234 -0.528 1.878 880s -0.490 -1.137 1.199 880s 0.065 0.938 -0.401 880s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.500 -0.404 0.911 880s -0.858 0.829 0.414 880s -0.565 0.727 0.821 880s 880s x1: [3, 4, 3, 1] 880s 0.824 0.543 -0.834 880s -0.155 -0.935 -0.476 880s -0.568 0.997 -0.374 880s -0.903 0.687 0.292 880s 880s y: [3, 4, 3, 1] 880s -1.391 -0.603 -0.755 880s 0.021 -0.839 -0.983 880s -0.459 1.159 0.739 880s 0.441 1.466 1.250 880s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.610 -0.583 0.866 880s -0.925 0.842 -0.728 880s 880s x1: [3, 3, 3, 1] 880s 0.452 0.792 0.855 880s 0.261 -0.709 0.996 880s 0.696 -0.113 0.825 880s 880s y: [2, 3, 3, 1] 880s 0.003 -0.374 880s 1.117 -1.564 880s 0.356 -1.340 880s test-mul-mat0: iter:187/500 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.152 -0.849 -0.293 0.245 880s -0.626 -0.450 -0.424 0.841 880s -0.959 0.108 -0.053 0.408 880s 0.725 0.375 -0.975 0.089 880s 880s x1: [4, 4, 1, 1] 880s -0.383 0.364 -0.590 0.793 880s 0.213 0.487 -0.983 0.563 880s -0.806 -0.593 0.638 -0.389 880s -0.751 0.504 -0.117 0.352 880s 880s y: [4, 4, 1, 1] 880s 0.000 0.993 0.761 0.504 880s 0.045 0.538 0.131 1.345 880s 0.099 0.174 0.516 -1.463 880s -0.421 0.589 0.924 -0.211 880s testing: mul_mat, [4, 4, 2, 1] = [4, 4, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 4, 2, 1] 880s -0.843 0.589 0.055 -0.978 880s 0.056 0.297 0.525 0.133 880s 0.788 -0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s .467 -0.905 0.632 880s 0.202 -0.793 0.627 -0.422 880s 880s x1: [4, 4, 2, 1] 880s 0.785 0.819 -0.698 0.272 880s 0.835 0.865 0.466 -0.758 880s 0.503 -0.922 -0.509 0.007 880s -0.039 0.844 -0.331 -0.887 880s 880s y: [4, 4, 2, 1] 880s -0.484 -0.044 1.041 -1.043 880s 0.572 0.447 -0.647 0.095 880s -1.002 -0.512 1.293 0.510 880s 1.379 -0.044 -0.686 -0.509 880s testing: mul_mat, [3, 4, 2, 4] = [4, 3, 2, 4] * [4, 4, 2, 4] 880s x0: [4, 3, 2, 4] 880s -0.243 0.217 0.111 -0.974 880s 0.034 -0.333 -0.674 -0.636 880s 0.701 -0.263 0.621 -0.165 880s 880s x1: [4, 4, 2, 4] 880s 0.800 0.521 0.343 -0.144 880s -0.182 -0.132 0.989 -0.394 880s 0.401 -0.916 -0.762 -0.398 880s -0.709 0.865 0.180 0.856 880s 880s y: [3, 4, 2, 4] 880s 0.097 -0.285 0.660 880s 0.509 -0.379 0.586 880s 0.006 1.085 0.114 880s -0.454 -0.977 -0.754 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.203 -0.610 -0.716 -0.842 880s 0.517 0.710 0.132 0.296 880s -0.128 -0.907 0.897 -0.543 880s 880s x1: [4, 3, 1, 1] 880s 0.923 -0.692 0.492 0.759 880s -0.378 -0.607 0.743 0.045 880s -0.949 0.896 -0.854 0.227 880s 880s y: [3, 3, 1, 1] 880s -0.757 0.275 0.539 880s -0.123 -0.515 1.241 880s 0.067 0.101 -1.581 880s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 4, 2, 1] 880s -0.852 0.549 0.980 880s -0.747 0.764 -0.568 880s 0.307 -0.379 -0.821 880s -0.693 0.624 -0.290 880s 880s x1: [3, 4, 2, 1] 880s -0.686 -0.926 -0.784 880s 0.811 -0.444 -0.269 880s 0.734 -0.136 -0.777 880s 0.492 0.486 -0.385 880s 880s y: [4, 4, 2, 1] 880s -0.692 0.251 0.784 0.125 880s -1.199 -0.792 0.639 -0.761 880s -1.461 -0.210 0.915 -0.368 880s -0.529 0.222 0.282 0.073 880s testing: mul_mat, [1, 3, 2, 4] = [4, 1, 2, 4] * [4, 3, 2, 4] 880s x0: [4, 1, 2, 4] 880s 0.664 0.079 -0.537 -0.324 880s 880s x1: [4, 3, 2, 4] 880s -0.531 0.212 0.518 0.786 880s 0.076 -0.361 0.196 0.225 880s -0.109 -0.497 0.531 -0.559 880s 880s y: [1, 3, 2, 4] 880s -0.868 880s -0.156 880s -0.215 880s test-mul-mat0: iter:188/500 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.851 880s -0.710 880s 0.101 880s 880s x1: [1, 2, 1, 1] 880s -0.374 880s -0.544 880s 880s y: [3, 2, 1, 1] 880s -0.318 0.265 -0.038 880s -0.463 0.386 -0.055 880s testing: mul_mat, [3, 3, 2, 1] = [1, 3, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.923 880s -0.964 880s -0.292 880s 880s x1: [1, 3, 2, 1] 880s -0.977 880s 0.007 880s -0.926 880s 880s y: [3, 3, 2, 1] 880s 0.902 0.942 0.285 880s -0.007 -0.007 -0.002 880s 0.854 0.893 0.270 880s testing: mul_mat, [3, 3, 2, 4] = [1, 3, 2, 4] * [1, 3, 2, 4] 880s x0: [1, 3, 2, 4] 880s 0.093 880s 0.360 880s -0.995 880s 880s x1: [1, 3, 2, 4] 880s -0.790 880s 0.208 880s 0.846 880s 880s y: [3, 3, 2, 4] 880s -0.073 -0.285 0.786 880s 0.019 0.075 -0.206 880s 0.078 0.305 -0.842 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.821 880s -0.116 880s 0.918 880s 0.271 880s 880s x1: [1, 3, 1, 1] 880s 0.418 880s 0.164 880s 0.559 880s 880s y: [4, 3, 1, 1] 880s -0.343 -0.049 0.384 0.113 880s -0.134 -0.019 0.150 0.044 880s -0.459 -0.065 0.514 0.152 880s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.622 -0.455 -0.778 0.866 880s -0.355 0.068 -0.599 -0.615 880s -0.788 0.588 -0.597 -0.492 880s 880s x1: [4, 1, 2, 1] 880s -0.756 0.924 0.202 -0.175 880s 880s y: [3, 1, 2, 1] 880s -0.258 0.318 1.105 880s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 880s x0: [3, 2, 2, 4] 880s -0.709 0.818 0.914 880s 0.153 0.933 -0.469 880s 880s x1: [3, 4, 2, 4] 880s 0.581 0.631 0.781 880s -0.078 -0.754 -0.513 880s -0.430 -0.376 0.132 880s -0.218 0.169 -0.800 880s 880s y: [2, 4, 2, 4] 880s 0.819 0.312 880s -1.031 -0.474 880s 0.118 -0.479 880s -0.439 0.499 880s test-mul-mat0: iter:189/500 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.780 0.429 0.476 880s 0.029 0.137 0.682 880s -0.652 0.497 0.610 880s 880s x1: [3, 4, 1, 1] 880s -0.145 -0.949 -0.437 880s 0.911 -0.623 0.926 880s -0.540 -0.994 0.686 880s 0.410 0.927 -0.400 880s 880s y: [3, 4, 1, 1] 880s -0.728 -0.433 -0.644 880s 0.884 0.572 -0.340 880s -0.520 0.316 0.276 880s 0.527 -0.134 -0.050 880s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.326 0.014 -0.967 880s 0.134 -0.542 0.982 880s 0.504 -0.723 0.857 880s 880s x1: [3, 3, 4, 1] 880s -0.116 -0.927 0.789 880s 0.275 0.325 0.672 880s 0.605 0.393 0.386 880s 880s y: [3, 3, 4, 1] 880s -0.738 1.262 1.287 880s -0.735 0.521 0.480 880s -0.565 0.247 0.352 880s testing: mul_mat, [2, 3, 4, 3] = [3, 2, 4, 3] * [3, 3, 4, 3] 880s x0: [3, 2, 4, 3] 880s -0.519 -0.413 0.218 880s -0.204 -0.621 -0.166 880s 880s x1: [3, 3, 4, 3] 880s 0.285 0.033 0.786 880s 0.141 0.368 0.8ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 89 880s -0.441 0.943 0.745 880s 880s y: [2, 3, 4, 3] 880s 0.010 -0.209 880s -0.032 -0.405 880s 0.002 -0.620 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.070 -0.746 0.554 880s 880s x1: [3, 2, 1, 1] 880s 0.726 -0.059 0.967 880s -0.764 0.541 0.030 880s 880s y: [1, 2, 1, 1] 880s 0.630 880s -0.441 880s testing: mul_mat, [2, 3, 4, 1] = [1, 2, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.274 880s -0.144 880s 880s x1: [1, 3, 4, 1] 880s 0.426 880s -0.129 880s 0.791 880s 880s y: [2, 3, 4, 1] 880s 0.117 -0.061 880s -0.035 0.019 880s 0.216 -0.114 880s testing: mul_mat, [4, 1, 4, 3] = [2, 4, 4, 3] * [2, 1, 4, 3] 880s x0: [2, 4, 4, 3] 880s -0.564 -0.898 880s 0.642 0.118 880s -0.935 0.216 880s 0.963 0.504 880s 880s x1: [2, 1, 4, 3] 880s 0.253 0.401 880s 880s y: [4, 1, 4, 3] 880s -0.503 0.210 -0.150 0.446 880s test-mul-mat0: iter:190/500 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.111 -0.991 880s -0.226 0.852 880s 0.461 0.977 880s 880s x1: [2, 3, 1, 1] 880s 0.794 -0.773 880s -0.279 -0.166 880s 0.570 0.768 880s 880s y: [3, 3, 1, 1] 880s 0.678 -0.838 -0.389 880s 0.195 -0.078 -0.291 880s -0.824 0.526 1.013 880s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.020 0.877 880s 0.636 -0.450 880s 0.645 -0.836 880s 0.439 0.654 880s 880s x1: [2, 3, 3, 1] 880s -0.151 0.599 880s -0.064 -0.077 880s 0.198 -0.966 880s 880s y: [4, 3, 3, 1] 880s 0.528 -0.365 -0.598 0.326 880s -0.066 -0.006 0.023 -0.078 880s -0.851 0.560 0.935 -0.545 880s testing: mul_mat, [2, 4, 3, 1] = [2, 2, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 2, 3, 1] 880s 0.582 -0.267 880s -0.605 -0.936 880s 880s x1: [2, 4, 3, 1] 880s -0.459 0.091 880s -0.459 0.549 880s 0.843 0.698 880s -0.649 -0.177 880s 880s y: [2, 4, 3, 1] 880s -0.292 0.193 880s -0.413 -0.236 880s 0.305 -1.164 880s -0.331 0.558 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.148 0.381 880s -0.356 -0.977 880s 880s x1: [2, 2, 1, 1] 880s -0.495 0.195 880s 0.927 -0.508 880s 880s y: [2, 2, 1, 1] 880s 0.001 -0.014 880s -0.057 0.166 880s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.316 0.289 880s 0.507 -0.964 880s 0.845 -0.502 880s 880s x1: [2, 2, 3, 1] 880s 0.734 0.485 880s -0.987 0.739 880s 880s y: [3, 2, 3, 1] 880s -0.092 -0.096 0.377 880s 0.525 -1.212 -1.204 880s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.841 -0.586 0.637 880s -0.966 0.788 0.191 880s 880s x1: [3, 2, 3, 1] 880s -0.184 0.038 0.961 880s -0.445 -0.232 -0.321 880s 880s y: [2, 2, 3, 1] 880s 0.435 0.391 880s -0.443 0.186 880s test-mul-mat0: iter:191/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.315 0.776 -0.235 -0.506 880s -0.109 -0.926 0.857 0.953 880s -0.801 -0.742 -0.024 -0.121 880s 880s x1: [4, 3, 1, 1] 880s 0.400 0.799 0.540 -0.489 880s 0.389 -0.394 -0.091 0.014 880s -0.829 -0.250 0.048 -0.415 880s 880s y: [3, 3, 1, 1] 880s 0.614 -0.787 -0.868 880s -0.414 0.258 -0.018 880s 0.266 -0.033 0.898 880s testing: mul_mat, [2, 3, 4, 1] = [4, 2, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 2, 4, 1] 880s -0.425 0.929 0.889 0.849 880s -0.250 0.486 0.197 0.427 880s 880s x1: [4, 3, 4, 1] 880s 0.426 -0.323 0.710 -0.600 880s -0.607 -0.972 0.801 -0.809 880s 0.568 -0.689 0.580 -0.825 880s 880s y: [2, 3, 4, 1] 880s -0.359 -0.380 880s -0.619 -0.508 880s -1.067 -0.716 880s testing: mul_mat, [4, 2, 4, 3] = [4, 4, 4, 3] * [4, 2, 4, 3] 880s x0: [4, 4, 4, 3] 880s 0.056 -0.500 0.142 0.142 880s -0.778 0.981 -0.298 -0.313 880s -0.553 0.022 0.817 -0.705 880s 0.755 0.011 0.327 -0.304 880s 880s x1: [4, 2, 4, 3] 880s 0.006 -0.562 0.504 0.756 880s 0.924 -0.299 0.184 -0.039 880s 880s y: [4, 2, 4, 3] 880s 0.461 -0.944 -0.137 -0.066 880s 0.222 -1.054 -0.340 0.766 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.657 -0.710 -0.410 0.135 880s 0.841 -0.261 0.024 -0.895 880s -0.871 -0.116 0.592 0.711 880s 880s x1: [4, 4, 1, 1] 880s 0.147 0.865 0.181 -0.207 880s -0.441 -0.452 -0.948 0.417 880s 0.558 0.777 0.252 -0.839 880s 0.688 0.218 0.841 0.319 880s 880s y: [3, 4, 1, 1] 880s -0.813 0.087 -0.268 880s 1.055 -0.648 0.172 880s -1.135 1.022 -1.022 880s -0.908 0.256 0.101 880s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.404 0.439 -0.642 880s -0.079 0.151 0.972 880s 0.899 -0.033 0.058 880s 880s x1: [3, 4, 4, 1] 880s 0.001 -0.692 -0.142 880s -0.133 0.489 0.651 880s 0.426 -0.963 0.703 880s -0.158 0.595 0.481 880s 880s y: [3, 4, 4, 1] 880s -0.212 -0.243 0.016 880s -0.258 0.717 -0.098 880s -0.702 0.504 0.456 880s -0.111 0.569 -0.134 880s testing: mul_mat, [3, 3, 4, 3] = [3, 3, 4, 3] * [3, 3, 4, 3] 880s x0: [3, 3, 4, 3] 880s -0.118 -0.599 -0.714 880s 0.283 0.953 -0.466 880s 0.478 -0.039 0.201 880s 880s x1: [3, 3, 4, 3] 880s -0.233 -0.067 0.816 880s 0.825 0.801 0.743 880s 0.079 -0.109 -0.373 880s 880s y: [3, 3, 4, 3] 880s -0.515 -0.511 0.056 880s -1.108 0.651 0.512 880s 0.322 0.093 -0.033 880s test-mul-mat0: iter:192/500 880s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.263 880s -0.579 880s -0.602 880s -0.085 880s 880s x1: [1, 4, 1, 1] 880s -0.549 880s 0.386 880s 0.891 880s -0.308 880s 880s y: [4, 4, 1, 1] 880s 0.144 0.318 0.331 0.047 880s -0.102 -0.224 -0.233 -0.033 880s -0.234 -0.516 -0.536 -0.076 880s 0.081 0.178 0.185 0.026 880s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 3, 3, 1] 880s -0.934 880s 0.650 880s -0.512 880s 880s x1: [1, 4, 3, 1] 880s -0.267 880s -0.546 880s -0.842 880s 0.609 880s 880s y: [3, 4, 3, 1] 880s 0.250 -0.174 0.137 880s 0.510 -0.355 0.279 880s 0.787 -0.548 0.431 880s -0.569 0.396 -0.312 880s testing: mul_mat, [4, 3, 3, 4] = [1, 4, 3, 4] * [1, 3, 3, 4] 880s x0: [1, 4, 3, 4] 880s 0.367 880s 0.459 880s -0.848 880s -0.545 880s 880s x1: [1, 3, 3, 4] 880s 0.243 880s 0.228 880s 0.447 880s 880s y: [4, 3, 3, 4] 880s 0.089 0.111 -0.206 -0.132 880s 0.083 0.105 -0.193 -0.124 880s 0.164 0.205 -0.379 -0.244 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.971 880s 880s x1: [1, 4, 1, 1] 880s -0.969 880s 0.744 880s -0.550 880s 0.166 880s 880s y: [1, 4, 1, 1] 880s -0.941 880s 0.722 880s -0.534 880s 0.161 880s testing: mul_mat, [1, 1, 3, 1] = [1, 1, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 1, 3, 1] 880s -0.495 880s 880s x1: [1, 1, 3, 1] 880s -0.307 880s 880s y: [1, 1, 3, 1] 880s 0.152 880s testing: mul_mat, [2, 1, 3, 4] = [1, 2, 3, 4] * [1, 1, 3, 4] 880s x0: [1, 2, 3, 4] 880s 0.149 880s 0.429 880s 880s x1: [1, 1, 3, 4] 880s -0.801 880s 880s y: [2, 1, 3, 4] 880s -0.120 -0.344 880s test-mul-mat0: iter:193/500 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.340 -0.178 0.519 -0.764 880s -0.025 -0.839 -0.702 0.221 880s -0.225 0.054 0.333 0.955 880s 880s x1: [4, 1, 1, 1] 880s 0.264 0.898 -0.525 -0.587 880s 880s y: [3, 1, 1, 1] 880s 0.106 -0.522 -0.746 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.157 -0.597 -0.289 0.318 880s 880s x1: [4, 3, 1, 1] 880s 0.650 0.950 0.311 0.575 880s -0.234 0.815 0.774 0.626 880s -0.103 -0.774 0.785 0.140 880s 880s y: [1, 3, 1, 1] 880s -0.576 880s -0.474 880s 0.295 880s testing: mul_mat, [2, 1, 1, 3] = [4, 2, 1, 3] * [4, 1, 1, 3] 880s x0: [4, 2, 1, 3] 880s -0.552 0.897 0.116 0.759 880s 0.472 0.882 0.574 0.246 880s 880s x1: [4, 1, 1, 3] 880s -0.184 -0.962 -0.342 0.638 880s 880s y: [2, 1, 1, 3] 880s -0.317 -0.975 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.198 0.080 -0.411 0.882 880s 0.010 -0.416 0.055 -0.334 880s 0.814 0.060 -0.468 -0.184 880s 880s x1: [4, 2, 1, 1] 880s 0.081 -0.691 -0.801 -0.085 880s -0.198 -0.555 -0.918 0.251 880s 880s y: [3, 2, 1, 1] 880s 0.215 0.273 0.415 880s 0.515 0.095 0.189 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.976 0.237 0.797 880s -0.454 -0.942 0.579 880s 880s x1: [3, 4, 1, 1] 880s -0.981 -0.320 0.812 880s 0.473 0.511 -0.155 880s 0.586 0.980 0.435 880s -0.408 0.061 0.744 880s 880s y: [2, 4, 1, 1] 880s -0.387 1.217 880s 0.459 -0.786 880s 1.150 -0.937 880s 0.209 0.558 880s testing: mul_mat, [3, 3, 1, 3] = [2, 3, 1, 3] * [2, 3, 1, 3] 880s x0: [2, 3, 1, 3] 880s 0.282 -0.657 880s 0.893 0.637 880s -0.522 0.269 880s 880s x1: [2, 3, 1, 3] 880s 0.256 -0.194 880s 0.394 -0.664 880s 0.390 -0.546 880s 880s y: [3, 3, 1, 3] 880s 0.199 0.105 -0.186 880s 0.547 -0.071 -0.384 880s 0.469 0.000 -0.351 880s test-mul-mat0: iter:194/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.173 -0.174 -0.208 -0.288 880s -0.892 -0.315 0.190 -0.549 880s 880s x1: [4, 1, 1, 1] 880s 0.732 -0.280 -0.472 0.524 880s 880s y: [2, 1, 1, 1] 880s -0.131 -0.942 880s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.223 0.414 -0.741 0.908 880s -0.397 -0.289 -0.769 0.062 880s 880s x1: [4, 2, 3, 1] 880s -0.677 -0.541 -0.229 -0.494 880s -0.036 0.148 -0.514 0.049 880s 880s y: [2, 2, 3, 1] 880s -0.654 0.571 880s 0.479 0.370 880s testing: mul_mat, [3, 2, 3, 4] = [4, 3, 3, 4] * [4, 2, 3, 4] 880s x0: [4, 3, 3, 4] 880s -0.911 -0.347 0.129 0.126 880s 0.327 0.623 -0.013 -0.441 880s 0.897 0.002 0.159 0.689 880s 880s x1: [4, 2, 3, 4] 880s 0.812 0.539 0.029 0.137 880s 0.970 0.731 0.288 0.193 880s 880s y: [3, 2, 3, 4] 880s -0.906 0.541 0.828 880s -1.076 0.684 1.050 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.402 -0.482 -0.021 0.773 880s -0.011 0.125 -0.945 -ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 0.652 880s -0.122 -0.940 0.784 0.865 880s 880s x1: [4, 3, 1, 1] 880s -0.718 -0.957 -0.817 0.928 880s 0.262 -0.643 0.082 -0.321 880s 0.551 -0.877 0.264 0.472 880s 880s y: [3, 3, 1, 1] 880s 0.908 0.056 1.150 880s 0.165 0.048 0.359 880s 1.003 -0.673 1.372 880s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 1, 3, 1] 880s -0.520 -0.886 0.531 880s 880s x1: [3, 4, 3, 1] 880s 0.891 0.967 -0.960 880s 0.038 -0.689 0.402 880s 0.320 -0.646 0.586 880s 0.248 0.615 0.943 880s 880s y: [1, 4, 3, 1] 880s -1.829 880s 0.805 880s 0.717 880s -0.174 880s testing: mul_mat, [3, 3, 3, 4] = [1, 3, 3, 4] * [1, 3, 3, 4] 880s x0: [1, 3, 3, 4] 880s -0.579 880s -0.183 880s -0.532 880s 880s x1: [1, 3, 3, 4] 880s -0.489 880s 0.584 880s 0.934 880s 880s y: [3, 3, 3, 4] 880s 0.283 0.090 0.260 880s -0.338 -0.107 -0.311 880s -0.541 -0.171 -0.497 880s test-mul-mat0: iter:195/500 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.492 0.907 880s 0.412 0.317 880s 0.694 0.611 880s 880s x1: [2, 4, 1, 1] 880s -0.689 -0.784 880s -0.006 -0.508 880s 0.061 -0.019 880s -0.625 -0.618 880s 880s y: [3, 4, 1, 1] 880s -1.050 -0.533 -0.958 880s -0.464 -0.164 -0.315 880s 0.013 0.019 0.031 880s -0.868 -0.453 -0.811 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.804 0.351 880s 0.610 -0.126 880s 880s x1: [2, 3, 1, 1] 880s -0.183 -0.746 880s -0.942 0.926 880s -0.986 0.296 880s 880s y: [2, 3, 1, 1] 880s -0.115 -0.018 880s 1.083 -0.691 880s 0.896 -0.639 880s testing: mul_mat, [4, 2, 1, 3] = [2, 4, 1, 3] * [2, 2, 1, 3] 880s x0: [2, 4, 1, 3] 880s -0.089 0.431 880s 0.640 -0.677 880s -0.252 0.334 880s 0.935 0.565 880s 880s x1: [2, 2, 1, 3] 880s -0.295 0.257 880s -0.239 -0.927 880s 880s y: [4, 2, 1, 3] 880s 0.137 -0.362 0.160 -0.131 880s -0.379 0.475 -0.249 -0.747 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.572 -0.793 880s -0.728 0.753 880s -0.811 0.350 880s 0.762 -0.800 880s 880s x1: [2, 4, 1, 1] 880s 0.706 -0.996 880s -0.930 0.056 880s 0.038 -0.991 880s -0.211 0.949 880s 880s y: [4, 4, 1, 1] 880s 0.386 -1.264 -0.921 1.335 880s 0.487 0.719 0.775 -0.754 880s 0.764 -0.775 -0.378 0.822 880s -0.632 0.869 0.503 -0.921 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.798 -0.110 0.363 0.754 880s -0.143 0.749 -0.404 0.433 880s 880s x1: [4, 2, 1, 1] 880s 0.244 -0.048 0.488 0.270 880s -0.250 -0.872 -0.095 0.423 880s 880s y: [2, 2, 1, 1] 880s 0.581 -0.151 880s 0.181 -0.397 880s testing: mul_mat, [2, 4, 1, 3] = [2, 2, 1, 3] * [2, 4, 1, 3] 880s x0: [2, 2, 1, 3] 880s 0.399 0.326 880s 0.791 0.995 880s 880s x1: [2, 4, 1, 3] 880s 0.653 -0.208 880s 0.442 -0.559 880s -0.259 -0.118 880s -0.130 0.014 880s 880s y: [2, 4, 1, 3] 880s 0.192 0.309 880s -0.006 -0.206 880s -0.142 -0.322 880s -0.047 -0.089 880s test-mul-mat0: iter:196/500 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.323 -0.371 880s 0.294 0.186 880s 880s x1: [2, 4, 1, 1] 880s 0.312 -0.022 880s 0.572 -0.268 880s 0.046 -0.495 880s 0.641 -0.615 880s 880s y: [2, 4, 1, 1] 880s -0.093 0.088 880s -0.085 0.119 880s 0.169 -0.078 880s 0.021 0.074 880s testing: mul_mat, [4, 2, 2, 1] = [2, 4, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.658 0.544 880s -0.683 0.419 880s -0.373 0.276 880s 0.510 -0.219 880s 880s x1: [2, 2, 2, 1] 880s -0.330 0.257 880s -0.415 -0.539 880s 880s y: [4, 2, 2, 1] 880s 0.357 0.333 0.194 -0.225 880s -0.020 0.058 0.006 -0.094 880s testing: mul_mat, [1, 4, 2, 2] = [2, 1, 2, 2] * [2, 4, 2, 2] 880s x0: [2, 1, 2, 2] 880s 0.666 -0.866 880s 880s x1: [2, 4, 2, 2] 880s 0.190 0.133 880s 0.871 0.818 880s -0.573 0.057 880s -0.511 0.685 880s 880s y: [1, 4, 2, 2] 880s 0.011 880s -0.129 880s -0.430 880s -0.933 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.415 0.798 880s 0.033 -0.461 880s 0.824 -0.649 880s -0.430 0.839 880s 880s x1: [2, 1, 1, 1] 880s 0.804 -0.415 880s 880s y: [4, 1, 1, 1] 880s 0.002 0.218 0.931 -0.694 880s testing: mul_mat, [2, 2, 2, 1] = [4, 2, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 2, 2, 1] 880s 0.192 0.005 0.544 -0.590 880s -0.041 -0.005 0.532 -0.424 880s 880s x1: [4, 2, 2, 1] 880s -0.235 -0.086 -0.254 0.812 880s 0.235 -0.908 -0.588 -0.639 880s 880s y: [2, 2, 2, 1] 880s -0.663 -0.469 880s 0.098 -0.047 880s testing: mul_mat, [1, 4, 2, 2] = [2, 1, 2, 2] * [2, 4, 2, 2] 880s x0: [2, 1, 2, 2] 880s 0.526 0.234 880s 880s x1: [2, 4, 2, 2] 880s -0.041 -0.528 880s 0.479 0.051 880s -0.116 0.840 880s 0.433 0.091 880s 880s y: [1, 4, 2, 2] 880s -0.146 880s 0.264 880s 0.136 880s 0.249 880s test-mul-mat0: iter:197/500 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.733 880s 0.970 880s 880s x1: [1, 2, 1, 1] 880s 0.968 880s -0.112 880s 880s y: [2, 2, 1, 1] 880s 0.709 0.939 880s -0.082 -0.108 880s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.634 880s 0.439 880s 880s x1: [1, 2, 4, 1] 880s -0.242 880s 0.030 880s 880s y: [2, 2, 4, 1] 880s -0.153 -0.106 880s 0.019 0.013 880s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 4, 4, 1] 880s 0.732 880s 0.316 880s -0.168 880s 0.464 880s 880s x1: [1, 2, 4, 1] 880s -0.326 880s 0.720 880s 880s y: [4, 2, 4, 1] 880s -0.239 -0.103 0.055 -0.151 880s 0.527 0.227 -0.121 0.334 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.352 880s -0.817 880s 880s x1: [1, 4, 1, 1] 880s -0.799 880s -0.152 880s -0.035 880s 0.509 880s 880s y: [2, 4, 1, 1] 880s 0.281 0.653 880s 0.053 0.124 880s 0.012 0.028 880s -0.179 -0.416 880s testing: mul_mat, [4, 1, 4, 1] = [2, 4, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.011 -0.500 880s -0.090 -0.808 880s 0.698 0.024 880s -0.725 0.487 880s 880s x1: [2, 1, 4, 1] 880s 0.620 0.986 880s 880s y: [4, 1, 4, 1] 880s -0.499 -0.852 0.457 0.031 880s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.039 -0.589 0.585 -0.469 880s 880s x1: [4, 2, 4, 1] 880s 0.153 0.191 -0.532 0.653 880s 0.384 0.492 0.140 0.753 880s 880s y: [1, 2, 4, 1] 880s -0.724 880s -0.546 880s test-mul-mat0: iter:198/500 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.327 -0.621 -0.928 880s -0.742 -0.090 0.623 880s 880s x1: [3, 3, 1, 1] 880s 0.933 0.944 -0.608 880s 0.296 0.461 -0.230 880s -0.098 -0.529 0.341 880s 880s y: [2, 3, 1, 1] 880s -0.327 -1.156 880s -0.170 -0.404 880s 0.044 0.332 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.606 -0.080 -0.760 880s -0.421 -0.508 -0.025 880s -0.735 0.709 -0.091 880s 880s x1: [3, 2, 1, 1] 880s 0.596 -0.360 0.198 880s 0.463 0.693 -0.291 880s 880s y: [3, 2, 1, 1] 880s -0.482 -0.073 -0.711 880s -0.115 -0.540 0.177 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.409 -0.160 0.074 880s 0.502 -0.537 -0.330 880s -0.858 0.660 -0.867 880s 0.834 -0.631 -0.385 880s 880s x1: [3, 3, 1, 1] 880s -0.791 -0.899 -0.795 880s 0.671 -0.129 0.107 880s -0.858 -0.788 -0.232 880s 880s y: [4, 3, 1, 1] 880s 0.409 0.349 0.775 0.214 880s -0.246 0.370 -0.754 0.599 880s 0.460 0.070 0.417 -0.129 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.538 -0.326 0.265 880s -0.948 -0.180 0.659 880s 0.095 0.910 -0.017 880s 880s x1: [3, 4, 1, 1] 880s -0.772 0.289 -0.145 880s -0.193 0.781 0.830 880s 0.072 0.490 -0.261 880s 0.282 0.591 -0.055 880s 880s y: [3, 4, 1, 1] 880s 0.282 0.584 0.192 880s 0.069 0.590 0.678 880s -0.268 -0.329 0.457 880s -0.359 -0.410 0.565 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.440 0.255 0.778 880s 880s x1: [3, 3, 1, 1] 880s -0.233 -0.878 0.653 880s -0.092 0.782 0.786 880s -0.257 -0.849 -0.600 880s 880s y: [1, 3, 1, 1] 880s 0.181 880s 0.771 880s -0.797 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.825 880s 880s x1: [1, 3, 1, 1] 880s 0.221 880s 0.086 880s -0.150 880s 880s y: [1, 3, 1, 1] 880s 0.182 880s 0.071 880s -0.123 880s test-mul-mat0: iter:199/500 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.205 -0.489 0.121 880s 0.356 -0.089 -0.908 880s -0.204 -0.834 0.870 880s -0.983 0.252 -0.280 880s 880s x1: [3, 4, 1, 1] 880s -0.236 -0.178 -0.821 880s 0.438 0.641 -0.911 880s -0.297 0.301 0.072 880s 0.470 0.423 -0.275 880s 880s y: [4, 4, 1, 1] 880s -0.060 0.678 -0.517 0.417 880s -0.334 0.927 -1.417 -0.014 880s -0.199 -0.198 -0.128 0.348 880s -0.144 0.380 -0.688 -0.278 880s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 2, 2, 1] 880s -0.610 0.589 -0.135 880s -0.593 -0.159 0.585 880s 880s x1: [3, 4, 2, 1] 880s -0.273 0.078 -0.148 880s 0.029 0.847 0.937 880s 0.793 -0.331 -0.885 880s 0.232 -0.690 -0.796 880s 880s y: [2, 4, 2, 1] 880s 0.232 0.063 880s 0.354 0.396 880s -0.559 -0.935 880s -0.440 -0.493 880s testing: mul_mat, [1, 2, 2, 1] = [3, 1, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 1, 2, 1] 880s 0.629 -0.519 0.213 880s 880s x1: [3, 2, 2, 1] 880s 0.957 -0.565 -0.511 880s 0.188 -0.255 -0.307 880s 880s y: [1, 2, 2, 1] 880s 0.786 880s 0.185 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.666 -0.747 -0.981 880s -0.342 0.095 0.468 880s 0.155 0.318 0.275 880s 880s x1: [3, 1, 1, 1] 880s 0.614 0.493 0.073 880s 880s y: [3, 1, 1, 1] 880s -0.849 -0.129 0.272 880s testing: mul_mat, [1, 3, 2, 1] = [3, 1, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 1, 2, 1] 880s -0.534 -0.082 0.433 880s 880s x1: [3, 3, 2, 1] 880s -0.546 0.957 -0.537 880s 0.199 -0.350 0.586 880s 0.556 -0.381 -0.887 880s 880s y: [1, 3, 2, 1] 880s -0.019 880s 0.176 880s -0.650 880s testing: mul_mat, [3,ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 3, 2, 1] = [1, 3, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.904 880s 0.644 880s -0.276 880s 880s x1: [1, 3, 2, 1] 880s 0.053 880s -0.329 880s 0.906 880s 880s y: [3, 3, 2, 1] 880s -0.047 0.034 -0.014 880s 0.297 -0.212 0.091 880s -0.819 0.583 -0.250 880s test-mul-mat0: iter:200/500 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.403 880s 0.600 880s 0.998 880s 880s x1: [1, 2, 1, 1] 880s 0.766 880s 0.802 880s 880s y: [3, 2, 1, 1] 880s 0.308 0.460 0.764 880s 0.323 0.481 0.800 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.554 880s 0.264 880s 0.743 880s 880s x1: [1, 3, 1, 1] 880s 0.248 880s 0.013 880s -0.536 880s 880s y: [3, 3, 1, 1] 880s -0.137 0.065 0.184 880s -0.007 0.003 0.010 880s 0.297 -0.141 -0.398 880s testing: mul_mat, [2, 3, 1, 4] = [1, 2, 1, 4] * [1, 3, 1, 4] 880s x0: [1, 2, 1, 4] 880s 0.917 880s -0.239 880s 880s x1: [1, 3, 1, 4] 880s 0.034 880s -0.684 880s -0.586 880s 880s y: [2, 3, 1, 4] 880s 0.031 -0.008 880s -0.627 0.163 880s -0.538 0.140 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.247 880s 0.558 880s -0.798 880s -0.081 880s 880s x1: [1, 2, 1, 1] 880s 0.505 880s -0.455 880s 880s y: [4, 2, 1, 1] 880s -0.125 0.282 -0.403 -0.041 880s 0.112 -0.254 0.363 0.037 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.075 0.170 -0.900 -0.723 880s 0.979 0.628 -0.917 0.864 880s 880s x1: [4, 1, 1, 1] 880s -0.995 0.466 -0.338 0.039 880s 880s y: [2, 1, 1, 1] 880s 0.281 -0.338 880s testing: mul_mat, [3, 4, 1, 4] = [2, 3, 1, 4] * [2, 4, 1, 4] 880s x0: [2, 3, 1, 4] 880s 0.673 -0.134 880s -0.309 0.471 880s -0.777 0.374 880s 880s x1: [2, 4, 1, 4] 880s 0.720 0.673 880s -0.142 0.637 880s -0.565 0.194 880s -0.219 0.151 880s 880s y: [3, 4, 1, 4] 880s 0.394 0.095 -0.308 880s -0.181 0.344 0.349 880s -0.406 0.266 0.512 880s -0.168 0.139 0.227 880s test-mul-mat0: iter:201/500 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.901 0.674 880s 880s x1: [2, 3, 1, 1] 880s -0.380 0.739 880s -0.923 -0.707 880s -0.570 -0.700 880s 880s y: [1, 3, 1, 1] 880s 0.156 880s -1.309 880s -0.986 880s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 3, 3, 1] 880s 0.863 -0.444 880s 0.795 -0.007 880s -0.422 0.921 880s 880s x1: [2, 1, 3, 1] 880s 0.322 0.098 880s 880s y: [3, 1, 3, 1] 880s 0.234 0.255 -0.046 880s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.697 0.352 880s -0.888 -0.847 880s 0.384 -0.459 880s 880s x1: [2, 3, 3, 1] 880s -0.431 -0.114 880s 0.094 0.729 880s -0.213 -0.233 880s 880s y: [3, 3, 3, 1] 880s 0.260 0.479 -0.113 880s 0.191 -0.700 -0.299 880s 0.066 0.386 0.025 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.120 -0.078 880s -0.750 -0.296 880s -0.152 -0.510 880s -0.813 -0.726 880s 880s x1: [2, 3, 1, 1] 880s 0.434 -0.884 880s -0.382 -0.229 880s -0.433 -0.844 880s 880s y: [4, 3, 1, 1] 880s 0.017 -0.064 0.385 0.289 880s 0.063 0.354 0.175 0.477 880s 0.118 0.575 0.496 0.965 880s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.543 -0.769 0.601 0.035 880s 0.740 0.943 -0.580 0.867 880s 0.669 0.063 0.328 0.678 880s -0.753 -0.125 0.361 0.701 880s 880s x1: [4, 2, 3, 1] 880s -0.184 -0.356 0.658 0.357 880s -0.547 0.759 -0.385 -0.461 880s 880s y: [4, 2, 3, 1] 880s 0.583 -0.544 0.312 0.671 880s -1.129 0.135 -0.756 -0.146 880s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.449 0.787 -0.375 0.935 880s -0.697 -0.275 -0.528 -0.656 880s 880s x1: [4, 4, 3, 1] 880s -0.488 0.651 0.500 0.940 880s -0.113 0.150 0.474 -0.535 880s -0.416 0.680 -0.460 -0.288 880s 0.440 0.362 -0.616 -0.060 880s 880s y: [2, 4, 3, 1] 880s 0.984 -0.719 880s -0.610 0.138 880s 0.252 0.535 880s 0.657 -0.041 880s test-mul-mat0: iter:202/500 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.135 880s 0.113 880s 880s x1: [1, 2, 1, 1] 880s 0.416 880s -0.190 880s 880s y: [2, 2, 1, 1] 880s -0.056 0.047 880s 0.026 -0.021 880s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 4, 4, 1] 880s 0.039 880s 0.286 880s 0.659 880s -0.196 880s 880s x1: [1, 2, 4, 1] 880s -0.879 880s 0.590 880s 880s y: [4, 2, 4, 1] 880s -0.035 -0.252 -0.579 0.172 880s 0.023 0.169 0.389 -0.116 880s testing: mul_mat, [2, 4, 4, 1] = [1, 2, 4, 1] * [1, 4, 4, 1] 880s x0: [1, 2, 4, 1] 880s -0.766 880s -0.469 880s 880s x1: [1, 4, 4, 1] 880s 0.454 880s -0.399 880s -0.576 880s 0.789 880s 880s y: [2, 4, 4, 1] 880s -0.348 -0.213 880s 0.306 0.187 880s 0.442 0.270 880s -0.604 -0.370 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.609 880s 0.054 880s 0.526 880s 880s x1: [1, 2, 1, 1] 880s 0.985 880s 0.869 880s 880s y: [3, 2, 1, 1] 880s 0.600 0.053 0.518 880s 0.529 0.047 0.457 880s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.810 -0.851 0.960 880s -0.308 -0.074 -0.497 880s 0.677 -0.792 0.106 880s 880s x1: [3, 1, 4, 1] 880s 0.063 0.655 0.949 880s 880s y: [3, 1, 4, 1] 880s 0.405 -0.540 -0.376 880s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 1, 4, 1] 880s -0.498 -0.896 -0.462 880s 880s x1: [3, 3, 4, 1] 880s 0.319 -0.620 -0.933 880s -0.575 -0.668 0.893 880s -0.379 -0.731 0.589 880s 880s y: [1, 3, 4, 1] 880s 0.827 880s 0.473 880s 0.572 880s test-mul-mat0: iter:203/500 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.454 0.622 0.645 -0.573 880s 880s x1: [4, 2, 1, 1] 880s 0.928 -0.708 -0.825 -0.047 880s -0.005 0.600 0.090 -0.881 880s 880s y: [1, 2, 1, 1] 880s -1.367 880s 0.939 880s testing: mul_mat, [1, 1, 3, 1] = [4, 1, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 1, 3, 1] 880s 0.517 0.834 -0.609 0.376 880s 880s x1: [4, 1, 3, 1] 880s 0.458 -0.406 -0.072 0.562 880s 880s y: [1, 1, 3, 1] 880s 0.153 880s testing: mul_mat, [1, 1, 3, 4] = [4, 1, 3, 4] * [4, 1, 3, 4] 880s x0: [4, 1, 3, 4] 880s -0.285 -0.832 -0.016 0.893 880s 880s x1: [4, 1, 3, 4] 880s -0.218 0.585 -0.091 -0.595 880s 880s y: [1, 1, 3, 4] 880s -0.955 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.279 -0.453 -0.245 0.859 880s 880s x1: [4, 1, 1, 1] 880s 0.885 0.999 0.896 0.965 880s 880s y: [1, 1, 1, 1] 880s 0.405 880s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 3, 3, 1] 880s -0.903 880s 0.884 880s -0.001 880s 880s x1: [1, 4, 3, 1] 880s -0.404 880s 0.398 880s -0.632 880s -0.296 880s 880s y: [3, 4, 3, 1] 880s 0.365 -0.357 0.000 880s -0.360 0.352 -0.000 880s 0.571 -0.559 0.001 880s 0.267 -0.262 0.000 880s testing: mul_mat, [2, 1, 3, 4] = [3, 2, 3, 4] * [3, 1, 3, 4] 880s x0: [3, 2, 3, 4] 880s 0.129 0.432 -0.358 880s 0.009 -0.884 0.217 880s 880s x1: [3, 1, 3, 4] 880s 0.878 0.133 -0.721 880s 880s y: [2, 1, 3, 4] 880s 0.429 -0.266 880s test-mul-mat0: iter:204/500 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.768 0.526 0.258 880s 0.513 0.877 -0.650 880s 880s x1: [3, 4, 1, 1] 880s -0.410 -0.473 0.347 880s 0.603 0.658 0.814 880s -0.187 -0.924 0.428 880s -0.618 0.961 0.086 880s 880s y: [2, 4, 1, 1] 880s -0.473 -0.851 880s 1.019 0.357 880s -0.519 -1.185 880s 0.052 0.470 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.006 -0.441 0.959 880s 0.883 0.570 -0.451 880s -0.589 -0.083 -0.848 880s -0.932 -0.269 -0.035 880s 880s x1: [3, 2, 1, 1] 880s -0.372 -0.743 0.522 880s -0.670 -0.530 -0.218 880s 880s y: [4, 2, 1, 1] 880s 0.830 -0.987 -0.161 0.528 880s 0.029 -0.796 0.623 0.775 880s testing: mul_mat, [1, 4, 1, 4] = [3, 1, 1, 4] * [3, 4, 1, 4] 880s x0: [3, 1, 1, 4] 880s -0.845 -0.890 -0.379 880s 880s x1: [3, 4, 1, 4] 880s -0.855 -0.841 0.347 880s -0.895 0.245 -0.048 880s 0.873 -0.230 -0.790 880s 0.386 -0.352 -0.440 880s 880s y: [1, 4, 1, 4] 880s 1.341 880s 0.557 880s -0.234 880s 0.154 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.609 0.011 0.150 880s 880s x1: [3, 1, 1, 1] 880s 0.138 -0.371 -0.034 880s 880s y: [1, 1, 1, 1] 880s 0.075 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.289 880s 0.095 880s -0.044 880s 880s x1: [1, 3, 1, 1] 880s -0.538 880s 0.826 880s -0.158 880s 880s y: [3, 3, 1, 1] 880s -0.155 -0.051 0.024 880s 0.239 0.078 -0.036 880s -0.046 -0.015 0.007 880s testing: mul_mat, [4, 1, 1, 4] = [3, 4, 1, 4] * [3, 1, 1, 4] 880s x0: [3, 4, 1, 4] 880s -0.614 -0.792 -0.118 880s 0.070 0.273 -0.472 880s -0.355 0.517 0.026 880s -0.449 0.876 0.345 880s 880s x1: [3, 1, 1, 4] 880s 0.656 0.429 0.554 880s 880s y: [4, 1, 1, 4] 880s -0.808 -0.098 0.003 0.272 880s test-mul-mat0: iter:205/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.020 880s -0.692 880s 880s x1: [1, 1, 1, 1] 880s -0.896 880s 880s y: [2, 1, 1, 1] 880s 0.018 0.620 880s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 2, 3, 1] 880s 0.213 880s -0.050 880s 880s x1: [1, 2, 3, 1] 880s 0.449 880s -0.376 880s 880s y: [2, 2, 3, 1] 880s 0.096 -0.022 880s -0.080 0.019 880s testing: mul_mat, [1, 2, 3, 2] = [1, 1, 3, 2] * [1, 2, 3, 2] 880s x0: [1, 1, 3, 2] 880s -0.269 880s 880s x1: [1, 2, 3, 2] 880s 0.164 880s 0.594 880s 880s y: [1, 2, 3, 2] 880s -0.044 880s -0.160 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.947 880s 0.718 880s 880s x1: [1, 1, 1, 1] 880s -0.666 880s 880s y: [2, 1, 1, 1] 880s 0.630 -0.478 880s testing: mul_mat, [1, 1, 3, 1] = [2, 1, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.963 0.345 880s 880s x1: [2, 1, 3, 1] 880s 0.961 -0.856 880s 880s y: [1, 1, 3, 1] 880s -1.221 880s testing: mul_mat, [4, 2, 3, 2] = [1, 4, 3, 2ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ] * [1, 2, 3, 2] 880s x0: [1, 4, 3, 2] 880s 0.392 880s 0.688 880s 0.227 880s 0.445 880s 880s x1: [1, 2, 3, 2] 880s -0.805 880s -0.640 880s 880s y: [4, 2, 3, 2] 880s -0.315 -0.554 -0.183 -0.358 880s -0.251 -0.440 -0.145 -0.285 880s test-mul-mat0: iter:206/500 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.267 0.356 880s 880s x1: [2, 2, 1, 1] 880s -0.481 0.022 880s -0.961 0.911 880s 880s y: [1, 2, 1, 1] 880s -0.120 880s 0.068 880s testing: mul_mat, [3, 1, 2, 1] = [2, 3, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 3, 2, 1] 880s -0.167 -0.030 880s 0.394 -0.242 880s -0.167 -0.430 880s 880s x1: [2, 1, 2, 1] 880s -0.883 -0.546 880s 880s y: [3, 1, 2, 1] 880s 0.164 -0.215 0.382 880s testing: mul_mat, [3, 3, 2, 3] = [2, 3, 2, 3] * [2, 3, 2, 3] 880s x0: [2, 3, 2, 3] 880s 0.709 -0.015 880s -0.234 0.426 880s -0.449 -0.910 880s 880s x1: [2, 3, 2, 3] 880s 0.189 -0.467 880s 0.923 -0.752 880s 0.136 0.591 880s 880s y: [3, 3, 2, 3] 880s 0.141 -0.243 0.340 880s 0.665 -0.536 0.271 880s 0.088 0.220 -0.599 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.163 0.403 880s 880s x1: [2, 3, 1, 1] 880s -0.872 0.476 880s -0.979 0.658 880s 0.324 0.362 880s 880s y: [1, 3, 1, 1] 880s 0.050 880s 0.106 880s 0.199 880s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.575 880s 0.183 880s 0.485 880s 880s x1: [1, 2, 2, 1] 880s 0.370 880s -0.613 880s 880s y: [3, 2, 2, 1] 880s -0.213 0.068 0.179 880s 0.353 -0.112 -0.297 880s testing: mul_mat, [2, 1, 2, 3] = [3, 2, 2, 3] * [3, 1, 2, 3] 880s x0: [3, 2, 2, 3] 880s -0.305 0.432 -0.934 880s -0.289 0.596 -0.002 880s 880s x1: [3, 1, 2, 3] 880s -0.557 -0.647 0.163 880s 880s y: [2, 1, 2, 3] 880s -0.261 -0.226 880s test-mul-mat0: iter:207/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.347 0.210 -0.216 880s 0.650 0.806 0.261 880s 880s x1: [3, 2, 1, 1] 880s 0.002 -0.256 0.399 880s -0.992 0.016 0.087 880s 880s y: [2, 2, 1, 1] 880s -0.141 -0.101 880s 0.329 -0.609 880s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.907 0.234 -0.792 880s -0.076 -0.678 0.597 880s 0.577 0.532 -0.620 880s 0.227 0.338 0.641 880s 880s x1: [3, 2, 3, 1] 880s -0.444 0.003 -0.704 880s 0.391 -0.541 0.213 880s 880s y: [4, 2, 3, 1] 880s 0.155 -0.388 0.182 -0.552 880s 0.060 0.464 -0.194 0.042 880s testing: mul_mat, [3, 4, 3, 3] = [3, 3, 3, 3] * [3, 4, 3, 3] 880s x0: [3, 3, 3, 3] 880s -0.186 0.842 0.630 880s -0.601 -0.028 -0.540 880s -0.583 -0.918 0.483 880s 880s x1: [3, 4, 3, 3] 880s -0.009 -0.269 -0.223 880s 0.372 0.958 -0.885 880s 0.013 0.741 -0.544 880s -0.050 -0.085 0.256 880s 880s y: [3, 4, 3, 3] 880s -0.365 0.133 0.144 880s 0.181 0.227 -1.523 880s 0.279 0.265 -0.950 880s 0.099 -0.105 0.231 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.685 -0.880 0.335 880s -0.078 0.176 0.124 880s -0.979 -0.075 0.898 880s 880s x1: [3, 3, 1, 1] 880s 0.844 -0.366 -0.465 880s 0.211 -0.596 -0.518 880s -0.230 0.061 0.901 880s 880s y: [3, 3, 1, 1] 880s 0.744 -0.188 -1.216 880s 0.495 -0.186 -0.627 880s 0.091 0.140 1.030 880s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.546 0.844 -0.960 880s 0.793 -0.039 -0.745 880s 880s x1: [3, 3, 3, 1] 880s 0.415 -0.752 -0.588 880s 0.377 0.028 -0.979 880s -0.995 0.949 -0.612 880s 880s y: [2, 3, 3, 1] 880s 0.156 0.796 880s 1.169 1.027 880s 0.845 -0.370 880s testing: mul_mat, [4, 3, 3, 3] = [2, 4, 3, 3] * [2, 3, 3, 3] 880s x0: [2, 4, 3, 3] 880s 0.379 -0.683 880s 0.068 0.194 880s 0.783 0.893 880s 0.384 -0.978 880s 880s x1: [2, 3, 3, 3] 880s -0.545 0.609 880s 0.173 -0.315 880s -0.985 0.744 880s 880s y: [4, 3, 3, 3] 880s -0.622 0.081 0.118 -0.805 880s 0.281 -0.049 -0.146 0.375 880s -0.881 0.078 -0.106 -1.106 880s test-mul-mat0: iter:208/500 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.198 -0.086 -0.338 880s 880s x1: [3, 3, 1, 1] 880s 0.348 0.420 0.376 880s 0.982 0.511 0.064 880s 0.788 -0.898 0.355 880s 880s y: [1, 3, 1, 1] 880s -0.094 880s 0.129 880s 0.114 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.720 0.553 -0.924 880s -0.149 -0.922 0.104 880s 880s x1: [3, 1, 1, 1] 880s 0.611 0.429 0.854 880s 880s y: [2, 1, 1, 1] 880s -0.992 -0.398 880s testing: mul_mat, [3, 2, 1, 2] = [3, 3, 1, 2] * [3, 2, 1, 2] 880s x0: [3, 3, 1, 2] 880s -0.108 -0.044 0.004 880s -0.753 0.867 -0.798 880s 0.161 -0.470 0.813 880s 880s x1: [3, 2, 1, 2] 880s 0.034 0.450 0.275 880s -0.594 -0.920 0.839 880s 880s y: [3, 2, 1, 2] 880s -0.023 0.145 0.018 880s 0.108 -1.021 1.019 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.824 -0.284 -0.037 880s -0.564 0.391 0.259 880s 0.423 -0.573 -0.371 880s 880s x1: [3, 3, 1, 1] 880s -0.245 0.464 -0.757 880s 0.161 0.544 -0.918 880s 0.131 0.996 0.268 880s 880s y: [3, 3, 1, 1] 880s -0.306 0.124 -0.089 880s 0.012 -0.116 0.097 880s -0.185 0.385 -0.615 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.877 -0.941 0.307 880s 0.063 0.587 0.220 880s 880s x1: [3, 3, 1, 1] 880s -0.876 0.789 -0.558 880s -0.286 -0.828 -0.769 880s 0.584 -0.891 -0.905 880s 880s y: [2, 3, 1, 1] 880s -0.145 0.284 880s 0.794 -0.674 880s 0.049 -0.686 880s testing: mul_mat, [1, 3, 1, 2] = [2, 1, 1, 2] * [2, 3, 1, 2] 880s x0: [2, 1, 1, 2] 880s 0.742 0.352 880s 880s x1: [2, 3, 1, 2] 880s 0.130 0.388 880s -0.648 0.126 880s -0.344 -0.127 880s 880s y: [1, 3, 1, 2] 880s 0.233 880s -0.437 880s -0.300 880s test-mul-mat0: iter:209/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.183 -0.584 0.521 -0.511 880s 0.636 -0.349 0.878 0.988 880s 880s x1: [4, 1, 1, 1] 880s 0.052 0.811 -0.881 -0.644 880s 880s y: [2, 1, 1, 1] 880s -0.595 -1.658 880s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 2, 3, 1] 880s -0.649 -0.159 -0.294 0.002 880s -0.281 -0.306 -0.221 0.253 880s 880s x1: [4, 2, 3, 1] 880s 0.777 -0.466 -0.139 -0.272 880s 0.626 0.157 -0.606 -0.891 880s 880s y: [2, 2, 3, 1] 880s -0.390 -0.114 880s -0.255 -0.315 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.802 0.157 -0.099 -0.853 880s -0.539 -0.200 -0.607 -0.340 880s 0.869 0.293 0.690 0.290 880s 880s x1: [4, 2, 3, 1] 880s -0.754 -0.800 0.069 -0.100 880s 0.030 0.421 0.969 0.380 880s 880s y: [3, 2, 3, 1] 880s 0.558 0.559 -0.871 880s -0.378 -0.818 0.928 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.885 0.302 0.713 -0.779 880s -0.981 0.736 -0.924 -0.949 880s 0.857 0.982 -0.496 -0.301 880s -0.884 0.155 -0.372 0.267 880s 880s x1: [4, 3, 1, 1] 880s 0.846 -0.914 0.192 0.715 880s 0.379 -0.118 0.005 -0.359 880s 0.952 -0.443 -0.422 0.405 880s 880s y: [4, 3, 1, 1] 880s 0.053 -2.359 -0.483 -0.770 880s 0.583 -0.123 0.314 -0.451 880s 0.093 -1.255 0.468 -0.645 880s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.499 -0.961 -0.211 -0.601 880s 0.652 0.911 -0.174 -0.571 880s 880s x1: [4, 4, 3, 1] 880s 0.744 0.231 0.114 0.831 880s -0.577 -0.171 0.210 0.305 880s 0.834 0.851 0.257 -0.609 880s -0.571 -0.339 0.482 -0.686 880s 880s y: [2, 4, 3, 1] 880s -0.373 0.202 880s -0.351 -0.743 880s -0.090 1.622 880s 0.351 -0.373 880s testing: mul_mat, [3, 4, 3, 1] = [2, 3, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 3, 3, 1] 880s 0.719 0.566 880s -0.806 0.817 880s 0.753 -0.378 880s 880s x1: [2, 4, 3, 1] 880s 0.992 -0.622 880s 0.859 0.619 880s 0.448 0.530 880s -0.416 -0.053 880s 880s y: [3, 4, 3, 1] 880s 0.361 -1.308 0.982 880s 0.968 -0.188 0.413 880s 0.622 0.072 0.136 880s -0.329 0.292 -0.293 880s test-mul-mat0: iter:210/500 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.250 -0.722 0.429 880s 0.067 -0.101 -0.012 880s 0.262 0.380 -0.835 880s 880s x1: [3, 3, 1, 1] 880s 0.070 -0.916 0.567 880s -0.480 -0.840 0.312 880s -0.856 0.057 -0.475 880s 880s y: [3, 3, 1, 1] 880s 0.922 0.090 -0.804 880s 0.621 0.049 -0.706 880s -0.459 -0.057 0.194 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.311 0.662 -0.286 880s 880s x1: [3, 3, 1, 1] 880s -0.096 0.391 0.925 880s 0.493 -0.416 -0.692 880s -0.059 0.747 -0.860 880s 880s y: [1, 3, 1, 1] 880s -0.035 880s 0.075 880s 0.722 880s testing: mul_mat, [3, 1, 1, 2] = [3, 3, 1, 2] * [3, 1, 1, 2] 880s x0: [3, 3, 1, 2] 880s 0.066 0.241 0.862 880s 0.344 -0.330 -0.071 880s -0.757 0.658 -0.809 880s 880s x1: [3, 1, 1, 2] 880s 0.395 0.229 0.234 880s 880s y: [3, 1, 1, 2] 880s 0.283 0.044 -0.338 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.628 -0.782 0.742 880s -0.499 -0.306 -0.920 880s 880s x1: [3, 3, 1, 1] 880s 0.345 -0.803 0.174 880s -0.344 0.858 0.888 880s -0.948 0.087 0.122 880s 880s y: [2, 3, 1, 1] 880s 0.974 -0.086 880s -0.228 -0.909 880s -0.573 0.334 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.893 -0.706 880s 880s x1: [2, 3, 1, 1] 880s -0.963 -0.588 880s 0.008 -0.719 880s -0.930 0.199 880s 880s y: [1, 3, 1, 1] 880s -0.444 880s 0.515 880s -0.971 880s testing: mul_mat, [3, 2, 1, 2] = [1, 3, 1, 2] * [1, 2, 1, 2] 880s x0: [1, 3, 1, 2] 880s -0.589 880s 0.861 880s 0.894 880s 880s x1: [1, 2, 1, 2] 880s -0.082 880s 0.641 880s 880s y: [3, 2, 1, 2] 880s 0.048 -0.071 -0.073 880s -0.377 0.552 0.573 880s test-mul-mat0: iter:211/500 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.482 0.479 0.321 -0.319 880s 880s x1: [4, 2, 1, 1] 880s -0.824 -0.533 0.097 -0.606 880s 0.161 -0.162 -0.527 0.198 880s 880s y: [1, 2, 1, 1] 880s -0.428 880s -0.232 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.858 -0.183 -0.448 -0.731 880s -0.322 -0.555 0.854 0.196 880s -0.802 0.328 0.765 0.037 880s 880s x1: [4, 1, 1, 1] 880s 0.383 0.214 -0.025 -0.699 880s 880s y: [3, 1, 1, 1] 880s 0.811 -0.400 -0.282 880s testing: mul_mat, [2, 3, 1, 4] = [4, 2, 1, 4] * [4, 3, 1, 4] 880s x0: [4, 2, 1, 4] 880s 0.741 0.576 0.693 -0.209 880s 0.649 -0.787 -0.713 0.576 880s 880s x1: [4, 3, 1, 4] 880s 0.924 0.941 0.504 0.021 880s -0.665 -0.336 0.859 -0.192 880s 0.863 0.110 -0.710 0.341 880s 880s y: [2, 3, 1, 4] 880s 1.572 -0.488 880s -0.051 -0.891 880s 0.139 1.177 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.237 0.457 -0.426 0.163 880s -0.374 -0.268 0.726 0.202 880s 0.788 0.458 0.196 0.882 880s 0.248 -0.956 0.177 -0.802 880s 880s x1: [4, 2, 1, 1] 880s 0.181 0.076 -0.189 -0.170 880s 0.289 0.097 -0.594 0.217 880s 880s y: [4, 2, 1, 1] 880s 0.131 -0.260 -0.010 0.075 880s 0.402 -0.522 0.348 -0.301 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.269 -0.536 -0.002 -0.425 880s 0.540 -0.999 -0.492 -0.276 880s 880s x1: [4, 4, 1, 1] 880s 0.035 0.414 -0.524 -0.361 880s -0.202 -0.504 0.820 0.874 880s 0.306 -0.351 0.163 -0.596 880s 0.055 -0.620 0.481 -0.708 880s 880s y: [2, 4, 1, 1] 880s -0.058 -0.038 880s -0.158 -0.250 880s 0.524 0.600 880s 0.647 0.608 880s testing: mul_mat, [3, 4, 1, 4] = [2, 3, 1, 4] * [2, 4, 1, 4] 880s x0: [2, 3, 1, 4] 880s 0.546 0.631 880s 0.853 0.794 880s 0.478 -0.474 880s 880s x1: [2, 4, 1, 4] 880s 0.704 0.751 880s 0.887 -0.093 880s 0.634 -0.915 880s 0.942 0.048 880s 880s y: [3, 4, 1, 4] 880s 0.859 1.198 -0.019 880s 0.425 0.682 0.468 880s -0.231 -0.186 0.736 880s 0.544 0.841 0.428 880s test-mul-mat0: iter:212/500 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.084 880s -0.966 880s 880s x1: [1, 4, 1, 1] 880s -0.142 880s 0.425 880s 0.606 880s -0.597 880s 880s y: [2, 4, 1, 1] 880s -0.012 0.138 880s 0.036 -0.410 880s 0.051 -0.585 880s -0.050 0.576 880s testing: mul_mat, [4, 2, 3, 1] = [1, 4, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 4, 3, 1] 880s -0.927 880s 0.092 880s -0.953 880s 0.047 880s 880s x1: [1, 2, 3, 1] 880s 0.072 880s 0.611 880s 880s y: [4, 2, 3, 1] 880s -0.067 0.007 -0.069 0.003 880s -0.566 0.056 -0.582 0.028 880s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 4, 3, 1] 880s 0.984 880s 0.289 880s -0.275 880s 0.967 880s 880s x1: [1, 4, 3, 1] 880s 0.024 880s 0.922 880s 0.766 880s 0.503 880s 880s y: [4, 4, 3, 1] 880s 0.024 0.007 -0.007 0.023 880s 0.908 0.266 -0.254 0.891 880s 0.754 0.221 -0.211 0.741 880s 0.495 0.145 -0.138 0.486 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.843 880s 880s x1: [1, 4, 1, 1] 880s -0.631 880s 0.438 880s 0.275 880s -0.845 880s 880s y: [1, 4, 1, 1] 880s -0.532 880s 0.369 880s 0.232 880s -0.712 880s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 2, 3, 1] 880s 0.298 880s -0.767 880s 880s x1: [1, 1, 3, 1] 880s 0.179 880s 880s y: [2, 1, 3, 1] 880s 0.053 -0.137 880s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.623 -0.048 880s -0.234 -0.764 880s 880s x1: [2, 1, 3, 1] 880s -0.039 -0.678 880s 880s y: [2, 1, 3, 1] 880s 0.057 0.528 880s test-mul-mat0: iter:213/500 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.802 -0.048 0.850 0.158 880s 880s x1: [4, 1, 1, 1] 880s 0.459 -0.576 0.611 0.691 880s 880s y: [1, 1, 1, 1] 880s 0.289 880s testing: mul_mat, [4, 1, 3, 1] = [4, 4, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.981 -0.586 -0.927 -0.974 880s -0.023 -0.228 0.875 -0.281 880s 0.085 0.334 0.144 -0.304 880s 0.025 0.178 -0.106 0.977 880s 880s x1: [4, 1, 3, 1] 880s 0.273 -0.056 -0.896 -0.116 880s 880s y: [4, 1, 3, 1] 880s 0.710 -0.746 -0.089 -0.022 880s testing: mul_mat, [2, 4, 3, 4] = [4, 2, 3, 4] * [4, 4, 3, 4] 880s x0: [4, 2, 3, 4] 880s -0.238 -0.790 0.486 -0.983 880s -0.534 0.925 -0.122 -0.460 880s 880s x1: [4, 4, 3, 4] 880s 0.529 0.056 -0.424 0.685 880s 0.190 0.310 -0.144 -0.416 880s 0.571 0.994 0.476 0.120 880s 0.139 0.433 -0.312 -0.302 880s 880s y: [2, 4, 3, 4] 880s -1.049 -0.494 880s 0.048 0.394 880s -0.808 0.501 880s -0.230 0.503 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.921 -0.819 -0.274 -0.921 880s -0.668 -0.887 -0.825 0.832 880s -0.994 0.245 -0.037 -0.433 880s -0.185 -0.316 -0.599 -0.528 880s 880s x1: [4, 2, 1, 1] 880s 0.500 -0.528 0.876 -0.007 880s -0.778 -0.482 0.425 -0.191 880s 880s y: [4, 2, 1, 1] 880s 0.659 -0.593 -0.657 -0.447 880s -0.262 0.437 0.722 0.143 880s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.793 -0.414 0.262 0.915 880s 880s x1: [4, 4, 3, 1] 880s 0.718 -0.487 -0.374 0.808 880s 0.740 0.300 0.309 -0.788 880s 0.176 -0.698 -0.566 0.694 880s 0.727 0.243 0.303 0.647 880s 880s y: [1, 4, 3, 1] 880s 0.274 880s -1.351 880s 0.636 880s -0.005 880s testing: mul_mat, [4, 4, 3, 4] = [1, 4, 3, 4] * [1, 4, 3, 4] 880s x0: [1, 4, 3, 4] 880s 0.601 880s -0.873 880s 0.514 880s 0.618 880s 880s x1: [1, 4, 3, 4] 880s -0.806 880s -0.386 880s -0.301 880s 0.254 880s 880s y: [4, 4, 3, 4] 880s -0.485 0.704 -0.415 -0.499 880s -0.232 0.337 -0.199 -0.239 880s -0.181 0.263 -0.155 -0.186 880s 0.153 -0.222 0.131 0.157 880s test-mul-mat0: iter:214/500 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.173 0.680 0.664 880s 0.913 0.731 0.322 880s 880s x1: [3, 3, 1, 1] 880s -0.137 -0.528 0.761 880s 0.512 0.493 0.518 880s 0.602 0.359 0.536 880s 880s y: [2, 3, 1, 1] 880s 0.171 -0.266 880s 0.590 0.995 880s 0.496 0.985 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.667 -0.402 0.527 880s -0.820 -0.256 -0.735 880s 0.564 0.352 0.128 880s 880s x1: [3, 2, 1, 1] 880s -0.956 0.267 0.632 880s -0.397 0.624 -0.962 880s 880s y: [3, 2, 1, 1] 880s 0.863 0.251 -0.365 880s -0.494 0.872 -0.127 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.847 -0.218 0.627 880s 0.578 -0.896 0.671 880s 880s x1: [3, 3, 1, 1] 880s -0.964 0.113 -0.360 880s 0.529 -0.369 -0.758 880s -0.112 -0.833 0.118 880s 880s y: [2, 3, 1, 1] 880s -1.066 -0.900 880s 0.053 0.127 880s 0.161 0.761 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.628 0.848 -0.018 880s 880s x1: [3, 2, 1, 1] 880s -0.155 0.736 -0.725 880s -0.531 0.774 -0.679 880s 880s y: [1, 2, 1, 1] 880s 0.735 880s 1.002 880s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.241 880s -0.418 880s 880s x1: [1, 3, 1, 1] 880s 0.116 880s -0.887 880s -0.454 880s 880s y: [2, 3, 1, 1] 880s 0.028 -0.048 880s -0.214 0.370 880s -0.109 0.189 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.788 -0.622 880s -0.877 0.241 880s -0.001 -0.286 880s 880s x1: [2, 1, 1, 1] 880s 0.581 0.881 880s 880s y: [3, 1, 1, 1] 880s -1.005 -0.297 -0.253 880s test-mul-mat0: iter:215/500 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.863 -0.714 880s 0.116 0.638 880s -0.393 0.918 880s -0.990 -0.544 880s 880s x1: [2, 1, 1, 1] 880s -0.873 -0.988 880s 880s y: [4, 1, 1, 1] 880s -0.049 -0.731 -0.565 1.403 880s testing: mul_mat, [1, 4, 2, 1] = [2, 1, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 1, 2, 1] 880s 0.077 -0.806 880s 880s x1: [2, 4, 2, 1] 880s -0.100 0.125 880s -0.431 0.447 880s 0.593 0.810 880s -0.971 0.173 880s 880s y: [1, 4, 2, 1] 880s -0.108 880s -0.393 880s -0.607 880s -0.214 880s testing: mul_mat, [2, 1, 2, 3] = [2, 2, 2, 3] * [2, 1, 2, 3] 880s x0: [2, 2, 2, 3] 880s 0.820 0.432 880s 0.933 0.849 880s 880s x1: [2, 1, 2, 3] 880s -0.794 -0.562 880s 880s y: [2, 1, 2, 3] 880s -0.894 -1.218 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.096 0.191 880s -0.349 -0.472 880s 880s x1: [2, 2, 1, 1] 880s -0.028 -0.002 880s 0.257 -0.473 880s 880s y: [2, 2, 1, 1] 880s -0.003 0.011 880s -0.065 0.133 880s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.864 -0.381 880s 0.139 0.703 880s 880s x1: [2, 2, 2, 1] 880s -0.416 0.040 880s 0.133 -0.792 880s 880s y: [2, 2, 2, 1] 880s 0.344 -0.030 880s 0.187 -0.539 880s testing: mul_mat, [3, 2, 2, 3] = [2, 3, 2, 3] * [2, 2, 2, 3] 880s x0: [2, 3, 2, 3] 880s 0.490 0.641 880s 0.994 -0.532 880s -0.112 -0.417 880s 880s x1: [2, 2, 2, 3] 880s -0.877 -0.687 880s 0.568 0.648 880s 880s y: [3, 2, 2, 3] 880s -0.870 -0.506 0.384 880s 0.694 0.219 -0.334 880s test-mul-mat0: iter:216/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.413 -0.569 -0.495 -0.851 880s -0.128 -0.249 -0.418 0.604 880s 0.571 -0.495 -0.670 0.279 880s -0.931 0.524 0.570 0.397 880s 880s x1: [4, 3, 1, 1] 880s -0.260 0.612 0.133 -0.201 880s 0.362 -0.109 -0.124 -0.678 880s -0.634 -0.839 0.019 -0.856 880s 880s y: [4, 3, 1, 1] 880s -0.136 -0.296 -0.596 0.558 880s 0.551 -0.377 0.155 -0.735 880s 1.459 -0.236 -0.199 -0.179 880s testing: mul_mat, [3, 4, 4, 1] = [4, 3, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 3, 4, 1] 880s 0.573 -0.788 -0.053 0.961 880s -0.248 -0.062 0.144 0.693 880s -0.096 -0.859 0.180 -0.463 880s 880s x1: [4, 4, 4, 1] 880s 0.499 -0.987 -0.862 0.111 880s 0.145 -0.063 -0.527 -0.964 880s 0.813 -0.205 -0.598 0.974 880s 0.814 -0.455 0.668 -0.600 880s 880s y: [3, 4, 4, 1] 880s 1.216 -0.111 0.593 880s -0.766 -0.777 0.392 880s 1.595 0.399 -0.461 880s 0.213 -0.493 0.710 880s testing: mul_mat, [1, 3, 4, 3] = [4, 1, 4, 3] * [4, 3, 4, 3] 880s x0: [4, 1, 4, 3] 880s -0.576 0.777 -0.157 -0.466 880s 880s x1: [4, 3, 4, 3] 880s -0.254 -0.816 -0.478 0.051 880s 0.465 0.788 -0.640 0.168 880s 0.907 -0.982 -0.601 0.683 880s 880s y: [1, 3, 4, 3] 880s -0.436 880s 0.366 880s -1.509 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.418 -0.240 0.621 -0.093 880s 0.186 0.840 -0.245 0.448 880s -0.530 0.233 -0.826 0.959 880s 880s x1: [4, 1, 1, 1] 880s -0.077 -0.514 0.051 0.614 880s 880s y: [3, 1, 1, 1] 880s 0.131 -0.183 0.468 880s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 2, 4, 1] 880s 0.555 -0.056 0.905 880s -0.727 -0.488 -0.266 880s 880s x1: [3, 4, 4, 1] 880s -0.273 -0.538 -0.319 880s 0.594 0.080 -0.824 880s -0.376 -0.212 0.671 880s -0.565 0.547 -0.772 880s 880s y: [2, 4, 4, 1] 880s -0.410 0.546 880s -0.421 -0.252 880s 0.410 0.199 880s -1.043 0.349 880s testing: mul_mat, [4, 3, 4, 3] = [2, 4, 4, 3] * [2, 3, 4, 3] 880s x0: [2, 4, 4, 3] 880s -0.332 -0.423 880s -0.211 0.038 880s -0.529 0.716 880s 0.882 0.322 880s 880s x1: [2, 3, 4, 3] 880s 0.502 0.394 880s -0.433 0.731 880s -0.538 0.967 880s 880s y: [4, 3, 4, 3] 880s -0.333 -0.091 0.017 0.570 880s -0.166 0.119 0.753 -0.147 880s -0.231 0.151 0.978 -0.164 880s test-mul-mat0: iter:217/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.879 0.051 0.085 880s 880s x1: [3, 1, 1, 1] 880s -0.835 -0.849 0.527 880s 880s y: [1, 1, 1, 1] 880s -0.732 880s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 1, 4, 1] 880s 0.835 -0.787 -0.014 880s 880s x1: [3, 1, 4, 1] 880s 0.960 0.960 0.814 880s 880s y: [1, 1, 4, 1] 880s 0.035 880s testing: mul_mat, [2, 1, 4, 1] = [3, 2, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 2, 4, 1] 880s -0.779 0.905 0.667 880s -0.972 0.342 -0.950 880s 880s x1: [3, 1, 4, 1] 880s 0.731 -0.903 -0.096 880s 880s y: [2, 1, 4, 1] 880s -1.451 -0.928 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.332 -0.691 0.723 880s -0.620 -0.304 -0.641 880s 880s x1: [3, 2, 1, 1] 880s 0.870 -0.069 -0.341 880s 0.505 -0.553 -0.525 880s 880s y: [2, 2, 1, 1] 880s -0.488 -0.300 880s -0.165 0.191 880s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 4, 4, 1] 880s 0.205 -0.657 880s -0.379 0.548 880s 0.189 -0.539 880s 0.901 0.621 880s 880s x1: [2, 3, 4, 1] 880s -0.440 0.278 880s 0.395 0.529 880s -0.929 0.480 880s 880s y: [4, 3, 4, 1] 880s -0.273 0.319 -0.233 -0.224 880s -0.266 0.140 -0.211 0.685 880s -0.505 0.615 -0.434 -0.540 880s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.746 0.001 -0.533 -0.135 880s 880s x1: [4, 2, 4, 1] 880s 0.214 0.235 -0.342 0.557 880s -0.217 0.119 0.179 0.726 880s 880s y: [1, 2, 4, 1] 880s 0.267 880s -0.356 880s test-mul-mat0: iter:218/500 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.850 880s 880s x1: [1, 1, 1, 1] 880s 0.537 880s 880s y: [1, 1, 1, 1] 880s -0.456 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.518 880s 0.404 880s 0.274 880s 0.570 880s 880s x1: [1, 1, 1, 1] 880s 0.174 880s 880s y: [4, 1, 1, 1] 880s -0.090 0.070 0.048 0.099 880s testing: mul_mat, [2, 4, 1, 2] = [1, 2, 1, 2] * [1, 4, 1, 2] 880s x0: [1, 2, 1, 2] 880s 0.079 880s 0.960 880s 880s x1: [1, 4, 1, 2] 880s -0.152 880s -0.041 880s 0.316 880s 0.849 880s 880s y: [2, 4, 1, 2] 880s -0.012 -0.146 880s -0.003 -0.040 880s 0.025 0.303 880s 0.067 0.815 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.846 880s 0.773 880s 0.144 880s 880s x1: [1, 2, 1, 1] 880s -0.707 880s -0.102 880s 880s y: [3, 2, 1, 1] 880s -0.598 -0.546 -0.102 880s -0.086 -0.079 -0.015 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.378 0.147 0.061 880s -0.335 -0.787 -0.564 880s -0.191 0.477 -0.968 880s -0.798 -0.283 -0.090 880s 880s x1: [3, 1, 1, 1] 880s -0.902 -0.044 -0.485 880s 880s y: [4, 1, 1, 1] 880s -0.376 0.610 0.621 0.776 880s testing: mul_mat, [3, 3, 1, 2] = [4, 3, 1, 2] * [4, 3, 1, 2] 880s x0: [4, 3, 1, 2] 880s 0.186 0.622 0.483 -0.037 880s -0.485 -0.453 -0.240 0.126 880s 0.393 -0.698 0.241 0.258 880s 880s x1: [4, 3, 1, 2] 880s 0.861 -0.787 -0.062 0.480 880s -0.634 -0.983 0.439 -0.498 880s -0.211 0.732 0.400 -0.673 880s 880s y: [3, 3, 1, 2] 880s -0.377 0.014 0.996 880s -0.498 0.584 0.414 880s 0.634 -0.410 -0.671 880s test-mul-mat0: iter:219/500 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.393 0.337 0.882 0.766 880s 0.050 -0.917 0.610 -0.562 880s -0.586 0.439 0.138 -0.748 880s 0.919 0.957 0.799 -0.803 880s 880s x1: [4, 4, 1, 1] 880s 0.547 -0.722 0.152 -0.267 880s -0.208 -0.455 -0.645 0.339 ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 880s -0.153 0.839 -0.901 -0.912 880s -0.199 0.225 0.346 0.203 880s 880s y: [4, 4, 1, 1] 880s -0.099 0.932 -0.418 0.147 880s -0.544 -0.177 -0.421 -1.415 880s -1.271 -0.813 1.016 0.674 880s 0.459 -0.119 0.111 0.146 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.612 0.093 0.118 -0.529 880s 880s x1: [4, 4, 1, 1] 880s 0.108 -0.467 -0.011 0.653 880s -0.112 -0.672 -0.500 -0.273 880s -0.573 -0.412 0.528 0.652 880s 0.934 -0.269 -0.494 0.327 880s 880s y: [1, 4, 1, 1] 880s -0.457 880s 0.091 880s 0.030 880s -0.828 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.138 -0.938 -0.349 -0.063 880s -0.741 0.759 0.470 0.248 880s 880s x1: [4, 1, 1, 1] 880s -0.297 0.556 -0.115 -0.857 880s 880s y: [2, 1, 1, 1] 880s -0.469 0.376 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.565 0.413 0.021 0.701 880s 0.345 -0.450 0.558 -0.218 880s -0.838 -0.145 -0.912 0.044 880s 0.762 -0.468 0.907 0.763 880s 880s x1: [4, 2, 1, 1] 880s 0.412 -0.642 0.576 0.913 880s 0.085 -0.996 -0.499 -0.387 880s 880s y: [4, 2, 1, 1] 880s 0.154 0.554 -0.738 1.834 880s -0.742 0.283 0.511 -0.217 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.331 -0.734 -0.931 -0.759 880s 0.214 0.219 -0.202 0.628 880s 880s x1: [4, 4, 1, 1] 880s 0.433 0.982 -0.978 0.192 880s 0.452 0.270 -0.396 0.810 880s -0.153 -0.483 -0.105 -0.149 880s 0.018 0.508 -0.493 0.452 880s 880s y: [2, 4, 1, 1] 880s -0.099 0.626 880s -0.594 0.744 880s 0.616 -0.211 880s -0.263 0.499 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.707 -0.375 880s 880s x1: [2, 4, 1, 1] 880s -0.114 -0.853 880s 0.329 0.668 880s 0.192 0.092 880s 0.101 0.173 880s 880s y: [1, 4, 1, 1] 880s 0.400 880s -0.483 880s -0.170 880s -0.136 880s test-mul-mat0: iter:220/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.244 -0.281 -0.205 0.642 880s -0.134 -0.876 0.310 -0.942 880s 0.216 -0.589 0.231 -0.670 880s -0.296 0.856 -0.285 0.601 880s 880s x1: [4, 3, 1, 1] 880s 0.414 0.330 -0.917 -0.568 880s -0.162 -0.410 0.884 0.691 880s 0.259 0.098 0.957 -0.522 880s 880s y: [4, 3, 1, 1] 880s -0.371 -0.093 0.064 0.079 880s 0.417 0.004 -0.052 -0.140 880s -0.622 0.668 0.569 -0.579 880s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 1, 3, 1] 880s 0.076 -0.375 0.254 -0.110 880s 880s x1: [4, 4, 3, 1] 880s -0.710 0.947 0.015 0.620 880s -0.970 0.447 -0.542 -0.381 880s 0.331 -0.851 0.878 -0.571 880s -0.894 -0.643 0.596 -0.138 880s 880s y: [1, 4, 3, 1] 880s -0.473 880s -0.337 880s 0.629 880s 0.339 880s testing: mul_mat, [2, 1, 3, 3] = [4, 2, 3, 3] * [4, 1, 3, 3] 880s x0: [4, 2, 3, 3] 880s 0.585 -0.336 -0.105 0.019 880s 0.036 -0.180 0.203 0.119 880s 880s x1: [4, 1, 3, 3] 880s -0.426 -0.682 -0.454 -0.881 880s 880s y: [2, 1, 3, 3] 880s 0.011 -0.089 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.382 0.054 -0.931 0.001 880s -0.637 -0.974 0.820 0.716 880s -0.123 -0.306 0.451 -0.601 880s -0.960 -0.215 0.228 0.257 880s 880s x1: [4, 2, 1, 1] 880s -0.820 -0.064 0.564 0.210 880s -0.904 -0.888 -0.634 -0.869 880s 880s y: [4, 2, 1, 1] 880s -0.842 1.198 0.249 0.984 880s 0.197 0.299 0.619 0.691 880s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.980 -0.828 -0.585 -0.922 880s 0.465 0.119 -0.145 -0.183 880s 880s x1: [4, 4, 3, 1] 880s -0.385 -0.599 0.790 0.268 880s 0.586 -0.663 0.448 -0.478 880s 0.901 -0.342 -0.382 -0.987 880s 0.023 -0.251 0.711 -0.595 880s 880s y: [2, 4, 3, 1] 880s -0.590 -0.414 880s 1.302 0.216 880s 2.299 0.614 880s 0.363 -0.013 880s testing: mul_mat, [1, 4, 3, 3] = [2, 1, 3, 3] * [2, 4, 3, 3] 880s x0: [2, 1, 3, 3] 880s 0.182 -0.976 880s 880s x1: [2, 4, 3, 3] 880s -0.892 -0.651 880s -0.472 0.077 880s 0.884 0.827 880s -0.366 0.864 880s 880s y: [1, 4, 3, 3] 880s 0.473 880s -0.161 880s -0.646 880s -0.909 880s test-mul-mat0: iter:221/500 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.750 880s -0.164 880s -0.453 880s 880s x1: [1, 4, 1, 1] 880s -0.330 880s -0.083 880s 0.305 880s 0.623 880s 880s y: [3, 4, 1, 1] 880s 0.247 0.054 0.149 880s 0.062 0.014 0.038 880s -0.229 -0.050 -0.138 880s -0.467 -0.102 -0.282 880s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 1, 2, 1] 880s 0.315 880s 880s x1: [1, 3, 2, 1] 880s 0.399 880s -0.982 880s -0.428 880s 880s y: [1, 3, 2, 1] 880s 0.126 880s -0.309 880s -0.135 880s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.068 880s 880s x1: [1, 1, 2, 1] 880s -0.906 880s 880s y: [1, 1, 2, 1] 880s 0.062 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.106 880s 880s x1: [1, 1, 1, 1] 880s -0.218 880s 880s y: [1, 1, 1, 1] 880s 0.023 880s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.319 880s 0.204 880s 880s x1: [1, 1, 2, 1] 880s -0.369 880s 880s y: [2, 1, 2, 1] 880s -0.118 -0.075 880s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.552 0.122 880s -0.449 -0.540 880s -0.353 -0.842 880s -0.534 0.505 880s 880s x1: [2, 1, 2, 1] 880s -0.879 -0.002 880s 880s y: [4, 1, 2, 1] 880s 0.485 0.396 0.313 0.468 880s test-mul-mat0: iter:222/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.124 880s -0.027 880s 880s x1: [1, 1, 1, 1] 880s 0.769 880s 880s y: [2, 1, 1, 1] 880s -0.096 -0.021 880s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.640 880s 0.391 880s 880s x1: [1, 2, 4, 1] 880s -0.252 880s -0.135 880s 880s y: [2, 2, 4, 1] 880s -0.161 -0.098 880s -0.087 -0.053 880s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 4, 4, 1] 880s 0.887 880s 0.563 880s -0.443 880s -0.237 880s 880s x1: [1, 2, 4, 1] 880s 0.261 880s -0.697 880s 880s y: [4, 2, 4, 1] 880s 0.232 0.147 -0.116 -0.062 880s -0.618 -0.392 0.309 0.165 880s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.009 880s -0.319 880s 0.243 880s 0.496 880s 880s x1: [1, 4, 1, 1] 880s 0.145 880s 0.785 880s 0.969 880s 0.420 880s 880s y: [4, 4, 1, 1] 880s -0.001 -0.046 0.035 0.072 880s -0.007 -0.251 0.191 0.390 880s -0.009 -0.309 0.235 0.481 880s -0.004 -0.134 0.102 0.208 880s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 2, 4, 1] 880s -0.487 -0.749 -0.753 0.983 880s 0.828 -0.803 0.396 -0.784 880s 880s x1: [4, 1, 4, 1] 880s -0.915 0.627 -0.947 -0.584 880s 880s y: [2, 1, 4, 1] 880s 0.115 -1.177 880s testing: mul_mat, [2, 4, 4, 1] = [2, 2, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 2, 4, 1] 880s -0.457 0.981 880s -0.512 0.544 880s 880s x1: [2, 4, 4, 1] 880s -0.312 -0.009 880s 0.019 -0.065 880s -0.614 0.001 880s 0.152 0.202 880s 880s y: [2, 4, 4, 1] 880s 0.133 0.155 880s -0.072 -0.045 880s 0.281 0.315 880s 0.129 0.032 880s test-mul-mat0: iter:223/500 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.036 -0.319 880s -0.333 0.829 880s 0.285 -0.809 880s 880s x1: [2, 1, 1, 1] 880s -0.041 -0.184 880s 880s y: [3, 1, 1, 1] 880s 0.060 -0.139 0.137 880s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 1, 2, 1] 880s -0.915 -0.260 880s 880s x1: [2, 3, 2, 1] 880s 0.382 -0.121 880s -0.265 0.869 880s -0.140 -0.721 880s 880s y: [1, 3, 2, 1] 880s -0.318 880s 0.016 880s 0.316 880s testing: mul_mat, [3, 1, 2, 3] = [2, 3, 2, 3] * [2, 1, 2, 3] 880s x0: [2, 3, 2, 3] 880s -0.150 -0.328 880s -0.256 0.585 880s -0.458 0.604 880s 880s x1: [2, 1, 2, 3] 880s 0.268 -0.538 880s 880s y: [3, 1, 2, 3] 880s 0.136 -0.383 -0.448 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.540 0.913 880s 880s x1: [2, 3, 1, 1] 880s -0.303 0.016 880s -0.769 -0.655 880s -0.374 0.788 880s 880s y: [1, 3, 1, 1] 880s -0.149 880s -1.013 880s 0.518 880s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 1, 2, 1] 880s 0.522 880s 880s x1: [1, 2, 2, 1] 880s 0.573 880s 0.194 880s 880s y: [1, 2, 2, 1] 880s 0.300 880s 0.101 880s testing: mul_mat, [4, 1, 2, 3] = [1, 4, 2, 3] * [1, 1, 2, 3] 880s x0: [1, 4, 2, 3] 880s -0.638 880s -0.669 880s -0.142 880s 0.929 880s 880s x1: [1, 1, 2, 3] 880s -0.880 880s 880s y: [4, 1, 2, 3] 880s 0.561 0.589 0.125 -0.817 880s test-mul-mat0: iter:224/500 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.058 0.034 880s 0.698 0.210 880s 0.683 0.794 880s 880s x1: [2, 3, 1, 1] 880s -0.674 -0.243 880s -0.775 -0.312 880s 0.088 0.083 880s 880s y: [3, 3, 1, 1] 880s -0.047 -0.521 -0.653 880s -0.056 -0.606 -0.776 880s 0.008 0.079 0.126 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.932 -0.763 880s 880s x1: [2, 3, 1, 1] 880s 0.592 -0.324 880s -0.483 0.763 880s 0.585 -0.220 880s 880s y: [1, 3, 1, 1] 880s -0.305 880s -0.132 880s -0.377 880s testing: mul_mat, [1, 1, 1, 3] = [2, 1, 1, 3] * [2, 1, 1, 3] 880s x0: [2, 1, 1, 3] 880s 0.405 0.218 880s 880s x1: [2, 1, 1, 3] 880s 0.418 -0.696 880s 880s y: [1, 1, 1, 3] 880s 0.018 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.691 0.518 880s 0.561 0.244 880s -0.480 -0.686 880s 880s x1: [2, 1, 1, 1] 880s 0.250 -0.473 880s 880s y: [3, 1, 1, 1] 880s -0.418 0.025 0.204 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.573 -0.861 0.656 880s 0.847 0.877 -0.350 880s 880s x1: [3, 2, 1, 1] 880s 0.111 0.920 -0.169 880s -0.126 0.505 0.611 880s 880s y: [2, 2, 1, 1] 880s -0.839 0.959 880s -0.106 0.122 880s testing: mul_mat, [1, 3, 1, 3] = [2, 1, 1, 3] * [2, 3, 1, 3] 880s x0: [2, 1, 1, ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 3] 880s 0.070 -0.948 880s 880s x1: [2, 3, 1, 3] 880s 0.888 0.157 880s 0.841 0.293 880s -0.625 0.359 880s 880s y: [1, 3, 1, 3] 880s -0.087 880s -0.219 880s -0.384 880s test-mul-mat0: iter:225/500 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.912 880s -0.677 880s 0.869 880s 0.206 880s 880s x1: [1, 1, 1, 1] 880s 0.166 880s 880s y: [4, 1, 1, 1] 880s 0.151 -0.112 0.144 0.034 880s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 2, 2, 1] 880s -0.231 880s -0.915 880s 880s x1: [1, 4, 2, 1] 880s -0.302 880s 0.227 880s -0.338 880s -0.483 880s 880s y: [2, 4, 2, 1] 880s 0.070 0.276 880s -0.052 -0.208 880s 0.078 0.310 880s 0.112 0.442 880s testing: mul_mat, [2, 2, 2, 4] = [1, 2, 2, 4] * [1, 2, 2, 4] 880s x0: [1, 2, 2, 4] 880s -0.146 880s -0.284 880s 880s x1: [1, 2, 2, 4] 880s 0.306 880s -0.440 880s 880s y: [2, 2, 2, 4] 880s -0.045 -0.087 880s 0.064 0.125 880s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.338 880s 880s x1: [1, 2, 1, 1] 880s 0.773 880s 0.536 880s 880s y: [1, 2, 1, 1] 880s 0.261 880s 0.181 880s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.086 880s 0.832 880s 880s x1: [1, 1, 2, 1] 880s -0.444 880s 880s y: [2, 1, 2, 1] 880s -0.038 -0.369 880s testing: mul_mat, [4, 1, 2, 4] = [2, 4, 2, 4] * [2, 1, 2, 4] 880s x0: [2, 4, 2, 4] 880s -0.507 -0.054 880s 0.410 0.705 880s 0.884 0.604 880s -0.733 0.502 880s 880s x1: [2, 1, 2, 4] 880s 0.554 -0.566 880s 880s y: [4, 1, 2, 4] 880s -0.250 -0.171 0.148 -0.690 880s test-mul-mat0: iter:226/500 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.504 880s 880s x1: [1, 4, 1, 1] 880s -0.544 880s -0.404 880s -0.355 880s -0.537 880s 880s y: [1, 4, 1, 1] 880s -0.274 880s -0.204 880s -0.179 880s -0.271 880s testing: mul_mat, [4, 1, 3, 1] = [1, 4, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 4, 3, 1] 880s 0.343 880s -0.948 880s 0.716 880s -0.486 880s 880s x1: [1, 1, 3, 1] 880s 0.910 880s 880s y: [4, 1, 3, 1] 880s 0.312 -0.863 0.652 -0.442 880s testing: mul_mat, [1, 4, 3, 4] = [1, 1, 3, 4] * [1, 4, 3, 4] 880s x0: [1, 1, 3, 4] 880s 0.640 880s 880s x1: [1, 4, 3, 4] 880s -0.123 880s -0.005 880s 0.480 880s -0.768 880s 880s y: [1, 4, 3, 4] 880s -0.079 880s -0.003 880s 0.307 880s -0.492 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.271 880s -0.808 880s 880s x1: [1, 1, 1, 1] 880s -0.849 880s 880s y: [2, 1, 1, 1] 880s 0.230 0.686 880s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 4, 3, 1] 880s 0.599 -0.479 880s -0.080 -0.727 880s -0.970 0.649 880s 0.319 -0.081 880s 880s x1: [2, 1, 3, 1] 880s -0.907 0.873 880s 880s y: [4, 1, 3, 1] 880s -0.962 -0.561 1.447 -0.360 880s testing: mul_mat, [1, 2, 3, 4] = [4, 1, 3, 4] * [4, 2, 3, 4] 880s x0: [4, 1, 3, 4] 880s 0.676 0.841 -0.210 -0.176 880s 880s x1: [4, 2, 3, 4] 880s -0.567 0.864 -0.540 -0.440 880s -0.505 -0.016 -0.489 -0.906 880s 880s y: [1, 2, 3, 4] 880s 0.535 880s -0.093 880s test-mul-mat0: iter:227/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.426 0.791 880s 0.633 -0.546 880s -0.064 -0.158 880s 880s x1: [2, 2, 1, 1] 880s -0.533 -0.280 880s 0.054 0.464 880s 880s y: [3, 2, 1, 1] 880s 0.005 -0.184 0.079 880s 0.344 -0.219 -0.077 880s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 2, 4, 1] 880s -0.278 -0.867 880s -0.109 0.176 880s 880s x1: [2, 3, 4, 1] 880s -0.825 -0.130 880s -0.125 -0.988 880s -0.405 0.918 880s 880s y: [2, 3, 4, 1] 880s 0.342 0.067 880s 0.891 -0.160 880s -0.684 0.206 880s testing: mul_mat, [4, 2, 4, 1] = [2, 4, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 4, 4, 1] 880s 0.517 -0.072 880s -0.993 0.249 880s -0.721 0.945 880s 0.857 0.643 880s 880s x1: [2, 2, 4, 1] 880s -0.163 -0.943 880s -0.166 0.995 880s 880s y: [4, 2, 4, 1] 880s -0.016 -0.073 -0.774 -0.747 880s -0.158 0.412 1.060 0.498 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.559 0.393 880s -0.409 -0.951 880s -0.173 -0.912 880s 0.406 -0.810 880s 880s x1: [2, 4, 1, 1] 880s 0.172 -0.800 880s 0.422 0.452 880s -0.855 0.279 880s 0.095 -0.375 880s 880s y: [4, 4, 1, 1] 880s -0.218 0.690 0.700 0.718 880s 0.414 -0.602 -0.485 -0.194 880s -0.368 0.085 -0.106 -0.573 880s -0.094 0.318 0.326 0.342 880s testing: mul_mat, [4, 2, 4, 1] = [4, 4, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.754 0.063 -0.383 -0.663 880s -0.818 -0.274 -0.810 -0.944 880s 0.492 0.214 0.528 -0.832 880s -0.517 0.199 -0.319 0.937 880s 880s x1: [4, 2, 4, 1] 880s -0.552 -0.459 -0.227 -0.568 880s 0.204 -0.762 -0.119 0.234 880s 880s y: [4, 2, 4, 1] 880s 0.850 1.297 -0.017 -0.265 880s -0.312 -0.083 -0.321 0.000 880s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 4, 4, 1] 880s 0.949 0.173 0.580 0.859 880s -0.759 -0.699 -0.442 0.779 880s 0.957 0.115 0.545 -0.337 880s -0.666 0.247 -0.784 0.574 880s 880s x1: [4, 4, 4, 1] 880s -0.915 -0.428 0.499 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s -0.117 880s 0.190 0.607 -0.701 0.997 880s 0.243 -0.604 0.142 -0.065 880s -0.339 -0.776 0.358 0.858 880s 880s y: [4, 4, 4, 1] 880s -0.753 0.682 -0.614 0.046 880s 0.735 0.519 -0.466 1.145 880s 0.152 0.124 0.262 -0.460 880s 0.489 1.310 -0.508 0.246 880s test-mul-mat0: iter:228/500 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.034 880s 880s x1: [1, 3, 1, 1] 880s 0.992 880s 0.828 880s 0.575 880s 880s y: [1, 3, 1, 1] 880s -0.034 880s -0.028 880s -0.019 880s testing: mul_mat, [1, 1, 3, 1] = [1, 1, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 1, 3, 1] 880s -0.697 880s 880s x1: [1, 1, 3, 1] 880s -0.960 880s 880s y: [1, 1, 3, 1] 880s 0.669 880s testing: mul_mat, [4, 1, 3, 4] = [1, 4, 3, 4] * [1, 1, 3, 4] 880s x0: [1, 4, 3, 4] 880s -0.659 880s -0.766 880s -0.087 880s 0.884 880s 880s x1: [1, 1, 3, 4] 880s 0.611 880s 880s y: [4, 1, 3, 4] 880s -0.403 -0.468 -0.053 0.541 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.327 880s 0.578 880s 880s x1: [1, 4, 1, 1] 880s 0.153 880s 0.507 880s -0.715 880s 0.445 880s 880s y: [2, 4, 1, 1] 880s 0.050 0.089 880s 0.166 0.293 880s -0.234 -0.413 880s 0.145 0.257 880s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 4, 3, 1] 880s 0.815 -0.418 880s 0.776 -0.019 880s 0.132 -0.674 880s -0.868 0.685 880s 880s x1: [2, 1, 3, 1] 880s -0.611 0.289 880s 880s y: [4, 1, 3, 1] 880s -0.619 -0.480 -0.275 0.728 880s testing: mul_mat, [2, 2, 3, 4] = [4, 2, 3, 4] * [4, 2, 3, 4] 880s x0: [4, 2, 3, 4] 880s 0.255 0.642 0.445 0.160 880s 0.424 0.358 0.288 -0.289 880s 880s x1: [4, 2, 3, 4] 880s 0.863 -0.532 -0.778 -0.307 880s 0.850 0.661 -0.783 0.665 880s 880s y: [2, 2, 3, 4] 880s -0.518 0.041 880s 0.399 0.180 880s test-mul-mat0: iter:229/500 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.934 -0.386 880s 0.326 0.611 880s -0.180 -0.395 880s 0.390 0.372 880s 880s x1: [2, 2, 1, 1] 880s -0.545 0.792 880s 0.682 -0.908 880s 880s y: [4, 2, 1, 1] 880s -0.815 0.306 -0.215 0.083 880s 0.988 -0.332 0.236 -0.072 880s testing: mul_mat, [4, 4, 2, 1] = [2, 4, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.592 0.829 880s 0.846 -0.910 880s 0.921 -0.177 880s -0.976 -0.465 880s 880s x1: [2, 4, 2, 1] 880s -0.657 0.045 880s -0.550 -0.186 880s -0.527 0.324 880s 0.914 -0.584 880s 880s y: [4, 4, 2, 1] 880s 0.426 -0.597 -0.613 0.620 880s 0.172 -0.296 -0.473 0.623 880s 0.581 -0.741 -0.543 0.364 880s -1.025 1.305 0.945 -0.620 880s testing: mul_mat, [4, 4, 2, 4] = [2, 4, 2, 4] * [2, 4, 2, 4] 880s x0: [2, 4, 2, 4] 880s 0.260 0.212 880s 0.716 0.011 880s -0.586 0.162 880s -0.407 0.129 880s 880s x1: [2, 4, 2, 4] 880s 0.177 0.911 880s -0.458 -0.499 880s 0.825 -0.042 880s -0.180 0.991 880s 880s y: [4, 4, 2, 4] 880s 0.239 0.137 0.043 0.045 880s -0.225 -0.333 0.188 0.122 880s 0.206 0.590 -0.490 -0.341 880s 0.163 -0.119 0.266 0.201 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.688 0.879 880s 0.277 -0.235 880s -0.442 0.015 880s -0.228 0.890 880s 880s x1: [2, 4, 1, 1] 880s -0.401 -0.700 880s -0.239 0.934 880s -0.837 0.191 880s -0.372 -0.625 880s 880s y: [4, 4, 1, 1] 880s -0.891 0.054 0.166 -0.532 880s 0.656 -0.285 0.120 0.886 880s -0.408 -0.276 0.373 0.361 880s -0.805 0.044 0.155 -0.471 880s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.416 0.212 -0.100 0.671 880s -0.774 -0.399 0.878 0.757 880s 0.288 0.639 -0.802 -0.037 880s 880s x1: [4, 2, 2, 1] 880s 0.095 0.180 0.350 -0.805 880s -0.596 0.654 0.965 0.302 880s 880s y: [3, 2, 2, 1] 880s -0.577 -0.448 -0.108 880s 0.493 1.277 -0.539 880s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 880s x0: [3, 2, 2, 4] 880s 0.375 -0.840 -0.091 880s -0.206 0.687 -0.537 880s 880s x1: [3, 4, 2, 4] 880s 0.448 -0.675 -0.284 880s -0.215 -0.025 -0.221 880s -0.118 0.559 0.005 880s -0.830 -0.229 0.606 880s 880s y: [2, 4, 2, 4] 880s 0.761 -0.403 880s -0.040 0.146 880s -0.514 0.406 880s -0.174 -0.312 880s test-mul-mat0: iter:230/500 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.326 -0.396 0.770 880s -0.445 -0.044 -0.392 880s 880s x1: [3, 4, 1, 1] 880s -0.893 0.448 0.458 880s 0.397 0.096 -0.100 880s 0.135 0.476 0.969 880s 0.752 0.231 0.832 880s 880s y: [2, 4, 1, 1] 880s 0.467 0.197 880s -0.244 -0.141 880s 0.513 -0.461 880s 0.303 -0.671 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.865 -0.504 0.567 880s 0.864 -0.085 0.674 880s 0.312 -0.627 0.071 880s 880s x1: [3, 2, 1, 1] 880s 0.739 0.595 0.015 880s -0.458 -0.488 -0.227 880s 880s y: [3, 2, 1, 1] 880s 0.348 0.599 -0.141 880s -0.279 -0.508 0.147 880s testing: mul_mat, [1, 3, 1, 4] = [3, 1, 1, 4] * [3, 3, 1, 4] 880s x0: [3, 1, 1, 4] 880s 0.188 -0.975 -0.655 880s 880s x1: [3, 3, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 1, 4] 880s -0.592 0.273 -0.795 880s 0.884 0.242 0.957 880s 0.115 0.074 -0.514 880s 880s y: [1, 3, 1, 4] 880s 0.143 880s -0.697 880s 0.286 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.915 0.204 -0.317 880s 0.142 -0.013 -0.642 880s -0.638 -0.832 0.348 880s -0.615 0.639 -0.416 880s 880s x1: [3, 1, 1, 1] 880s -0.973 0.308 0.995 880s 880s y: [4, 1, 1, 1] 880s -1.143 -0.781 0.710 0.382 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.069 -0.810 0.120 -0.293 880s -0.805 -0.242 -0.461 -0.825 880s 0.346 -0.846 -0.464 -0.210 880s -0.760 0.332 -0.893 0.065 880s 880s x1: [4, 3, 1, 1] 880s -0.568 -0.701 -0.558 0.620 880s -0.676 -0.213 -0.403 -0.973 880s 0.388 0.917 -0.635 -0.148 880s 880s y: [4, 3, 1, 1] 880s 0.358 0.372 0.524 0.738 880s 0.456 1.584 0.338 0.741 880s -0.802 -0.120 -0.315 0.567 880s testing: mul_mat, [1, 4, 1, 4] = [4, 1, 1, 4] * [4, 4, 1, 4] 880s x0: [4, 1, 1, 4] 880s -0.696 -0.343 -0.339 0.068 880s 880s x1: [4, 4, 1, 4] 880s 0.523 0.373 0.496 0.822 880s 0.816 0.117 -0.854 -0.397 880s 0.714 -0.826 0.991 0.630 880s -0.462 -0.157 -0.050 0.469 880s 880s y: [1, 4, 1, 4] 880s -0.605 880s -0.345 880s -0.506 880s 0.424 880s test-mul-mat0: iter:231/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.284 -0.135 -0.803 -0.555 880s -0.130 -0.430 0.738 -0.649 880s -0.362 -0.889 -0.225 0.442 880s 880s x1: [4, 3, 1, 1] 880s -0.215 -0.484 -0.086 0.670 880s -0.455 0.806 0.624 -0.251 880s 0.493 -0.978 -0.377 0.798 880s 880s y: [3, 3, 1, 1] 880s -0.298 -0.262 0.824 880s -0.599 0.336 -0.804 880s 0.132 -0.439 1.129 880s testing: mul_mat, [2, 3, 4, 1] = [4, 2, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 2, 4, 1] 880s 0.778 -0.533 0.640 -0.202 880s -0.921 -0.109 -0.613 0.429 880s 880s x1: [4, 3, 4, 1] 880s 0.861 0.854 -0.789 -0.958 880s 0.450 0.504 -0.172 0.966 880s -0.582 -0.502 -0.489 -0.776 880s 880s y: [2, 3, 4, 1] 880s -0.096 -0.813 880s -0.224 0.051 880s -0.341 0.558 880s testing: mul_mat, [1, 2, 4, 2] = [4, 1, 4, 2] * [4, 2, 4, 2] 880s x0: [4, 1, 4, 2] 880s 0.960 0.992 -0.161 0.349 880s 880s x1: [4, 2, 4, 2] 880s 0.773 -0.980 -0.197 0.852 880s -0.088 0.189 0.281 0.405 880s 880s y: [1, 2, 4, 2] 880s 0.099 880s 0.199 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.343 -0.229 -0.579 -0.149 880s 0.758 0.063 -0.353 0.785 880s 0.333 -0.738 0.904 0.998 880s 0.508 0.957 -0.443 -0.665 880s 880s x1: [4, 1, 1, 1] 880s -0.597 0.102 0.748 -0.240 880s 880s y: [4, 1, 1, 1] 880s -0.216 -0.898 0.163 -0.377 880s testing: mul_mat, [4, 4, 4, 1] = [4, 4, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.616 -0.168 0.049 0.352 880s -0.581 -0.257 0.132 0.633 880s 0.698 0.517 -0.820 0.062 880s 0.850 0.833 -0.956 0.686 880s 880s x1: [4, 4, 4, 1] 880s -0.223 -0.606 -0.215 0.364 880s 0.773 0.894 0.803 -0.816 880s -0.843 -0.799 -0.413 0.260 880s 0.948 0.347 -0.310 -0.395 880s 880s y: [4, 4, 4, 1] 880s 0.356 0.487 -0.270 -0.239 880s -0.874 -1.089 0.293 0.075 880s 0.724 0.804 -0.646 -0.809 880s -0.796 -0.931 1.071 1.121 880s testing: mul_mat, [2, 4, 4, 2] = [4, 2, 4, 2] * [4, 4, 4, 2] 880s x0: [4, 2, 4, 2] 880s 0.324 0.954 -0.524 0.725 880s -0.855 0.815 0.925 0.878 880s 880s x1: [4, 4, 4, 2] 880s -0.697 0.931 -0.689 0.103 880s 0.847 0.768 -0.049 -0.105 880s 0.405 -0.157 0.294 -0.325 880s 0.798 0.380 -0.094 0.917 880s 880s y: [2, 4, 4, 2] 880s 1.098 0.808 880s 0.956 -0.237 880s -0.408 -0.487 880s 1.334 0.346 880s test-mul-mat0: iter:232/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.238 880s -0.909 880s 880s x1: [1, 1, 1, 1] 880s 0.430 880s 880s y: [2, 1, 1, 1] 880s -0.102 -0.391 880s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.997 880s 0.764 880s 880s x1: [1, 2, 3, 1] 880s 0.388 880s -0.810 880s 880s y: [2, 2, 3, 1] 880s -0.387 0.297 880s 0.808 -0.619 880s testing: mul_mat, [4, 2, 3, 4] = [1, 4, 3, 4] * [1, 2, 3, 4] 880s x0: [1, 4, 3, 4] 880s 0.292 880s -0.566 880s 0.431 880s -0.388 880s 880s x1: [1, 2, 3, 4] 880s -0.218 880s 0.420 880s 880s y: [4, 2, 3, 4] 880s -0.064 0.123 -0.094 0.085 880s 0.123 -0.238 0.181 -0.163 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.630 880s -0.418 880s 0.391 880s 880s x1: [1, 4, 1, 1] 880s -0.794 880s 0.398 880s -0.969 880s -0.837 880s 880s y: [3, 4, 1, 1] 880s -0.500 0.332 -0.310 880s 0.251 -0.166 0.156 880s -0.611 0.405 -0.379 880s -0.528 0.350 -0.327 880s testing: mul_mat, [1, 1, 3, 1] = [3, 1, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 1, 3, 1] 880s 0.258 -0.937 -0.432 880s 880s x1: [3, 1, 3, 1] 880s 0.888 -0.411 0.714 880s 880s y: [1, 1, 3, 1] 880s 0.306 880s testing: mul_mat, [2, 3, 3, 4] = [1, 2, 3, 4] *ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s [1, 3, 3, 4] 880s x0: [1, 2, 3, 4] 880s -0.339 880s 0.596 880s 880s x1: [1, 3, 3, 4] 880s -0.816 880s 0.801 880s 0.357 880s 880s y: [2, 3, 3, 4] 880s 0.276 -0.486 880s -0.271 0.477 880s -0.121 0.213 880s test-mul-mat0: iter:233/500 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.739 0.046 880s 880s x1: [2, 3, 1, 1] 880s 0.385 -0.682 880s -0.967 -0.954 880s 0.914 0.817 880s 880s y: [1, 3, 1, 1] 880s -0.316 880s 0.671 880s -0.638 880s testing: mul_mat, [4, 1, 4, 1] = [2, 4, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.425 0.058 880s 0.995 0.730 880s -0.843 0.848 880s 0.444 0.534 880s 880s x1: [2, 1, 4, 1] 880s 0.494 0.950 880s 880s y: [4, 1, 4, 1] 880s -0.154 1.185 0.389 0.726 880s testing: mul_mat, [3, 4, 4, 3] = [2, 3, 4, 3] * [2, 4, 4, 3] 880s x0: [2, 3, 4, 3] 880s -0.383 -0.363 880s -0.594 -0.922 880s -0.892 0.753 880s 880s x1: [2, 4, 4, 3] 880s 0.489 -0.598 880s 0.716 0.646 880s -0.750 0.160 880s 0.180 0.571 880s 880s y: [3, 4, 4, 3] 880s 0.030 0.260 -0.886 880s -0.508 -1.021 -0.152 880s 0.229 0.299 0.789 880s -0.276 -0.633 0.269 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.690 -0.254 880s 0.551 -0.322 880s 0.472 0.402 880s -0.832 -0.548 880s 880s x1: [2, 3, 1, 1] 880s 0.156 0.958 880s -0.148 -0.508 880s -0.510 0.466 880s 880s y: [4, 3, 1, 1] 880s -0.351 -0.222 0.459 -0.655 880s 0.231 0.082 -0.274 0.402 880s 0.234 -0.431 -0.053 0.169 880s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 3, 4, 1] 880s -0.101 0.955 0.685 -0.773 880s 0.860 0.469 -0.903 -0.222 880s -0.407 0.921 0.102 -0.832 880s 880s x1: [4, 2, 4, 1] 880s 0.981 -0.188 -0.235 0.419 880s 0.241 0.056 0.609 0.329 880s 880s y: [3, 2, 4, 1] 880s -0.764 0.875 -0.946 880s 0.192 -0.389 -0.258 880s testing: mul_mat, [2, 4, 4, 3] = [3, 2, 4, 3] * [3, 4, 4, 3] 880s x0: [3, 2, 4, 3] 880s -0.430 -0.418 -0.600 880s -0.318 -0.401 0.205 880s 880s x1: [3, 4, 4, 3] 880s -0.456 -0.014 0.066 880s 0.959 0.223 0.154 880s 0.155 -0.403 -0.075 880s -0.188 -0.489 -0.169 880s 880s y: [2, 4, 4, 3] 880s 0.162 0.164 880s -0.598 -0.363 880s 0.147 0.097 880s 0.387 0.222 880s test-mul-mat0: iter:234/500 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.345 -0.436 880s -0.675 0.495 880s 0.102 -0.703 880s 880s x1: [2, 4, 1, 1] 880s -0.647 -0.414 880s -0.227 0.894 880s -0.072 -0.393 880s 0.265 -0.836 880s 880s y: [3, 4, 1, 1] 880s -0.043 0.232 0.225 880s -0.468 0.596 -0.652 880s 0.146 -0.146 0.269 880s 0.455 -0.593 0.615 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.861 0.338 880s 0.573 -0.700 880s 880s x1: [2, 3, 1, 1] 880s -0.244 0.339 880s -0.549 0.808 880s 0.857 0.389 880s 880s y: [2, 3, 1, 1] 880s -0.096 -0.377 880s -0.200 -0.880 880s 0.869 0.219 880s testing: mul_mat, [2, 2, 1, 3] = [2, 2, 1, 3] * [2, 2, 1, 3] 880s x0: [2, 2, 1, 3] 880s -0.877 -0.312 880s -0.554 -0.552 880s 880s x1: [2, 2, 1, 3] 880s 0.476 -0.679 880s 0.830 -0.797 880s 880s y: [2, 2, 1, 3] 880s -0.205 0.112 880s -0.479 -0.020 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.096 -0.266 880s 0.397 0.600 880s 0.692 -0.533 880s 880s x1: [2, 2, 1, 1] 880s -0.416 0.798 880s -0.396 0.922 880s 880s y: [3, 2, 1, 1] 880s -0.172 0.313 -0.713 880s -0.207 0.396 -0.765 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.987 0.025 -0.257 880s -0.705 -0.567 0.208 880s 880s x1: [3, 2, 1, 1] 880s -0.083 -0.611 -0.984 880s -0.005 -0.894 -0.828 880s 880s y: [2, 2, 1, 1] 880s 0.156 0.200 880s 0.185 0.339 880s testing: mul_mat, [3, 3, 1, 3] = [2, 3, 1, 3] * [2, 3, 1, 3] 880s x0: [2, 3, 1, 3] 880s -0.425 -0.319 880s -0.763 -0.591 880s 0.765 -0.897 880s 880s x1: [2, 3, 1, 3] 880s 0.980 0.487 880s -0.853 -0.133 880s -0.318 -0.107 880s 880s y: [3, 3, 1, 3] 880s -0.572 -1.036 0.313 880s 0.405 0.730 -0.533 880s 0.169 0.306 -0.147 880s test-mul-mat0: iter:235/500 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.418 -0.662 0.279 -0.902 880s -0.253 0.382 -0.234 0.451 880s 880s x1: [4, 2, 1, 1] 880s -0.929 -0.872 -0.109 -0.343 880s 0.881 -0.256 -0.157 -0.899 880s 880s y: [2, 2, 1, 1] 880s 0.468 -0.227 880s 1.304 -0.689 880s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 1, 4, 1] 880s -0.687 0.392 0.000 -0.855 880s 880s x1: [4, 2, 4, 1] 880s -0.490 0.867 -0.102 -0.154 880s -0.052 0.832 0.329 -0.104 880s 880s y: [1, 2, 4, 1] 880s 0.809 880s 0.452 880s testing: mul_mat, [1, 1, 4, 3] = [4, 1, 4, 3] * [4, 1, 4, 3] 880s x0: [4, 1, 4, 3] 880s -0.662 0.500 0.892 0.738 880s 880s x1: [4, 1, 4, 3] 880s 0.872 0.555 0.290 -0.249 880s 880s y: [1, 1, 4, 3] 880s -0.224 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.476 0.107 -0.779 0.673 880s 0.962 0.851 -0.044 0.561 880s 0.011 0.655 0.026 -0.711 880s 880s x1: [4, 1, 1, 1] 880s -0.028 0.828 -0.189 0.726 880s 880s y: [3, 1, 1, 1] 880s 0.737 1.094 0.021 880s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 2, 4, 1] 880s 0.461 -0.125 -0.418 880s -0.841 0.728 -0.954 880s 880s x1: [3, 4, 4, 1] 880s -0.138 0.375 -0.040 880s 0.114 -0.697 0.129 880s -0.920 0.342 -0.418 880s -0.201 0.361 -0.341 880s 880s y: [2, 4, 4, 1] 880s -0.093 0.428 880s 0.086 -0.727 880s -0.292 1.421 880s 0.005 0.757 880s testing: mul_mat, [1, 3, 4, 3] = [2, 1, 4, 3] * [2, 3, 4, 3] 880s x0: [2, 1, 4, 3] 880s 0.637 0.318 880s 880s x1: [2, 3, 4, 3] 880s 0.836 0.330 880s 0.120 -0.965 880s 0.534 0.963 880s 880s y: [1, 3, 4, 3] 880s 0.638 880s -0.230 880s 0.647 880s test-mul-mat0: iter:236/500 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.731 0.109 880s 880s x1: [2, 4, 1, 1] 880s -0.130 -0.181 880s -0.578 -0.493 880s -0.864 0.999 880s 0.817 -0.949 880s 880s y: [1, 4, 1, 1] 880s 0.075 880s 0.369 880s 0.740 880s -0.701 880s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 1, 2, 1] 880s -0.160 -0.340 880s 880s x1: [2, 1, 2, 1] 880s 0.187 -0.946 880s 880s y: [1, 1, 2, 1] 880s 0.292 880s testing: mul_mat, [2, 1, 2, 1] = [2, 2, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.650 0.025 880s 0.176 0.326 880s 880s x1: [2, 1, 2, 1] 880s 0.277 0.530 880s 880s y: [2, 1, 2, 1] 880s -0.167 0.221 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.444 -0.447 880s 0.193 -0.620 880s -0.211 -0.156 880s 880s x1: [2, 2, 1, 1] 880s -0.456 -0.297 880s 0.393 -0.457 880s 880s y: [3, 2, 1, 1] 880s -0.070 0.096 0.142 880s 0.379 0.359 -0.011 880s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.413 -0.876 0.847 880s 0.725 0.851 -0.681 880s 880s x1: [3, 2, 2, 1] 880s 0.337 -0.165 0.219 880s -0.823 0.495 0.039 880s 880s y: [2, 2, 2, 1] 880s 0.470 -0.045 880s -0.741 -0.202 880s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 1, 2, 1] 880s 0.609 -0.997 880s 880s x1: [2, 3, 2, 1] 880s 0.946 -0.872 880s -0.668 -0.501 880s -0.493 0.176 880s 880s y: [1, 3, 2, 1] 880s 1.446 880s 0.092 880s -0.475 880s test-mul-mat0: iter:237/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.317 -0.914 0.998 0.153 880s 0.428 -0.607 0.167 -0.734 880s 0.827 0.402 0.875 0.830 880s 880s x1: [4, 3, 1, 1] 880s 0.605 0.262 -0.876 0.924 880s -0.630 -0.484 -0.053 -0.128 880s 0.451 -0.510 -0.183 0.579 880s 880s y: [3, 3, 1, 1] 880s -0.780 -0.725 0.607 880s 0.170 0.109 -0.868 880s 0.516 0.048 0.488 880s testing: mul_mat, [2, 3, 3, 1] = [4, 2, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 2, 3, 1] 880s -0.537 0.681 0.224 0.558 880s -0.404 -0.476 0.600 -0.618 880s 880s x1: [4, 3, 3, 1] 880s -0.915 0.872 -0.618 0.566 880s 0.282 -0.341 0.171 -0.456 880s -0.217 0.095 -0.086 0.300 880s 880s y: [2, 3, 3, 1] 880s 1.263 -0.766 880s -0.600 0.433 880s 0.329 -0.194 880s testing: mul_mat, [2, 2, 3, 4] = [4, 2, 3, 4] * [4, 2, 3, 4] 880s x0: [4, 2, 3, 4] 880s -0.919 -0.167 -0.954 0.800 880s -0.370 0.253 0.773 0.238 880s 880s x1: [4, 2, 3, 4] 880s 0.552 0.586 0.809 0.335 880s -0.122 -0.054 0.776 0.341 880s 880s y: [2, 2, 3, 4] 880s -1.109 0.649 880s -0.346 0.712 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.408 0.781 -0.174 0.695 880s 0.340 -0.206 0.504 0.580 880s -0.939 0.730 0.747 -0.831 880s -0.777 -0.907 -0.340 0.143 880s 880s x1: [4, 2, 1, 1] 880s 0.761 -0.339 -0.207 0.787 880s 0.331 -0.849 -0.475 0.461 880s 880s y: [4, 2, 1, 1] 880s 0.008 0.681 -1.771 -0.101 880s -0.395 0.316 -1.668 0.741 880s testing: mul_mat, [2, 4, 3, 1] = [4, 2, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 2, 3, 1] 880s -0.286 -0.919 -0.002 0.652 880s 0.287 0.431 -0.051 -0.545 880s 880s x1: [4, 4, 3, 1] 880s -0.849 0.853 -0.122 -0.902 880s 0.045 0.978 0.859 0.705 880s -0.230 0.646 0.036 -0.079 880s -0.829 -0.503 0.186 -0.236 880s 880s y: [2, 4, 3, 1] 880s -1.128 0.621 880s -0.453 0.006 880s -0.579 0.253 880s 0.545 -0.335 880s testing: mul_mat, [4, 4, 3, 4] = [2, 4, 3, 4] * [2, 4, 3, 4] 880s x0: [2, 4, 3, 4] 880s 0.088 -0.927 880s 0.743 0.863 880s -0.511 -0.694 880s 0.258 -0.995 880s 880s x1: [2, 4, 3, 4] 880s -0.224 -0.129 880s 0.783 0.569 880s -0.402 0.452 880s 0.609 0.312 880s 880s y: [4, 4, 3, 4] 880s 0.100 -0.278 0.204 0.071 880s -0.459 1.073 -0.794 -0.364 880s -0.454 0.092 -0.108 -0.554 880s -0.235 0.721 -0.527 -0.153 880s test-mul-mat0: iter:238/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.419 0.435 0.938 0.909 880s -0.666 0.873 -0.125 0.975 880s -0.687 -0.842 -0.386 0.318 880s 880s x1: [4, 3, 1, 1] 880s 0.950 -0.985 -0.840 -0.443 880s 0.238 -0.649 0.492 0.568 880s -0.475 -0.255 -0.154 -0.408 880s 880s y: [3, 3, 1, 1] 880s -1.220 -1.821 0.359 880s 0.794 -0.233 0.374 880s -0.824 -0.284 0.470 880s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.197 0.049 0.383 0.781 880s -0.976 -0.497 0.857 -0.433 880s -0.853 -0.321 0.957 -0.617 880s 0.036 -0.234 -0.945 0.330 880s 880s x1: [4, 3, 3, 1] 880s 0.341 -0.006 -0.791 -0.923 880s 0.723 -0.392 -0.972 0.738 880s -0.232 -0.416 -0.024 0.119 880s 880s y: [4, 3, 3, 1] 880s -0.957 -0.609 -0.477 0.457 880s 0.326 -1.663 -1.877 1.280 880s 0.018 0.360 0.234 0.151 880s testing: mul_mat, [2, 4, 3, 2] = [4, 2, 3, 2] * [4, 4, 3, 2] 880s x0: [4, 2, 3, 2] 880s -0.698 -0.599 0.087 -0.139 880s 0.028 -0.915 -0.705 -0.007 880s 880s x1: [4, 4, 3, 2] 880s -0.071 -0.537 -0.546 -0.031 880s -0.560 0.281 0.603 -0.785 880s 0.199 0.825 0.998 -0.714 880s 0.357 -0.605 -0.431 0.938 880s 880s y: [2, 4, 3, 2] 880s 0.328 0.874 880s 0.384 -0.692 880s -0.446 -1.447 880s -0.054 0.861 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.023 -0.085 -0.349 -0.213 880s -0.785 0.933 0.987 -0.215 880s -0.924 -0.821 -0.424 0.674 880s 0.400 -0.297 0.664 -0.551 880s 880s x1: [4, 2, 1, 1] 880s -0.477 -0.623 -0.748 0.021 880s 0.741 0.587 0.471 0.500 880s 880s y: [4, 2, 1, 1] 880s 0.299 -0.949 1.284 -0.514 880s -0.304 0.322 -1.030 0.160 880s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 3, 3, 1] 880s 0.603 -0.065 -0.746 -0.640 880s -0.995 -0.373 0.824 0.807 880s 0.804 0.925 0.709 0.167 880s 880s x1: [4, 4, 3, 1] 880s -0.227 -0.174 0.386 0.715 880s 0.171 0.102 -0.762 0.548 880s 0.353 0.259 0.289 -0.060 880s -0.270 -0.211 0.461 0.753 880s 880s y: [3, 4, 3, 1] 880s -0.872 1.186 0.050 880s 0.314 -0.394 -0.217 880s 0.019 -0.258 0.719 880s -0.975 1.335 0.041 880s testing: mul_mat, [2, 4, 3, 2] = [3, 2, 3, 2] * [3, 4, 3, 2] 880s x0: [3, 2, 3, 2] 880s -0.736 -0.106 -0.434 880s -0.018 -0.978 -0.315 880s 880s x1: [3, 4, 3, 2] 880s -0.511 0.648 0.830 880s -0.802 -0.992 0.636 880s 0.366 -0.078 -0.802 880s -0.182 0.777 0.231 880s 880s y: [2, 4, 3, 2] 880s -0.053 -0.886 880s 0.418 0.785 880s 0.088 0.322 880s -0.048 -0.829 880s test-mul-mat0: iter:239/500 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.582 0.245 0.929 880s 0.568 0.258 -0.445 880s 0.478 -0.912 -0.896 880s 880s x1: [3, 3, 1, 1] 880s -0.882 0.962 0.753 880s 0.566 0.585 0.429 880s -0.120 -0.839 -0.353 880s 880s y: [3, 3, 1, 1] 880s 0.422 -0.589 -1.974 880s 0.871 0.282 -0.647 880s -0.603 -0.127 1.025 880s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 1, 3, 1] 880s 0.959 0.127 0.331 880s 880s x1: [3, 3, 3, 1] 880s -0.135 0.954 -0.277 880s -0.483 -0.198 -0.763 880s -0.808 -0.784 -0.564 880s 880s y: [1, 3, 3, 1] 880s -0.099 880s -0.740 880s -1.061 880s testing: mul_mat, [2, 1, 3, 3] = [3, 2, 3, 3] * [3, 1, 3, 3] 880s x0: [3, 2, 3, 3] 880s 0.328 0.236 0.522 880s -0.139 0.336 -0.749 880s 880s x1: [3, 1, 3, 3] 880s 0.533 -0.900 -0.272 880s 880s y: [2, 1, 3, 3] 880s -0.180 -0.173 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.532 -0.515 0.050 880s -0.436 -0.140 -0.659 880s 0.730 0.531 -0.212 880s -0.594 0.125 0.608 880s 880s x1: [3, 2, 1, 1] 880s -0.827 -0.941 0.773 880s 0.378 -0.820 0.504 880s 880s y: [4, 2, 1, 1] 880s 0.083 -0.016 -1.267 0.843 880s 0.649 -0.382 -0.266 -0.021 880s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.657 0.675 0.388 0.563 880s 0.226 0.068 -0.098 0.089 880s -0.480 -0.823 -0.130 0.601 880s -0.169 -0.818 0.213 0.953 880s 880s x1: [4, 3, 3, 1] 880s -0.267 0.786 0.014 -0.510 880s 0.319 0.077 0.936 0.334 880s -0.479 0.174 -0.552 -0.452 880s 880s y: [4, 3, 3, 1] 880s 0.073 -0.054 -0.826 -1.080 880s 0.813 0.016 -0.138 0.400 880s -0.666 -0.083 -0.113 -0.610 880s testing: mul_mat, [3, 4, 3, 3] = [4, 3, 3, 3] * [4, 4, 3, 3] 880s x0: [4, 3, 3, 3] 880s -0.137 -0.440 -0.840 -0.867 880s -0.347 0.028 -0.297 -0.395 880s -0.743 -0.083 0.506 -0.250 880s 880s x1: [4, 4, 3, 3] 880s -0.403 0.608 -0.645 0.071 880s 0.111 -0.276 -0.668 -0.477 880s -0.676 -0.834 -0.144 0.645 880s 0.892 0.189 -0.378 -0.155 880s 880s y: [3, 4, 3, 3] 880s 0.268 0.321 -0.095 880s 1.082 0.341 -0.279 880s 0.022 -0.001 0.338 880s 0.247 -0.131 -0.831 880s test-mul-mat0: iter:240/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.471 0.535 0.662 0.201 880s -0.532 0.064 -0.958 -0.787 880s 880s x1: [4, 1, 1, 1] 880s -0.141 0.244 0.610 0.887 880s 880s y: [2, 1, 1, 1] 880s 0.779 -1.192 880s testing: mul_mat, [2, 2, 1, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.000 -0.463 0.679 0.162 880s 0.056 -0.646 -0.233 -0.806 880s 880s x1: [4, 2, 1, 1] 880s -0.249 0.728 -0.145 -0.472 880s 0.849 -0.359 -0.093 0.503 880s 880s y: [2, 2, 1, 1] 880s -0.512 -0.070 880s 0.185 -0.104 880s testing: mul_mat, [4, 2, 1, 4] = [4, 4, 1, 4] * [4, 2, 1, 4] 880s x0: [4, 4, 1, 4] 880s -0.366 -0.289 -0.818 0.836 880s 0.796 0.230 -0.300 -0.918 880s -0.254 -0.372 -0.589 -0.526 880s 0.307 -0.533 0.058 -0.485 880s 880s x1: [4, 2, 1, 4] 880s -0.053 0.626 0.439 -0.443 880s 0.513 0.202 0.086 0.048 880s 880s y: [4, 2, 1, 4] 880s -0.891 0.376 -0.245 -0.110 880s -0.276 0.385 -0.281 0.031 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.901 -0.952 0.814 -0.150 880s -0.659 0.112 0.981 -0.195 880s 880s x1: [4, 4, 1, 1] 880s -0.353 -0.399 0.226 0.792 880s 0.689 0.488 0.684 0.406 880s -0.840 -0.950 -0.570 0.299 880s 0.473 0.969 -0.191 0.179 880s 880s y: [2, 4, 1, 1] 880s 0.763 0.255 880s -0.590 0.192 880s 1.153 -0.171 880s -1.530 -0.425 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.851 -0.585 880s 0.009 0.387 880s 0.227 0.882 880s 0.833 -0.155 880s 880s x1: [2, 4, 1, 1] 880s -0.021 -0.492 880s 0.468 0.710 880s -0.281 -0.486 880s -0.643 0.320 880s 880s y: [4, 4, 1, 1] 880s 0.306 -0.191 -0.439 0.058 880s -0.813 0.279 0.733 0.280 880s 0.523 -0.191 -0.493 -0.159 880s 0.360 0.118 0.136 -0.585 880s testing: mul_mat, [2, 2, 1, 4] = [4, 2, 1, 4] * [4, 2, 1, 4] 880s x0: [4, 2, 1, 4] 880s 0.187 0.779 0.607 0.136 880s -0.987 -0.863 -0.345 -0.686 880s 880s x1: [4, 2, 1, 4] 880s -0.314 0.356 -0.186 0.496 880s -0.466 -0.050 0.594 -0.125 880s 880s y: [2, 2, 1, 4] 880s 0.173 -0.273 880s 0.218 0.383 880s test-mul-mat0: iter:241/500 880s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.220 880s -0.138 880s -0.590 880s 880s x1: [1, 1, 1, 1] 880s 0.518 880s 880s y: [3, 1, 1, 1] 880s 0.114 -0.071 -0.305 880s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.501 880s -0.774 880s 880s x1: [1, 3, 1, 1] 880s 0.090 880s 0.700 880s -0.516 880s 880s y: [2, 3, 1, 1] 880s 0.045 -0.070 880s 0.351 -0.542 880s -0.258 0.399 880s testing: mul_mat, [3, 2, 1, 2] = [1, 3, 1, 2] * [1, 2, 1, 2] 880s x0: [1, 3, 1, 2] 880s 0.936 880s -0.502 880s -0.792 880s 880s x1: [1, 2, 1, 2] 880s 0.742 880s -0.222 880s 880s y: [3, 2, 1, 2] 880s 0.695 -0.373 -0.588 880s -0.208 0.112 0.176 880s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.182 880s 0.560 880s 880s x1: [1, 3, 1, 1] 880s 0.723 880s 0.796 880s -0.739 880s 880s y: [2, 3, 1, 1] 880s -0.131 0.404 880s -0.145 0.445 880s 0.134 -0.414 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.187 0.686 880s -0.728 -0.951 880s 880s x1: [2, 1, 1, 1] 880s 0.669 0.977 880s 880s y: [2, 1, 1, 1] 880s 0.795 -1.417 880s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 880s x0: [2, 1, 1, 2] 880s 0.387 -0.894 880s 880s x1: [2, 2, 1, 2] 880s -0.318 -0.225 880s 0.749 0.166 880s 880s y: [1, 2, 1, 2] 880s 0.078 880s 0.141 880s test-mul-mat0: iter:242/500 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.258 880s 0.735 880s 0.179 880s 880s x1: [1, 2, 1, 1] 880s -0.519 880s -0.071 880s 880s y: [3, 2, 1, 1] 880s 0.134 -0.381 -0.093 880s 0.018 -0.052 -0.013 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.408 880s 0.428 880s -0.547 880s 880s x1: [1, 3, 1, 1] 880s -0.698 880s 0.405 880s 0.156 880s 880s y: [3, 3, 1, 1] 880s 0.285 -0.299 0.382 880s -0.165 0.173 -0.221 880s -0.064 0.067 -0.085 880s testing: mul_mat, [2, 3, 1, 4] = [1, 2, 1, 4] * [1, 3, 1, 4] 880s x0: [1, 2, 1, 4] 880s 0.378 880s 0.282 880s 880s x1: [1, 3, 1, 4] 880s -0.359 880s -0.890 880s 0.229 880s 880s y: [2, 3, 1, 4] 880s -0.136 -0.101 880s -0.336 -0.251 880s 0.086 0.064 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.998 880s -0.347 880s -0.458 880s -0.234 880s 880s x1: [1, 2, 1, 1] 880s 0.696 880s 0.248 880s 880s y: [4, 2, 1, 1] 880s 0.695 -0.242 -0.319 -0.163 880s 0.248 -0.086 -0.114 -0.058 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.552 0.356 0.916 -0.952 880s -0.724 0.477 -0.975 0.585 880s 880s x1: [4, 1, 1, 1] 880s 0.245 0.970 0.220 0.886 880s 880s y: [2, 1, 1, 1] 880s -0.432 0.589 880s testing: mul_mat, [4, 4, 1, 4] = [2, 4, 1, 4] * [2, 4, 1, 4] 880s x0: [2, 4, 1, 4] 880s -0.085 0.001 880s -0.041 -0.782 880s -0.636 0.916 880s 0.939 -0.444 880s 880s x1: [2, 4, 1, 4] 880s -0.919 -0.275 880s -0.449 0.458 880s -0.993 0.024 880s -0.958 0.854 880s 880s y: [4, 4, 1, 4] 880s 0.078 0.253 0.333 -0.741 880s 0.038 -0.340 0.705 -0.625 880s 0.084 0.022 0.653 -0.943 880s 0.082 -0.628 1.392 -1.27ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 9 880s test-mul-mat0: iter:243/500 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.337 0.436 -0.765 0.011 880s -0.205 -0.660 -0.277 -0.739 880s 880s x1: [4, 2, 1, 1] 880s 0.619 -0.118 -0.456 0.897 880s -0.527 0.903 0.623 0.735 880s 880s y: [2, 2, 1, 1] 880s 0.516 -0.586 880s -0.252 -1.204 880s testing: mul_mat, [4, 2, 2, 1] = [4, 4, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 4, 2, 1] 880s 0.945 -0.486 0.808 0.568 880s -0.751 -0.473 -0.095 0.685 880s -0.238 0.916 -0.519 0.102 880s -0.360 -0.258 -0.591 -0.530 880s 880s x1: [4, 2, 2, 1] 880s 0.307 0.831 0.310 0.054 880s -0.642 0.990 0.166 -0.613 880s 880s y: [4, 2, 2, 1] 880s 0.168 -0.616 0.533 -0.537 880s -1.302 -0.423 0.912 0.202 880s testing: mul_mat, [3, 4, 2, 1] = [4, 3, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.545 -0.292 0.293 -0.580 880s 0.179 -0.752 -0.698 -0.869 880s -0.695 -0.177 0.824 -0.230 880s 880s x1: [4, 4, 2, 1] 880s -0.463 0.666 0.060 -0.214 880s -0.807 0.965 -0.529 -0.045 880s 0.882 -0.048 -0.943 -0.479 880s 0.694 -0.534 -0.008 0.745 880s 880s y: [3, 4, 2, 1] 880s 0.200 -0.439 0.303 880s 0.029 -0.461 -0.035 880s -0.465 1.269 -1.272 880s -0.657 -0.116 -0.566 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.046 0.869 0.683 0.318 880s 0.324 0.148 -0.282 -0.925 880s -0.746 0.025 0.724 -0.018 880s 880s x1: [4, 3, 1, 1] 880s 0.874 -0.526 0.386 0.308 880s 0.869 0.308 -0.591 -0.676 880s -0.985 0.702 -0.256 0.194 880s 880s y: [3, 3, 1, 1] 880s -0.055 -0.189 -0.391 880s -0.311 1.120 -1.056 880s 0.451 -0.323 0.563 880s testing: mul_mat, [1, 4, 2, 1] = [3, 1, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 1, 2, 1] 880s -0.130 -0.286 -0.469 880s 880s x1: [3, 4, 2, 1] 880s 0.988 -0.711 0.620 880s -0.715 -0.140 0.621 880s -0.841 0.335 0.007 880s 0.467 0.204 -0.686 880s 880s y: [1, 4, 2, 1] 880s -0.216 880s -0.159 880s 0.010 880s 0.203 880s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 1, 2, 1] 880s 0.273 880s 880s x1: [1, 3, 2, 1] 880s -0.824 880s 0.078 880s 0.132 880s 880s y: [1, 3, 2, 1] 880s -0.225 880s 0.021 880s 0.036 880s test-mul-mat0: iter:244/500 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.216 -0.229 880s 0.655 0.655 880s 0.486 -0.814 880s 0.842 -0.725 880s 880s x1: [2, 4, 1, 1] 880s -0.994 -0.596 880s 0.540 -0.721 880s 0.927 0.067 880s 0.597 -0.225 880s 880s y: [4, 4, 1, 1] 880s 0.351 -1.042 0.003 -0.405 880s 0.049 -0.119 0.850 0.978 880s -0.215 0.651 0.396 0.732 880s -0.077 0.244 0.473 0.666 880s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 3, 4, 1] 880s -0.592 -0.027 880s 0.704 0.699 880s -0.443 -0.197 880s 880s x1: [2, 4, 4, 1] 880s 0.698 -0.981 880s 0.353 -0.170 880s -0.520 0.292 880s 0.584 0.100 880s 880s y: [3, 4, 4, 1] 880s -0.387 -0.195 -0.116 880s -0.204 0.130 -0.123 880s 0.300 -0.162 0.173 880s -0.348 0.480 -0.278 880s testing: mul_mat, [2, 3, 4, 3] = [2, 2, 4, 3] * [2, 3, 4, 3] 880s x0: [2, 2, 4, 3] 880s -0.317 0.834 880s -0.398 0.395 880s 880s x1: [2, 3, 4, 3] 880s 0.675 0.486 880s -0.113 -0.120 880s -0.459 0.030 880s 880s y: [2, 3, 4, 3] 880s 0.191 -0.077 880s -0.064 -0.003 880s 0.170 0.195 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.749 0.405 880s 0.023 0.042 880s 0.702 0.232 880s 0.275 0.371 880s 880s x1: [2, 2, 1, 1] 880s -0.735 0.927 880s -0.035 -0.413 880s 880s y: [4, 2, 1, 1] 880s -0.176 0.022 -0.302 0.142 880s -0.193 -0.018 -0.120 -0.163 880s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.952 -0.825 -0.289 0.607 880s 880s x1: [4, 2, 4, 1] 880s -0.473 0.872 0.171 -0.387 880s 0.907 0.655 0.979 0.827 880s 880s y: [1, 2, 4, 1] 880s -1.454 880s 0.543 880s testing: mul_mat, [1, 4, 4, 3] = [1, 1, 4, 3] * [1, 4, 4, 3] 880s x0: [1, 1, 4, 3] 880s 0.964 880s 880s x1: [1, 4, 4, 3] 880s -0.647 880s 0.061 880s -0.530 880s 0.335 880s 880s y: [1, 4, 4, 3] 880s -0.624 880s 0.058 880s -0.511 880s 0.323 880s test-mul-mat0: iter:245/500 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.187 0.707 0.380 880s -0.827 -0.217 -0.228 880s 880s x1: [3, 1, 1, 1] 880s -0.594 0.832 -0.841 880s 880s y: [2, 1, 1, 1] 880s 0.380 0.502 880s testing: mul_mat, [1, 2, 3, 1] = [3, 1, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 1, 3, 1] 880s -0.333 0.388 0.922 880s 880s x1: [3, 2, 3, 1] 880s 0.476 -0.808 0.647 880s -0.105 -0.355 0.611 880s 880s y: [1, 2, 3, 1] 880s 0.125 880s 0.461 880s testing: mul_mat, [3, 1, 3, 4] = [3, 3, 3, 4] * [3, 1, 3, 4] 880s x0: [3, 3, 3, 4] 880s 0.238 -0.473 0.956 880s 0.678 -0.919 -0.513 880s -0.312 0.509 -0.524 880s 880s x1: [3, 1, 3, 4] 880s 0.485 -0.962 0.592 880s 880s y: [3, 1, 3, 4] 880s 1.137 0.910 -0.951 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s xggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 0: [3, 1, 1, 1] 880s -0.823 -0.785 -0.701 880s 880s x1: [3, 3, 1, 1] 880s 0.551 0.712 -0.831 880s 0.917 0.622 0.786 880s 0.323 -0.581 0.732 880s 880s y: [1, 3, 1, 1] 880s -0.431 880s -1.795 880s -0.322 880s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 2, 3, 1] 880s 0.875 880s 0.572 880s 880s x1: [1, 3, 3, 1] 880s -0.649 880s 0.098 880s -0.699 880s 880s y: [2, 3, 3, 1] 880s -0.568 -0.371 880s 0.086 0.056 880s -0.612 -0.399 880s testing: mul_mat, [1, 1, 3, 4] = [2, 1, 3, 4] * [2, 1, 3, 4] 880s x0: [2, 1, 3, 4] 880s 0.170 0.049 880s 880s x1: [2, 1, 3, 4] 880s 0.800 0.819 880s 880s y: [1, 1, 3, 4] 880s 0.176 880s test-mul-mat0: iter:246/500 880s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.621 880s -0.652 880s 880s x1: [1, 3, 1, 1] 880s 0.285 880s 0.330 880s 0.451 880s 880s y: [2, 3, 1, 1] 880s -0.177 -0.186 880s -0.205 -0.215 880s -0.280 -0.294 880s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.672 880s -0.937 880s 880s x1: [1, 2, 3, 1] 880s -0.614 880s 0.154 880s 880s y: [2, 2, 3, 1] 880s 0.412 0.575 880s -0.103 -0.144 880s testing: mul_mat, [2, 2, 3, 4] = [1, 2, 3, 4] * [1, 2, 3, 4] 880s x0: [1, 2, 3, 4] 880s 0.330 880s 0.904 880s 880s x1: [1, 2, 3, 4] 880s -0.456 880s 0.511 880s 880s y: [2, 2, 3, 4] 880s -0.150 -0.412 880s 0.169 0.462 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.173 880s 0.843 880s 0.907 880s 0.292 880s 880s x1: [1, 2, 1, 1] 880s -0.585 880s -0.159 880s 880s y: [4, 2, 1, 1] 880s -0.101 -0.493 -0.531 -0.171 880s -0.027 -0.134 -0.144 -0.046 880s testing: mul_mat, [4, 1, 3, 1] = [4, 4, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.187 -0.390 -0.347 -0.508 880s 0.985 -0.122 0.651 0.031 880s 0.357 -0.937 -0.035 -0.759 880s -0.423 -0.180 0.068 -0.665 880s 880s x1: [4, 1, 3, 1] 880s 0.173 0.811 0.152 0.187 880s 880s y: [4, 1, 3, 1] 880s -0.497 0.176 -0.845 -0.333 880s testing: mul_mat, [1, 4, 3, 4] = [4, 1, 3, 4] * [4, 4, 3, 4] 880s x0: [4, 1, 3, 4] 880s 0.610 -0.210 0.514 0.247 880s 880s x1: [4, 4, 3, 4] 880s -0.770 0.694 0.510 -0.086 880s -0.611 0.714 -0.718 -0.172 880s -0.473 0.199 0.695 -0.218 880s 0.612 -0.804 0.849 -0.990 880s 880s y: [1, 4, 3, 4] 880s -0.375 880s -0.935 880s -0.027 880s 0.735 880s test-mul-mat0: iter:247/500 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.091 0.296 -0.127 -0.622 880s 0.650 -0.694 -0.985 -0.035 880s -0.183 0.070 -0.166 0.235 880s 880s x1: [4, 2, 1, 1] 880s 0.283 -0.170 -0.326 0.297 880s -0.308 -0.606 -0.046 -0.811 880s 880s y: [3, 2, 1, 1] 880s -0.220 0.613 0.060 880s 0.359 0.295 -0.169 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.846 0.248 0.404 -0.200 880s 0.437 0.642 0.709 -0.267 880s -0.486 -0.913 -0.616 -0.180 880s 880s x1: [4, 3, 1, 1] 880s 0.427 -0.841 0.864 0.953 880s -0.377 0.919 -0.750 0.267 880s 0.726 -0.664 -0.451 -0.444 880s 880s y: [3, 3, 1, 1] 880s 0.311 0.006 -0.143 880s -0.447 -0.177 -0.242 880s 0.356 -0.310 0.612 880s testing: mul_mat, [1, 3, 1, 2] = [4, 1, 1, 2] * [4, 3, 1, 2] 880s x0: [4, 1, 1, 2] 880s 0.136 -0.645 0.708 0.223 880s 880s x1: [4, 3, 1, 2] 880s -0.898 0.349 0.637 0.172 880s -0.817 -0.128 -0.401 -0.658 880s -0.264 -0.448 -0.035 -0.345 880s 880s y: [1, 3, 1, 2] 880s 0.142 880s -0.460 880s 0.151 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.388 -0.571 0.588 0.216 880s 0.774 -0.550 -0.411 0.931 880s 0.648 0.808 0.747 0.288 880s 0.577 -0.190 0.737 -0.356 880s 880s x1: [4, 1, 1, 1] 880s -0.834 -0.503 -0.730 0.038 880s 880s y: [4, 1, 1, 1] 880s 0.190 -0.034 -1.481 -0.937 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.569 0.191 -0.875 -0.981 880s 880s x1: [4, 4, 1, 1] 880s 0.553 -0.518 0.780 0.908 880s -0.810 0.003 -0.354 -0.282 880s -0.672 -0.266 -0.116 -0.175 880s 0.003 0.921 0.921 0.615 880s 880s y: [1, 4, 1, 1] 880s -1.358 880s 0.126 880s -0.160 880s -1.231 880s testing: mul_mat, [2, 4, 1, 2] = [1, 2, 1, 2] * [1, 4, 1, 2] 880s x0: [1, 2, 1, 2] 880s 0.448 880s -0.834 880s 880s x1: [1, 4, 1, 2] 880s -0.000 880s -0.065 880s 0.607 880s 0.588 880s 880s y: [2, 4, 1, 2] 880s -0.000 0.000 880s -0.029 0.054 880s 0.272 -0.506 880s 0.264 -0.491 880s test-mul-mat0: iter:248/500 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.543 -0.173 0.305 -0.928 880s 880s x1: [4, 1, 1, 1] 880s -0.979 -0.441 0.454 -0.095 880s 880s y: [1, 1, 1, 1] 880s 0.834 880s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.332 0.116 -0.108 0.563 880s 880s x1: [4, 1, 4, 1] 880s -0.478 -0.126 0.263 -0.353 880s 880s y: [1, 1, 4, 1] 880s -0.400 880s testing: mul_mat, [3, 1, 4, 3] = [4, 3, 4, 3] * [4, 1, 4, 3] 880s x0: [4, 3, 4, 3] 880s -0.040 -0.370 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s -0.391 0.933 880s -0.549 0.220 -0.588 -0.981 880s -0.523 0.703 0.406 -0.099 880s 880s x1: [4, 1, 4, 3] 880s 0.082 0.814 -0.422 0.345 880s 880s y: [3, 1, 4, 3] 880s 0.183 0.043 0.324 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.348 0.512 0.849 0.369 880s 0.488 0.315 0.876 0.447 880s 0.513 -0.676 -0.660 -0.010 880s 0.247 -0.185 -0.282 -0.482 880s 880s x1: [4, 3, 1, 1] 880s -0.164 0.901 -0.453 -0.243 880s 0.900 0.278 -0.579 0.719 880s 0.732 -0.269 -0.117 0.982 880s 880s y: [4, 3, 1, 1] 880s 0.045 -0.302 -0.392 0.037 880s -0.397 0.341 0.649 -0.012 880s -0.130 0.609 0.625 -0.209 880s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 2, 4, 1] 880s -0.061 0.129 0.732 0.408 880s 0.358 -0.975 -0.707 0.575 880s 880s x1: [4, 4, 4, 1] 880s -0.223 0.910 0.354 -0.322 880s -0.542 -0.889 -0.422 0.736 880s -0.468 -0.703 0.468 0.263 880s 0.180 0.449 0.886 0.833 880s 880s y: [2, 4, 4, 1] 880s 0.259 -1.402 880s -0.091 1.393 880s 0.388 0.338 880s 1.036 -0.521 880s testing: mul_mat, [4, 4, 4, 3] = [2, 4, 4, 3] * [2, 4, 4, 3] 880s x0: [2, 4, 4, 3] 880s -0.428 -0.481 880s 0.372 -0.774 880s 0.443 0.901 880s -0.032 0.776 880s 880s x1: [2, 4, 4, 3] 880s 0.040 0.295 880s 0.269 -0.227 880s 0.589 -0.323 880s -0.652 -0.690 880s 880s y: [4, 4, 4, 3] 880s -0.159 -0.214 0.284 0.228 880s -0.006 0.276 -0.086 -0.185 880s -0.097 0.469 -0.030 -0.270 880s 0.612 0.292 -0.911 -0.515 880s test-mul-mat0: iter:249/500 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.266 -0.944 880s -0.127 -0.650 880s -0.494 -0.106 880s 880s x1: [2, 4, 1, 1] 880s 0.510 0.336 880s 0.726 -0.584 880s -0.168 -0.244 880s 0.290 0.705 880s 880s y: [3, 4, 1, 1] 880s -0.453 -0.283 -0.288 880s 0.358 0.287 -0.297 880s 0.275 0.180 0.109 880s -0.743 -0.495 -0.218 880s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.651 -0.405 880s 0.042 0.638 880s 880s x1: [2, 3, 3, 1] 880s 0.769 0.535 880s 0.026 0.671 880s 0.334 0.196 880s 880s y: [2, 3, 3, 1] 880s -0.718 0.374 880s -0.288 0.429 880s -0.297 0.139 880s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.795 0.490 880s -0.479 -0.862 880s 880s x1: [2, 2, 3, 1] 880s -0.509 -0.602 880s 0.124 -0.838 880s 880s y: [2, 2, 3, 1] 880s 0.109 0.762 880s -0.509 0.663 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.481 0.766 880s 0.710 -0.395 880s 880s x1: [2, 2, 1, 1] 880s -0.503 -0.495 880s 0.384 0.219 880s 880s y: [2, 2, 1, 1] 880s -0.621 -0.162 880s 0.352 0.186 880s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.234 -0.098 880s 880s x1: [2, 2, 3, 1] 880s 0.872 0.498 880s -0.076 -0.863 880s 880s y: [1, 2, 3, 1] 880s -0.253 880s 0.102 880s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 3, 3, 1] 880s -0.228 880s -0.727 880s -0.196 880s 880s x1: [1, 2, 3, 1] 880s 0.378 880s -0.101 880s 880s y: [3, 2, 3, 1] 880s -0.086 -0.275 -0.074 880s 0.023 0.073 0.020 880s test-mul-mat0: iter:250/500 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.639 -0.865 880s 0.124 0.260 880s 0.543 -0.001 880s -0.336 -0.665 880s 880s x1: [2, 4, 1, 1] 880s 0.058 -0.046 880s 0.727 -0.292 880s 0.527 -0.507 880s 0.610 -0.093 880s 880s y: [4, 4, 1, 1] 880s 0.002 -0.005 0.032 0.011 880s -0.212 0.015 0.395 -0.050 880s 0.101 -0.066 0.287 0.160 880s -0.309 0.052 0.332 -0.143 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.614 -0.313 880s 880s x1: [2, 4, 1, 1] 880s -0.907 0.436 880s -0.392 -0.103 880s 0.974 0.486 880s 0.573 -0.990 880s 880s y: [1, 4, 1, 1] 880s 0.420 880s 0.273 880s -0.750 880s -0.041 880s testing: mul_mat, [4, 1, 1, 3] = [2, 4, 1, 3] * [2, 1, 1, 3] 880s x0: [2, 4, 1, 3] 880s 0.771 -0.993 880s 0.564 -0.323 880s 0.753 0.925 880s -0.188 -0.123 880s 880s x1: [2, 1, 1, 3] 880s -0.813 0.289 880s 880s y: [4, 1, 1, 3] 880s -0.914 -0.552 -0.344 0.117 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.024 -0.193 880s 880s x1: [2, 4, 1, 1] 880s -0.739 -0.885 880s 0.741 0.507 880s -0.641 0.255 880s 0.460 -0.871 880s 880s y: [1, 4, 1, 1] 880s 0.153 880s -0.080 880s -0.065 880s 0.179 880s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.891 880s 880s x1: [1, 2, 1, 1] 880s 0.014 880s -0.051 880s 880s y: [1, 2, 1, 1] 880s 0.013 880s -0.045 880s testing: mul_mat, [2, 1, 1, 3] = [1, 2, 1, 3] * [1, 1, 1, 3] 880s x0: [1, 2, 1, 3] 880s -0.334 880s 0.737 880s 880s x1: [1, 1, 1, 3] 880s -0.866 880s 880s y: [2, 1, 1, 3] 880s 0.289 -0.639 880s test-mul-mat0: iter:251/500 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.022 -0.973 -0.103 880s -0.087 -0.839 -0.128 880s 880s x1: ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s [3, 4, 1, 1] 880s 0.868 0.751 0.087 880s 0.983 0.492 -0.406 880s -0.658 -0.253 -0.946 880s -0.528 -0.992 0.078 880s 880s y: [2, 4, 1, 1] 880s -0.720 -0.716 880s -0.415 -0.446 880s 0.329 0.390 880s 0.945 0.868 880s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.723 -0.184 -0.270 880s 0.731 0.895 -0.992 880s -0.247 0.922 -0.095 880s 0.666 -0.917 0.778 880s 880s x1: [3, 2, 3, 1] 880s -0.319 -0.856 0.538 880s -0.582 0.411 0.504 880s 880s y: [4, 2, 3, 1] 880s -0.219 -1.533 -0.761 0.992 880s -0.632 -0.557 0.475 -0.373 880s testing: mul_mat, [1, 4, 3, 2] = [3, 1, 3, 2] * [3, 4, 3, 2] 880s x0: [3, 1, 3, 2] 880s -0.655 0.030 0.197 880s 880s x1: [3, 4, 3, 2] 880s -0.304 0.972 0.720 880s 0.602 0.638 0.803 880s 0.379 0.985 -0.971 880s -0.305 -0.249 0.666 880s 880s y: [1, 4, 3, 2] 880s 0.370 880s -0.217 880s -0.410 880s 0.323 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.962 0.391 0.096 880s 880s x1: [3, 1, 1, 1] 880s -0.336 0.488 0.861 880s 880s y: [1, 1, 1, 1] 880s -0.049 880s testing: mul_mat, [1, 3, 3, 1] = [1, 1, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 1, 3, 1] 880s -0.202 880s 880s x1: [1, 3, 3, 1] 880s 0.819 880s 0.119 880s 0.434 880s 880s y: [1, 3, 3, 1] 880s -0.165 880s -0.024 880s -0.088 880s testing: mul_mat, [2, 1, 3, 2] = [1, 2, 3, 2] * [1, 1, 3, 2] 880s x0: [1, 2, 3, 2] 880s 0.306 880s 0.343 880s 880s x1: [1, 1, 3, 2] 880s -0.431 880s 880s y: [2, 1, 3, 2] 880s -0.132 -0.148 880s test-mul-mat0: iter:252/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.437 -0.424 0.097 0.906 880s 880s x1: [4, 3, 1, 1] 880s -0.362 0.095 0.339 0.767 880s 0.189 -0.863 0.181 -0.189 880s -0.293 -0.857 -0.834 0.862 880s 880s y: [1, 3, 1, 1] 880s 0.846 880s 0.129 880s 1.192 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.910 0.439 0.578 0.363 880s 880s x1: [4, 1, 1, 1] 880s 0.884 0.553 0.914 -0.320 880s 880s y: [1, 1, 1, 1] 880s -0.149 880s testing: mul_mat, [3, 1, 1, 4] = [4, 3, 1, 4] * [4, 1, 1, 4] 880s x0: [4, 3, 1, 4] 880s 0.590 0.505 0.595 -0.313 880s 0.411 0.479 -0.760 0.325 880s -0.840 -0.890 0.415 0.598 880s 880s x1: [4, 1, 1, 4] 880s -0.984 -0.595 -0.931 0.681 880s 880s y: [3, 1, 1, 4] 880s -1.648 0.240 1.377 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.165 0.682 -0.383 0.850 880s -0.143 -0.233 -0.648 -0.384 880s 880s x1: [4, 3, 1, 1] 880s 0.750 -0.042 -0.412 0.821 880s -0.662 -0.384 0.858 -0.831 880s -0.699 -0.330 0.157 0.784 880s 880s y: [2, 3, 1, 1] 880s 0.951 -0.146 880s -1.405 -0.053 880s 0.265 -0.226 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.023 0.373 880s -0.864 0.564 880s 880s x1: [2, 4, 1, 1] 880s -0.654 0.091 880s -0.274 0.756 880s 0.931 -0.476 880s 0.943 -0.614 880s 880s y: [2, 4, 1, 1] 880s 0.049 0.616 880s 0.288 0.663 880s -0.199 -1.073 880s -0.251 -1.161 880s testing: mul_mat, [4, 2, 1, 4] = [2, 4, 1, 4] * [2, 2, 1, 4] 880s x0: [2, 4, 1, 4] 880s -0.645 0.045 880s 0.290 0.842 880s 0.911 0.604 880s 0.081 -0.012 880s 880s x1: [2, 2, 1, 4] 880s 0.958 0.711 880s -0.819 0.815 880s 880s y: [4, 2, 1, 4] 880s -0.586 0.876 1.302 0.069 880s 0.566 0.449 -0.254 -0.076 880s test-mul-mat0: iter:253/500 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.994 880s -0.625 880s 880s x1: [1, 2, 1, 1] 880s 0.765 880s -0.644 880s 880s y: [2, 2, 1, 1] 880s 0.760 -0.478 880s -0.640 0.403 880s testing: mul_mat, [2, 2, 2, 1] = [1, 2, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 2, 2, 1] 880s -0.131 880s 0.594 880s 880s x1: [1, 2, 2, 1] 880s -0.645 880s -0.641 880s 880s y: [2, 2, 2, 1] 880s 0.085 -0.383 880s 0.084 -0.381 880s testing: mul_mat, [4, 2, 2, 3] = [1, 4, 2, 3] * [1, 2, 2, 3] 880s x0: [1, 4, 2, 3] 880s -0.479 880s 0.222 880s 0.559 880s 0.923 880s 880s x1: [1, 2, 2, 3] 880s 0.786 880s 0.443 880s 880s y: [4, 2, 2, 3] 880s -0.376 0.174 0.439 0.725 880s -0.212 0.098 0.247 0.408 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.327 880s 0.938 880s 880s x1: [1, 4, 1, 1] 880s 0.994 880s -0.645 880s -0.972 880s 0.131 880s 880s y: [2, 4, 1, 1] 880s 0.326 0.933 880s -0.211 -0.605 880s -0.318 -0.911 880s 0.043 0.123 880s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 1, 2, 1] 880s 0.412 0.276 880s 880s x1: [2, 1, 2, 1] 880s 0.041 -0.451 880s 880s y: [1, 1, 2, 1] 880s -0.108 880s testing: mul_mat, [4, 2, 2, 3] = [1, 4, 2, 3] * [1, 2, 2, 3] 880s x0: [1, 4, 2, 3] 880s -0.743 880s -0.153 880s -0.256 880s -0.388 880s 880s x1: [1, 2, 2, 3] 880s 0.944 880s 0.856 880s 880s y: [4, 2, 2, 3] 880s -0.702 -0.144 -0.241 -0.367 880s -0.637 -0.131 -0.219 -0.333 880s test-mul-mat0: iter:254/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.274 0.566 0.123 880s 880s x1: [3, 1, 1, 1] 880s -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s .745 -0.573 -0.178 880s 880s y: [1, 1, 1, 1] 880s -0.551 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.254 -0.573 0.105 880s -0.435 0.562 0.394 880s 880s x1: [3, 1, 1, 1] 880s -0.850 0.441 -0.745 880s 880s y: [2, 1, 1, 1] 880s -0.548 0.325 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.183 0.400 -0.247 880s -0.321 -0.345 0.074 880s -0.249 -0.089 0.501 880s 880s x1: [3, 2, 1, 1] 880s 0.286 0.974 -0.801 880s 0.710 -0.670 -0.139 880s 880s y: [3, 2, 1, 1] 880s 0.640 -0.487 -0.560 880s -0.104 -0.007 -0.187 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.227 -0.211 -0.236 880s -0.617 0.776 0.976 880s -0.602 0.684 0.394 880s 880s x1: [3, 3, 1, 1] 880s 0.572 0.422 -0.225 880s 0.138 -0.455 -0.076 880s -0.421 -0.200 0.277 880s 880s y: [3, 3, 1, 1] 880s 0.094 -0.245 -0.144 880s 0.145 -0.512 -0.424 880s -0.119 0.375 0.226 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.803 -0.387 0.279 880s 880s x1: [3, 3, 1, 1] 880s 0.094 -0.163 0.284 880s -0.723 -0.763 -0.962 880s -0.044 -0.108 0.112 880s 880s y: [1, 3, 1, 1] 880s 0.218 880s -0.553 880s 0.038 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.688 880s -0.004 880s -0.429 880s 880s x1: [1, 3, 1, 1] 880s 0.225 880s 0.387 880s -0.583 880s 880s y: [3, 3, 1, 1] 880s -0.155 -0.001 -0.097 880s -0.267 -0.002 -0.166 880s 0.401 0.002 0.250 880s test-mul-mat0: iter:255/500 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.555 -0.439 880s 0.591 -0.277 880s -0.202 0.629 880s 880s x1: [2, 3, 1, 1] 880s 0.228 -0.413 880s 0.748 -0.088 880s 0.351 0.724 880s 880s y: [3, 3, 1, 1] 880s 0.055 0.249 -0.305 880s -0.376 0.466 -0.206 880s -0.513 0.007 0.384 880s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 3, 3, 1] 880s 0.446 -0.638 880s 0.047 -0.203 880s -0.914 0.354 880s 880s x1: [2, 3, 3, 1] 880s 0.679 0.691 880s -0.260 0.386 880s 0.494 0.353 880s 880s y: [3, 3, 3, 1] 880s -0.138 -0.108 -0.376 880s -0.362 -0.091 0.374 880s -0.005 -0.048 -0.326 880s testing: mul_mat, [3, 3, 3, 4] = [2, 3, 3, 4] * [2, 3, 3, 4] 880s x0: [2, 3, 3, 4] 880s -0.812 0.200 880s -0.194 -0.256 880s 0.006 0.868 880s 880s x1: [2, 3, 3, 4] 880s 0.695 0.348 880s 0.407 -0.224 880s 0.838 -0.540 880s 880s y: [3, 3, 3, 4] 880s -0.495 -0.224 0.307 880s -0.376 -0.022 -0.192 880s -0.789 -0.024 -0.464 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.948 -0.815 880s -0.452 -0.978 880s 880s x1: [2, 3, 1, 1] 880s -0.667 -0.721 880s 0.299 0.690 880s 0.972 0.345 880s 880s y: [2, 3, 1, 1] 880s -0.045 1.007 880s -0.278 -0.810 880s 0.641 -0.777 880s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.638 -0.302 880s 880s x1: [2, 2, 3, 1] 880s 0.475 -0.311 880s -0.641 0.903 880s 880s y: [1, 2, 3, 1] 880s -0.209 880s 0.136 880s testing: mul_mat, [1, 2, 3, 4] = [1, 1, 3, 4] * [1, 2, 3, 4] 880s x0: [1, 1, 3, 4] 880s 0.703 880s 880s x1: [1, 2, 3, 4] 880s 0.476 880s 0.621 880s 880s y: [1, 2, 3, 4] 880s 0.335 880s 0.436 880s test-mul-mat0: iter:256/500 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.872 -0.670 880s 0.200 -0.339 880s -0.968 -0.042 880s 0.229 -0.987 880s 880s x1: [2, 1, 1, 1] 880s -0.003 -0.040 880s 880s y: [4, 1, 1, 1] 880s 0.029 0.013 0.005 0.039 880s testing: mul_mat, [2, 4, 3, 1] = [2, 2, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 2, 3, 1] 880s 0.609 -0.538 880s -0.664 -0.432 880s 880s x1: [2, 4, 3, 1] 880s 0.359 -0.430 880s 0.393 0.263 880s -0.728 0.471 880s 0.328 -0.760 880s 880s y: [2, 4, 3, 1] 880s 0.450 -0.052 880s 0.098 -0.375 880s -0.696 0.280 880s 0.608 0.111 880s testing: mul_mat, [2, 2, 3, 3] = [2, 2, 3, 3] * [2, 2, 3, 3] 880s x0: [2, 2, 3, 3] 880s 0.582 -0.312 880s -0.288 0.447 880s 880s x1: [2, 2, 3, 3] 880s -0.483 -0.226 880s -0.950 -0.301 880s 880s y: [2, 2, 3, 3] 880s -0.210 0.038 880s -0.459 0.139 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.187 0.423 880s 0.289 -0.504 880s 880s x1: [2, 2, 1, 1] 880s 0.697 0.911 880s 0.758 0.421 880s 880s y: [2, 2, 1, 1] 880s 0.515 -0.257 880s 0.320 0.007 880s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 1, 3, 1] 880s 0.088 -0.588 880s 880s x1: [2, 2, 3, 1] 880s 0.763 0.248 880s -0.746 -0.465 880s 880s y: [1, 2, 3, 1] 880s -0.078 880s 0.207 880s testing: mul_mat, [4, 2, 3, 3] = [1, 4, 3, 3] * [1, 2, 3, 3] 880s x0: [1, 4, 3, 3] 880s -0.121 880s 0.260 880s 0.866 880s 0.082 880s 880s x1: [1, 2, 3, 3] 880s 0.007 880s -0.866 880s 880s y: [4, 2, 3, 3] 880s -0.001 0.002 0.006 0.001 880s 0.104 -0.225 -0.750 -0.071 880s test-mul-mat0: iter:257/500 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.117 -0.976 880s -0.3ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 97 -0.493 880s 880s x1: [2, 3, 1, 1] 880s 0.465 -0.351 880s -0.325 0.334 880s -0.139 0.920 880s 880s y: [2, 3, 1, 1] 880s 0.397 -0.012 880s -0.364 -0.036 880s -0.915 -0.398 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.793 -0.650 880s -0.004 -0.322 880s 0.016 0.570 880s 880s x1: [2, 2, 1, 1] 880s -0.827 -0.287 880s -0.545 -0.136 880s 880s y: [3, 2, 1, 1] 880s 0.843 0.096 -0.177 880s 0.521 0.046 -0.087 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.443 0.429 880s 880s x1: [2, 3, 1, 1] 880s -0.972 -0.057 880s -0.989 -0.539 880s 0.986 -0.102 880s 880s y: [1, 3, 1, 1] 880s 0.406 880s 0.206 880s -0.480 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.864 0.184 880s -0.933 -0.951 880s -0.975 -0.627 880s 0.770 0.691 880s 880s x1: [2, 1, 1, 1] 880s -0.471 0.206 880s 880s y: [4, 1, 1, 1] 880s -0.369 0.243 0.330 -0.220 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.547 0.650 0.059 0.396 880s 880s x1: [4, 2, 1, 1] 880s -0.778 0.086 -0.854 0.086 880s 0.084 -0.647 0.436 -0.920 880s 880s y: [1, 2, 1, 1] 880s 0.465 880s -0.805 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.939 880s 880s x1: [1, 4, 1, 1] 880s 0.661 880s 0.520 880s 0.382 880s -0.441 880s 880s y: [1, 4, 1, 1] 880s 0.621 880s 0.488 880s 0.359 880s -0.414 880s test-mul-mat0: iter:258/500 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.767 0.477 0.290 880s -0.414 -0.087 0.370 880s 0.616 0.366 0.020 880s -0.325 -0.239 -0.319 880s 880s x1: [3, 3, 1, 1] 880s -0.023 0.212 -0.881 880s -0.253 -0.604 -0.832 880s 0.120 -0.913 -0.610 880s 880s y: [4, 3, 1, 1] 880s -0.137 -0.335 0.046 0.238 880s -0.335 -0.150 -0.393 0.492 880s -0.704 -0.196 -0.272 0.374 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.164 0.608 -0.263 880s -0.077 0.999 -0.057 880s 0.156 0.475 -0.767 880s 0.742 -0.612 0.602 880s 880s x1: [3, 4, 1, 1] 880s -0.805 -0.857 0.240 880s -0.221 -0.918 0.228 880s 0.893 0.228 0.322 880s -0.130 -0.560 0.440 880s 880s y: [4, 4, 1, 1] 880s -0.452 -0.808 -0.717 0.072 880s -0.582 -0.912 -0.646 0.535 880s -0.092 0.141 0.001 0.717 880s -0.435 -0.574 -0.624 0.511 880s testing: mul_mat, [2, 4, 1, 3] = [3, 2, 1, 3] * [3, 4, 1, 3] 880s x0: [3, 2, 1, 3] 880s -0.313 -0.071 0.369 880s 0.695 0.670 0.140 880s 880s x1: [3, 4, 1, 3] 880s 0.358 0.753 -0.378 880s -0.966 -0.485 0.303 880s -0.771 -0.342 -0.457 880s 0.008 -0.259 0.771 880s 880s y: [2, 4, 1, 3] 880s -0.305 0.701 880s 0.449 -0.954 880s 0.097 -0.829 880s 0.301 -0.060 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.727 -0.333 -0.307 880s -0.578 -0.269 0.494 880s 880s x1: [3, 2, 1, 1] 880s -0.057 -0.217 -0.338 880s -0.758 0.849 -0.335 880s 880s y: [2, 2, 1, 1] 880s 0.217 -0.075 880s 0.371 0.045 880s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.070 0.002 880s -0.764 0.827 880s -0.926 -0.732 880s -0.023 0.910 880s 880s x1: [2, 3, 1, 1] 880s 0.418 0.622 880s -0.137 0.113 880s 0.292 -0.997 880s 880s y: [4, 3, 1, 1] 880s -0.028 0.196 -0.842 0.556 880s 0.010 0.198 0.044 0.106 880s -0.023 -1.047 0.459 -0.914 880s testing: mul_mat, [2, 2, 1, 3] = [4, 2, 1, 3] * [4, 2, 1, 3] 880s x0: [4, 2, 1, 3] 880s 0.676 -0.701 -0.511 0.207 880s -0.779 0.503 -0.791 -0.784 880s 880s x1: [4, 2, 1, 3] 880s 0.704 0.727 -0.043 -0.353 880s -0.491 0.620 -0.111 -0.642 880s 880s y: [2, 2, 1, 3] 880s -0.085 0.128 880s -0.842 1.285 880s test-mul-mat0: iter:259/500 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.382 -0.468 880s -0.532 0.591 880s 0.739 -0.317 880s -0.243 0.593 880s 880s x1: [2, 2, 1, 1] 880s 0.353 0.758 880s 0.083 0.028 880s 880s y: [4, 2, 1, 1] 880s -0.220 0.260 0.020 0.364 880s 0.018 -0.027 0.052 -0.003 880s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 4, 4, 1] 880s 0.474 0.071 880s 0.739 -0.414 880s 0.277 -0.869 880s 0.174 -0.439 880s 880s x1: [2, 4, 4, 1] 880s 0.408 0.543 880s -0.664 -0.508 880s -0.386 0.112 880s -0.794 -0.608 880s 880s y: [4, 4, 4, 1] 880s 0.232 0.077 -0.359 -0.167 880s -0.350 -0.280 0.258 0.108 880s -0.175 -0.332 -0.204 -0.116 880s -0.419 -0.335 0.308 0.129 880s testing: mul_mat, [1, 4, 4, 2] = [2, 1, 4, 2] * [2, 4, 4, 2] 880s x0: [2, 1, 4, 2] 880s 0.595 0.792 880s 880s x1: [2, 4, 4, 2] 880s 0.123 0.539 880s -0.339 -0.599 880s 0.670 0.835 880s -0.038 0.372 880s 880s y: [1, 4, 4, 2] 880s 0.501 880s -0.676 880s 1.060 880s 0.272 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.621 0.222 880s -0.920 0.921 880s -0.535 0.173 880s 880s x1: [2, 1, 1, 1] 880s 0.518 -0.826 880s 880s y: [3, 1, 1ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s , 1] 880s -0.505 -1.237 -0.420 880s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 1, 4, 1] 880s 0.595 -0.172 -0.322 880s 880s x1: [3, 2, 4, 1] 880s -0.900 0.496 -0.841 880s -0.864 -0.655 -0.913 880s 880s y: [1, 2, 4, 1] 880s -0.350 880s -0.107 880s testing: mul_mat, [1, 3, 4, 2] = [1, 1, 4, 2] * [1, 3, 4, 2] 880s x0: [1, 1, 4, 2] 880s 0.255 880s 880s x1: [1, 3, 4, 2] 880s -0.215 880s -0.273 880s -0.114 880s 880s y: [1, 3, 4, 2] 880s -0.055 880s -0.070 880s -0.029 880s test-mul-mat0: iter:260/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.612 880s -0.847 880s -0.950 880s 880s x1: [1, 3, 1, 1] 880s 0.506 880s 0.375 880s -0.310 880s 880s y: [3, 3, 1, 1] 880s -0.310 -0.429 -0.481 880s -0.229 -0.318 -0.356 880s 0.190 0.262 0.294 880s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.417 880s 880s x1: [1, 3, 2, 1] 880s 0.099 880s -0.811 880s -0.832 880s 880s y: [1, 3, 2, 1] 880s -0.041 880s 0.338 880s 0.347 880s testing: mul_mat, [4, 1, 2, 1] = [1, 4, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 4, 2, 1] 880s -0.860 880s 0.549 880s 0.239 880s -0.551 880s 880s x1: [1, 1, 2, 1] 880s -0.116 880s 880s y: [4, 1, 2, 1] 880s 0.099 -0.063 -0.028 0.064 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.068 880s 880s x1: [1, 4, 1, 1] 880s -0.941 880s 0.269 880s 0.557 880s -0.668 880s 880s y: [1, 4, 1, 1] 880s 0.064 880s -0.018 880s -0.038 880s 0.045 880s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.175 880s 0.007 880s 880s x1: [1, 1, 2, 1] 880s 0.022 880s 880s y: [2, 1, 2, 1] 880s 0.004 0.000 880s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 1, 2, 1] 880s -0.081 0.182 880s 880s x1: [2, 1, 2, 1] 880s 0.175 0.445 880s 880s y: [1, 1, 2, 1] 880s 0.067 880s test-mul-mat0: iter:261/500 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.007 -0.181 880s 0.056 -0.725 880s -0.624 0.388 880s 0.050 0.309 880s 880s x1: [2, 1, 1, 1] 880s 0.464 -0.846 880s 880s y: [4, 1, 1, 1] 880s 0.156 0.639 -0.618 -0.238 880s testing: mul_mat, [2, 4, 2, 1] = [2, 2, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 2, 2, 1] 880s -0.623 0.477 880s 0.386 0.841 880s 880s x1: [2, 4, 2, 1] 880s -0.589 -0.993 880s -0.371 0.586 880s 0.014 0.049 880s -0.050 -0.812 880s 880s y: [2, 4, 2, 1] 880s -0.107 -1.063 880s 0.510 0.350 880s 0.015 0.047 880s -0.356 -0.702 880s testing: mul_mat, [3, 2, 2, 3] = [2, 3, 2, 3] * [2, 2, 2, 3] 880s x0: [2, 3, 2, 3] 880s -0.727 -0.626 880s -0.562 -0.344 880s 0.223 -0.112 880s 880s x1: [2, 2, 2, 3] 880s 0.439 0.124 880s -0.174 -0.173 880s 880s y: [3, 2, 2, 3] 880s -0.397 -0.289 0.084 880s 0.234 0.157 -0.019 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.810 0.387 880s 880s x1: [2, 3, 1, 1] 880s -0.458 -0.272 880s -0.836 -0.743 880s 0.650 -0.315 880s 880s y: [1, 3, 1, 1] 880s -0.476 880s -0.964 880s 0.404 880s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.434 880s 0.888 880s 0.366 880s 880s x1: [1, 2, 2, 1] 880s -0.278 880s -0.207 880s 880s y: [3, 2, 2, 1] 880s 0.121 -0.247 -0.102 880s 0.090 -0.184 -0.076 880s testing: mul_mat, [4, 1, 2, 3] = [3, 4, 2, 3] * [3, 1, 2, 3] 880s x0: [3, 4, 2, 3] 880s 0.975 -0.497 -0.877 880s 0.352 -0.243 -0.081 880s 0.207 0.826 0.811 880s 0.260 -0.609 -0.432 880s 880s x1: [3, 1, 2, 3] 880s 0.911 0.107 -0.146 880s 880s y: [4, 1, 2, 3] 880s 0.964 0.307 0.158 0.235 880s test-mul-mat0: iter:262/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.368 0.220 880s -0.542 -0.445 880s 0.715 0.630 880s 880s x1: [2, 2, 1, 1] 880s -0.598 -0.184 880s -0.385 -0.796 880s 880s y: [3, 2, 1, 1] 880s 0.180 0.406 -0.544 880s -0.034 0.563 -0.776 880s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.682 -0.479 880s 0.531 -0.764 880s -0.764 0.161 880s 880s x1: [2, 3, 4, 1] 880s -0.715 -0.790 880s -0.375 -0.588 880s 0.336 0.857 880s 880s y: [3, 3, 4, 1] 880s -0.109 0.224 0.419 880s 0.026 0.250 0.192 880s -0.181 -0.476 -0.120 880s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 2, 4, 1] 880s 0.541 0.532 880s 0.549 0.638 880s 880s x1: [2, 3, 4, 1] 880s 0.849 -0.120 880s 0.977 -0.583 880s 0.930 0.277 880s 880s y: [2, 3, 4, 1] 880s 0.396 0.389 880s 0.218 0.164 880s 0.651 0.687 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.438 -0.744 880s 0.920 -0.830 880s 880s x1: [2, 2, 1, 1] 880s -0.752 0.515 880s -0.405 -0.383 880s 880s y: [2, 2, 1, 1] 880s -0.053 -1.119 880s 0.463 -0.055 880s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 2, 4, 1] 880s 0.382 0.058 880s 0.060 -0.137 880s 880s x1: [2, 2, 4, 1] 880s 0.067 -0.255 880s 0.953 0.099 880s 880s y: [2, 2, 4, 1] 880s 0.011 0.039 880s 0.370 0.044 880s testing: mul_mat, [4, 2, 4, 1] = [2, 4, 4, 1] * [2, 2, 4,ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 1] 880s x0: [2, 4, 4, 1] 880s -0.497 -0.450 880s -0.828 -0.042 880s -0.110 0.731 880s 0.808 -0.078 880s 880s x1: [2, 2, 4, 1] 880s -0.247 -0.992 880s -0.126 -0.931 880s 880s y: [4, 2, 4, 1] 880s 0.569 0.246 -0.697 -0.123 880s 0.482 0.144 -0.667 -0.030 880s test-mul-mat0: iter:263/500 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.533 -0.200 -0.649 880s -0.050 0.724 0.492 880s 0.442 0.522 -0.298 880s 880s x1: [3, 1, 1, 1] 880s -0.624 -0.106 0.249 880s 880s y: [3, 1, 1, 1] 880s 0.192 0.077 -0.406 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.894 0.115 0.043 880s 0.460 -0.108 -0.539 880s -0.377 0.218 -0.163 880s 0.517 -0.533 -0.200 880s 880s x1: [3, 3, 1, 1] 880s -0.418 0.225 -0.155 880s -0.352 0.846 0.215 880s 0.403 0.254 0.511 880s 880s y: [4, 3, 1, 1] 880s 0.393 -0.133 0.232 -0.305 880s 0.422 -0.369 0.282 -0.676 880s -0.309 -0.117 -0.180 -0.028 880s testing: mul_mat, [2, 4, 1, 2] = [3, 2, 1, 2] * [3, 4, 1, 2] 880s x0: [3, 2, 1, 2] 880s 0.027 -0.859 0.868 880s -0.456 -0.392 -0.332 880s 880s x1: [3, 4, 1, 2] 880s 0.985 0.267 0.152 880s -0.065 -0.009 -0.356 880s -0.623 -0.487 0.346 880s -0.041 0.738 -0.809 880s 880s y: [2, 4, 1, 2] 880s -0.071 -0.604 880s -0.303 0.151 880s 0.702 0.360 880s -1.337 -0.002 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.298 0.406 0.417 880s 880s x1: [3, 2, 1, 1] 880s 0.841 0.380 0.810 880s -0.200 0.118 -1.000 880s 880s y: [1, 2, 1, 1] 880s 0.241 880s -0.310 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.647 880s 0.512 880s -0.114 880s 0.276 880s 880s x1: [1, 3, 1, 1] 880s 0.539 880s -0.678 880s -0.057 880s 880s y: [4, 3, 1, 1] 880s -0.349 0.276 -0.061 0.149 880s 0.439 -0.347 0.077 -0.187 880s 0.037 -0.029 0.006 -0.016 880s testing: mul_mat, [1, 1, 1, 2] = [4, 1, 1, 2] * [4, 1, 1, 2] 880s x0: [4, 1, 1, 2] 880s -0.102 -0.550 -0.232 0.362 880s 880s x1: [4, 1, 1, 2] 880s -0.457 0.913 0.417 -0.590 880s 880s y: [1, 1, 1, 2] 880s -0.766 880s test-mul-mat0: iter:264/500 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.989 0.253 880s 0.370 -0.124 880s -0.471 0.912 880s -0.211 0.946 880s 880s x1: [2, 2, 1, 1] 880s -0.422 -0.085 880s 0.506 -0.100 880s 880s y: [4, 2, 1, 1] 880s -0.439 -0.145 0.121 0.008 880s 0.476 0.200 -0.330 -0.202 880s testing: mul_mat, [4, 4, 2, 1] = [2, 4, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.981 -0.416 880s -0.522 0.526 880s 0.484 -0.664 880s 0.515 -0.263 880s 880s x1: [2, 4, 2, 1] 880s -0.677 0.247 880s 0.971 0.401 880s -0.767 0.872 880s -0.700 -0.316 880s 880s y: [4, 4, 2, 1] 880s 0.561 0.483 -0.492 -0.414 880s -1.119 -0.296 0.203 0.395 880s 0.389 0.859 -0.950 -0.624 880s 0.818 0.199 -0.129 -0.278 880s testing: mul_mat, [3, 4, 2, 4] = [2, 3, 2, 4] * [2, 4, 2, 4] 880s x0: [2, 3, 2, 4] 880s -0.600 -0.605 880s 0.181 0.756 880s -0.290 0.094 880s 880s x1: [2, 4, 2, 4] 880s 0.182 0.342 880s -0.340 0.054 880s 0.642 0.344 880s 0.694 0.304 880s 880s y: [3, 4, 2, 4] 880s -0.316 0.292 -0.021 880s 0.171 -0.021 0.104 880s -0.593 0.376 -0.154 880s -0.600 0.355 -0.173 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.525 0.658 880s 880s x1: [2, 3, 1, 1] 880s 0.095 0.655 880s 0.893 -0.254 880s -0.282 0.711 880s 880s y: [1, 3, 1, 1] 880s 0.481 880s 0.302 880s 0.319 880s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 4, 2, 1] 880s 0.430 880s -0.485 880s -0.695 880s -0.727 880s 880s x1: [1, 2, 2, 1] 880s -0.959 880s 0.668 880s 880s y: [4, 2, 2, 1] 880s -0.413 0.466 0.666 0.698 880s 0.287 -0.324 -0.464 -0.486 880s testing: mul_mat, [4, 1, 2, 4] = [4, 4, 2, 4] * [4, 1, 2, 4] 880s x0: [4, 4, 2, 4] 880s 0.606 0.665 -0.738 -0.805 880s 0.580 0.044 -0.377 0.082 880s 0.482 -0.238 -0.229 0.658 880s -0.205 -0.155 -0.803 0.425 880s 880s x1: [4, 1, 2, 4] 880s 0.704 0.107 0.034 -0.262 880s 880s y: [4, 1, 2, 4] 880s 0.683 0.379 0.134 -0.300 880s test-mul-mat0: iter:265/500 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.664 -0.527 880s 0.662 0.346 880s 880s x1: [2, 2, 1, 1] 880s -0.506 0.638 880s 0.790 -0.251 880s 880s y: [2, 2, 1, 1] 880s -0.000 -0.114 880s -0.392 0.436 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.027 0.652 880s -0.677 0.465 880s 880s x1: [2, 2, 1, 1] 880s 0.446 -0.650 880s 0.185 -0.239 880s 880s y: [2, 2, 1, 1] 880s -0.412 -0.604 880s -0.151 -0.237 880s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 880s x0: [2, 1, 1, 2] 880s -0.373 -0.589 880s 880s x1: [2, 2, 1, 2] 880s -0.368 -0.788 880s 0.080 -0.665 880s 880s y: [1, 2, 1, 2] 880s 0.602 880s 0.362 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.654 -0.037 880s 0.3ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 02 -0.693 880s -0.039 0.573 880s -0.873 0.313 880s 880s x1: [2, 1, 1, 1] 880s -0.095 0.080 880s 880s y: [4, 1, 1, 1] 880s -0.065 -0.084 0.050 0.108 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.065 0.111 -0.961 0.481 880s -0.048 -0.949 -0.323 0.513 880s -0.842 0.856 -0.733 0.172 880s 880s x1: [4, 2, 1, 1] 880s 0.492 -0.666 0.993 -0.480 880s 0.986 -0.684 0.985 -0.383 880s 880s y: [3, 2, 1, 1] 880s -1.228 0.042 -1.795 880s -1.143 0.088 -2.203 880s testing: mul_mat, [2, 4, 1, 2] = [3, 2, 1, 2] * [3, 4, 1, 2] 880s x0: [3, 2, 1, 2] 880s 0.617 0.258 -0.344 880s 0.143 0.098 -0.570 880s 880s x1: [3, 4, 1, 2] 880s -0.439 -0.863 -0.174 880s 0.863 0.098 -0.047 880s -0.174 0.405 -0.474 880s -0.861 -0.102 -0.141 880s 880s y: [2, 4, 1, 2] 880s -0.433 -0.048 880s 0.574 0.160 880s 0.161 0.285 880s -0.509 -0.053 880s test-mul-mat0: iter:266/500 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.482 -0.684 0.180 0.060 880s -0.219 -0.829 -0.324 0.924 880s 880s x1: [4, 2, 1, 1] 880s -0.806 0.594 -0.380 0.612 880s -0.561 0.068 0.015 0.901 880s 880s y: [2, 2, 1, 1] 880s -0.826 0.373 880s -0.261 0.894 880s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 2, 3, 1] 880s -0.297 0.923 0.087 -0.126 880s -0.400 0.011 -0.698 -0.626 880s 880s x1: [4, 2, 3, 1] 880s 0.429 0.774 -0.420 0.858 880s 0.433 0.373 0.114 -0.045 880s 880s y: [2, 2, 3, 1] 880s 0.442 -0.407 880s 0.231 -0.220 880s testing: mul_mat, [2, 2, 3, 4] = [4, 2, 3, 4] * [4, 2, 3, 4] 880s x0: [4, 2, 3, 4] 880s 0.665 0.962 -0.388 0.418 880s 0.231 -0.159 -0.203 -0.472 880s 880s x1: [4, 2, 3, 4] 880s -0.606 0.743 0.294 -0.616 880s 0.266 -0.843 -0.310 0.969 880s 880s y: [2, 2, 3, 4] 880s -0.060 -0.027 880s -0.108 -0.199 880s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.389 0.543 0.197 0.269 880s 880s x1: [4, 2, 1, 1] 880s -0.575 -0.250 -0.388 0.809 880s -0.391 -0.374 0.478 0.796 880s 880s y: [1, 2, 1, 1] 880s 0.229 880s 0.258 880s testing: mul_mat, [2, 4, 3, 1] = [1, 2, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.464 880s -0.721 880s 880s x1: [1, 4, 3, 1] 880s -0.026 880s -0.920 880s -0.763 880s -0.638 880s 880s y: [2, 4, 3, 1] 880s 0.012 0.018 880s 0.427 0.663 880s 0.354 0.550 880s 0.296 0.460 880s testing: mul_mat, [3, 1, 3, 4] = [2, 3, 3, 4] * [2, 1, 3, 4] 880s x0: [2, 3, 3, 4] 880s 0.312 0.157 880s -0.644 0.561 880s 0.141 0.222 880s 880s x1: [2, 1, 3, 4] 880s -0.404 0.817 880s 880s y: [3, 1, 3, 4] 880s 0.002 0.718 0.125 880s test-mul-mat0: iter:267/500 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.640 -0.142 880s 0.901 0.083 880s -0.636 -0.564 880s 0.171 0.571 880s 880s x1: [2, 1, 1, 1] 880s -0.165 0.254 880s 880s y: [4, 1, 1, 1] 880s 0.070 -0.128 -0.039 0.117 880s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.551 -0.838 880s 0.847 0.646 880s -0.162 -0.155 880s 0.887 -0.181 880s 880s x1: [2, 4, 4, 1] 880s -0.863 -0.245 880s -0.312 0.779 880s 0.912 0.197 880s -0.325 -0.002 880s 880s y: [4, 4, 4, 1] 880s 0.681 -0.890 0.178 -0.721 880s -0.481 0.239 -0.070 -0.418 880s -0.668 0.900 -0.179 0.773 880s 0.181 -0.276 0.053 -0.287 880s testing: mul_mat, [1, 4, 4, 3] = [2, 1, 4, 3] * [2, 4, 4, 3] 880s x0: [2, 1, 4, 3] 880s -0.089 -0.850 880s 880s x1: [2, 4, 4, 3] 880s -0.526 0.834 880s -0.136 0.312 880s -0.320 -0.249 880s -0.869 0.099 880s 880s y: [1, 4, 4, 3] 880s -0.662 880s -0.253 880s 0.240 880s -0.007 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.976 0.962 880s -0.165 0.745 880s 0.936 -0.914 880s 880s x1: [2, 1, 1, 1] 880s 0.909 0.169 880s 880s y: [3, 1, 1, 1] 880s -0.724 -0.024 0.696 880s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 1, 4, 1] 880s 0.103 0.056 0.136 880s 880s x1: [3, 2, 4, 1] 880s -0.478 0.909 0.021 880s -0.094 0.869 -0.102 880s 880s y: [1, 2, 4, 1] 880s 0.004 880s 0.025 880s testing: mul_mat, [2, 3, 4, 3] = [1, 2, 4, 3] * [1, 3, 4, 3] 880s x0: [1, 2, 4, 3] 880s -0.514 880s 0.256 880s 880s x1: [1, 3, 4, 3] 880s 0.525 880s 0.867 880s 0.702 880s 880s y: [2, 3, 4, 3] 880s -0.270 0.135 880s -0.446 0.222 880s -0.361 0.180 880s test-mul-mat0: iter:268/500 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.501 0.558 0.823 880s -0.804 -0.400 0.168 880s 0.500 0.670 -0.007 880s 880s x1: [3, 1, 1, 1] 880s 0.002 -0.755 0.492 880s 880s y: [3, 1, 1, 1] 880s -0.016 0.383 -0.508 880s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 3, 2, 1] 880s 0.280 -0.430 -0.064 880s -0.219 -0.872 -0.241 880s -0.022 -0.272 0.927 880s 880s x1: [3, 3, 2, 1] 880s -0.510 -0.011 -0.502 880s -0.731 0.613 -0.180 880s 0.903 -0.658 0.180 880s 880s y: [3, 3, 2, 1] 880s -0.106 0.242 -0.451 880s -0.457 -0.331 -0.318 880s 0.525 0.333 0.326 880s testing: mul_mat, [4, 3, 2, 3] = [3, 4, 2, 3] * [3, 3, 2, 3] 880s x0: [3, 4, 2, 3] 880s 0.478 0.997 -0.058 880s 0.435 -0.190 0.357 880s 0.629 -0.237 0.114 880s 0.242 -0.169 -0.900 880s 880s x1: [3, 3, 2, 3] 880s 0.139 0.341 0.418 880s -0.885 0.229 0.240 880s -0.902 -0.940 0.128 880s 880s y: [4, 3, 2, 3] 880s 0.382 0.145 0.054 -0.401 880s -0.208 -0.342 -0.584 -0.469 880s -1.376 -0.168 -0.330 -0.174 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.793 0.987 -0.343 880s -0.454 0.904 0.165 880s -0.056 0.146 -0.847 880s -0.174 0.542 -0.362 880s 880s x1: [3, 4, 1, 1] 880s 0.900 -0.821 0.131 880s -0.001 -0.049 0.402 880s 0.807 0.009 0.935 880s 0.028 -0.130 0.695 880s 880s y: [4, 4, 1, 1] 880s -1.569 -1.129 -0.281 -0.649 880s -0.185 0.022 -0.347 -0.172 880s -0.951 -0.203 -0.836 -0.474 880s -0.389 -0.016 -0.609 -0.327 880s testing: mul_mat, [4, 3, 2, 1] = [4, 4, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 4, 2, 1] 880s -0.081 -0.195 0.143 -0.346 880s -0.349 0.667 0.265 0.624 880s 0.029 0.775 0.714 0.101 880s -0.916 0.983 0.557 0.042 880s 880s x1: [4, 3, 2, 1] 880s 0.692 0.178 0.476 0.489 880s 0.731 -0.587 0.389 0.910 880s 0.545 -0.612 -0.139 -0.054 880s 880s y: [4, 3, 2, 1] 880s -0.192 0.309 0.548 -0.172 880s -0.204 0.025 -0.064 -0.991 880s 0.074 -0.668 -0.563 -1.180 880s testing: mul_mat, [4, 4, 2, 3] = [4, 4, 2, 3] * [4, 4, 2, 3] 880s x0: [4, 4, 2, 3] 880s 0.620 0.254 0.585 0.930 880s 0.611 -0.988 0.054 -0.393 880s -0.213 0.613 -0.421 -0.867 880s -0.759 0.725 -0.494 -0.463 880s 880s x1: [4, 4, 2, 3] 880s -0.609 0.850 -0.182 0.196 880s 0.517 -0.406 0.179 -0.340 880s 0.858 -0.107 -0.783 -0.487 880s -0.483 0.318 0.555 0.002 880s 880s y: [4, 4, 2, 3] 880s -0.085 -1.298 0.557 1.077 880s 0.006 0.860 -0.140 -0.618 880s -0.407 0.779 0.504 -0.117 880s 0.107 -0.580 0.063 0.323 880s test-mul-mat0: iter:269/500 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.313 0.321 880s 0.881 -0.250 880s -0.912 -0.498 880s -0.073 0.257 880s 880s x1: [2, 4, 1, 1] 880s -0.280 0.879 880s -0.316 0.793 880s -0.645 0.698 880s -0.905 0.196 880s 880s y: [4, 4, 1, 1] 880s 0.370 -0.467 -0.182 0.247 880s 0.354 -0.477 -0.107 0.227 880s 0.426 -0.743 0.241 0.227 880s 0.346 -0.847 0.727 0.117 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.577 -0.844 880s 0.895 -0.942 880s 0.783 -0.386 880s 0.936 -0.533 880s 880s x1: [2, 4, 1, 1] 880s -0.918 0.185 880s 0.999 -0.430 880s 0.023 0.768 880s 0.307 0.561 880s 880s y: [4, 4, 1, 1] 880s 0.374 -0.996 -0.790 -0.958 880s -0.214 1.300 0.948 1.165 880s -0.661 -0.703 -0.278 -0.388 880s -0.650 -0.253 0.024 -0.011 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.940 -0.103 880s 0.754 -0.134 880s 880s x1: [2, 4, 1, 1] 880s -0.592 -0.709 880s -0.835 -0.497 880s 0.487 0.370 880s 0.190 -0.191 880s 880s y: [2, 4, 1, 1] 880s -0.484 -0.352 880s -0.734 -0.563 880s 0.420 0.318 880s 0.198 0.169 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.075 0.712 880s 0.922 -0.365 880s 880s x1: [2, 2, 1, 1] 880s -0.845 0.835 880s -0.949 -0.846 880s 880s y: [2, 2, 1, 1] 880s 0.532 -1.084 880s -0.673 -0.566 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.426 0.041 880s -0.942 -0.534 880s -0.871 -0.579 880s 880s x1: [2, 2, 1, 1] 880s 0.549 -0.865 880s -0.209 0.443 880s 880s y: [3, 2, 1, 1] 880s -0.269 -0.055 0.023 880s 0.107 -0.040 -0.075 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.902 0.614 0.581 880s 880s x1: [3, 2, 1, 1] 880s 0.206 -0.032 0.908 880s -0.424 -0.842 -0.283 880s 880s y: [1, 2, 1, 1] 880s 0.323 880s -0.299 880s test-mul-mat0: iter:270/500 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.164 0.862 880s -0.908 -0.106 880s 0.992 0.133 880s 0.359 -0.588 880s 880s x1: [2, 4, 1, 1] 880s 0.922 0.822 880s -0.909 0.844 880s 0.534 -0.274 880s 0.393 0.669 880s 880s y: [4, 4, 1, 1] 880s 0.557 -0.925 1.024 -0.152 880s 0.877 0.736 -0.790 -0.823 880s -0.324 -0.456 0.493 0.353 880s 0.513 -0.428 0.478 -0.252 880s testing: mul_mat, [2, 4, 4, 1] = [2, 2, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 2, 4, 1] 880s 0.353 0.377 880s 0.481 -0.609 880s 880s x1: [2, 4, 4, 1] 880s -0.661 -0.672 880s -0.680 -0.085 880s -0.514 0.037 880s -0.257 -0.416 880s 880s y: [2, 4, 4, 1] 880s -0.487 0.092 880s -0.272 -0.275 880s -0.168 -0.270 880s -0.248 0.130 880s testing: mul_mat, [3, 2, 4, 2] = [2, 3, 4, 2] * [2, 2, 4, 2] 880s x0: [2, 3, 4, 2] 880s 0.505 0.033 880s -0.547 0.487 880s 0.277 -0.443 880s 880s x1: [2, 2, 4, 2] 880s -0.152 -0.785 880s 0.567 0.615 880s 880s y: [3, 2, ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 4, 2] 880s -0.103 -0.299 0.306 880s 0.307 -0.011 -0.115 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.688 -0.547 880s -0.022 -0.372 880s 0.284 -0.553 880s 880s x1: [2, 3, 1, 1] 880s -0.994 -0.430 880s 0.501 -0.637 880s 0.296 0.265 880s 880s y: [3, 3, 1, 1] 880s 0.919 0.182 -0.044 880s 0.004 0.226 0.495 880s -0.349 -0.105 -0.063 880s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 1, 4, 1] 880s 0.964 0.964 0.843 880s 880s x1: [3, 2, 4, 1] 880s 0.932 0.782 -0.357 880s 0.864 -0.576 -0.883 880s 880s y: [1, 2, 4, 1] 880s 1.352 880s -0.466 880s testing: mul_mat, [3, 3, 4, 2] = [1, 3, 4, 2] * [1, 3, 4, 2] 880s x0: [1, 3, 4, 2] 880s -0.010 880s -0.636 880s -0.484 880s 880s x1: [1, 3, 4, 2] 880s -0.434 880s -0.359 880s -0.218 880s 880s y: [3, 3, 4, 2] 880s 0.004 0.276 0.210 880s 0.004 0.228 0.174 880s 0.002 0.139 0.106 880s test-mul-mat0: iter:271/500 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.148 -0.650 0.258 -0.187 880s -0.211 0.775 0.336 0.693 880s -0.200 0.073 -0.151 -0.449 880s -0.320 -0.387 -0.251 0.180 880s 880s x1: [4, 2, 1, 1] 880s -0.111 0.021 -0.172 0.879 880s 0.385 0.344 0.082 -0.453 880s 880s y: [4, 2, 1, 1] 880s -0.206 0.591 -0.345 0.229 880s -0.175 -0.101 0.139 -0.358 880s testing: mul_mat, [1, 4, 2, 1] = [4, 1, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 1, 2, 1] 880s 0.800 -0.819 -0.615 -0.824 880s 880s x1: [4, 4, 2, 1] 880s -0.363 -0.618 0.131 -0.115 880s 0.413 -0.625 0.774 -0.566 880s 0.203 0.653 0.818 -0.453 880s -0.264 -0.634 0.457 0.587 880s 880s y: [1, 4, 2, 1] 880s 0.230 880s 0.833 880s -0.502 880s -0.457 880s testing: mul_mat, [3, 1, 2, 4] = [4, 3, 2, 4] * [4, 1, 2, 4] 880s x0: [4, 3, 2, 4] 880s -0.487 -0.878 0.697 -0.556 880s 0.840 0.794 -0.225 0.063 880s -0.620 -0.157 0.484 -0.744 880s 880s x1: [4, 1, 2, 4] 880s -0.363 0.925 -0.656 0.003 880s 880s y: [3, 1, 2, 4] 880s -1.094 0.577 -0.239 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.600 -0.296 -0.146 -0.825 880s 880s x1: [4, 3, 1, 1] 880s 0.249 0.731 -0.796 -0.586 880s -0.333 -0.422 -0.013 -0.065 880s -0.247 -0.311 -0.513 -0.298 880s 880s y: [1, 3, 1, 1] 880s 0.234 880s 0.380 880s 0.562 880s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 4, 2, 1] 880s -0.918 880s -0.672 880s -0.223 880s -0.275 880s 880s x1: [1, 4, 2, 1] 880s -0.683 880s -0.605 880s 0.917 880s -0.238 880s 880s y: [4, 4, 2, 1] 880s 0.627 0.459 0.152 0.188 880s 0.555 0.406 0.135 0.166 880s -0.842 -0.616 -0.204 -0.252 880s 0.219 0.160 0.053 0.066 880s testing: mul_mat, [3, 1, 2, 4] = [4, 3, 2, 4] * [4, 1, 2, 4] 880s x0: [4, 3, 2, 4] 880s 0.205 -0.558 -0.618 0.024 880s -0.875 -0.760 0.959 -0.467 880s 0.906 -0.550 0.395 -0.485 880s 880s x1: [4, 1, 2, 4] 880s -0.561 0.122 -0.515 0.427 880s 880s y: [3, 1, 2, 4] 880s 0.146 -0.296 -0.985 880s test-mul-mat0: iter:272/500 880s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.532 880s -0.112 880s -0.188 880s 0.738 880s 880s x1: [1, 4, 1, 1] 880s -0.550 880s -0.517 880s 0.031 880s 0.728 880s 880s y: [4, 4, 1, 1] 880s 0.292 0.061 0.104 -0.405 880s 0.275 0.058 0.097 -0.381 880s -0.017 -0.003 -0.006 0.023 880s -0.387 -0.081 -0.137 0.537 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.992 880s -0.436 880s 880s x1: [1, 4, 1, 1] 880s 0.598 880s -0.081 880s -0.992 880s -0.696 880s 880s y: [2, 4, 1, 1] 880s 0.593 -0.261 880s -0.080 0.035 880s -0.985 0.433 880s -0.691 0.303 880s testing: mul_mat, [4, 2, 1, 2] = [1, 4, 1, 2] * [1, 2, 1, 2] 880s x0: [1, 4, 1, 2] 880s 0.947 880s -0.338 880s -0.173 880s -0.545 880s 880s x1: [1, 2, 1, 2] 880s 0.473 880s -0.324 880s 880s y: [4, 2, 1, 2] 880s 0.447 -0.160 -0.082 -0.258 880s -0.307 0.110 0.056 0.177 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.099 880s -0.410 880s -0.769 880s 880s x1: [1, 4, 1, 1] 880s 0.147 880s -0.269 880s 0.173 880s 0.630 880s 880s y: [3, 4, 1, 1] 880s -0.014 -0.060 -0.113 880s 0.027 0.110 0.207 880s -0.017 -0.071 -0.133 880s -0.062 -0.259 -0.485 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.680 0.968 0.003 880s 0.976 -0.745 -0.278 880s 0.691 0.410 -0.587 880s 0.937 -0.356 -0.050 880s 880s x1: [3, 1, 1, 1] 880s 0.789 0.401 0.968 880s 880s y: [4, 1, 1, 1] 880s -0.145 0.203 0.141 0.549 880s testing: mul_mat, [2, 3, 1, 2] = [4, 2, 1, 2] * [4, 3, 1, 2] 880s x0: [4, 2, 1, 2] 880s -0.009 0.476 -0.890 0.097 880s 0.697 0.713 -0.574 0.350 880s 880s x1: [4, 3, 1, 2] 880s 0.384 0.240 0.405 -0.447 880s -0.445 -0.585 0.276 0.702 880s 0.146 -0.551 0.332 -0.091 880s 880s y: [2, 3, 1, 2] 880s -0.293 0.050 880s -0.451 -0.639 880s -0.568 -0.513 880s test-mul-mat0: iter:273/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.392 0.409 -0.313 880s 0.081 -0.601 -0.616 880s 880s x1: [3, 2, 1, 1] 880s 0.441 0.864 -0.815 880s -0.450 -0.610 0.828 880s 880s y: [2, 2, 1, 1] 880s 0.435 0.019 880s -0.332 -0.180 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.141 0.750 0.059 880s 0.784 0.357 -0.548 880s 880s x1: [3, 2, 1, 1] 880s -0.444 -0.888 -0.506 880s -0.018 0.209 0.844 880s 880s y: [2, 2, 1, 1] 880s -0.633 -0.387 880s 0.209 -0.402 880s testing: mul_mat, [4, 2, 1, 4] = [3, 4, 1, 4] * [3, 2, 1, 4] 880s x0: [3, 4, 1, 4] 880s 0.159 0.538 0.236 880s -0.057 -0.105 0.687 880s -0.899 0.192 -0.560 880s 0.983 -0.650 -0.365 880s 880s x1: [3, 2, 1, 4] 880s 0.158 -0.703 -0.247 880s 0.882 0.158 -0.805 880s 880s y: [4, 2, 1, 4] 880s -0.411 -0.105 -0.139 0.702 880s 0.036 -0.620 -0.311 1.059 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.976 -0.378 -0.588 880s -0.295 -0.217 -0.446 880s -0.655 -0.161 -0.358 880s 0.564 -0.360 -0.608 880s 880s x1: [3, 4, 1, 1] 880s 0.486 -0.797 0.981 880s -0.691 -0.365 -0.597 880s -0.418 0.448 0.917 880s -0.201 0.624 0.731 880s 880s y: [4, 4, 1, 1] 880s 0.200 -0.408 -0.541 -0.035 880s -0.186 0.549 0.725 0.105 880s -1.116 -0.383 -0.128 -0.955 880s -0.862 -0.402 -0.231 -0.783 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.747 -0.365 -0.043 -0.744 880s -0.097 -0.388 0.417 -0.398 880s 0.809 0.853 0.961 -0.848 880s 0.877 0.340 0.740 -0.190 880s 880s x1: [4, 3, 1, 1] 880s -0.097 -0.628 -0.935 -0.368 880s 0.967 -0.398 -0.881 -0.830 880s -0.417 -0.572 -0.195 -0.014 880s 880s y: [4, 3, 1, 1] 880s 0.615 0.010 -1.201 -0.920 880s 0.079 0.024 0.298 0.219 880s 0.539 0.187 -1.001 -0.702 880s testing: mul_mat, [2, 4, 1, 4] = [4, 2, 1, 4] * [4, 4, 1, 4] 880s x0: [4, 2, 1, 4] 880s -0.851 -0.988 0.417 0.220 880s 0.881 -0.191 0.556 -0.253 880s 880s x1: [4, 4, 1, 4] 880s 0.243 -0.456 0.713 0.615 880s -0.391 -0.655 0.582 0.211 880s -0.536 0.751 0.794 -0.108 880s -0.444 -0.220 0.902 -0.191 880s 880s y: [2, 4, 1, 4] 880s 0.676 0.542 880s 1.269 0.051 880s 0.021 -0.148 880s 0.929 0.201 880s test-mul-mat0: iter:274/500 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.750 -0.044 880s 0.359 -0.421 880s 880s x1: [2, 1, 1, 1] 880s -0.742 -0.231 880s 880s y: [2, 1, 1, 1] 880s 0.567 -0.169 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.702 -0.222 880s 0.297 -0.903 880s 0.169 0.212 880s 0.160 -0.267 880s 880s x1: [2, 2, 1, 1] 880s -0.080 -0.991 880s -0.045 0.347 880s 880s y: [4, 2, 1, 1] 880s 0.164 0.872 -0.223 0.252 880s -0.109 -0.327 0.066 -0.100 880s testing: mul_mat, [1, 4, 1, 4] = [2, 1, 1, 4] * [2, 4, 1, 4] 880s x0: [2, 1, 1, 4] 880s -0.220 0.823 880s 880s x1: [2, 4, 1, 4] 880s -0.267 -0.937 880s -0.507 0.326 880s -0.479 0.184 880s -0.933 -0.140 880s 880s y: [1, 4, 1, 4] 880s -0.713 880s 0.379 880s 0.257 880s 0.090 880s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.934 -0.610 880s 880s x1: [2, 1, 1, 1] 880s -0.053 0.996 880s 880s y: [1, 1, 1, 1] 880s -0.557 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.493 880s 0.031 880s 880s x1: [1, 2, 1, 1] 880s 0.232 880s -0.456 880s 880s y: [2, 2, 1, 1] 880s 0.115 0.007 880s -0.225 -0.014 880s testing: mul_mat, [3, 1, 1, 4] = [2, 3, 1, 4] * [2, 1, 1, 4] 880s x0: [2, 3, 1, 4] 880s 0.536 -0.515 880s 0.330 -0.890 880s 0.100 -0.077 880s 880s x1: [2, 1, 1, 4] 880s 0.292 0.102 880s 880s y: [3, 1, 1, 4] 880s 0.104 0.006 0.021 880s test-mul-mat0: iter:275/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.897 0.703 -0.440 880s 880s x1: [3, 1, 1, 1] 880s -0.461 -0.433 -0.398 880s 880s y: [1, 1, 1, 1] 880s -0.543 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.438 -0.197 -0.993 880s 880s x1: [3, 1, 1, 1] 880s -0.993 -0.374 -0.484 880s 880s y: [1, 1, 1, 1] 880s 0.119 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.335 -0.761 0.406 880s 880s x1: [3, 1, 1, 1] 880s 0.756 -0.964 0.928 880s 880s y: [1, 1, 1, 1] 880s 1.364 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.119 0.213 0.901 880s -0.623 -0.229 0.213 880s 880s x1: [3, 1, 1, 1] 880s 0.178 0.914 -0.659 880s 880s y: [2, 1, 1, 1] 880s -0.421 -0.460 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.628 -0.546 880s -0.857 0.582 880s -0.539 0.264 880s 880s x1: [2, 3, 1, 1] 880s 0.310 -0.532 880s 0.815 0.385 880s -0.635 0.518 880s 880s y: [3, 3, 1, 1] 880s 0.096 -0.575 -0.307 880s -0.722 -0.474 -0.337 880s 0.116 0.845 0.479 880s testing: mul_mat, [1, 2, 1, 1]ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.291 0.814 -0.457 880s 880s x1: [3, 2, 1, 1] 880s -0.539 0.338 0.300 880s -0.611 0.303 -0.365 880s 880s y: [1, 2, 1, 1] 880s 0.294 880s 0.591 880s test-mul-mat0: iter:276/500 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.109 -0.839 0.624 880s -0.164 -0.467 0.768 880s 0.298 -0.013 0.349 880s -0.438 0.449 -0.313 880s 880s x1: [3, 2, 1, 1] 880s 0.079 -0.805 0.328 880s -0.708 0.506 0.796 880s 880s y: [4, 2, 1, 1] 880s 0.871 0.614 0.148 -0.498 880s 0.150 0.491 0.061 0.288 880s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 1, 3, 1] 880s -0.610 -0.258 0.520 880s 880s x1: [3, 4, 3, 1] 880s 0.862 0.838 0.990 880s -0.503 -0.534 -0.301 880s -0.689 0.009 0.748 880s -0.823 -0.567 0.175 880s 880s y: [1, 4, 3, 1] 880s -0.228 880s 0.288 880s 0.807 880s 0.740 880s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.982 -0.074 0.747 880s 0.702 -0.645 -0.170 880s 880s x1: [3, 1, 3, 1] 880s 0.371 0.937 0.495 880s 880s y: [2, 1, 3, 1] 880s 0.665 -0.429 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.973 -0.159 0.482 880s -0.465 -0.204 -0.807 880s 880s x1: [3, 2, 1, 1] 880s -0.859 0.074 0.086 880s 0.512 0.011 -0.419 880s 880s y: [2, 2, 1, 1] 880s -0.806 0.315 880s 0.294 0.098 880s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.821 0.097 880s -0.484 0.386 880s -0.151 0.163 880s 0.190 0.632 880s 880s x1: [2, 3, 3, 1] 880s -0.222 -0.592 880s 0.940 -0.521 880s -0.237 -0.230 880s 880s y: [4, 3, 3, 1] 880s 0.125 -0.121 -0.063 -0.416 880s -0.822 -0.656 -0.227 -0.151 880s 0.172 0.026 -0.002 -0.190 880s testing: mul_mat, [1, 2, 3, 1] = [4, 1, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.233 0.956 0.687 -0.129 880s 880s x1: [4, 2, 3, 1] 880s -0.019 -0.758 0.691 -0.823 880s 0.347 0.121 0.514 0.526 880s 880s y: [1, 2, 3, 1] 880s -0.140 880s 0.320 880s test-mul-mat0: iter:277/500 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.387 0.630 0.909 880s -0.133 -0.975 -0.517 880s 880s x1: [3, 1, 1, 1] 880s -0.805 0.020 0.654 880s 880s y: [2, 1, 1, 1] 880s 0.296 -0.251 880s testing: mul_mat, [3, 2, 4, 1] = [3, 3, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.058 -0.478 0.694 880s 0.926 -0.452 -0.824 880s 0.795 0.328 -0.552 880s 880s x1: [3, 2, 4, 1] 880s 0.869 -0.219 -0.729 880s 0.156 -0.334 0.039 880s 880s y: [3, 2, 4, 1] 880s -0.350 1.504 1.021 880s 0.196 0.264 -0.007 880s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 2, 4, 1] 880s -0.314 0.824 0.401 880s -0.439 -0.902 0.092 880s 880s x1: [3, 3, 4, 1] 880s -0.456 0.720 -0.187 880s -0.008 -0.329 -0.193 880s -0.522 -0.266 0.154 880s 880s y: [2, 3, 4, 1] 880s 0.661 -0.466 880s -0.346 0.283 880s 0.007 0.483 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.013 0.104 -0.183 880s 0.005 0.781 -0.256 880s -0.311 0.168 -0.340 880s 0.574 0.101 -0.586 880s 880s x1: [3, 2, 1, 1] 880s 0.602 -0.880 -0.908 880s 0.041 -0.673 0.182 880s 880s y: [4, 2, 1, 1] 880s 0.083 -0.452 -0.026 0.787 880s -0.103 -0.572 -0.187 -0.151 880s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 1, 4, 1] 880s -0.886 -0.082 -0.999 0.762 880s 880s x1: [4, 3, 4, 1] 880s -0.413 -0.042 0.844 -0.782 880s -0.114 -0.954 -0.947 0.118 880s 0.022 0.028 -0.978 -0.304 880s 880s y: [1, 3, 4, 1] 880s -1.070 880s 1.216 880s 0.724 880s testing: mul_mat, [2, 4, 4, 1] = [1, 2, 4, 1] * [1, 4, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.079 880s 0.686 880s 880s x1: [1, 4, 4, 1] 880s -0.508 880s 0.583 880s 0.155 880s -0.682 880s 880s y: [2, 4, 4, 1] 880s -0.040 -0.348 880s 0.046 0.400 880s 0.012 0.106 880s -0.054 -0.468 880s test-mul-mat0: iter:278/500 880s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.002 -0.825 880s 880s x1: [2, 1, 1, 1] 880s 0.415 0.020 880s 880s y: [1, 1, 1, 1] 880s -0.017 880s testing: mul_mat, [4, 1, 4, 1] = [2, 4, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 4, 4, 1] 880s 0.036 -0.608 880s 0.030 -0.040 880s -0.075 -0.891 880s -0.354 0.433 880s 880s x1: [2, 1, 4, 1] 880s 0.401 0.385 880s 880s y: [4, 1, 4, 1] 880s -0.220 -0.003 -0.373 0.025 880s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 1, 4, 1] 880s 0.967 0.152 880s 880s x1: [2, 4, 4, 1] 880s -0.235 0.235 880s -0.123 0.690 880s 0.344 0.523 880s 0.123 -0.389 880s 880s y: [1, 4, 4, 1] 880s -0.191 880s -0.013 880s 0.412 880s 0.060 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.663 -0.018 880s -0.435 -0.770 880s -0.322 0.503 880s 0.776 -0.149 880s 880s x1: [2, 1, 1, 1] 880s -0.834 0.423 880s 880s y: [4, 1, 1, 1] 880s -0.560 0.038 0.481 -0.710 880s testing: mul_mat, [2, 2, 4, 1] =ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s [4, 2, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 2, 4, 1] 880s -0.695 0.171 0.746 -0.916 880s 0.418 0.854 -0.578 0.165 880s 880s x1: [4, 2, 4, 1] 880s -0.134 -0.770 0.073 0.157 880s -0.446 0.549 -0.671 0.252 880s 880s y: [2, 2, 4, 1] 880s -0.127 -0.729 880s -0.328 0.712 880s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 4, 4, 1] 880s 0.976 -0.866 880s 0.675 0.501 880s 0.422 -0.943 880s -0.099 0.144 880s 880s x1: [2, 4, 4, 1] 880s 0.548 -0.480 880s -0.158 0.295 880s -0.057 -0.074 880s -0.541 -0.921 880s 880s y: [4, 4, 4, 1] 880s 0.950 0.130 0.684 -0.123 880s -0.409 0.041 -0.345 0.058 880s 0.009 -0.076 0.046 -0.005 880s 0.270 -0.826 0.641 -0.079 880s test-mul-mat0: iter:279/500 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.643 0.931 -0.505 880s 0.664 0.196 -0.756 880s -0.706 -0.394 -0.176 880s -0.118 0.150 0.785 880s 880s x1: [3, 4, 1, 1] 880s 0.989 0.477 0.914 880s -0.429 -0.674 0.491 880s 0.378 -0.120 -0.660 880s 0.234 0.497 -0.870 880s 880s y: [4, 4, 1, 1] 880s 0.619 0.059 -1.047 0.672 880s -1.152 -0.789 0.482 0.335 880s 0.464 0.727 -0.104 -0.581 880s 1.053 0.911 -0.208 -0.636 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.772 -0.065 0.611 880s 0.269 0.065 0.059 880s 880s x1: [3, 4, 1, 1] 880s 0.949 0.669 -0.580 880s -0.562 0.090 0.190 880s -0.573 -0.433 0.104 880s -0.002 -0.108 -0.405 880s 880s y: [2, 4, 1, 1] 880s 0.335 0.265 880s -0.324 -0.134 880s -0.351 -0.176 880s -0.242 -0.032 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.202 -0.378 0.752 880s 880s x1: [3, 2, 1, 1] 880s -0.088 -0.004 0.554 880s -0.424 -0.808 0.798 880s 880s y: [1, 2, 1, 1] 880s 0.436 880s 0.991 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.173 -0.983 0.121 880s -0.861 -0.434 0.564 880s -0.293 0.274 -0.834 880s 880s x1: [3, 1, 1, 1] 880s 0.948 -0.593 0.702 880s 880s y: [3, 1, 1, 1] 880s 0.504 -0.163 -1.025 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.836 -0.096 0.566 880s -0.304 0.962 0.702 880s 880s x1: [3, 3, 1, 1] 880s 0.716 0.940 0.938 880s -0.349 0.550 0.208 880s 0.716 -0.390 -0.880 880s 880s y: [2, 3, 1, 1] 880s -0.157 1.346 880s 0.357 0.781 880s -1.060 -1.212 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.160 0.726 880s 0.974 0.247 880s 0.835 -0.045 880s 880s x1: [2, 3, 1, 1] 880s -0.416 0.319 880s 0.650 -0.009 880s 0.020 0.267 880s 880s y: [3, 3, 1, 1] 880s 0.165 -0.326 -0.361 880s 0.097 0.631 0.543 880s 0.197 0.086 0.005 880s test-mul-mat0: iter:280/500 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.220 880s -0.441 880s 0.039 880s 0.389 880s 880s x1: [1, 1, 1, 1] 880s 0.352 880s 880s y: [4, 1, 1, 1] 880s -0.077 -0.155 0.014 0.137 880s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 4, 2, 1] 880s -0.740 880s -0.992 880s 0.090 880s 0.280 880s 880s x1: [1, 4, 2, 1] 880s -0.321 880s 0.751 880s 0.553 880s 0.375 880s 880s y: [4, 4, 2, 1] 880s 0.238 0.318 -0.029 -0.090 880s -0.556 -0.745 0.068 0.210 880s -0.410 -0.549 0.050 0.155 880s -0.278 -0.372 0.034 0.105 880s testing: mul_mat, [3, 4, 2, 4] = [1, 3, 2, 4] * [1, 4, 2, 4] 880s x0: [1, 3, 2, 4] 880s -0.477 880s -0.539 880s -0.582 880s 880s x1: [1, 4, 2, 4] 880s -0.258 880s -0.834 880s -0.503 880s 0.698 880s 880s y: [3, 4, 2, 4] 880s 0.123 0.139 0.150 880s 0.398 0.450 0.486 880s 0.240 0.271 0.293 880s -0.333 -0.376 -0.406 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.329 880s 880s x1: [1, 3, 1, 1] 880s -0.859 880s -0.356 880s -0.092 880s 880s y: [1, 3, 1, 1] 880s 0.283 880s 0.117 880s 0.030 880s testing: mul_mat, [3, 1, 2, 1] = [1, 3, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.867 880s -0.683 880s 0.627 880s 880s x1: [1, 1, 2, 1] 880s 0.672 880s 880s y: [3, 1, 2, 1] 880s -0.583 -0.459 0.421 880s testing: mul_mat, [3, 1, 2, 4] = [3, 3, 2, 4] * [3, 1, 2, 4] 880s x0: [3, 3, 2, 4] 880s -0.637 0.326 -0.189 880s -0.469 0.951 0.937 880s 0.707 0.029 0.866 880s 880s x1: [3, 1, 2, 4] 880s 0.485 0.987 -0.163 880s 880s y: [3, 1, 2, 4] 880s 0.043 0.558 0.231 880s test-mul-mat0: iter:281/500 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.238 0.820 0.485 0.525 880s 0.032 -0.750 -0.649 -0.909 880s -0.241 0.946 0.208 0.093 880s -0.328 -0.084 -0.041 0.857 880s 880s x1: [4, 1, 1, 1] 880s -0.390 -0.691 -0.067 -0.862 880s 880s y: [4, 1, 1, 1] 880s -1.144 1.333 -0.653 -0.550 880s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 3, 3, 1] 880s 0.993 -0.547 0.464 0.448 880s -0.620 0.536 -0.152 0.514 880s -0.300 -0.079 0.566 -0.176 880s 880s x1: [4, 4, 3, 1] 880s 0.166 0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 385 -0.690 0.896 880s -0.762 0.581 -0.551 -0.764 880s 0.691 0.408 -0.154 -1.000 880s -0.660 -0.016 0.577 0.579 880s 880s y: [3, 4, 3, 1] 880s 0.035 0.668 -0.628 880s -1.673 0.476 0.005 880s -0.056 -0.701 -0.150 880s -0.119 0.611 0.423 880s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.359 -0.717 0.909 -0.327 880s 880s x1: [4, 3, 3, 1] 880s -0.824 -0.085 0.665 -0.124 880s 0.836 0.230 0.700 -0.314 880s -0.987 -0.968 0.777 0.910 880s 880s y: [1, 3, 3, 1] 880s 1.001 880s 0.273 880s 1.456 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.326 -0.770 0.703 -0.701 880s -0.147 0.453 -0.882 -0.297 880s 880s x1: [4, 1, 1, 1] 880s -0.986 -0.091 -0.485 0.774 880s 880s y: [2, 1, 1, 1] 880s -0.492 0.301 880s testing: mul_mat, [2, 4, 3, 1] = [2, 2, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.269 -0.480 880s 0.800 -0.151 880s 880s x1: [2, 4, 3, 1] 880s -0.817 -0.940 880s 0.368 0.123 880s 0.657 0.263 880s 0.764 0.941 880s 880s y: [2, 4, 3, 1] 880s 0.672 -0.512 880s -0.158 0.276 880s -0.304 0.486 880s -0.658 0.470 880s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.361 0.140 880s 0.024 0.371 880s 0.905 -0.279 880s 880s x1: [2, 2, 3, 1] 880s 0.919 0.758 880s 0.026 -0.643 880s 880s y: [3, 2, 3, 1] 880s -0.226 0.303 0.621 880s -0.099 -0.238 0.203 880s test-mul-mat0: iter:282/500 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.322 -0.240 -0.666 0.786 880s -0.489 0.765 -0.160 -0.583 880s 0.014 0.750 -0.757 -0.980 880s 0.685 -0.574 -0.216 0.638 880s 880s x1: [4, 4, 1, 1] 880s -0.235 0.952 0.760 0.273 880s -0.854 0.513 -0.608 0.497 880s 0.455 0.031 -0.479 0.360 880s -0.829 0.892 -0.919 -0.959 880s 880s y: [4, 4, 1, 1] 880s -0.596 0.564 -0.132 -0.698 880s 0.397 0.618 0.346 -0.432 880s 0.741 -0.332 0.039 0.627 880s -0.623 1.793 2.293 -1.494 880s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 2, 4, 1] 880s -0.354 0.217 -0.668 0.737 880s 0.877 -0.187 0.342 -0.812 880s 880s x1: [4, 4, 4, 1] 880s -0.814 -0.943 0.783 0.700 880s -0.551 0.280 0.155 0.480 880s 0.801 -0.486 0.651 0.693 880s -0.405 0.692 -0.244 0.917 880s 880s y: [2, 4, 4, 1] 880s 0.075 -0.837 880s 0.506 -0.873 880s -0.313 0.454 880s 1.132 -1.312 880s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 3, 4, 1] 880s -0.907 0.992 -0.641 -0.708 880s -0.191 0.532 -0.967 0.817 880s -0.379 0.433 0.473 0.644 880s 880s x1: [4, 2, 4, 1] 880s -0.249 0.374 -0.143 -0.372 880s -0.813 -0.801 -0.183 0.173 880s 880s y: [3, 2, 4, 1] 880s 0.951 0.082 -0.050 880s -0.062 0.048 -0.014 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.708 0.429 -0.317 -0.442 880s -0.643 0.397 -0.530 -0.024 880s -0.024 0.278 0.398 -0.009 880s 880s x1: [4, 3, 1, 1] 880s 0.866 -0.870 -0.769 -0.685 880s -0.235 0.534 0.841 0.156 880s -0.047 0.980 -0.655 0.608 880s 880s y: [3, 3, 1, 1] 880s 0.785 -0.478 -0.563 880s -0.272 -0.087 0.488 880s 0.326 0.751 0.008 880s testing: mul_mat, [4, 4, 4, 1] = [3, 4, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 4, 4, 1] 880s -0.499 -0.621 -0.875 880s 0.189 0.226 0.054 880s -0.203 -0.155 -0.316 880s -0.966 -0.748 0.999 880s 880s x1: [3, 4, 4, 1] 880s -0.923 0.796 0.778 880s 0.073 0.657 -0.651 880s -0.061 0.787 -0.420 880s 0.254 -0.448 -0.886 880s 880s y: [4, 4, 4, 1] 880s -0.715 0.047 -0.182 1.074 880s 0.125 0.127 0.089 -1.213 880s -0.091 0.144 0.023 -0.949 880s 0.927 -0.101 0.298 -0.795 880s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.433 -0.175 0.569 0.912 880s 0.625 0.401 0.374 0.340 880s 0.801 0.889 0.349 0.860 880s -0.316 0.510 0.536 -0.205 880s 880s x1: [4, 3, 4, 1] 880s -0.848 -0.643 -0.654 -0.734 880s -0.178 0.413 -0.953 0.308 880s 0.349 0.203 -0.118 0.230 880s 880s y: [4, 3, 4, 1] 880s -0.562 -1.283 -2.111 -0.260 880s -0.257 -0.198 0.156 -0.307 880s -0.044 0.334 0.617 -0.118 880s test-mul-mat0: iter:283/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.146 0.274 -0.020 -0.332 880s 0.313 0.196 0.181 0.703 880s 0.511 0.248 -0.648 -0.560 880s 880s x1: [4, 3, 1, 1] 880s 0.686 0.462 -0.456 0.754 880s 0.031 0.464 0.735 0.941 880s -0.757 -0.490 -0.553 -0.971 880s 880s y: [3, 3, 1, 1] 880s -0.215 0.753 0.338 880s -0.205 0.895 -0.872 880s 0.310 -1.116 0.394 880s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 4, 4, 1] 880s 0.006 -0.473 -0.311 -0.525 880s 0.059 -0.501 0.139 -0.228 880s -0.989 0.314 0.734 0.876 880s 0.929 -0.870 -0.375 -0.673 880s 880s x1: [4, 3, 4, 1] 880s 0.435 -0.115 0.783 -0.573 880s -0.872 -0.057 -0.887 0.590 880s 0.486 0.867 -0.378 -0.050 880s 880s y: [4, 3, 4, 1] 880s 0.114 0.323 -0.393 0.596 880s -0.012 -0.280 0.710 -0.825 880s -0.263 -0.447 -0.530 -0.127 880s testing: mul_mat, [3, 4, 4, 3] = [4, 3, 4, 3] * [4, 4, 4, 3] 880s x0: [4, 3, 4, 3] 880s -0.989 -0.880 -0.796 -0.862 880s 0.779 -0.126 0.909 -0.603 880s 0.978 -0.226 -0.402 0.067 880s 880s x1: [4, 4, 4, 3] 880s 0.042 0.573 -0.390 0.664 880s 0.810 -0.658 -0.703 0.361 880s 0.807 -0.194 -0.159 0.403 880s 0.940 -0.353 0.953 -0.424 880s 880s y: [3, 4, 4, 3] 880s -0.807 -0.795 0.112 880s 0.027 -0.143 1.247 880s -0.848 0.265 0.924 880s -1.012 1.898 0.587 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.030 -0.922 -0.367 -0.929 880s -0.114 -0.211 -0.806 0.268 880s -0.696 0.400 0.354 0.299 880s 880s x1: [4, 3, 1, 1] 880s 0.092 0.260 -0.503 -0.537 880s 0.412 0.480 0.450 0.401 880s 0.641 0.524 0.815 -0.294 880s 880s y: [3, 3, 1, 1] 880s 0.446 0.196 -0.299 880s -0.968 -0.404 0.185 880s -0.489 -0.920 -0.036 880s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 2, 4, 1] 880s -0.142 -0.936 0.889 880s -0.239 -0.996 0.571 880s 880s x1: [3, 4, 4, 1] 880s -0.076 -0.807 -0.808 880s 0.416 -0.869 0.507 880s -0.492 0.391 -0.995 880s -0.029 -0.197 0.485 880s 880s y: [2, 4, 4, 1] 880s 0.048 0.361 880s 1.205 1.056 880s -1.181 -0.840 880s 0.619 0.480 880s testing: mul_mat, [4, 3, 4, 3] = [2, 4, 4, 3] * [2, 3, 4, 3] 880s x0: [2, 4, 4, 3] 880s -0.034 -0.021 880s 0.872 0.402 880s -0.062 -0.136 880s 0.890 0.327 880s 880s x1: [2, 3, 4, 3] 880s 0.231 -0.189 880s -0.443 -0.255 880s -0.172 -0.731 880s 880s y: [4, 3, 4, 3] 880s -0.004 0.125 0.011 0.144 880s 0.020 -0.488 0.062 -0.477 880s 0.021 -0.444 0.110 -0.392 880s test-mul-mat0: iter:284/500 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.513 -0.088 0.905 -0.930 880s -0.355 0.046 0.909 -0.601 880s 0.623 -0.768 0.896 0.333 880s 880s x1: [4, 4, 1, 1] 880s 0.866 0.300 -0.136 0.364 880s -0.627 -0.301 -0.153 -0.508 880s 0.161 -0.996 0.164 -0.650 880s 0.151 -0.072 0.973 0.369 880s 880s y: [3, 4, 1, 1] 880s -0.044 -0.636 0.308 880s 0.039 0.374 -0.466 880s 0.923 0.437 0.796 880s 0.621 0.606 1.144 880s testing: mul_mat, [1, 3, 2, 1] = [4, 1, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 1, 2, 1] 880s -0.741 0.890 0.952 -0.509 880s 880s x1: [4, 3, 2, 1] 880s 0.641 0.720 0.199 -0.059 880s -0.416 -0.437 0.314 0.283 880s 0.409 0.806 -0.556 0.413 880s 880s y: [1, 3, 2, 1] 880s 0.386 880s 0.074 880s -0.326 880s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 3, 2, 1] 880s 0.869 0.161 -0.389 -0.566 880s 0.059 0.378 -0.633 -0.563 880s 0.657 0.213 0.720 -0.993 880s 880s x1: [4, 1, 2, 1] 880s -0.516 0.072 -0.910 0.047 880s 880s y: [3, 1, 2, 1] 880s -0.109 0.546 -1.026 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.945 0.195 -0.187 0.158 880s 880s x1: [4, 3, 1, 1] 880s 0.977 -0.182 -0.654 0.350 880s 0.274 -0.463 0.167 0.144 880s 0.699 0.778 0.577 -0.243 880s 880s y: [1, 3, 1, 1] 880s 1.065 880s 0.161 880s 0.665 880s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.746 880s 880s x1: [1, 4, 2, 1] 880s -0.085 880s -0.180 880s 0.862 880s -0.035 880s 880s y: [1, 4, 2, 1] 880s 0.063 880s 0.135 880s -0.643 880s 0.026 880s testing: mul_mat, [4, 1, 2, 1] = [1, 4, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 4, 2, 1] 880s -0.299 880s -0.157 880s 0.034 880s -0.948 880s 880s x1: [1, 1, 2, 1] 880s -0.276 880s 880s y: [4, 1, 2, 1] 880s 0.082 0.043 -0.009 0.262 880s test-mul-mat0: iter:285/500 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.863 0.785 0.803 0.102 880s -0.130 -0.741 -0.931 -0.289 880s 0.096 -0.678 0.440 0.821 880s 880s x1: [4, 1, 1, 1] 880s -0.848 -0.216 0.223 -0.035 880s 880s y: [3, 1, 1, 1] 880s 0.737 0.073 0.134 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.967 -0.059 -0.807 0.255 880s 0.906 -0.865 0.393 0.691 880s 0.938 -0.505 -0.439 -0.803 880s 880s x1: [4, 3, 1, 1] 880s -0.653 -0.871 -0.478 0.191 880s 0.163 -0.426 0.309 -0.265 880s 0.792 -0.430 -0.995 -0.211 880s 880s y: [3, 3, 1, 1] 880s 1.117 0.107 -0.115 880s -0.449 0.454 0.446 880s 0.008 0.553 1.567 880s testing: mul_mat, [3, 3, 1, 2] = [4, 3, 1, 2] * [4, 3, 1, 2] 880s x0: [4, 3, 1, 2] 880s 0.424 0.329 0.978 0.918 880s 0.890 -0.825 -0.518 0.161 880s 0.468 0.367 0.873 -0.418 880s 880s x1: [4, 3, 1, 2] 880s -0.437 0.272 0.293 -0.114 880s -0.288 0.114 0.233 -0.160 880s 0.636 -0.576 -0.997 -0.790 880s 880s y: [3, 3, 1, 2] 880s 0.086 -0.783 0.199 880s -0.004 -0.497 0.177 880s -1.621 1.431 -0.454 880s testing: mul_mat, [4, 3, 1,ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.174 -0.154 0.137 -0.425 880s -0.027 -0.081 0.818 0.729 880s -0.663 -0.206 -0.725 0.994 880s -0.806 0.559 0.564 0.634 880s 880s x1: [4, 3, 1, 1] 880s -0.228 -0.028 -0.047 -0.323 880s 0.003 -0.408 -0.283 -0.574 880s 0.921 -0.305 -0.655 0.811 880s 880s y: [4, 3, 1, 1] 880s 0.175 -0.265 -0.130 -0.063 880s 0.268 -0.617 -0.283 -0.754 880s -0.547 0.054 0.734 -0.769 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.700 -0.689 -0.516 0.756 880s 0.690 -0.381 0.870 -0.697 880s 880s x1: [4, 4, 1, 1] 880s 0.925 0.215 -0.594 -0.103 880s -0.831 0.083 0.900 -0.239 880s 0.800 -0.674 -0.318 -0.506 880s -0.330 -0.507 0.364 0.497 880s 880s y: [2, 4, 1, 1] 880s 0.728 0.111 880s -1.284 0.344 880s 0.806 0.885 880s 0.306 -0.065 880s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 880s x0: [2, 4, 1, 2] 880s -0.706 0.088 880s 0.592 -0.900 880s -0.365 0.358 880s 0.657 -0.221 880s 880s x1: [2, 4, 1, 2] 880s -0.855 0.320 880s -0.123 0.874 880s 0.315 -0.489 880s 0.799 -0.470 880s 880s y: [4, 4, 1, 2] 880s 0.632 -0.794 0.427 -0.633 880s 0.164 -0.859 0.358 -0.274 880s -0.265 0.626 -0.290 0.315 880s -0.606 0.896 -0.460 0.629 880s test-mul-mat0: iter:286/500 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.597 880s 880s x1: [1, 3, 1, 1] 880s -0.900 880s -0.102 880s 0.871 880s 880s y: [1, 3, 1, 1] 880s -0.537 880s -0.061 880s 0.520 880s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.198 880s 0.964 880s 880s x1: [1, 1, 4, 1] 880s 0.871 880s 880s y: [2, 1, 4, 1] 880s 0.173 0.840 880s testing: mul_mat, [1, 2, 4, 2] = [1, 1, 4, 2] * [1, 2, 4, 2] 880s x0: [1, 1, 4, 2] 880s -0.394 880s 880s x1: [1, 2, 4, 2] 880s 0.773 880s -0.655 880s 880s y: [1, 2, 4, 2] 880s -0.305 880s 0.258 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.002 880s 880s x1: [1, 1, 1, 1] 880s 0.864 880s 880s y: [1, 1, 1, 1] 880s -0.002 880s testing: mul_mat, [3, 1, 4, 1] = [1, 3, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 3, 4, 1] 880s -0.066 880s 0.668 880s 0.599 880s 880s x1: [1, 1, 4, 1] 880s -0.141 880s 880s y: [3, 1, 4, 1] 880s 0.009 -0.094 -0.085 880s testing: mul_mat, [2, 1, 4, 2] = [3, 2, 4, 2] * [3, 1, 4, 2] 880s x0: [3, 2, 4, 2] 880s -0.386 -0.835 0.236 880s 0.923 -0.909 0.626 880s 880s x1: [3, 1, 4, 2] 880s -0.514 0.184 0.713 880s 880s y: [2, 1, 4, 2] 880s 0.213 -0.195 880s test-mul-mat0: iter:287/500 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.891 -0.169 -0.562 880s -0.519 -0.169 0.077 880s 0.429 0.250 0.055 880s -0.953 0.015 0.728 880s 880s x1: [3, 3, 1, 1] 880s -0.298 0.744 -0.007 880s 0.887 -0.837 0.226 880s -0.423 0.829 -0.933 880s 880s y: [4, 3, 1, 1] 880s 0.144 0.028 0.057 0.290 880s -0.775 -0.301 0.184 -0.692 880s 0.760 0.007 -0.025 -0.264 880s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.701 0.839 -0.309 880s -0.083 -0.585 -0.981 880s -0.388 -0.580 -0.403 880s 880s x1: [3, 4, 4, 1] 880s 0.088 -0.539 -0.589 880s -0.834 0.302 -0.717 880s 0.589 -0.750 0.381 880s -0.709 0.994 -0.626 880s 880s y: [3, 4, 4, 1] 880s -0.333 0.885 0.515 880s 1.060 0.595 0.437 880s -1.160 0.016 0.053 880s 1.525 0.092 -0.049 880s testing: mul_mat, [4, 3, 4, 3] = [3, 4, 4, 3] * [3, 3, 4, 3] 880s x0: [3, 4, 4, 3] 880s 0.807 0.259 0.804 880s -0.953 0.544 0.474 880s -0.139 0.825 0.521 880s -0.745 -0.445 -0.579 880s 880s x1: [3, 3, 4, 3] 880s -0.186 -0.684 -0.018 880s 0.411 0.678 0.455 880s 0.601 0.519 0.068 880s 880s y: [4, 3, 4, 3] 880s -0.342 -0.203 -0.548 0.454 880s 0.873 0.193 0.740 -0.872 880s 0.675 -0.258 0.380 -0.719 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.562 -0.915 -0.587 880s -0.252 -0.915 0.084 880s 880s x1: [3, 4, 1, 1] 880s 0.326 0.604 -0.644 880s 0.554 -0.428 -0.728 880s -0.260 -0.288 0.390 880s -0.015 0.506 0.108 880s 880s y: [2, 4, 1, 1] 880s 0.009 -0.689 880s 1.130 0.191 880s -0.111 0.362 880s -0.535 -0.451 880s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 4, 4, 1] 880s 0.723 0.017 880s -0.396 -0.340 880s 0.576 -0.795 880s 0.807 -0.736 880s 880s x1: [2, 3, 4, 1] 880s -0.602 0.246 880s 0.811 -0.635 880s -0.542 -0.496 880s 880s y: [4, 3, 4, 1] 880s -0.431 0.155 -0.543 -0.668 880s 0.575 -0.105 0.972 1.123 880s -0.400 0.383 0.082 -0.072 880s testing: mul_mat, [1, 2, 4, 3] = [4, 1, 4, 3] * [4, 2, 4, 3] 880s x0: [4, 1, 4, 3] 880s 0.983 -0.582 0.198 0.785 880s 880s x1: [4, 2, 4, 3] 880s 0.968 -0.341 -0.730 -0.015 880s 0.319 -0.525 0.249 0.791 880s 880s y: [1, 2, 4, 3] 880s 0.995 880s 1.289 880s test-mul-mat0: iter:288/500 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 0.748 880s -0.609 880s -0.874 880s 880s x1: [1, 2, 1, 1] 880s -0.301 880s 0.461 880s 880s y: [3, 2, 1, 1] 880s -0.225 0.184 0.263 880s 0.345 -0.281 -0.403 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.973 880s 880s x1: [1, 3, 1, 1] 880s -0.455 880s 0.190 880s 0.140 880s 880s y: [1, 3, 1, 1] 880s -0.443 880s 0.185 880s 0.137 880s testing: mul_mat, [1, 1, 1, 2] = [1, 1, 1, 2] * [1, 1, 1, 2] 880s x0: [1, 1, 1, 2] 880s 0.160 880s 880s x1: [1, 1, 1, 2] 880s -0.427 880s 880s y: [1, 1, 1, 2] 880s -0.069 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.933 880s 880s x1: [1, 1, 1, 1] 880s 0.148 880s 880s y: [1, 1, 1, 1] 880s 0.138 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.626 880s 880s x1: [1, 1, 1, 1] 880s -0.143 880s 880s y: [1, 1, 1, 1] 880s -0.090 880s testing: mul_mat, [4, 1, 1, 2] = [1, 4, 1, 2] * [1, 1, 1, 2] 880s x0: [1, 4, 1, 2] 880s 0.288 880s 0.045 880s -0.183 880s -0.424 880s 880s x1: [1, 1, 1, 2] 880s -0.793 880s 880s y: [4, 1, 1, 2] 880s -0.228 -0.036 0.145 0.336 880s test-mul-mat0: iter:289/500 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.736 0.802 -0.213 0.425 880s -0.553 0.935 0.191 -0.620 880s -0.208 0.356 -0.993 -0.001 880s 0.478 -0.342 -0.713 -0.477 880s 880s x1: [4, 1, 1, 1] 880s 0.567 -0.727 0.692 0.428 880s 880s y: [4, 1, 1, 1] 880s -0.966 -1.127 -1.064 -0.178 880s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 3, 3, 1] 880s 0.470 -0.747 -0.723 0.216 880s 0.038 0.838 -0.683 0.828 880s -0.601 -0.103 0.277 0.246 880s 880s x1: [4, 4, 3, 1] 880s 0.475 -0.137 0.753 -0.802 880s -0.862 -0.555 -0.377 0.160 880s 0.527 0.174 -0.272 0.800 880s -0.134 -0.844 -0.953 0.130 880s 880s y: [3, 4, 3, 1] 880s -0.392 -1.275 -0.260 880s 0.317 -0.108 0.511 880s 0.487 1.015 -0.214 880s 1.285 0.046 -0.064 880s testing: mul_mat, [3, 3, 3, 1] = [4, 3, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.921 0.114 0.548 -0.414 880s -0.447 -0.239 0.900 -0.855 880s 0.043 0.028 -0.282 0.310 880s 880s x1: [4, 3, 3, 1] 880s 0.905 0.689 -0.587 -0.696 880s -0.414 0.690 0.550 -0.415 880s 0.891 -0.493 0.439 0.214 880s 880s y: [3, 3, 3, 1] 880s -0.788 -0.502 0.008 880s 0.933 0.870 -0.282 880s -0.725 -0.069 -0.033 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.082 0.303 0.078 -0.182 880s -0.914 -0.979 -0.817 -0.127 880s -0.566 0.642 -0.207 -0.528 880s 880s x1: [4, 3, 1, 1] 880s 0.705 0.255 0.491 -0.252 880s -0.716 -0.791 -0.942 0.993 880s 0.108 0.406 0.828 -0.530 880s 880s y: [3, 3, 1, 1] 880s 0.219 -1.265 -0.204 880s -0.553 2.072 -0.431 880s 0.293 -1.106 0.308 880s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.125 0.879 -0.514 880s 0.948 -0.513 -0.808 880s 880s x1: [3, 4, 3, 1] 880s -0.786 0.642 0.266 880s 0.158 -0.002 -0.388 880s -0.136 -0.746 -0.896 880s 0.612 -0.463 -0.687 880s 880s y: [2, 4, 3, 1] 880s 0.527 -1.290 880s 0.178 0.464 880s -0.179 0.978 880s -0.130 1.373 880s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.600 0.632 880s -0.442 0.932 880s 0.358 -0.137 880s 880s x1: [2, 3, 3, 1] 880s 0.411 0.379 880s 0.662 -0.164 880s -0.983 -0.284 880s 880s y: [3, 3, 3, 1] 880s -0.007 0.171 0.096 880s -0.501 -0.445 0.260 880s 0.410 0.169 -0.313 880s test-mul-mat0: iter:290/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.250 880s 0.209 880s 880s x1: [1, 1, 1, 1] 880s 0.452 880s 880s y: [2, 1, 1, 1] 880s -0.113 0.095 880s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 4, 4, 1] 880s 0.811 880s -0.731 880s -0.095 880s -0.443 880s 880s x1: [1, 2, 4, 1] 880s 0.779 880s -0.697 880s 880s y: [4, 2, 4, 1] 880s 0.631 -0.569 -0.074 -0.345 880s -0.565 0.509 0.066 0.309 880s testing: mul_mat, [3, 4, 4, 3] = [1, 3, 4, 3] * [1, 4, 4, 3] 880s x0: [1, 3, 4, 3] 880s 0.872 880s 0.992 880s 0.447 880s 880s x1: [1, 4, 4, 3] 880s 0.748 880s 0.418 880s -0.733 880s -0.529 880s 880s y: [3, 4, 4, 3] 880s 0.652 0.742 0.334 880s 0.364 0.414 0.187 880s -0.639 -0.727 -0.327 880s -0.461 -0.524 -0.236 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.720 880s -0.770 880s 0.495 880s 880s x1: [1, 3, 1, 1] 880s 0.827 880s -0.516 880s -0.498 880s 880s y: [3, 3, 1, 1] 880s 0.595 -0.637 0.409 880s -0.372 0.397 -0.255 880s -0.358 0.383 -0.246 880s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 4, 4, 1] 880s 0.094 -0.634 0.305 880s 0.513 0.915 -0.737 880s 0.972 0.549 0.295 880s 0.386 0.210 0.131 880s 880s x1: [3, 1, 4, 1] 880s 0.237 0.014 -0.516 880s 880s y: [4, 1, 4, 1] 880s -0.144 0.515 0.086 0.027 880s testing: mul_mat, [3, 3, 4, 3] = [4, 3, 4, 3] * [4, 3, 4, 3] 880s x0: [4, 3, 4, 3] 880s -0.371 -0.522 -0.234 -0.063 880s 0.700 0.874 -0.654 -0.728 880s 0.569 0.441 -0.751 -0.549 880s 880s x1: [4, 3, 4, 3] 880s -0.070 0.853 0.335 -0.545 880s -0.142 0.703 0.531 0.250 880s 0.490 0.772 -0.931 -0.865 880s 880s y: [3, 3, 4, 3] 880s -0.464 0.874 0.385 880s -0.454 -0.015 -0.307 880s -0.313 2.256 1.793 880s test-mul-mat0: iter:291/500 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.941 880s 0.722 880s 0.535 880s 880s x1: [1, 4, 1, 1] 880s -0.504 880s 0.982 880s -0.843 880s 0.058 880s 880s y: [3, 4, 1, 1] 880s 0.475 -0.364 -0.270 880s -0.924 0.709 0.525 880s 0.793 -0.608 -0.451 880s -0.055 0.042 0.031 880s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.166 880s -0.573 880s 880s x1: [1, 3, 1, 1] 880s -0.136 880s 0.523 880s 0.169 880s 880s y: [2, 3, 1, 1] 880s -0.023 0.078 880s 0.087 -0.299 880s 0.028 -0.097 880s testing: mul_mat, [4, 2, 1, 3] = [1, 4, 1, 3] * [1, 2, 1, 3] 880s x0: [1, 4, 1, 3] 880s -0.115 880s 0.644 880s -0.660 880s -0.669 880s 880s x1: [1, 2, 1, 3] 880s 0.206 880s 0.275 880s 880s y: [4, 2, 1, 3] 880s -0.024 0.132 -0.136 -0.138 880s -0.032 0.177 -0.182 -0.184 880s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.210 880s 0.638 880s -0.235 880s -0.621 880s 880s x1: [1, 4, 1, 1] 880s -0.284 880s 0.520 880s 0.675 880s -0.226 880s 880s y: [4, 4, 1, 1] 880s -0.060 -0.181 0.067 0.176 880s 0.109 0.332 -0.122 -0.323 880s 0.142 0.431 -0.159 -0.419 880s -0.047 -0.144 0.053 0.140 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.898 0.163 0.264 0.948 880s 0.887 -0.067 0.982 -0.117 880s -0.952 0.241 0.073 0.438 880s 0.597 0.503 -0.120 -0.590 880s 880s x1: [4, 1, 1, 1] 880s 0.389 0.931 -0.194 -0.405 880s 880s y: [4, 1, 1, 1] 880s -0.632 0.140 -0.337 0.963 880s testing: mul_mat, [4, 4, 1, 3] = [4, 4, 1, 3] * [4, 4, 1, 3] 880s x0: [4, 4, 1, 3] 880s 0.613 -0.068 -0.308 -0.398 880s 0.828 -0.506 -0.421 0.901 880s 0.177 -0.035 -0.832 -0.883 880s 0.298 0.108 -0.672 -0.445 880s 880s x1: [4, 4, 1, 3] 880s -0.326 -0.846 0.930 -0.651 880s -0.072 0.172 0.559 -0.434 880s 0.936 0.938 0.955 0.867 880s -0.256 -0.450 0.073 -0.154 880s 880s y: [4, 4, 1, 3] 880s -0.169 -0.821 -0.227 -0.524 880s -0.055 -0.772 -0.101 -0.186 880s -0.129 0.679 -1.428 -0.647 880s -0.088 -0.154 0.046 -0.106 880s test-mul-mat0: iter:292/500 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.064 -0.932 880s -0.653 -0.560 880s -0.140 -0.344 880s 0.644 -0.172 880s 880s x1: [2, 2, 1, 1] 880s -0.271 -0.876 880s 0.914 -0.602 880s 880s y: [4, 2, 1, 1] 880s 0.799 0.667 0.339 -0.024 880s 0.619 -0.260 0.079 0.692 880s testing: mul_mat, [4, 4, 3, 1] = [2, 4, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.913 0.938 880s -0.912 -0.257 880s 0.582 -0.083 880s 0.006 0.362 880s 880s x1: [2, 4, 3, 1] 880s -0.737 0.780 880s 0.291 -0.165 880s 0.024 -0.554 880s 0.985 -0.564 880s 880s y: [4, 4, 3, 1] 880s 1.404 0.472 -0.494 0.278 880s -0.421 -0.223 0.183 -0.058 880s -0.541 0.121 0.060 -0.200 880s -1.428 -0.753 0.620 -0.198 880s testing: mul_mat, [4, 4, 3, 4] = [2, 4, 3, 4] * [2, 4, 3, 4] 880s x0: [2, 4, 3, 4] 880s -0.141 0.187 880s 0.441 0.614 880s 0.766 0.822 880s 0.569 -0.554 880s 880s x1: [2, 4, 3, 4] 880s -0.597 0.670 880s 0.031 0.563 880s 0.411 -0.472 880s -0.208 0.498 880s 880s y: [4, 4, 3, 4] 880s 0.210 0.148 0.093 -0.711 880s 0.101 0.359 0.486 -0.295 880s -0.147 -0.109 -0.073 0.495 880s 0.123 0.214 0.249 -0.395 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.746 0.280 880s -0.656 0.051 880s -0.719 0.103 880s 880s x1: [2, 4, 1, 1] 880s 0.367 -0.899 880s -0.479 0.882 880s 0.712 -0.643 880s -0.293 0.749 880s 880s y: [3, 4, 1, 1] 880s 0.022 -0.286 -0.357 880s -0.111 0.359 0.436 880s 0.351 -0.500 -0.579 880s -0.009 0.230 0.288 880s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.334 0.636 0.768 880s 0.887 0.487 0.066 880s 0.402 0.787 0.031 880s -0.959 0.980 -0.129 880s 880s x1: [3, 2, 3, 1] 880s -0.109 0.948 -0.293 880s -0.537 0.551 0.095 880s 880s y: [4, 2, 3, 1] 880s 0.341 0.345 0.693 1.072 880s 0.244 -0.202 0.221 1.044 880s testing: mul_mat, [4, 3, 3, 4] = [4, 4, 3, 4] * [4, 3, 3, 4] 880s x0: [4, 4, 3, 4] 880s -0.056 -0.377 -0.830 -0.293 880s 0.243 0.742 0.676 -0.702 880s 0.059 0.340 0.741 -0.381 880s 0.968 0.021 -0.040 -0.481 880s 880s x1: [4, 3, 3, 4] 880s -0.187 0.218 -0.789 -0.420 880s -0.716 0.242 0.452 -0.759 880s -0.779 0.030 -0.339 -0.249 880s 880s y: [4, 3, 3, 4] 880s 0.706 -0.123 -0.362 0.057 880s -0.204 0.844 0.664 -0.340 880s 0.387 -0.222 -0.192 -0.620 880s test-mul-mat0: iter:293/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.898 -0.354 0.184 -0.986 880s 0.842 0.098 0.575 -0.074 880s 880s x1: [4, 1, 1, 1] 880s 0.265 0.012 -0.370 0.050 880s 880s y: [2, 1, 1, 1] 880s 0.116 0.008 880s testing: mul_mat, [4, 2, 2, 1] = [4, 4, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 4, 2, 1] 880s -0.453 0.088 0.243 0.177 880s -0.862 0.387 0.075 -0.216 880s -0.429 0.089 -0.375 0.668 880s -0.336 0.551 -0.331 -0.611 880s 880s x1: [4, 2, 2, 1] 880s 0.001 0.726 0.299 -0.213 880s 0.557 -0.703 0.690 -0.876 880s 880s y: [4, 2, 2, 1] 880s 0.098 0.349 -0.190 0.431 880s -0.301 -0.512 -1.145 -0.268 880s testing: mul_mat, [3, 4, 2, 4] = [4, 3, 2, 4] * [4, 4, 2, 4] 880s x0: [4, 3, 2, 4] 880s 0.097 0.836 0.193 -0.264 880s -0.794 -0.780 0.836 0.277 880s -0.919 -0.010 -0.580 0.903 880s 880s x1: [4, 4, 2, 4] 880s -0.152 -0.140 -0.509 -0.015 880s -0.753 0.566 0.769 -0.182 880s -0.345 -0.605 -0.514 0.319 880s 0.946 0.155 0.708 -0.204 880s 880s y: [3, 4, 2, 4] 880s -0.226 -0.200 0.423 880s 0.597 0.750 0.076 880s -0.723 0.405 0.909 880s 0.412 -0.337 -1.465 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.956 0.437 -0.016 0.840 880s -0.835 0.913 0.858 -0.176 880s -0.757 -0.034 -0.614 0.764 880s 880s x1: [4, 3, 1, 1] 880s -0.522 -0.771 0.524 0.495 880s -0.895 -0.437 -0.551 -0.863 880s -0.254 0.785 0.273 -0.357 880s 880s y: [3, 3, 1, 1] 880s -0.429 0.094 0.478 880s -1.763 0.028 0.371 880s -0.205 1.226 -0.274 880s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 4, 2, 1] 880s 0.430 -0.196 0.783 880s 0.981 -0.063 -0.753 880s 0.174 -0.445 0.337 880s 0.270 -0.972 -0.649 880s 880s x1: [3, 4, 2, 1] 880s 0.285 0.224 -0.155 880s 0.691 0.309 -0.431 880s -0.831 0.538 -0.908 880s 0.664 0.643 -0.344 880s 880s y: [4, 4, 2, 1] 880s -0.042 0.382 -0.102 -0.040 880s -0.101 0.983 -0.163 0.166 880s -1.173 -0.165 -0.690 -0.159 880s -0.109 0.871 -0.287 -0.223 880s testing: mul_mat, [2, 3, 2, 4] = [4, 2, 2, 4] * [4, 3, 2, 4] 880s x0: [4, 2, 2, 4] 880s 0.019 0.853 0.323 -0.422 880s 0.288 0.303 -0.621 -0.369 880s 880s x1: [4, 3, 2, 4] 880s 0.670 -0.638 -0.244 0.492 880s -0.192 -0.121 -0.720 -0.762 880s -0.141 0.454 0.508 0.664 880s 880s y: [2, 3, 2, 4] 880s -0.819 -0.030 880s -0.018 0.636 880s 0.268 -0.463 880s test-mul-mat0: iter:294/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.196 -0.091 -0.228 -0.306 880s -0.317 0.719 -0.382 0.605 880s 0.181 -0.207 -0.460 -0.527 880s -0.762 -0.224 0.448 -0.744 880s 880s x1: [4, 3, 1, 1] 880s 0.168 -0.886 -0.616 0.225 880s -0.725 -0.524 0.882 0.610 880s 0.851 -0.077 -0.711 0.095 880s 880s y: [4, 3, 1, 1] 880s 0.119 -0.319 0.379 -0.373 880s -0.198 -0.115 -0.750 0.612 880s -0.026 0.003 0.448 -1.020 880s testing: mul_mat, [4, 4, 2, 1] = [4, 4, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 4, 2, 1] 880s -0.249 -0.813 0.430 0.399 880s -0.789 -0.226 0.930 0.845 880s 0.970 0.384 0.768 -0.740 880s 0.865 -0.505 0.460 0.119 880s 880s x1: [4, 4, 2, 1] 880s 0.635 -0.515 0.424 0.749 880s -0.131 -0.351 -0.976 0.345 880s -0.469 0.634 0.196 0.454 880s 0.923 -0.710 -0.519 -0.273 880s 880s y: [4, 4, 2, 1] 880s 0.742 0.642 0.189 1.093 880s 0.037 -0.434 -1.266 -0.344 880s -0.133 0.792 -0.397 -0.582 880s 0.015 -1.282 0.426 0.887 880s testing: mul_mat, [3, 4, 2, 4] = [4, 3, 2, 4] * [4, 4, 2, 4] 880s x0: [4, 3, 2, 4] 880s 0.618 0.541 0.273 -0.822 880s -0.362 0.829 -0.409 0.872 880s -0.910 -0.494 0.761 0.788 880s 880s x1: [4, 4, 2, 4] 880s -0.718 -0.061 0.336 -0.507 880s 0.714 0.266 -0.662 0.684 880s -0.349 -0.895 0.944 -0.484 880s -0.400 0.404 0.635 -0.483 880s 880s y: [3, 4, 2, 4] 880s 0.032 -0.370 0.539 880s -0.158 0.829 -0.746 880s -0.044 -1.423 1.097 880s 0.542 -0.202 0.267 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.836 -0.091 0.239 -0.548 880s -0.411 -0.755 0.826 0.632 880s -0.430 0.539 0.846 0.262 880s 880s x1: [4, 3, 1, 1] 880s -0.956 0.017 0.623 -0.170 880s -0.471 0.121 0.034 -0.268 880s -0.195 0.952 -0.076 -0.092 880s 880s y: [3, 3, 1, 1] 880s -0.558 0.787 0.903 880s -0.250 -0.039 0.227 880s -0.217 -0.760 0.508 880s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 3, 2, 1] 880s -0.838 -0.386 0.686 880s -0.367 -0.736 -0.194 880s -0.053 -0.791 0.202 880s 880s x1: [3, 4, 2, 1] 880s 0.072 -0.458 -0.007 880s 0.244 -0.024 0.516 880s 0.288 0.993 0.139 880s -0.882 -0.478 -0.740 880s 880s y: [3, 4, 2, 1] 880s 0.112 0.312 0.357 880s 0.159 -0.172 0.110 880s -0.530 -0.863 -0.772 880s 0.417 0.819 0.276 880s testing: mul_mat, [1, 3, 2, 4] = [3, 1, 2, 4] * [3, 3, 2, 4] 880s x0: [3, 1, 2, 4] 880s 0.945 -0.811 0.013 880s 880s x1: [3, 3, 2, 4] 880s 0.486 0.551 0.182 880s -0.410 0.728 0.155 880s 0.581 0.742 0.717 880s 880s y: [1, 3, 2, 4] 880s 0.015 880s -0.976 880s -0.043 880s test-mul-mat0: iter:295/500 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.630 880s 880s x1: [1, 4, 1, 1] 880s -0.858 880s 0.718 880s -0.383 880s -0.913 880s 880s y: [1, 4, 1, 1] 880s -0.541 880s 0.452 880s -0.241 880s -0.575 880s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.793 880s -0.119 880s 880s x1: [1, 1, 3, 1] 880s 0.059 880s 880s y: [2, 1, 3, 1] 880s -0.046 -0.007 880s testing: mul_mat, [4, 2, 3, 3] = [1, 4, 3, 3] * [1, 2, 3, 3] 880s x0: [1, 4, 3, 3] 880s -0.767 880s -0.590 880s 0.132 880s 0.170 880s 880s x1: [1, 2, 3, 3] 880s -0.406 880s 0.203 880s 880s y: [4, 2, 3, 3] 880s 0.312 0.240 -0.054 -0.069 880s -0.156 -0.120 0.027 0.034 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.642 880s 0.099 880s -0.825 880s 880s x1: [1, 4, 1, 1] 880s 0.364 880s -0.813 880s 0.375 880s 0.552 880s 880s y: [3, 4, 1, 1] 880s 0.234 0.036 -0.301 880s -0.522 -0.080 0.670 880s 0.241 0.037 -0.309 880s 0.354 0.055 -0.455 880s testing: mul_mat, [1, 1, 3, 1] = [3, 1, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 1, 3, 1] 880s 0.820 -0.119 0.386 880s 880s x1: [3, 1, 3, 1] 880s 0.111 0.260 -0.330 880s 880s y: [1, 1, 3, 1] 880s -0.067 880s testing: mul_mat, [4, 3, 3, 3] = [1, 4, 3, 3] * [1, 3, 3, 3] 880s x0: [1, 4, 3, 3] 880s 0.413 880s -0.556 880s -0.748 880s -0.398 880s 880s x1: [1, 3, 3, 3] 880s 0.189 880s 0.801 880s 0.245 880s 880s y: [4, 3, 3, 3] 880s 0.078 -0.105 -0.141 -0.075 880s 0.331 -0.445 -0.599 -0.319 880s 0.101 -0.136 -0.183 -0.097 880s test-mul-mat0: iter:296/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.483 -0.463 880s -0.683 0.380 880s 0.552 -0.737 880s 880s x1: [2, 2, 1, 1] 880s -0.758 0.347 880s 0.748 0.027 880s 880s y: [3, 2, 1, 1] 880s -0.527 0.649 -0.674 880s 0.349 -0.500 0.393 880s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.890 0.171 880s -0.830 0.490 880s -0.277 -0.566 880s -0.466 -0.376 880s 880s x1: [2, 3, 4, 1] 880s 0.044 0.901 880s 0.982 -0.205 880s 0.886 0.293 880s 880s y: [4, 3, 4, 1] 880s 0.115 0.405 -0.522 -0.359 880s -0.909 -0.915 -0.156 -0.381 880s -0.739 -0.591 -0.412 -0.524 880s testing: mul_mat, [2, 4, 4, 4] = [2, 2, 4, 4] * [2, 4, 4, 4] 880s x0: [2, 2, 4, 4] 880s -0.386 0.511 880s 0.874 0.160 880s 880s x1: [2, 4, 4, 4] 880s -0.651 0.456 880s -0.594 0.071 880s 0.890 -0.060 880s 0.695 -0.694 880s 880s y: [2, 4, 4, 4] 880s 0.485 -0.496 880s 0.266 -0.507 880s -0.374 0.768 880s -0.623 0.496 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.746 0.722 880s -0.987 0.110 880s 880s x1: [2, 2, 1, 1] 880s -0.220 -0.358 880s 0.608 -0.111 880s 880s y: [2, 2, 1, 1] 880s -0.422 0.177 880s 0.374 -0.612 880s testing: mul_mat, [1, 2, 4, 1] = [2, 1, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 1, 4, 1] 880s -0.361 -0.905 880s 880s x1: [2, 2, 4, 1] 880s -0.089 0.593 880s -0.028 -0.540 880s 880s y: [1, 2, 4, 1] 880s -0.504 880s 0.499 880s testing: mul_mat, [4, 2, 4, 4] = [1, 4, 4, 4] * [1, 2, 4, 4] 880s x0: [1, 4, 4, 4] 880s 0.797 880s -0.400 880s -0.502 880s 0.828 880s 880s x1: [1, 2, 4, 4] 880s 0.686 880s 0.453 880s 880s y: [4, 2, 4, 4] 880s 0.547 -0.275 -0.345 0.568 880s 0.361 -0.181 -0.228 0.375 880s test-mul-mat0: iter:297/500 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.086 880s 0.546 880s 0.747 880s -0.375 880s 880s x1: [1, 1, 1, 1] 880s -0.532 880s 880s y: [4, 1, 1, 1] 880s 0.046 -0.290 -0.397 0.199 880s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 3, 3, 1] 880s -0.522 880s 0.669 880s 0.102 880s 880s x1: [1, 4, 3, 1] 880s 0.885 880s 0.618 880s -0.226 880s -0.327 880s 880s y: [3, 4, 3, 1] 880s -0.462 0.592 0.090 880s -0.323 0.413 0.063 880s 0.118 -0.151 -0.023 880s 0.171 -0.219 -0.033 880s testing: mul_mat, [1, 3, 3, 2] = [1, 1, 3, 2] * [1, 3, 3, 2] 880s x0: [1, 1, 3, 2] 880s 0.416 880s 880s x1: [1, 3, 3, 2] 880s -0.257 880s 0.462 880s 0.854 880s 880s y: [1, 3, 3, 2] 880s -0.107 880s 0.192 880s 0.355 880s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.327 880s -0.217 880s 0.464 880s 880s x1: [1, 1, 1, 1] 880s -0.401 880s 880s y: [3, 1, 1, 1] 880s 0.131 0.087 -0.186 880s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.863 0.005 -0.831 880s 0.968 0.452 -0.963 880s 880s x1: [3, 1, 3, 1] 880s -0.176 0.526 -0.074 880s 880s y: [2, 1, 3, 1] 880s 0.217 0.139 880s testing: mul_mat, [1, 3, 3, 2] = [2, 1, 3, 2] * [2, 3, 3, 2] 880s x0: [2, 1, 3, 2] 880s -0.280 0.621 880s 880s x1: [2, 3, 3, 2] 880s -0.432 -0.428 880s 0.370 0.392 880s -0.902 -0.704 880s 880s y: [1, 3, 3, 2] 880s -0.145 880s 0.140 880s -0.185 880s test-mul-mat0: iter:298/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.031 -0.294 -0.975 880s 0.085 0.260 0.283 880s 0.773 0.183 0.933 880s -0.441 0.403 -0.347 880s 880s x1: [3, 1, 1, 1] 880s -0.460 -0.358 -0.941 880s 880s y: [4, 1, 1, 1] 880s 1.037 -0.399 -1.300 0.385 880s testing: mul_mat, [3, 4, 4, 1] = [3, 3, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.948 -0.744 -0.119 880s -0.984 0.215 -0.233 880s 0.367 0.948 0.836 880s 880s x1: [3, 4, 4, 1] 880s -0.820 0.487 -0.113 880s -0.870 0.395 0.474 880s -0.454 0.097 0.419 880s -0.121 -0.240 -0.242 880s 880s y: [3, 4, 4, 1] 880s -1.126 0.938 0.066 880s -1.175 0.830 0.452 880s -0.552 0.370 0.275 880s 0.093 0.123 -0.474 880s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 1, 4, 1] 880s -0.842 -0.217 -0.738 880s 880s x1: [3, 3, 4, 1] 880s -0.855 0.920 -0.446 880s 0.981 -0.428 0.793 880s -0.712 0.161 -0.495 880s 880s y: [1, 3, 4, 1] 880s 0.849 880s -1.318 880s 0.929 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.143 -0.656 0.194 880s -0.383 0.222 0.149 880s 880s x1: [3, 1, 1, 1] 880s -0.347 -0.011 -0.423 880s 880s y: [2, 1, 1, 1] 880s -0.025 0.068 880s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 2, 4, 1] 880s 0.990 0.542 880s -0.701 0.212 880s 880s x1: [2, 3, 4, 1] 880s -0.034 -0.677 880s -0.441 -0.145 880s 0.373 -0.477 880s 880s y: [2, 3, 4, 1] 880s -0.400 -0.120 880s -0.515 0.279 880s 0.111 -0.363 880s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.339 -0.970 880s -0.643 -0.464 880s -0.223 -0.407 880s 880s x1: [2, 2, 4, 1] 880s 0.962 -0.405 880s -0.761 0.097 880s 880s y: [3, 2, 4, 1] 880s 0.720 -0.430 -0.050 880s -0.353 0.444 0.130 880s test-mul-mat0: iter:299/500 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.270 -0.837 0.196 0.756 880s -0.923 -0.872 -0.340 -0.812 880s 0.394 0.585 0.602 0.261 880s 0.666 -0.998 -0.075 -0.108 880s 880s x1: [4, 4, 1, 1] 880s 0.665 0.226 0.684 0.004 880s 0.584 -0.539 0.034 -0.881 880s 0.053 -0.688 0.688 -0.165 880s 0.363 0.016 -0.198 -0.535 880s 880s y: [4, 4, 1, 1] 880s -0.231 -1.047 0.807 0.166 880s -0.365 0.635 -0.295 1.019 880s 0.572 0.450 -0.010 0.688 880s -0.554 0.153 -0.107 0.298 880s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.935 0.707 -0.842 0.919 880s 0.238 0.161 0.489 -0.629 880s 0.425 -0.275 0.506 -0.386 880s -0.147 -0.672 0.733 0.458 880s 880s x1: [4, 4, 3, 1] 880s 0.686 0.930 -0.524 -0.853 880s -0.037 -0.405 0.200 0.275 880s -0.717 -0.964 -0.362 0.299 880s -0.162 0.103 -0.714 0.568 880s 880s y: [4, 4, 3, 1] 880s 0.956 0.594 0.100 -1.500 880s -0.236 -0.149 0.091 0.550 880s -0.773 -0.691 -0.338 0.625 880s 1.045 -0.729 -0.678 -0.309 880s testing: mul_mat, [1, 4, 3, 4] = [4, 1, 3, 4] * [4, 4, 3, 4] 880s x0: [4, 1, 3, 4] 880s -0.789 0.030 0.427 -0.440 880s 880s x1: [4, 4, 3, 4] 880s 0.444 -0.642 -0.458 -0.384 880s -0.503 -0.789 0.783 0.418 880s -0.078 -0.951 0.407 -0.757 880s -0.321 0.945 0.500 0.018 880s 880s y: [1, 4, 3, 4] 880s -0.396 880s 0.523 880s 0.539 880s 0.487 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.557 0.240 0.862 -0.476 880s 0.408 -0.346 -0.307 -0.924 880s 0.536 0.421 -0.279 -0.199 880s -0.639 -0.535 -0.502 0.148 880s 880s x1: [4, 1, 1, 1] 880s -0.399 -0.756 -0.900 0.449 880s 880s y: [4, 1, 1, 1] 880s -0.949 -0.039 -0.370 1.178 880s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.117 -0.310 0.359 -0.101 880s 0.294 -0.765 -0.901 0.777 880s 0.609 -0.553 -0.002 -0.926 880s 0.318 -0.194 0.283 -0.250 880s 880s x1: [4, 4, 3, 1] 880s -0.414 -0.133 0.050 -0.976 880s -0.467 -0.459 -0.909 -0.768 880s 0.763 -0.889 -0.166 -0.992 880s -0.789 -0.717 -0.426 -0.347 880s 880s y: [4, 4, 3, 1] 880s 0.206 -0.822 0.725 0.152 880s -0.052 0.436 0.682 -0.125 880s 0.227 0.284 1.876 0.617 880s 0.197 0.432 0.238 -0.146 880s testing: mul_mat, [1, 4, 3, 4] = [4, 1, 3, 4] * [4, 4, 3, 4] 880s x0: [4, 1, 3, 4] 880s 0.751 -0.612 -0.239 -0.475 880s 880s x1: [4, 4, 3, 4] 880s 0.001 0.100 0.999 0.956 880s -0.602 -0.523 0.302 0.857 880s -0.830 0.321 -0.946 0.732 880s 0.703 0.096 -0.823 0.379 880s 880s y: [1, 4, 3, 4] 880s -0.754 880s -0.612 880s -0.941 880s 0.486 880s test-mul-mat0: iter:300/500 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.468 0.785 880s -0.862 0.166 880s 0.120 -0.483 880s 880s x1: [2, 1, 1, 1] 880s -0.306 -0.046 880s 880s y: [3, 1, 1, 1] 880s -0.179 0.256 -0.015 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.315 0.618 880s 880s x1: [2, 3, 1, 1] 880s -0.774 -0.794 880s 0.284 -0.953 880s 0.816 -0.512 880s 880s y: [1, 3, 1, 1] 880s -0.735 880s -0.500 880s -0.059 880s testing: mul_mat, [1, 1, 1, 4] = [2, 1, 1, 4] * [2, 1, 1, 4] 880s x0: [2, 1, 1, 4] 880s 0.099 0.481 880s 880s x1: [2, 1, 1, 4] 880s -0.722 0.530 880s 880s y: [1, 1, 1, 4] 880s 0.184 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.549 -0.343 880s 0.093 -0.622 880s 0.036 0.083 880s 880s x1: [2, 1, 1, 1] 880s 0.331 -0.424 880s 880s y: [3, 1, 1, 1] 880s -0.036 0.295 -0.023 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.186 0.103 0.911 880s -0.199 0.359 -0.542 880s 0.265 -0.133 -0.651 880s 880s x1: [3, 2, 1, 1] 880s -0.527 -0.134 -0.539 880s 0.789 -0.516 -0.261 880s 880s y: [3, 2, 1, 1] 880s -0.407 0.348 0.229 880s -0.437 -0.201 0.447 880s testing: mul_mat, [2, 3, 1, 4] = [3, 2, 1, 4] * [3, 3, 1, 4] 880s x0: [3, 2, 1, 4] 880s -0.249 -0.783 0.116 880s 0.848 -0.419 -0.771 880s 880s x1: [3, 3, 1, 4] 880s -0.498 0.160 -0.476 880s -0.952 0.598 -0.833 880s 0.379 -0.825 -0.023 880s 880s y: [2, 3, 1, 4] 880s -0.057 -0.122 880s -0.328 -0.416 880s 0.549 0.685 880s test-mul-mat0: iter:301/500 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.608 0.891 0.712 -0.340 880s 0.068 -0.444 -0.263 0.342 880s 880s x1: [4, 3, 1, 1] 880s -0.491 0.393 0.238 0.260 880s 0.241 0.455 0.841 -0.643 880s 0.685 -0.091 0.119 0.009 880s 880s y: [2, 3, 1, 1] 880s 0.730 -0.181 880s 1.076 -0.627 880s -0.416 0.059 880s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.570 -0.590 0.193 0.549 880s 0.420 -0.871 0.941 0.310 880s 0.841 -0.399 -0.622 -0.604 880s 880s x1: [4, 2, 2, 1] 880s -0.268 0.726 -0.927 0.109 880s 0.537 0.719 -0.966 -0.327 880s 880s y: [3, 2, 2, 1] 880s -0.394 -1.583 -0.004 880s -1.095 -1.412 0.962 880s testing: mul_mat, [2, 3, 2, 1] = [4, 2, 2, 1] * [4, 3, 2, 1] 880s x0: [4, 2, 2, 1] 880s 0.382 -0.155 -0.621 -0.343 880s 0.797 -0.318 0.113 0.581 880s 880s x1: [4, 3, 2, 1] 880s -0.544 0.784 0.574 -0.101 880s 0.857 -0.701 0.183 -0.712 880s -0.291 -0.624 0.837 -0.871 880s 880s y: [2, 3, 2, 1] 880s -0.651 -0.676 880s 0.567 0.513 880s -0.235 -0.444 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.436 0.521 0.517 0.370 880s -0.811 0.139 -0.744 0.744 880s 0.031 0.600 -0.837 -0.545 880s -0.782 -0.634 -0.801 0.382 880s 880s x1: [4, 2, 1, 1] 880s 0.562 0.393 -0.925 0.340 880s -0.167 -0.580 0.719 -0.350 880s 880s y: [4, 2, 1, 1] 880s 0.098 0.539 0.842 0.181 880s -0.132 -0.739 -0.765 -0.211 880s testing: mul_mat, [3, 4, 2, 1] = [4, 3, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.633 -0.485 -0.261 0.382 880s 0.223 0.692 0.502 0.031 880s 0.431 0.099 0.600 -0.249 880s 880s x1: [4, 4, 2, 1] 880s 0.182 0.010 -0.423 -0.093 880s -0.725 -0.852 -0.531 0.668 880s -0.777 0.810 -0.499 -0.357 880s 0.529 0.151 0.805 -0.035 880s 880s y: [3, 4, 2, 1] 880s -0.045 -0.168 -0.151 880s 1.266 -0.997 -0.882 880s 0.093 0.126 -0.465 880s -0.631 0.625 0.734 880s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 4, 2, 1] 880s 0.410 -0.396 -0.146 880s 0.603 -0.848 0.406 880s -0.417 -0.921 0.215 880s -0.101 -0.986 0.872 880s 880s x1: [3, 4, 2, 1] 880s 0.180 0.410 -0.298 880s 0.327 0.200 0.018 880s -0.517 0.568 -0.759 880s 0.914 -0.917 0.933 880s 880s y: [4, 4, 2, 1] 880s -0.045 -0.360 -0.517 -0.682 880s 0.052 0.034 -0.317 -0.215 880s -0.326 -1.102 -0.470 -1.170 880s 0.601 1.708 0.663 1.625 880s test-mul-mat0: iter:302/500 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.485 -0.889 -0.624 880s 880s x1: [3, 4, 1, 1] 880s -0.307 -0.950 -0.350 880s -0.897 0.653 0.233 880s 0.002 -0.743 0.385 880s 0.081 -0.729 -0.761 880s 880s y: [1, 4, 1, 1] 880s 0.914 880s -1.161 880s 0.422 880s 1.163 880s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 1, 2, 1] 880s 0.750 -0.575 -0.289 880s 880s x1: [3, 1, 2, 1] 880s -0.732 -0.776 -0.820 880s 880s y: [1, 1, 2, 1] 880s 0.134 880s testing: mul_mat, [3, 1, 2, 2] = [3, 3, 2, 2] * [3, 1, 2, 2] 880s x0: [3, 3, 2, 2] 880s -0.030 0.359 0.821 880s 0.081 0.735 -0.911 880s 0.304 0.915 0.222 880s 880s x1: [3, 1, 2, 2] 880s -0.894 0.463 -0.314 880s 880s y: [3, 1, 2, 2] 880s -0.065 0.555 0.082 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.479 -0.308 -0.836 880s 880s x1: [3, 3, 1, 1] 880s 0.399 0.562 0.891 880s -0.379 0.135 -0.198 880s -0.336 -0.541 -0.862 880s 880s y: [1, 3, 1, 1] 880s -0.726 880s -0.058 880s 0.726 880s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 4, 2, 1] 880s -0.686 880s 0.828 880s -0.870 880s -0.479 880s 880s x1: [1, 3, 2, 1] 880s -0.395 880s -0.292 880s 0.597 880s 880s y: [4, 3, 2, 1] 880s 0.271 -0.327 0.343 0.189 880s 0.200 -0.242 0.254 0.140 880s -0.409 0.494 -0.519 -0.285 880s testing: mul_mat, [4, 1, 2, 2] = [4, 4, 2, 2] * [4, 1, 2, 2] 880s x0: [4, 4, 2, 2] 880s 0.159 -0.390 -0.810 -0.468 880s -0.207 -0.940 -0.167 -0.898 880s 0.084 -0.689 0.149 -0.261 880s 0.997 -0.562 0.809 -0.512 880s 880s x1: [4, 1, 2, 2] 880s -0.070 0.957 -0.071 -0.671 880s 880s y: [4, 1, 2, 2] 880s -0.012 -0.269 -0.500 -0.322 880s test-mul-mat0: iter:303/500 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.159 880s 880s x1: [1, 3, 1, 1] 880s 0.049 880s -0.692 880s 0.630 880s 880s y: [1, 3, 1, 1] 880s 0.008 880s -0.110 880s 0.100 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.888 880s 0.785 880s -0.451 880s -0.905 880s 880s x1: [1, 1, 1, 1] 880s -0.662 880s 880s y: [4, 1, 1, 1] 880s 0.588 -0.520 0.298 0.599 880s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.232 880s -0.001 880s -0.327 880s 0.767 880s 880s x1: [1, 4, 1, 1] 880s -0.699 880s 0.559 880s 0.928 880s -0.166 880s 880s y: [4, 4, 1, 1] 880s -0.162 0.001 0.229 -0.536 880s 0.129 -0.001 -0.183 0.428 880s 0.215 -0.001 -0.303 0.711 880s -0.038 0.000 0.054 -0.127 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.044 880s 880s x1: [1, 4, 1, 1] 880s 0.828 880s -0.517 880s 0.985 880s -0.482 880s 880s y: [1, 4, 1, 1] 880s 0.037 880s -0.023 880s 0.043 880s -0.021 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.057 880s 880s x1: [1, 1, 1, 1] 880s 0.861 880s 880s y: [1, 1, 1, 1] 880s -0.049 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.991 880s 880s x1: [1, 1, 1, 1] 880s -0.532 880s 880s y: [1, 1, 1, 1] 880s -0.528 880s test-mul-mat0: iter:304/500 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.323 -0.930 0.498 880s 0.906 0.302 -0.503 880s -0.421 0.068 -0.674 880s 0.062 0.053 -0.156 880s 880s x1: [3, 2, 1, 1] 880s -0.422 -0.430 -0.952 880s -0.636 0.119 -0.858 880s 880s y: [4, 2, 1, 1] 880s 0.062 -0.033 0.790 0.100 880s -0.332 -0.110 0.854 0.101 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.376 0.932 -0.409 880s 880s x1: [3, 4, 1, 1] 880s -0.081 -0.903 -0.295 880s -0.202 0.040 0.173 880s 0.408 0.031 0.949 880s 0.649 -0.946 0.884 880s 880s y: [1, 4, 1, 1] 880s -0.690 880s 0.042 880s -0.513 880s -1.487 880s testing: mul_mat, [2, 1, 1, 2] = [3, 2, 1, 2] * [3, 1, 1, 2] 880s x0: [3, 2, 1, 2] 880s 0.446 0.251 -0.095 880s -0.485 0.576 0.967 880s 880s x1: [3, 1, 1, 2] 880s 0.742 -0.926 0.255 880s 880s y: [2, 1, 1, 2] 880s 0.075 -0.647 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.869 0.559 -0.425 880s -0.032 -0.844 -0.718 880s -0.587 -0.863 0.568 880s -0.937 -0.186 -0.098 880s 880s x1: [3, 2, 1, 1] 880s -0.296 -0.702 0.092 880s 0.735 -0.754 -0.259 880s 880s y: [4, 2, 1, 1] 880s -0.175 0.536 0.832 0.398 880s -0.950 0.799 0.071 -0.524 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.763 0.914 -0.822 -0.386 880s 0.437 -0.517 0.704 0.146 880s 880s x1: [4, 3, 1, 1] 880s 0.728 -0.182 0.807 -0.758 880s -0.606 0.774 0.810 -0.186 880s -0.340 0.475 0.939 -0.656 880s 880s y: [2, 3, 1, 1] 880s -1.093 0.869 880s 0.576 -0.122 880s 0.174 0.172 880s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 880s x0: [2, 4, 1, 2] 880s 0.114 -0.664 880s 0.991 -0.470 880s -0.649 0.290 880s 0.053 0.772 880s 880s x1: [2, 4, 1, 2] 880s -0.883 -0.323 880s -0.295 -0.728 880s -0.186 0.519 880s -0.152 0.096 880s 880s y: [4, 4, 1, 2] 880s 0.114 -0.723 0.480 -0.296 880s 0.450 0.050 -0.020 -0.578 880s -0.366 -0.428 0.271 0.391 880s -0.081 -0.196 0.127 0.066 880s test-mul-mat0: iter:305/500 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.979 0.742 880s 0.727 -0.988 880s -0.908 -0.220 880s 880s x1: [2, 4, 1, 1] 880s 0.384 -0.024 880s -0.592 0.134 880s 0.800 -0.687 880s -0.310 -0.134 880s 880s y: [3, 4, 1, 1] 880s -0.395 0.304 -0.343 880s 0.679 -0.562 0.508 880s -1.292 1.260 -0.574 880s 0.204 -0.093 0.311 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.213 -0.769 880s 0.166 0.478 880s 880s x1: [2, 3, 1, 1] 880s -0.653 -0.378 880s -0.930 -0.880 880s 0.066 0.844 880s 880s y: [2, 3, 1, 1] 880s 0.151 -0.289 880s 0.478 -0.575 880s -0.634 0.414 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.399 -0.849 880s -0.303 -0.468 880s 880s x1: [2, 2, 1, 1] 880s 0.296 -0.032 880s -0.986 0.176 880s 880s y: [2, 2, 1, 1] 880s 0.145 -0.075 880s -0.542 0.217 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.243 0.350 880s -0.442 0.255 880s 880s x1: [2, 2, 1, 1] 880s 0.951 0.010 880s 0.222 -0.183 880s 880s y: [2, 2, 1, 1] 880s 0.235 -0.417 880s -0.010 -0.145 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.722 -0.957 880s -0.661 0.792 880s 880s x1: [2, 2, 1, 1] 880s -0.349 -0.871 880s 0.602 0.273 880s 880s y: [2, 2, 1, 1] 880s 0.581 -0.458 880s 0.174 -0.182 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.926 0.044 880s 880s x1: [2, 2, 1, 1] 880s 0.552 -0.725 880s -0.042 0.030 880s 880s y: [1, 2, 1, 1] 880s 0.479 880s -0.037 880s test-mul-mat0: iter:306/500 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.440 -0.776 880s -0.107 -0.305 880s -0.126 0.023 880s 880s x1: [2, 3, 1, 1] 880s -0.089 -0.025 880s -0.151 0.921 880s -0.803 0.665 880s 880s y: [3, 3, 1, 1] 880s -0.019 0.017 0.011 880s -0.782 -0.265 0.040 880s -0.870 -0.117 0.116 880s testing: mul_mat, [1, 3, 4, 1] = [2, 1, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 1, 4, 1] 880s 0.165 0.493 880s 880s x1: [2, 3, 4, 1] 880s -0.703 -0.853 880s 0.223 -0.981 880s -0.514 0.266 880s 880s y: [1, 3, 4, 1] 880s -0.537 880s -0.447 880s 0.046 880s testing: mul_mat, [1, 1, 4, 1] = [2, 1, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 1, 4, 1] 880s -0.657 0.452 880s 880s x1: [2, 1, 4, 1] 880s -0.575 -0.824 880s 880s y: [1, 1, 4, 1] 880s 0.005 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.975 0.325 880s 0.832 0.392 880s 0.212 0.050 880s 0.190 0.693 880s 880s x1: [2, 1, 1, 1] 880s -0.700 -0.438 880s 880s y: [4, 1, 1, 1] 880s 0.541 -0.754 -0.170 -0.437 880s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 1, 4, 1] 880s 0.064 -0.733 -0.501 -0.727 880s 880s x1: [4, 2, 4, 1] 880s -0.406 0.208 -0.792 0.020 880s 0.384 0.209 -0.289 0.371 880s 880s y: [1, 2, 4, 1] 880s 0.204 880s -0.254 880s testing: mul_mat, [3, 4, 4, 1] = [1, 3, 4, 1] * [1, 4, 4, 1] 880s x0: [1, 3, 4, 1] 880s -0.518 880s -0.664 880s -0.901 880s 880s x1: [1, 4, 4, 1] 880s 0.137 880s 0.644 880s -0.566 880s 0.143 880s 880s y: [3, 4, 4, 1] 880s -0.071 -0.091 -0.123 880s -0.333 -0.427 -0.580 880s 0.293 0.376 0.510 880s -0.074 -0.095 -0.128 880s test-mul-mat0: iter:307/500 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.177 880s 880s x1: [1, 3, 1, 1] 880s -0.594 880s -0.803 880s -0.213 880s 880s y: [1, 3, 1, 1] 880s -0.105 880s -0.142 880s -0.038 880s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.348 880s -0.665 880s 880s x1: [1, 1, 2, 1] 880s 0.741 880s 880s y: [2, 1, 2, 1] 880s 0.258 -0.493 880s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 1, 2, 1] 880s 0.421 880s 880s x1: [1, 2, 2, 1] 880s -0.682 880s -0.617 880s 880s y: [1, 2, 2, 1] 880s -0.287 880s -0.260 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.505 880s 0.310 880s -0.335 880s 0.435 880s 880s x1: [1, 1, 1, 1] 880s 0.577 880s 880s y: [4, 1, 1, 1] 880s 0.291 0.179 -0.193 0.251 880s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 3, 2, 1] 880s 0.376 -0.157 0.178 0.495 880s 0.012 -0.668 -0.348 0.035 880s -0.493 0.854 -0.944 0.379 880s 880s x1: [4, 1, 2, 1] 880s -0.510 -0.417 -0.115 -0.538 880s 880s y: [3, 1, 2, 1] 880s -0.413 0.293 -0.200 880s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 2, 2, 1] 880s -0.950 0.760 0.849 880s -0.282 -0.561 -0.751 880s 880s x1: [3, 4, 2, 1] 880s 0.274 0.593 -0.874 880s 0.253 -0.539 0.961 880s -0.981 0.836 -0.027 880s -0.474 -0.321 0.305 880s 880s y: [2, 4, 2, 1] 880s -0.552 0.246 880s 0.164 -0.491 880s 1.545 -0.172 880s 0.466 0.085 880s test-mul-mat0: iter:308/500 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.671 0.627 880s 0.562 -0.856 880s 0.274 -0.692 880s -0.241 -0.630 880s 880s x1: [2, 2, 1, 1] 880s -0.582 -0.387 880s 0.892 -0.725 880s 880s y: [4, 2, 1, 1] 880s 0.148 0.004 0.108 0.384 880s -1.053 1.122 0.746 0.242 880s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.581 0.183 880s -0.852 0.278 880s -0.866 -0.229 880s 0.849 -0.472 880s 880s x1: [2, 4, 4, 1] 880s -0.615 -0.386 880s -0.580 0.103 880s -0.625 -0.141 880s -0.048 -0.377 880s 880s y: [4, 4, 4, 1] 880s 0.286 0.417 0.620 -0.340 880s 0.356 0.523 0.479 -0.541 880s 0.337 0.494 0.574 -0.464 880s -0.041 -0.063 0.128 0.137 880s testing: mul_mat, [4, 4, 4, 4] = [2, 4, 4, 4] * [2, 4, 4, 4] 880s x0: [2, 4, 4, 4] 880s -0.271 0.434 880s 0.102 0.837 880s -0.958 0.898 880s -0.137 0.050 880s 880s x1: [2, 4, 4, 4] 880s 0.164 0.611 880s 0.719 0.299 880s -0.618 0.569 880s 0.827 -0.848 880s 880s y: [4, 4, 4, 4] 880s 0.220 0.528 0.391 0.008 880s -0.065 0.323 -0.420 -0.084 880s 0.414 0.413 1.102 0.113 880s -0.592 -0.626 -1.553 -0.156 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.783 -0.013 ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 880s 0.998 -0.202 880s -0.760 0.544 880s 0.913 0.515 880s 880s x1: [2, 4, 1, 1] 880s -0.936 -0.062 880s -0.867 0.094 880s -0.426 -0.305 880s -0.930 -0.293 880s 880s y: [4, 4, 1, 1] 880s -0.733 -0.922 0.678 -0.887 880s -0.680 -0.884 0.710 -0.742 880s -0.330 -0.363 0.158 -0.546 880s -0.725 -0.869 0.548 -1.000 880s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 880s x0: [4, 1, 4, 1] 880s -0.331 -0.223 0.838 -0.040 880s 880s x1: [4, 2, 4, 1] 880s -0.476 0.993 0.870 -0.272 880s -0.465 -0.708 0.511 0.096 880s 880s y: [1, 2, 4, 1] 880s 0.676 880s 0.736 880s testing: mul_mat, [4, 4, 4, 4] = [1, 4, 4, 4] * [1, 4, 4, 4] 880s x0: [1, 4, 4, 4] 880s 0.801 880s 0.727 880s 0.209 880s -0.603 880s 880s x1: [1, 4, 4, 4] 880s -0.699 880s -0.071 880s 0.618 880s -0.402 880s 880s y: [4, 4, 4, 4] 880s -0.560 -0.508 -0.146 0.421 880s -0.057 -0.051 -0.015 0.043 880s 0.495 0.450 0.129 -0.373 880s -0.322 -0.292 -0.084 0.242 880s test-mul-mat0: iter:309/500 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.054 0.839 0.459 880s 0.536 0.762 0.292 880s 0.775 -0.075 0.539 880s 880s x1: [3, 4, 1, 1] 880s 0.026 0.634 0.486 880s 0.168 0.521 0.051 880s 0.619 -0.811 -0.244 880s -0.625 0.335 0.782 880s 880s y: [3, 4, 1, 1] 880s 0.756 0.639 0.234 880s 0.470 0.502 0.118 880s -0.759 -0.357 0.409 880s 0.606 0.149 -0.087 880s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 1, 3, 1] 880s 0.882 0.497 0.734 880s 880s x1: [3, 3, 3, 1] 880s 0.868 -0.831 -0.830 880s 0.617 0.221 -0.606 880s 0.419 -0.162 0.895 880s 880s y: [1, 3, 3, 1] 880s -0.257 880s 0.209 880s 0.947 880s testing: mul_mat, [2, 1, 3, 4] = [3, 2, 3, 4] * [3, 1, 3, 4] 880s x0: [3, 2, 3, 4] 880s -0.572 0.713 0.941 880s -0.495 -0.914 0.137 880s 880s x1: [3, 1, 3, 4] 880s -0.214 0.725 -0.806 880s 880s y: [2, 1, 3, 4] 880s -0.119 -0.667 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.913 0.440 -0.409 880s 880s x1: [3, 2, 1, 1] 880s -0.764 -0.677 -0.150 880s 0.497 -0.321 0.376 880s 880s y: [1, 2, 1, 1] 880s -0.934 880s 0.159 880s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 3, 3, 1] 880s 0.720 880s 0.622 880s -0.809 880s 880s x1: [1, 3, 3, 1] 880s -0.261 880s -0.094 880s -0.810 880s 880s y: [3, 3, 3, 1] 880s -0.188 -0.162 0.211 880s -0.067 -0.058 0.076 880s -0.583 -0.504 0.656 880s testing: mul_mat, [2, 1, 3, 4] = [3, 2, 3, 4] * [3, 1, 3, 4] 880s x0: [3, 2, 3, 4] 880s 0.616 0.076 0.416 880s 0.914 -0.773 0.952 880s 880s x1: [3, 1, 3, 4] 880s 0.695 0.837 -0.427 880s 880s y: [2, 1, 3, 4] 880s 0.314 -0.418 880s test-mul-mat0: iter:310/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.350 0.356 0.966 0.485 880s 880s x1: [4, 3, 1, 1] 880s 0.153 -0.631 -0.307 -0.184 880s 0.418 -0.402 0.198 -0.929 880s 0.407 0.626 -0.525 -0.979 880s 880s y: [1, 3, 1, 1] 880s -0.557 880s -0.257 880s -0.617 880s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 2, 4, 1] 880s 0.570 -0.234 -0.550 -0.233 880s -0.163 0.857 -0.607 0.312 880s 880s x1: [4, 1, 4, 1] 880s 0.031 0.643 0.956 0.270 880s 880s y: [2, 1, 4, 1] 880s -0.722 0.050 880s testing: mul_mat, [1, 2, 4, 4] = [4, 1, 4, 4] * [4, 2, 4, 4] 880s x0: [4, 1, 4, 4] 880s 0.732 0.596 0.943 -0.435 880s 880s x1: [4, 2, 4, 4] 880s -0.859 0.107 -0.712 -0.022 880s -0.036 -0.319 -0.711 -0.159 880s 880s y: [1, 2, 4, 4] 880s -1.226 880s -0.817 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.421 0.288 -0.367 0.014 880s 0.326 0.122 0.581 0.905 880s 0.380 0.582 0.982 -0.792 880s 0.919 0.354 0.184 0.428 880s 880s x1: [4, 1, 1, 1] 880s -0.397 -0.209 -0.973 -0.514 880s 880s y: [4, 1, 1, 1] 880s 0.457 -1.186 -0.821 -0.838 880s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 880s x0: [4, 1, 4, 1] 880s -0.622 -0.197 -0.169 0.887 880s 880s x1: [4, 4, 4, 1] 880s -0.566 -0.782 -0.536 0.414 880s 0.121 -0.924 0.813 -0.097 880s 0.995 0.227 0.506 -0.213 880s 0.254 0.992 -0.544 0.833 880s 880s y: [1, 4, 4, 1] 880s 0.963 880s -0.117 880s -0.938 880s 0.478 880s testing: mul_mat, [3, 4, 4, 4] = [1, 3, 4, 4] * [1, 4, 4, 4] 880s x0: [1, 3, 4, 4] 880s 0.266 880s -0.756 880s -0.094 880s 880s x1: [1, 4, 4, 4] 880s -0.822 880s 0.445 880s -0.079 880s -0.212 880s 880s y: [3, 4, 4, 4] 880s -0.218 0.622 0.077 880s 0.118 -0.337 -0.042 880s -0.021 0.060 0.007 880s -0.056 0.160 0.020 880s test-mul-mat0: iter:311/500 880s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.767 -0.313 880s 0.898 -0.896 880s 880s x1: [2, 3, 1, 1] 880s -0.487 -0.709 880s -0.921 -0.030 880s -0.304 0.246 880s 880s y: [2, 3, 1, 1] 880s -0.151 0.199 880s -0.697 -0.800 880s -0.311 -0.494 880s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 2, 4, 1] 880s -0.069 -0.171 880s -0.430 0.445 880s 880s x1: [2, 2, 4, 1] 880s -0.629 0.701 880s 0.647 0.536 880s 880s y: [2, 2, 4, 1] 880s -0.077 0.582 880s -0.136 -0.040 880s testing: mul_mat, [1, 2, 4, 3] = [2, 1, 4, 3] * [2, 2, 4, 3] 880s x0: [2, 1, 4, 3] 880s -0.301 0.319 880s 880s x1: [2, 2, 4, 3] 880s 0.333 -0.512 880s 0.962 0.411 880s 880s y: [1, 2, 4, 3] 880s -0.264 880s -0.158 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.619 -0.521 880s -0.142 0.557 880s 0.320 -0.360 880s 880s x1: [2, 1, 1, 1] 880s -0.224 0.962 880s 880s y: [3, 1, 1, 1] 880s -0.640 0.568 -0.418 880s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 1, 4, 1] 880s -0.836 0.645 0.723 880s 880s x1: [3, 2, 4, 1] 880s -0.996 -0.527 -0.607 880s -0.910 0.022 -0.838 880s 880s y: [1, 2, 4, 1] 880s 0.054 880s 0.169 880s testing: mul_mat, [3, 3, 4, 3] = [1, 3, 4, 3] * [1, 3, 4, 3] 880s x0: [1, 3, 4, 3] 880s 0.400 880s -0.062 880s -0.942 880s 880s x1: [1, 3, 4, 3] 880s -0.945 880s 0.213 880s -0.447 880s 880s y: [3, 3, 4, 3] 880s -0.378 0.058 0.891 880s 0.085 -0.013 -0.201 880s -0.179 0.028 0.422 880s test-mul-mat0: iter:312/500 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.008 -0.638 880s 880s x1: [2, 4, 1, 1] 880s 0.680 -0.807 880s 0.275 0.010 880s 0.336 0.767 880s 0.332 -0.490 880s 880s y: [1, 4, 1, 1] 880s 0.520 880s -0.004 880s -0.487 880s 0.315 880s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.990 0.272 880s -0.335 -0.317 880s 0.576 -0.276 880s 0.414 0.995 880s 880s x1: [2, 1, 3, 1] 880s 0.362 -0.206 880s 880s y: [4, 1, 3, 1] 880s -0.414 -0.056 0.265 -0.055 880s testing: mul_mat, [4, 4, 3, 3] = [2, 4, 3, 3] * [2, 4, 3, 3] 880s x0: [2, 4, 3, 3] 880s -0.223 0.376 880s -0.389 0.582 880s 0.596 0.387 880s -0.684 -0.789 880s 880s x1: [2, 4, 3, 3] 880s -0.079 0.437 880s 0.104 0.758 880s 0.047 0.473 880s 0.303 0.057 880s 880s y: [4, 4, 3, 3] 880s 0.182 0.285 0.122 -0.291 880s 0.262 0.401 0.355 -0.670 880s 0.167 0.257 0.211 -0.406 880s -0.046 -0.085 0.202 -0.252 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.726 -0.322 880s 0.145 0.353 880s 0.755 0.069 880s 0.150 0.498 880s 880s x1: [2, 4, 1, 1] 880s -0.152 0.416 880s -0.930 0.300 880s 0.082 0.926 880s -0.324 -0.375 880s 880s y: [4, 4, 1, 1] 880s -0.244 0.125 -0.086 0.184 880s -0.772 -0.029 -0.681 0.010 880s -0.239 0.339 0.126 0.473 880s -0.114 -0.179 -0.270 -0.235 880s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.252 -0.536 -0.187 0.725 880s 0.395 -0.271 0.424 0.712 880s 0.231 0.291 -0.730 -0.097 880s -0.395 -0.765 -0.375 -0.694 880s 880s x1: [4, 2, 3, 1] 880s -0.314 0.771 -0.916 -0.868 880s -0.965 0.812 -0.698 0.442 880s 880s y: [4, 2, 3, 1] 880s -0.950 -1.339 0.905 0.480 880s -0.227 -0.582 0.480 -0.284 880s testing: mul_mat, [4, 4, 3, 3] = [4, 4, 3, 3] * [4, 4, 3, 3] 880s x0: [4, 4, 3, 3] 880s -0.621 0.257 -0.734 0.247 880s -0.501 -0.646 -0.425 -0.995 880s 0.129 -0.255 -0.277 -0.573 880s -0.655 -0.722 -0.191 0.254 880s 880s x1: [4, 4, 3, 3] 880s 0.937 0.487 -0.236 0.076 880s 0.780 -0.339 -0.943 -0.144 880s 0.457 0.557 0.342 0.749 880s 0.302 -0.619 0.294 -0.559 880s 880s y: [4, 4, 3, 3] 880s -0.265 -0.759 0.019 -0.900 880s 0.085 0.372 0.531 -0.122 880s -0.206 -1.480 -0.607 -0.576 880s -0.701 0.680 0.435 0.051 880s test-mul-mat0: iter:313/500 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.902 0.166 880s 0.573 0.522 880s 880s x1: [2, 1, 1, 1] 880s -0.244 0.548 880s 880s y: [2, 1, 1, 1] 880s -0.129 0.146 880s testing: mul_mat, [3, 2, 3, 1] = [2, 3, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 3, 3, 1] 880s 0.248 0.364 880s -0.319 -0.599 880s 0.862 -0.136 880s 880s x1: [2, 2, 3, 1] 880s 0.147 0.061 880s -0.943 -0.177 880s 880s y: [3, 2, 3, 1] 880s 0.059 -0.083 0.118 880s -0.299 0.406 -0.789 880s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 2, 3, 1] 880s 0.179 0.550 880s 0.373 0.351 880s 880s x1: [2, 3, 3, 1] 880s -0.968 -0.250 880s -0.420 -0.145 880s -0.960 -0.280 880s 880s y: [2, 3, 3, 1] 880s -0.311 -0.449 880s -0.155 -0.208 880s -0.326 -0.456 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.298 -0.482 880s 0.150 -0.118 880s 880s x1: [2, 2, 1, 1] 880s 0.986 0.424 880s -0.422 -0.159 880s 880s y: [2, 2, 1, 1] 880s 0.090 0.098 880s -0.049 -0.045 880s testing: mul_mat, [4, 2, 3, 1] = [2, 4, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.557 0.389 880s -0.595 -0.741 880s -0.215 0.468 880s 0.513 -0.938 880s 880s x1: [2, 2, 3, 1] 880s -0.739 -0.983 880s 0.054 0.745 880s 880s y: [4, 2, 3, 1] 880s 0.029 1.168 -0.301 0.542 880s 0.260 -0.584 0.337 -0.671 880s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.582 0.675 0.270 0.310 880s 0.425 -0.375 -0.303 -0.204 880s 0.967 -0.604 0.096 -0.964 880s 0.034 0.839 0.451 -0.292 880s 880s x1: [4, 2, 3, 1] 880s 0.362 -0.836 -0.356 -0.699 880s 0.257 -0.819 -0.387 0.700 880s 880s y: [4, 2, 3, 1] 880s -0.666 0.718 1.495 -0.645 880s -0.291 0.391 0.032 -1.058 880s test-mul-mat0: iter:314/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.186 0.298 -0.573 0.329 880s 880s x1: [4, 3, 1, 1] 880s 0.501 -0.663 -0.214 -0.470 880s 0.593 -0.172 -0.559 -0.276 880s -0.191 -0.750 0.438 -0.784 880s 880s y: [1, 3, 1, 1] 880s -0.136 880s 0.288 880s -0.768 880s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 2, 2, 1] 880s 0.872 0.546 -0.604 0.384 880s -0.238 0.897 0.721 0.548 880s 880s x1: [4, 1, 2, 1] 880s -0.780 -0.902 -0.832 -0.763 880s 880s y: [2, 1, 2, 1] 880s -0.963 -1.642 880s testing: mul_mat, [3, 2, 2, 2] = [4, 3, 2, 2] * [4, 2, 2, 2] 880s x0: [4, 3, 2, 2] 880s -0.326 -0.220 0.072 -0.336 880s 0.699 -0.260 -0.177 -0.145 880s 0.628 0.299 0.060 -0.107 880s 880s x1: [4, 2, 2, 2] 880s 0.401 0.144 -0.338 -0.301 880s 0.571 0.991 -0.081 0.668 880s 880s y: [3, 2, 2, 2] 880s -0.086 0.346 0.307 880s -0.634 0.059 0.578 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.655 0.981 -0.343 0.824 880s 880s x1: [4, 3, 1, 1] 880s 0.718 0.171 -0.435 -0.887 880s -0.519 -0.157 -0.997 0.031 880s 0.657 -0.426 0.937 -0.518 880s 880s y: [1, 3, 1, 1] 880s -0.885 880s 0.553 880s -1.596 880s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 2, 2, 1] 880s -0.350 880s 0.004 880s 880s x1: [1, 4, 2, 1] 880s -0.856 880s 0.788 880s -0.957 880s 0.130 880s 880s y: [2, 4, 2, 1] 880s 0.299 -0.004 880s -0.276 0.003 880s 0.335 -0.004 880s -0.045 0.001 880s testing: mul_mat, [4, 1, 2, 2] = [2, 4, 2, 2] * [2, 1, 2, 2] 880s x0: [2, 4, 2, 2] 880s -0.148 -0.296 880s 0.479 0.846 880s 0.501 0.314 880s 0.333 0.361 880s 880s x1: [2, 1, 2, 2] 880s 0.787 -0.211 880s 880s y: [4, 1, 2, 2] 880s -0.054 0.198 0.328 0.186 880s test-mul-mat0: iter:315/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.442 880s -0.301 880s 880s x1: [1, 1, 1, 1] 880s 0.029 880s 880s y: [2, 1, 1, 1] 880s 0.013 -0.009 880s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.987 880s 0.631 880s 0.259 880s 880s x1: [1, 2, 2, 1] 880s 0.040 880s -0.409 880s 880s y: [3, 2, 2, 1] 880s -0.039 0.025 0.010 880s 0.403 -0.258 -0.106 880s testing: mul_mat, [4, 3, 2, 2] = [1, 4, 2, 2] * [1, 3, 2, 2] 880s x0: [1, 4, 2, 2] 880s -0.607 880s -0.285 880s 0.987 880s 0.422 880s 880s x1: [1, 3, 2, 2] 880s 0.546 880s 0.820 880s -0.458 880s 880s y: [4, 3, 2, 2] 880s -0.332 -0.156 0.539 0.231 880s -0.498 -0.234 0.809 0.346 880s 0.278 0.130 -0.452 -0.193 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.973 880s 880s x1: [1, 4, 1, 1] 880s -0.119 880s -0.199 880s -0.665 880s -0.299 880s 880s y: [1, 4, 1, 1] 880s -0.116 880s -0.194 880s -0.647 880s -0.291 880s testing: mul_mat, [2, 1, 2, 1] = [1, 2, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.634 880s -0.714 880s 880s x1: [1, 1, 2, 1] 880s -0.892 880s 880s y: [2, 1, 2, 1] 880s -0.566 0.637 880s testing: mul_mat, [1, 1, 2, 2] = [2, 1, 2, 2] * [2, 1, 2, 2] 880s x0: [2, 1, 2, 2] 880s 0.082 -0.525 880s 880s x1: [2, 1, 2, 2] 880s -0.267 0.040 880s 880s y: [1, 1, 2, 2] 880s -0.043 880s test-mul-mat0: iter:316/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.722 880s 0.558 880s 0.966 880s 880s x1: [1, 3, 1, 1] 880s -0.776 880s -0.051 880s -0.047 880s 880s y: [3, 3, 1, 1] 880s -0.560 -0.433 -0.750 880s -0.037 -0.029 -0.050 880s -0.034 -0.026 -0.045 880s testing: mul_mat, [2, 3, 4, 1] = [1, 2, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.688 880s -0.205 880s 880s x1: [1, 3, 4, 1] 880s -0.992 880s -0.309 880s -0.190 880s 880s y: [2, 3, 4, 1] 880s -0.683 0.203 880s -0.212 0.063 880s -0.131 0.039 880s testing: mul_mat, [3, 2, 4, 2] = [1, 3, 4, 2] * [1, 2, 4, 2] 880s x0: [1, 3, 4, 2] 880s 0.967 880s 0.308 880s 0.651 880s 880s x1: [1, 2, 4, 2] 880s 0.226 880s -0.892 880s 880s y: [3, 2, 4, 2] 880s 0.219 0.070 0.147 880s -0.862 -0.275 -0.580 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.486 880s -0.200 880s 0.046 880s 880s x1: [1, 3, 1, 1] 880s 0.801 880s 0.304 880s -0.804 880s 880s y: [3, 3, 1, 1] 880s -0.389 -0.160 0.037 880s -0.148 -0.061 0.014 880s 0.390 0.161 -0.037 880s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 4, 4, 1] 880s -0.741 0.285 -0.408 880s 0.178 0.578 -0.067 880s 0.343 0.107 -0.186 880s -0.931 0.175 -0.607 880s 880s x1: [3, 1, 4, 1] 880s -0.519 -0.891 -0.303 880s 880s y: [4, 1, 4, 1] 880s 0.253 -0.587 -0.217 0.510 880s testing: mul_mat, [2, 3, 4, 2] = [4, 2, 4, 2] * [4, 3, 4, 2] 880s x0: [4, 2, 4, 2] 880s -0.283 0.071 -0.565 -0.438 880s -0.741 -0.814ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 0.154 0.090 880s 880s x1: [4, 3, 4, 2] 880s 0.084 -0.374 0.632 -0.450 880s 0.520 -0.548 0.379 -0.055 880s 0.759 0.298 0.465 0.793 880s 880s y: [2, 3, 4, 2] 880s -0.210 0.299 880s -0.376 0.114 880s -0.803 -0.662 880s test-mul-mat0: iter:317/500 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.727 -0.010 0.143 -0.505 880s 0.816 -0.287 -0.216 -0.857 880s 880s x1: [4, 4, 1, 1] 880s -0.716 0.224 -0.475 -0.713 880s 0.382 0.915 -0.045 -0.924 880s -0.340 0.979 -0.537 0.515 880s -0.942 -0.059 0.348 0.475 880s 880s y: [2, 4, 1, 1] 880s 0.810 0.064 880s 0.173 0.849 880s -0.099 -0.884 880s 0.496 -1.234 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.886 -0.602 0.662 -0.173 880s 0.182 0.805 -0.631 -0.422 880s -0.153 -0.814 0.348 -0.486 880s 880s x1: [4, 2, 3, 1] 880s 0.542 0.397 0.042 0.817 880s -0.230 0.667 -0.899 0.994 880s 880s y: [3, 2, 3, 1] 880s -0.833 0.047 -0.788 880s -0.965 0.644 -1.304 880s testing: mul_mat, [2, 3, 3, 3] = [4, 2, 3, 3] * [4, 3, 3, 3] 880s x0: [4, 2, 3, 3] 880s -0.190 0.673 -0.535 0.502 880s 0.113 -0.098 -0.567 0.607 880s 880s x1: [4, 3, 3, 3] 880s -0.109 0.221 0.550 0.739 880s 0.407 -0.102 -0.747 0.694 880s -0.760 -0.042 -0.631 -0.042 880s 880s y: [2, 3, 3, 3] 880s 0.246 0.103 880s 0.602 0.901 880s 0.433 0.251 880s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.223 0.211 -0.161 0.985 880s 0.602 0.804 0.323 0.381 880s -0.533 0.358 0.197 0.423 880s 880s x1: [4, 2, 1, 1] 880s -0.291 -0.808 -0.869 0.859 880s -0.317 0.841 0.951 0.105 880s 880s y: [3, 2, 1, 1] 880s 0.880 -0.778 0.058 880s 0.198 0.832 0.701 880s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 1, 3, 1] 880s 0.655 0.536 0.628 880s 880s x1: [3, 4, 3, 1] 880s 0.459 0.653 -0.498 880s -0.239 0.250 0.335 880s -0.675 0.780 -0.236 880s 0.210 -0.511 -0.044 880s 880s y: [1, 4, 3, 1] 880s 0.337 880s 0.188 880s -0.172 880s -0.164 880s testing: mul_mat, [3, 3, 3, 3] = [1, 3, 3, 3] * [1, 3, 3, 3] 880s x0: [1, 3, 3, 3] 880s -0.107 880s 0.984 880s -0.121 880s 880s x1: [1, 3, 3, 3] 880s 0.386 880s -0.328 880s -0.034 880s 880s y: [3, 3, 3, 3] 880s -0.041 0.380 -0.047 880s 0.035 -0.323 0.040 880s 0.004 -0.034 0.004 880s test-mul-mat0: iter:318/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.518 0.238 -0.198 880s 0.921 -0.606 -0.937 880s -0.642 0.011 -0.255 880s 0.955 -0.439 -0.547 880s 880s x1: [3, 1, 1, 1] 880s -0.722 -0.138 0.392 880s 880s y: [4, 1, 1, 1] 880s -0.484 -0.949 0.362 -0.843 880s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.716 -0.058 0.479 880s 0.813 0.420 -0.329 880s 0.296 -0.224 -0.625 880s 0.057 -0.006 0.472 880s 880s x1: [3, 4, 3, 1] 880s 0.821 0.899 -0.788 880s -0.474 0.531 0.551 880s 0.400 0.212 0.335 880s -0.902 -0.522 0.191 880s 880s y: [4, 4, 3, 1] 880s 0.157 1.305 0.534 -0.331 880s -0.106 -0.344 -0.604 0.230 880s 0.435 0.304 -0.138 0.180 880s -0.524 -1.016 -0.270 0.042 880s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 1, 3, 1] 880s 0.852 -0.262 0.137 880s 880s x1: [3, 4, 3, 1] 880s -0.125 0.300 0.641 880s 0.250 -0.642 -0.365 880s -0.278 0.733 0.364 880s -0.472 -0.772 0.749 880s 880s y: [1, 4, 3, 1] 880s -0.098 880s 0.331 880s -0.379 880s -0.097 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.879 0.382 0.190 880s -0.770 -0.441 0.766 880s 0.373 -0.240 -0.928 880s 0.957 -0.275 -0.313 880s 880s x1: [3, 1, 1, 1] 880s -0.221 -0.619 0.553 880s 880s y: [4, 1, 1, 1] 880s -0.325 0.866 -0.447 -0.214 880s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.903 0.524 0.886 0.107 880s 880s x1: [4, 3, 3, 1] 880s 0.647 -0.280 0.824 -0.222 880s 0.725 0.597 -0.370 -0.537 880s -0.266 0.010 0.428 0.620 880s 880s y: [1, 3, 3, 1] 880s -0.024 880s -0.727 880s 0.692 880s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 3, 3, 1] 880s -0.981 880s 0.126 880s -0.803 880s 880s x1: [1, 4, 3, 1] 880s -0.284 880s -0.507 880s -0.193 880s -0.989 880s 880s y: [3, 4, 3, 1] 880s 0.278 -0.036 0.228 880s 0.498 -0.064 0.408 880s 0.189 -0.024 0.155 880s 0.970 -0.124 0.794 880s test-mul-mat0: iter:319/500 880s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.761 -0.658 880s -0.921 0.773 880s 0.351 -0.294 880s 880s x1: [2, 2, 1, 1] 880s 0.917 -0.097 880s 0.045 0.849 880s 880s y: [3, 2, 1, 1] 880s 0.761 -0.919 0.351 880s -0.525 0.615 -0.234 880s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.005 0.273 880s 0.616 -0.961 880s 0.122 -0.112 880s 880s x1: [2, 3, 3, 1] 880s 0.638 0.638 880s 0.878 -0.452 880s -0.033 0.667 880s 880s y: [3, 3, 3, 1] 880s 0.171 -0.219 0.006 880s -0.128 0.975 0.158 880s 0.182 -0.661 -0.079 880s testing: mul_mat, [2, 3, 3, 1] = [2, 2, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.492 -0.103 880s 0.225 -0.919 880s 880s x1: [2, 3, 3, 1] 880s -0.051 -0.595 880s -0.990 -0.665 880s -0.124 -0.010 880s 880s y: [2, 3, 3, 1] 880s 0.087 0.535 880s 0.555 0.388 880s 0.062 -0.019 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.333 0.793 880s 0.408 -0.651 880s 880s x1: [2, 2, 1, 1] 880s -0.998 0.988 880s 0.677 -0.663 880s 880s y: [2, 2, 1, 1] 880s 1.116 -1.050 880s -0.751 0.707 880s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 2, 3, 1] 880s 0.432 0.956 880s 0.739 -0.275 880s 880s x1: [2, 2, 3, 1] 880s -0.550 0.467 880s 0.387 -0.796 880s 880s y: [2, 2, 3, 1] 880s 0.209 -0.535 880s -0.594 0.505 880s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.425 -0.020 880s 880s x1: [2, 2, 3, 1] 880s -0.884 -0.662 880s -0.586 0.524 880s 880s y: [1, 2, 3, 1] 880s 0.389 880s 0.239 880s test-mul-mat0: iter:320/500 880s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.863 0.077 880s 880s x1: [2, 1, 1, 1] 880s -0.349 0.006 880s 880s y: [1, 1, 1, 1] 880s 0.302 880s testing: mul_mat, [3, 1, 4, 1] = [2, 3, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.252 0.678 880s -0.847 0.430 880s 0.411 -0.998 880s 880s x1: [2, 1, 4, 1] 880s 0.638 0.611 880s 880s y: [3, 1, 4, 1] 880s 0.576 -0.278 -0.348 880s testing: mul_mat, [2, 3, 4, 4] = [2, 2, 4, 4] * [2, 3, 4, 4] 880s x0: [2, 2, 4, 4] 880s 0.645 -0.548 880s 0.200 0.706 880s 880s x1: [2, 3, 4, 4] 880s -0.076 -0.396 880s 0.086 -0.837 880s 0.883 0.166 880s 880s y: [2, 3, 4, 4] 880s 0.168 -0.294 880s 0.514 -0.574 880s 0.478 0.293 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.459 -0.367 880s 0.370 -0.147 880s 880s x1: [2, 2, 1, 1] 880s 0.128 0.079 880s 0.366 0.832 880s 880s y: [2, 2, 1, 1] 880s -0.088 0.036 880s -0.473 0.013 880s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 880s x0: [2, 3, 4, 1] 880s -0.459 -0.044 880s -0.324 -0.819 880s 0.045 -0.827 880s 880s x1: [2, 2, 4, 1] 880s 0.305 -0.473 880s 0.946 -0.244 880s 880s y: [3, 2, 4, 1] 880s -0.119 0.289 0.405 880s -0.423 -0.107 0.244 880s testing: mul_mat, [1, 2, 4, 4] = [3, 1, 4, 4] * [3, 2, 4, 4] 880s x0: [3, 1, 4, 4] 880s -0.752 0.163 0.580 880s 880s x1: [3, 2, 4, 4] 880s -0.748 -0.379 0.222 880s 0.345 0.707 0.088 880s 880s y: [1, 2, 4, 4] 880s 0.630 880s -0.093 880s test-mul-mat0: iter:321/500 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.507 0.710 0.514 -0.755 880s 0.415 0.387 -0.437 -0.004 880s 880s x1: [4, 3, 1, 1] 880s -0.171 0.358 0.205 -0.577 880s 0.939 -0.281 0.085 -0.200 880s 0.625 -0.813 0.909 0.694 880s 880s y: [2, 3, 1, 1] 880s 0.882 -0.019 880s -0.481 0.244 880s -0.951 -0.456 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.397 -0.863 -0.335 0.808 880s -0.440 -0.397 -0.472 0.645 880s 880s x1: [4, 2, 1, 1] 880s -0.698 -0.647 -0.330 0.636 880s 0.041 0.201 -0.954 0.307 880s 880s y: [2, 2, 1, 1] 880s 1.461 1.130 880s 0.378 0.551 880s testing: mul_mat, [4, 2, 1, 2] = [4, 4, 1, 2] * [4, 2, 1, 2] 880s x0: [4, 4, 1, 2] 880s 0.541 -0.993 0.839 0.588 880s 0.314 0.145 -0.809 0.451 880s 0.810 0.999 -0.989 -0.587 880s -0.473 0.655 0.817 -0.320 880s 880s x1: [4, 2, 1, 2] 880s 0.403 -0.847 0.832 0.312 880s 0.847 -0.297 0.805 0.557 880s 880s y: [4, 2, 1, 2] 880s 1.942 -0.529 -1.526 -0.166 880s 1.757 -0.177 -0.735 -0.116 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.031 0.053 -0.890 0.134 880s -0.322 0.084 0.094 0.010 880s -0.251 0.733 0.330 0.892 880s 880s x1: [4, 4, 1, 1] 880s -0.431 0.501 0.044 0.883 880s -0.353 0.235 0.334 -0.543 880s 0.235 0.344 -0.130 0.762 880s -0.000 -0.314 -0.558 -0.513 880s 880s y: [3, 4, 1, 1] 880s 0.119 0.194 1.279 880s -0.347 0.160 -0.114 880s 0.229 -0.051 0.831 880s 0.411 -0.084 -0.871 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.743 -0.490 0.522 880s -0.977 -0.227 -0.793 880s -0.042 0.154 -0.728 880s -0.815 0.188 0.255 880s 880s x1: [3, 4, 1, 1] 880s -0.203 -0.533 -0.539 880s -0.702 0.511 -0.655 880s -0.055 -0.254 0.678 880s 0.401 0.981 0.023 880s 880s y: [4, 4, 1, 1] 880s -0.171 0.747 0.319 -0.072 880s -1.114 1.090 0.585 0.501 880s 0.437 -0.426 -0.530 0.170 880s -0.171 -0.633 0.117 -0.137 880s testing: mul_mat, [4, 3, 1, 2] = [4, 4, 1, 2] * [4, 3, 1, 2] 880s x0: [4, 4, 1, 2] 880s -0.838 0.977 -0.744 -0.615 880s 0.705 0.671 -0.863 -0.437 880s -0.584 -0.314 0.720 0.673 880s -0.631 0.344 -0.266 -0.070 880s 880s x1: [4, 3, 1, 2] 880s -0.683 0.366 -0.415 0.451 880s -0.624 -0.522 -0.752 -0.157 880s -0.061 -0.454 -0.646 0.284 880s 880s y: [4, 3, 1, 2] 880s 0.962 -0.075 0.289 0.636 880s 0.669 -0.072 -0.119 0.425 880s -0.087 0.086 -0.096 0.034 880s test-mul-mat0: iter:322/500 880s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.455 880s 0.834 880s -0.719 880s 880s x1: [1, 1, 1, 1] 880s 0.578 880s 880s y: [3, 1, 1, 1] 880s 0.263 0.482 -0.415 880s testing: mul_mat, [1, 3, 4, 1] = [1, 1, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 1, 4, 1] 880s 0.021 880s 880s x1: [1, 3, 4, 1] 880s 0.175 880s -0.432 880s -0.334 880s 880s y: [1, 3, 4, 1] 880s 0.004 880s -0.009 880s -0.007 880s testing: mul_mat, [2, 1, 4, 4] = [1, 2, 4, 4] * [1, 1, 4, 4] 880s x0: [1, 2, 4, 4] 880s 0.975 880s -0.977 880s 880s x1: [1, 1, 4, 4] 880s -0.598 880s 880s y: [2, 1, 4, 4] 880s -0.582 0.584 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.283 880s 0.118 880s 0.957 880s 880s x1: [1, 2, 1, 1] 880s 0.496 880s 0.248 880s 880s y: [3, 2, 1, 1] 880s -0.140 0.058 0.474 880s -0.070 0.029 0.237 880s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 1, 4, 1] 880s 0.132 -0.546 -0.730 880s 880s x1: [3, 1, 4, 1] 880s 0.056 -0.683 0.022 880s 880s y: [1, 1, 4, 1] 880s 0.364 880s testing: mul_mat, [1, 3, 4, 4] = [1, 1, 4, 4] * [1, 3, 4, 4] 880s x0: [1, 1, 4, 4] 880s 0.634 880s 880s x1: [1, 3, 4, 4] 880s -0.729 880s 0.782 880s -0.168 880s 880s y: [1, 3, 4, 4] 880s -0.462 880s 0.496 880s -0.106 880s test-mul-mat0: iter:323/500 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.421 -0.837 -0.266 880s -0.107 -0.250 -0.181 880s 0.310 -0.616 0.336 880s 880s x1: [3, 1, 1, 1] 880s -0.567 -0.321 0.108 880s 880s y: [3, 1, 1, 1] 880s 0.001 0.121 0.058 880s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.191 -0.707 -0.108 880s 0.755 0.514 0.136 880s -0.601 -0.085 0.179 880s 880s x1: [3, 3, 4, 1] 880s 0.127 0.335 0.895 880s -0.653 0.691 0.800 880s -0.892 0.595 0.826 880s 880s y: [3, 3, 4, 1] 880s -0.358 0.390 0.055 880s -0.450 -0.029 0.477 880s -0.339 -0.255 0.634 880s testing: mul_mat, [2, 3, 4, 4] = [3, 2, 4, 4] * [3, 3, 4, 4] 880s x0: [3, 2, 4, 4] 880s 0.499 0.641 -0.379 880s 0.563 0.357 0.599 880s 880s x1: [3, 3, 4, 4] 880s -0.495 0.630 -0.258 880s 0.685 -0.870 -0.418 880s -0.870 0.163 -0.975 880s 880s y: [2, 3, 4, 4] 880s 0.255 -0.208 880s -0.058 -0.176 880s 0.040 -1.016 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.351 0.855 -0.039 880s -0.492 -0.673 -0.020 880s 880s x1: [3, 2, 1, 1] 880s -0.696 0.735 0.421 880s 0.683 -0.280 -0.220 880s 880s y: [2, 2, 1, 1] 880s 0.856 -0.161 880s -0.471 -0.143 880s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 4, 4, 1] 880s -0.445 -0.822 880s 0.217 -0.391 880s 0.629 0.317 880s -0.118 -0.835 880s 880s x1: [2, 3, 4, 1] 880s -0.565 0.987 880s -0.142 0.171 880s -0.490 -0.008 880s 880s y: [4, 3, 4, 1] 880s -0.559 -0.509 -0.043 -0.757 880s -0.077 -0.098 -0.035 -0.126 880s 0.225 -0.103 -0.311 0.064 880s testing: mul_mat, [1, 2, 4, 4] = [4, 1, 4, 4] * [4, 2, 4, 4] 880s x0: [4, 1, 4, 4] 880s 0.317 0.542 0.796 -0.290 880s 880s x1: [4, 2, 4, 4] 880s 0.053 0.896 -0.987 0.232 880s -0.495 0.329 0.397 0.972 880s 880s y: [1, 2, 4, 4] 880s -0.351 880s 0.056 880s test-mul-mat0: iter:324/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.568 -0.302 -0.226 0.868 880s 880s x1: [4, 3, 1, 1] 880s 0.223 0.438 -0.671 0.983 880s -0.677 0.009 -0.309 0.003 880s -0.590 -0.151 -0.597 0.333 880s 880s y: [1, 3, 1, 1] 880s 1.000 880s -0.315 880s 0.134 880s testing: mul_mat, [4, 1, 2, 1] = [4, 4, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 4, 2, 1] 880s 0.440 -0.474 0.179 0.689 880s 0.790 -0.598 0.127 -0.881 880s -0.615 0.450 0.128 0.077 880s -0.547 0.538 0.925 -0.143 880s 880s x1: [4, 1, 2, 1] 880s -0.785 -0.378 0.239 -0.636 880s 880s y: [4, 1, 2, 1] 880s -0.562 0.197 0.295 0.538 880s testing: mul_mat, [1, 4, 2, 4] = [4, 1, 2, 4] * [4, 4, 2, 4] 880s x0: [4, 1, 2, 4] 880s 0.544 -0.303 -0.548 -0.806 880s 880s x1: [4, 4, 2, 4] 880s 0.750 -0.938 -0.798 0.540 880s -0.536 0.328 0.659 -0.151 880s -0.222 -0.213 0.925 0.231 880s -0.675 0.851 -0.912 0.195 880s 880s y: [1, 4, 2, 4] 880s 0.694 880s -0.630 880s -0.750 880s -0.283 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.207 -0.375 -0.059 0.522 880s 0.119 0.093 0.805 0.266 880s -0.624 -0.871 0.628 0.950 880s -0.924 -0.806 0.216 0.602 880s 880s x1: [4, 1, 1, 1] 880s 0.734 0.695 -0.343 0.713 880s 880s y: [4, 1, 1, 1] 880s -0.020 0.065 -0.602 -0.884 880s testing: mul_mat, [2, 4, 2, 1] = [4, 2, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 2, 2, 1] 880s 0.758 -0.065 -0.429 0.105 880s 0.353 -0.190 0.964 -0.748 880s 880s x1: [4, 4, 2, 1] 880s 0.747 0.186 0.493 -0.701 880s 0.992 0.052 0.218 0.029 880s -0.705 0.681 -0.237 0.990 880s -0.662 -0.524 -0.819 0.131 880s 880s y: [2, 4, 2, 1] 880s 0.269 1.228 880s 0.658 0.529 880s -0.373 -1.348 880s -0.102 -1.022 880s testing: mul_mat, [1, 4, 2, 4] = [2, 1, 2, 4] * [2, 4, 2, 4] 880s x0: [2, 1, 2, 4] 880s 0.349 -0.437 880s 880s x1: [2, 4, 2, 4] 880s -0.072 0.016 880s 0.482 -0.736 880s 0.429 -0.926 880s 0.404 0.470 880s 880s y: [1, 4, 2, 4] 880s -0.032 880s 0.490 880s 0.555 880s -0.065 880s test-mul-mat0: iter:325/500 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.397 0.363 880s 0.637 0.166 880s -0.211 0.900 880s 880s x1: [2, 4, 1, 1] 880s 0.525 0.952 880s 0.856 -0.995 880s 0.724 -0.551 880s -0.307 0.700 880s 880s y: [3, 4, 1, 1] 880s 0.138 0.493 0.746 880s -0.701 0.380 -1.076 880s -0.487 0.369 -0.649 880s 0.376 -0.079 0.695 880s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.291 0.199 880s 0.700 -0.543 880s 0.988 0.601 880s 880s x1: [2, 3, 4, 1] 880s 0.852 0.856 880s 0.144 0.438 880s -0.795 0.698 880s 880s y: [3, 3, 4, 1] 880s 0.418 0.132 1.356 880s 0.129 -0.137 0.405 880s -0.092 -0.936 -0.366 880s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.785 0.806 880s 0.061 -0.365 880s -0.265 0.935 880s 880s x1: [2, 3, 4, 1] 880s -0.824 -0.076 880s 0.010 0.952 880s 0.611 -0.154 880s 880s y: [3, 3, 4, 1] 880s -0.708 -0.023 0.147 880s 0.775 -0.347 0.887 880s 0.356 0.094 -0.306 880s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.725 -0.641 880s -0.265 0.469 880s -0.368 -0.433 880s 880s x1: [2, 3, 1, 1] 880s -0.627 0.534 880s -0.938 -0.029 880s 0.940 -0.071 880s 880s y: [3, 3, 1, 1] 880s -0.797 0.417 -0.000 880s -0.661 0.235 0.358 880s 0.727 -0.283 -0.315 880s testing: mul_mat, [3, 2, 4, 1] = [3, 3, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.117 0.476 -0.353 880s -0.848 0.622 -0.250 880s -0.570 -0.003 -0.491 880s 880s x1: [3, 2, 4, 1] 880s 0.170 0.128 0.512 880s 0.933 -0.602 0.304 880s 880s y: [3, 2, 4, 1] 880s -0.100 -0.193 -0.349 880s -0.285 -1.242 -0.679 880s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.172 0.204 0.542 880s -0.909 0.581 -0.420 880s 0.653 0.910 0.288 880s 880s x1: [3, 3, 4, 1] 880s 0.100 0.124 0.575 880s 0.609 0.704 -0.278 880s 0.422 -0.005 -0.561 880s 880s y: [3, 3, 4, 1] 880s 0.320 -0.261 0.344 880s -0.112 -0.028 0.958 880s -0.377 -0.152 0.109 880s test-mul-mat0: iter:326/500 880s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.401 -0.973 0.131 -0.193 880s 880s x1: [4, 1, 1, 1] 880s -0.405 -0.815 -0.213 0.393 880s 880s y: [1, 1, 1, 1] 880s 0.528 880s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 1, 2, 1] 880s -0.302 -0.684 -0.705 -0.072 880s 880s x1: [4, 1, 2, 1] 880s 0.546 0.655 -0.994 0.156 880s 880s y: [1, 1, 2, 1] 880s 0.077 880s testing: mul_mat, [4, 1, 2, 3] = [4, 4, 2, 3] * [4, 1, 2, 3] 880s x0: [4, 4, 2, 3] 880s -0.208 -0.956 0.010 0.137 880s -0.146 0.034 0.113 0.459 880s 0.054 0.301 0.145 0.670 880s 0.674 0.618 0.679 0.246 880s 880s x1: [4, 1, 2, 3] 880s 0.976 -0.396 -0.980 -0.812 880s 880s y: [4, 1, 2, 3] 880s 0.054 -0.639 -0.752 -0.451 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.349 0.453 -0.904 -0.944 880s 0.488 0.330 -0.201 0.944 880s 880s x1: [4, 4, 1, 1] 880s -0.310 0.251 -0.722 -0.230 880s -0.138 -0.900 -0.286 0.540 880s -0.203 -0.805 -0.095 -0.951 880s -0.541 -0.552 0.568 0.455 880s 880s y: [2, 4, 1, 1] 880s 1.092 -0.140 880s -0.610 0.203 880s 0.690 -1.243 880s -1.004 -0.131 880s testing: mul_mat, [3, 4, 2, 1] = [2, 3, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 3, 2, 1] 880s -0.975 0.593 880s 0.902 0.357 880s -0.036 0.307 880s 880s x1: [2, 4, 2, 1] 880s 0.902 -0.604 880s 0.140 0.074 880s 0.959 -0.199 880s 0.765 0.210 880s 880s y: [3, 4, 2, 1] 880s -1.238 0.598 -0.218 880s -0.092 0.153 0.018 880s -1.053 0.794 -0.096 880s -0.621 0.765 0.037 880s testing: mul_mat, [1, 2, 2, 3] = [3, 1, 2, 3] * [3, 2, 2, 3] 880s x0: [3, 1, 2, 3] 880s -0.860 -0.696 -0.721 880s 880s x1: [3, 2, 2, 3] 880s 0.631 -0.116 0.461 880s -0.973 -0.976 -0.465 880s 880s y: [1, 2, 2, 3] 880s -0.794 880s 1.851 880s test-mul-mat0: iter:327/500 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.242 880s 880s x1: [1, 4, 1, 1] 880s 0.532 880s 0.135 880s 0.313 880s 0.660 880s 880s y: [1, 4, 1, 1] 880s -0.129 880s -0.033 880s -0.076 880s -0.160 880s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 2, 4, 1] 880s 0.017 880s 0.291 880s 880s x1: [1, 1, 4, 1] 880s 0.814 880s 880s y: [2, 1, 4, 1] 880s 0.014 0.237 880s testing: mul_mat, [2, 2, 4, 3] = [1, 2, 4, 3] * [1, 2, 4, 3] 880s x0: [1, 2, 4, 3] 880s -0.552 880s 0.681 880s 880s x1: [1, 2, 4, 3] 880s -0.777 880s -0.950 880s 880s y: [2, 2, 4, 3] 880s 0.429 -0.529 880s 0.524 -0.647 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.293 880s 0.721 880s 0.113 880s 880s x1: [1, 2, 1, 1] 880s -0.062 880s 0.853 880s 880s y: [3, 2, 1, 1] 880s 0.018 -0.045 -0.007 880s -0.250 0.615 0.096 880s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 1, 4, 1] 880s 0.208 -0.488 -0.812 880s 880s x1: [3, 1, 4, 1] 880s 0.767 -0.831 -0.206 880s 880s y: [1, 1, 4, 1] 880s 0.733 880s testing: mul_mat, [1, 3, 4, 3] = [1, 1, 4, 3] * [1, 3, 4, 3] 880s x0: [1, 1, 4, 3] 880s 0.642 880s 880s x1: [1, 3, 4, 3] 880s 0.173 880s 0.751 880s -0.791 880s 880s y: [1, 3, 4, 3] 880s 0.111 880s 0.482 880s -0.508 880s test-mul-mat0: iter:328/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.276 -0.845 -0.035 880s -0.521 0.977 -0.040 880s 880s x1: [3, 2, 1, 1] 880s 0.202 -0.729 0.578 880s 0.909 0.825 -0.457 880s 880s y: [2, 2, 1, 1] 880s 0.652 -0.841 880s -0.429 0.351 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.556 -0.953 0.523 880s 880s x1: [3, 2, 1, 1] 880s 0.154 0.121 -0.398 880s 0.469 -0.620 -0.387 880s 880s y: [1, 2, 1, 1] 880s -0.238 880s 0.650 880s testing: mul_mat, [2, 1, 1, 4] = [3, 2, 1, 4] * [3, 1, 1, 4] 880s x0: [3, 2, 1, 4] 880s -0.841 0.097 0.987 880s -0.685 -0.938 -0.534 880s 880s x1: [3, 1, 1, 4] 880s 0.510 -0.461 0.528 880s 880s y: [2, 1, 1, 4] 880s 0.047 -0.199 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.556 -0.064 0.501 880s -0.126 0.881 -0.537 880s 0.415 0.050 0.582 880s -0.241 0.958 -0.411 880s 880s x1: [3, 2, 1, 1] 880s 0.832 -0.331 0.529 880s -0.282 -0.715 0.776 880s 880s y: [4, 2, 1, 1] 880s -0.177 -0.681 0.637 -0.735 880s 0.591 -1.011 0.299 -0.936 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.774 -0.105 -0.771 -0.489 880s 880s x1: [4, 3, 1, 1] 880s -0.745 0.819 0.593 0.604 880s -0.897 -0.880 -0.538 -0.074 880s -0.825 0.679 0.557 0.366 880s 880s y: [1, 3, 1, 1] 880s -0.262 880s 1.238 880s -0.041 880s testing: mul_mat, [3, 4, 1, 4] = [1, 3, 1, 4] * [1, 4, 1, 4] 880s x0: [1, 3, 1, 4] 880s 0.085 880s 0.943 880s -0.243 880s 880s x1: [1, 4, 1, 4] 880s -0.329 880s 0.657 880s -0.045 880s 0.545 880s 880s y: [3, 4, 1, 4] 880s -0.028 -0.310 0.080 880s 0.056 0.620 -0.160 880s -0.004 -0.043 0.011 880s 0.046 0.513 -0.132 880s test-mul-mat0: iter:329/500 880s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.831 0.889 880s 880s x1: [2, 1, 1, 1] 880s -0.883 0.408 880s 880s y: [1, 1, 1, 1] 880s -0.371 880s testing: mul_mat, [1, 1, 4, 1] = [2, 1, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 1, 4, 1] 880s -0.216 -0.612 880s 880s x1: [2, 1, 4, 1] 880s -0.350 -0.405 880s 880s y: [1, 1, 4, 1] 880s 0.324 880s testing: mul_mat, [3, 1, 4, 1] = [2, 3, 4, 1] * [2, 1, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.357 0.235 880s 0.185 0.188 880s 0.124 0.835 880s 880s x1: [2, 1, 4, 1] 880s 0.425 -0.229 880s 880s y: [3, 1, 4, 1] 880s 0.098 0.036 -0.138 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.214 0.684 880s 880s x1: [2, 3, 1, 1] 880s -0.712 -0.382 880s -0.378 0.152 880s 0.858 -0.551 880s 880s y: [1, 3, 1, 1] 880s -0.414 880s 0.023 880s -0.193 880s testing: mul_mat, [4, 2, 4, 1] = [1, 4, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 4, 4, 1] 880s 0.688 880s -0.306 880s -0.952 880s 0.559 880s 880s x1: [1, 2, 4, 1] 880s 0.826 880s -0.598 880s 880s y: [4, 2, 4, 1] 880s 0.569 -0.253 -0.787 0.462 880s -0.411 0.183 0.569 -0.334 880s testing: mul_mat, [3, 1, 4, 1] = [4, 3, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 3, 4, 1] 880s 0.449 0.614 -0.103 -0.280 880s 0.024 0.996 -0.146 -0.630 880s -0.429 0.788 0.395 0.289 880s 880s x1: [4, 1, 4, 1] 880s 0.047 -0.700 -0.414 0.688 880s 880s y: [3, 1, 4, 1] 880s -0.559 -1.069 -0.536 880s test-mul-mat0: iter:330/500 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.461 0.522 0.557 -0.394 880s 0.352 -0.225 -0.529 0.552 880s 880s x1: [4, 3, 1, 1] 880s -0.640 0.610 0.120 0.353 880s 0.394 0.611 -0.766 -0.421 880s 0.566 0.698 0.213 -0.454 880s 880s y: [2, 3, 1, 1] 880s -0.049 -0.231 880s 0.240 0.174 880s 0.923 -0.321 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.607 -0.910 0.117 0.168 880s -0.439 -0.330 -0.549 -0.939 880s -0.071 -0.423 -0.950 -0.009 880s 880s x1: [4, 2, 3, 1] 880s 0.283 0.499 -0.740 -0.874 880s 0.990 -0.939 -0.006 0.457 880s 880s y: [3, 2, 3, 1] 880s -0.859 0.938 0.480 880s 0.329 -0.550 0.328 880s testing: mul_mat, [1, 3, 3, 2] = [4, 1, 3, 2] * [4, 3, 3, 2] 880s x0: [4, 1, 3, 2] 880s -0.637 -0.510 0.719 0.200 880s 880s x1: [4, 3, 3, 2] 880s 0.114 0.847 -0.681 -0.957 880s -0.576 -0.631 0.034 0.995 880s 0.876 0.252 -0.033 0.200 880s 880s y: [1, 3, 3, 2] 880s -1.186 880s 0.913 880s -0.671 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.616 0.167 0.469 0.698 880s 0.447 0.839 0.775 -0.601 880s -0.361 0.439 0.465 -0.786 880s -0.126 -0.293 -0.882 -0.368 880s 880s x1: [4, 1, 1, 1] 880s -0.222 0.057 -0.626 0.470 880s 880s y: [4, 1, 1, 1] 880s 0.182 -0.819 -0.555 0.390 880s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 1, 3, 1] 880s 0.675 -0.747 -0.887 0.281 880s 880s x1: [4, 4, 3, 1] 880s 0.794 -0.588 0.696 -0.620 880s -0.237 -0.729 0.998 0.166 880s -0.947 -0.465 0.944 0.110 880s -0.091 0.414 0.386 0.293 880s 880s y: [1, 4, 3, 1] 880s 0.183 880s -0.454 880s -1.099 880s -0.630 880s testing: mul_mat, [2, 4, 3, 2] = [1, 2, 3, 2] * [1, 4, 3, 2] 880s x0: [1, 2, 3, 2] 880s -0.628 880s 0.443 880s 880s x1: [1, 4, 3, 2] 880s 0.452 880s -0.819 880s -0.367 880s -0.141 880s 880s y: [2, 4, 3, 2] 880s -0.284 0.200 880s 0.515 -0.363 880s 0.231 -0.163 880s 0.089 -0.063 880s test-mul-mat0: iter:331/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.085 -0.923 0.631 -0.642 880s 880s x1: [4, 3, 1, 1] 880s 0.121 0.594 -0.668 -0.633 880s -0.618 0.664 0.121 0.394 880s 0.245 0.549 -0.340 -0.061 880s 880s y: [1, 3, 1, 1] 880s -0.573 880s -0.737 880s -0.703 880s testing: mul_mat, [3, 1, 4, 1] = [4, 3, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 3, 4, 1] 880s -0.055 -0.236 0.913 -0.934 880s -0.842 0.158 0.615 -0.182 880s -0.903 -0.125 0.733 -0.826 880s 880s x1: [4, 1, 4, 1] 880s -0.890 0.743 0.589 -0.300 880s 880s y: [3, 1, 4, 1] 880s 0.691 1.284 1.390 880s testing: mul_mat, [3, 3, 4, 2] = [4, 3, 4, 2] * [4, 3, 4, 2] 880s x0: [4, 3, 4, 2] 880s -0.980 0.137 0.466 0.634 880s 0.667 -0.192 0.639 0.976 880s -0.533 0.231 0.351 -0.909 880s 880s x1: [4, 3, 4, 2] 880s -0.247 0.894 -0.380 0.660 880s 0.181 -0.332 0.185 0.158 880s -0.827 0.497 0.052 -0.743 880s 880s y: [3, 3, 4, 2] 880s 0.606 0.065 -0.395 880s -0.037 0.457 -0.252 880s 0.432 -1.338 1.248 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.403 0.930 0.165 -0.003 880s 0.207 0.185 0.108 0.487 880s 880s x1: [4, 3, 1, 1] 880s 0.412 -0.662 -0.945 0.049 880s -0.706 -0.925 0.287 0.782 880s -0.321 0.786 -0.027 -0.186 880s 880s y: [2, 3, 1, 1] 880s -0.606 -0.115 880s -1.100 0.094 880s 0.598 -0.015 880s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 1, 4, 1] 880s -0.466 -0.197 880s 880s x1: [2, 4, 4, 1] 880s 0.315 0.169 880s 0.443 -0.944 880s 0.779 -0.566 880s 0.082 -0.076 880s 880s y: [1, 4, 4, 1] 880s -0.180 880s -0.021 880s -0.251 880s -0.023 880s testing: mul_mat, [4, 2, 4, 2] = [1, 4, 4, 2] * [1, 2, 4, 2] 880s x0: [1, 4, 4, 2] 880s -0.413 880s -0.160 880s 0.448 880s 0.678 880s 880s x1: [1, 2, 4, 2] 880s -0.824 880s 0.298 880s 880s y: [4, 2, 4, 2] 880s 0.340 0.132 -0.369 -0.559 880s -0.123 -0.048 0.134 0.202 880s test-mul-mat0: iter:332/500 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.586 0.288 880s 0.471 0.357 880s 880s x1: [2, 2, 1, 1] 880s -0.552 0.146 880s 0.310 0.949 880s 880s y: [2, 2, 1, 1] 880s -0.281 -0.207 880s 0.455 0.484 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.888 0.188 880s -0.010 0.252 880s 0.188 0.419 880s -0.063 -0.728 880s 880s x1: [2, 2, 1, 1] 880s -0.197 0.913 880s -0.714 0.879 880s 880s y: [4, 2, 1, 1] 880s 0.346 0.232 0.346 -0.652 880s 0.800 0.228 0.235 -0.595 880s testing: mul_mat, [1, 4, 1, 4] = [2, 1, 1, 4] * [2, 4, 1, 4] 880s x0: [2, 1, 1, 4] 880s -0.583 0.475 880s 880s x1: [2, 4, 1, 4] 880s -0.644 0.607 880s -0.235 -0.228 880s -0.137 -0.903 880s -0.364 0.798 880s 880s y: [1, 4, 1, 4] 880s 0.664 880s 0.029 880s -0.349 880s 0.591 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.661 -0.450 880s 0.771 0.080 880s 880s x1: [2, 1, 1, 1] 880s 0.068 0.858 880s 880s y: [2, 1, 1, 1] 880s -0.432 0.121 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.671 0.183 880s -0.736 -0.750 880s -0.130 -0.054 880s -0.401 0.040 880s 880s x1: [2, 2, 1, 1] 880s 0.041 0.674 880s 0.734 -0.403 880s 880s y: [4, 2, 1, 1] 880s 0.151 -0.536 -0.042 0.010 880s 0.419 -0.238 -0.073 -0.311 880s testing: mul_mat, [4, 2, 1, 4] = [4, 4, 1, 4] * [4, 2, 1, 4] 880s x0: [4, 4, 1, 4] 880s -0.380 0.732 -0.174 -0.214 880s -0.100 -0.604 -0.575 -0.691 880s 0.162 -0.597 -0.684 -0.307 880s 0.450 -0.905 0.485 -0.774 880s 880s x1: [4, 2, 1, 4] 880s 0.902 0.648 -0.497 -0.682 880s 0.123 -0.171 -0.168 0.234 880s 880s y: [4, 2, 1, 4] 880s 0.365 0.275 0.308 0.106 880s -0.193 0.025 0.164 -0.053 880s test-mul-mat0: iter:333/500 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.763 0.717 0.522 0.439 880s 0.108 -0.105 0.470 -0.383 880s 880s x1: [4, 4, 1, 1] 880s -0.273 -0.224 -0.033 0.689 880s -0.352 -0.045 0.437 -0.171 880s 0.634 0.411 -0.647 0.890 880s -0.214 -0.151 0.280 -0.594 880s 880s y: [2, 4, 1, 1] 880s 0.333 -0.285 880s 0.389 0.238 880s -0.137 -0.620 880s -0.059 0.352 880s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 2, 3, 1] 880s 0.638 -0.728 -0.579 -0.467 880s 0.742 0.037 0.484 -0.029 880s 880s x1: [4, 2, 3, 1] 880s -0.049 0.229 0.406 0.141 880s -0.552 0.871 0.868 0.224 880s 880s y: [2, 2, 3, 1] 880s -0.498 0.164 880s -1.593 0.036 880s testing: mul_mat, [4, 2, 3, 4] = [4, 4, 3, 4] * [4, 2, 3, 4] 880s x0: [4, 4, 3, 4] 880s 0.221 -0.913 -0.155 0.117 880s 0.433 -0.022 -0.978 -0.378 880s -0.901 0.878 0.758 0.655 880s 0.249 -0.635 0.408 0.196 880s 880s x1: [4, 2, 3, 4] 880s -0.363 -0.828 -0.715 -0.766 880s -0.713 0.118 0.547 0.925 880s 880s y: [4, 2, 3, 4] 880s 0.697 0.851 -1.444 -0.007 880s -0.242 -1.195 1.767 0.152 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.319 0.670 0.175 -0.657 880s 880s x1: [4, 4, 1, 1] 880s 0.213 0.933 -0.194 0.976 880s -0.756 0.537 -0.053 -0.907 880s 0.093 -0.906 -0.507 0.178 880s -0.175 0.172 -0.572 -0.243 880s 880s y: [1, 4, 1, 1] 880s 0.018 880s 0.706 880s -0.784 880s 0.119 880s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 4, 3, 1] 880s 0.614 880s 0.257 880s -0.339 880s -0.439 880s 880s x1: [1, 4, 3, 1] 880s 0.200 880s 0.231 880s -0.102 880s -0.033 880s 880s y: [4, 4, 3, 1] 880s 0.123 0.051 -0.068 -0.088 880s 0.142 0.059 -0.078 -0.101 880s -0.063 -0.026 0.035 0.045 880s -0.020 -0.009 0.011 0.015 880s testing: mul_mat, [4, 1, 3, 4] = [4, 4, 3, 4] * [4, 1, 3, 4] 880s x0: [4, 4, 3, 4] 880s 0.124 -0.263 -0.032 -0.735 880s 0.015 0.529 -0.945 0.068 880s 0.933 0.205 -0.027 -0.828 880s 0.287 0.115 0.423 0.155 880s 880s x1: [4, 1, 3, 4] 880s -0.762 -0.707 -0.321 0.908 880s 880s y: [4, 1, 3, 4] 880s -0.566 -0.020 -1.599 -0.295 880s test-mul-mat0: iter:334/500 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.193 0.395 0.690 880s 0.506 0.923 -0.784 880s 880s x1: [3, 4, 1, 1] 880s -0.353 -0.821 0.623 880s 0.670 0.575 0.283 880s 0.541 -0.781 -0.569 880s 0.252 -0.560 -0.418 880s 880s y: [2, 4, 1, 1] 880s 0.038 -1.425 880s 0.551 0.648 880s -0.596 -0.001 880s -0.461 -0.061 880s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 3, 3, 1] 880s 0.709 -0.885 0.220 880s -0.851 -0.303 0.318 880s 0.342 -0.909 0.008 880s 880s x1: [3, 2, 3, 1] 880s 0.798 0.874 -0.944 880s -0.928 -0.188 -0.953 880s 880s y: [3, 2, 3, 1] 880s -0.416 -1.244 -0.528 880s -0.702 0.543 -0.155 880s testing: mul_mat, [4, 3, 3, 2] = [3, 4, 3, 2] * [3, 3, 3, 2] 880s x0: [3, 4, 3, 2] 880s 0.653 -0.304 -0.488 880s 0.392 0.114 0.839 880s 0.248 0.273 0.851 880s -0.095 0.143 -0.743 880s 880s x1: [3, 3, 3, 2] 880s 0.742 -0.316 0.801 880s 0.710 -0.607 0.916 880s -0.070 -0.457 -0.387 880s 880s y: [4, 3, 3, 2] 880s 0.189 0.927 0.779 -0.711 880s 0.201 0.978 0.790 -0.835 880s 0.282 -0.404 -0.472 0.229 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.194 -0.621 0.925 880s 880s x1: [3, 4, 1, 1] 880s -0.483 0.529 0.680 880s 0.877 -0.524 -0.736 880s -0.773 0.462 -0.599 880s 0.870 -0.948 0.701 880s 880s y: [1, 4, 1, 1] 880s 0.207 880s -0.185 880s -0.991 880s 1.406 880s testing: mul_mat, [1, 3, 3, 1] = [1, 1, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 1, 3, 1] 880s 0.792 880s 880s x1: [1, 3, 3, 1] 880s 0.936 880s 0.962 880s -0.490 880s 880s y: [1, 3, 3, 1] 880s 0.742 880s 0.762 880s -0.388 880s testing: mul_mat, [2, 1, 3, 2] = [1, 2, 3, 2] * [1, 1, 3, 2] 880s x0: [1, 2, 3, 2] 880s -0.212 880s -0.764 880s 880s x1: [1, 1, 3, 2] 880s 0.578 880s 880s y: [2, 1, 3, 2] 880s -0.123 -0.441 880s test-mul-mat0: iter:335/500 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.948 -0.970 -0.654 880s -0.939 0.613 0.134 880s -0.703 -0.343 -0.677 880s 880s x1: [3, 3, 1, 1] 880s 0.124 0.789 0.554 880s 0.894 -0.695 0.346 880s -0.021 0.920 -0.076 880s 880s y: [3, 3, 1, 1] 880s -1.245 0.442 -0.733 880s -0.400 -1.218 -0.624 880s -0.822 0.574 -0.249 880s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 3, 4, 1] 880s 0.010 0.967 -0.126 880s -0.143 -0.666 -0.352 880s 0.828 -0.509 0.976 880s 880s x1: [3, 3, 4, 1] 880s -0.597 0.753 0.213 880s -0.903 -0.956 -0.519 880s 0.717 -0.729 0.713 880s 880s y: [3, 3, 4, 1] 880s 0.695 -0.491 -0.670 880s -0.868 0.948 -0.767 880s -0.787 0.132 1.660 880s testing: mul_mat, [3, 3, 4, 4] = [3, 3, 4, 4] * [3, 3, 4, 4] 880s x0: [3, 3, 4, 4] 880s 0.612 0.766 -0.100 880s 0.090 -0.241 -0.111 880s 0.124 -0.757 0.102 880s 880s x1: [3, 3, 4, 4] 880s -0.812 0.243 0.157 880s -0.836 0.935 -0.889 880s -0.185 -0.531 0.447 880s 880s y: [3, 3, 4, 4] 880s -0.326 -0.149 -0.268 880s 0.294 -0.202 -0.902 880s -0.565 0.062 0.425 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.043 0.140 0.149 880s 0.341 -0.312 -0.313 880s -0.095 -0.232 -0.285 880s 880s x1: [3, 3, 1, 1] 880s 0.133 1.000 -0.130 880s 0.332 0.474 -0.132 880s 0.072 -0.964 0.101 880s 880s y: [3, 3, 1, 1] 880s 0.126 -0.226 -0.207 880s 0.061 0.007 -0.104 880s -0.117 0.294 0.188 880s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 2, 4, 1] 880s -0.538 -0.076 -0.442 880s 0.596 0.179 0.912 880s 880s x1: [3, 3, 4, 1] 880s 0.172 0.538 -0.298 880s 0.118 0.196 0.386 880s -0.632 -0.681 0.423 880s 880s y: [2, 3, 4, 1] 880s -0.002 -0.073 880s -0.249 0.458 880s 0.204 -0.112 880s testing: mul_mat, [1, 3, 4, 4] = [2, 1, 4, 4] * [2, 3, 4, 4] 880s x0: [2, 1, 4, 4] 880s 0.279 0.853 880s 880s x1: [2, 3, 4, 4] 880s 0.528 0.377 880s 0.969 0.643 880s 0.472 0.246 880s 880s y: [1, 3, 4, 4] 880s 0.469 880s 0.818 880s 0.341 880s test-mul-mat0: iter:336/500 880s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.858 -0.644 0.706 -0.305 880s 0.529 0.602 -0.308 0.889 880s 880s x1: [4, 1, 1, 1] 880s 0.076 -0.327 0.189 0.605 880s 880s y: [2, 1, 1, 1] 880s 0.095 0.323 880s testing: mul_mat, [1, 2, 3, 1] = [4, 1, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.907 -0.659 -0.687 0.695 880s 880s x1: [4, 2, 3, 1] 880s -0.342 0.215 -0.736 -0.648 880s 0.189 -0.117 -0.997 0.781 880s 880s y: [1, 2, 3, 1] 880s 0.224 880s 1.134 880s testing: mul_mat, [1, 1, 3, 4] = [4, 1, 3, 4] * [4, 1, 3, 4] 880s x0: [4, 1, 3, 4] 880s 0.416 0.204 -0.808 0.441 880s 880s x1: [4, 1, 3, 4] 880s -0.292 0.218 -0.379 0.004 880s 880s y: [1, 1, 3, 4] 880s 0.231 880s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.368 -0.134 -0.845 0.846 880s 0.758 -0.040 0.011 -0.289 880s -0.706 0.618 -0.705 0.816 880s 0.641 -0.022 0.542 -0.650 880s 880s x1: [4, 1, 1, 1] 880s -0.482 -0.094 0.871 0.549 880s 880s y: [4, 1, 1, 1] 880s -0.436 -0.511 0.116 -0.192 880s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 1, 3, 1] 880s 0.413 -0.649 -0.592 0.118 880s 880s x1: [4, 4, 3, 1] 880s -0.527 0.920 0.216 -0.464 880s -0.766 -0.514 -0.441 -0.646 880s -0.177 0.207 -0.129 0.730 880s 0.078 -0.580 -0.436 -0.554 880s 880s y: [1, 4, 3, 1] 880s -0.998 880s 0.202 880s -0.046 880s 0.602 880s testing: mul_mat, [2, 4, 3, 4] = [1, 2, 3, 4] * [1, 4, 3, 4] 880s x0: [1, 2, 3, 4] 880s -0.080 880s 0.697 880s 880s x1: [1, 4, 3, 4] 880s 0.308 880s 0.217 880s -0.525 880s 0.436 880s 880s y: [2, 4, 3, 4] 880s -0.025 0.214 880s -0.017 0.151 880s 0.042 -0.366 880s -0.035 0.304 880s test-mul-mat0: iter:337/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.398 -0.872 0.316 880s -0.090 0.635 0.046 880s 880s x1: [3, 2, 1, 1] 880s -0.981 -0.181 0.485 880s -0.060 -0.483 -0.025 880s 880s y: [2, 2, 1, 1] 880s 0.701 -0.004 880s 0.437 -0.303 880s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.700 -0.853 -0.926 880s -0.183 0.121 0.319 880s 0.419 0.250 -0.365 880s 880s x1: [3, 2, 3, 1] 880s 0.835 0.212 0.662 880s -0.316 -0.926 0.638 880s 880s y: [3, 2, 3, 1] 880s -1.378 0.084 0.161 880s 0.420 0.149 -0.597 880s testing: mul_mat, [1, 3, 3, 4] = [3, 1, 3, 4] * [3, 3, 3, 4] 880s x0: [3, 1, 3, 4] 880s 0.407 0.164 -0.446 880s 880s x1: [3, 3, 3, 4] 880s -0.586 0.477 0.292 880s 0.548 0.777 0.438 880s 0.622 -0.406 -0.440 880s 880s y: [1, 3, 3, 4] 880s -0.291 880s 0.155 880s 0.383 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.199 0.963 -0.961 880s -0.769 -0.359 0.232 880s 0.001 0.199 0.397 880s 0.172 -0.759 0.789 880s 880s x1: [3, 1, 1, 1] 880s 0.724 -0.651 0.484 880s 880s y: [4, 1, 1, 1] 880s -1.236 -0.210 0.063 1.000 880s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.267 0.360 0.595 -0.269 880s 880s x1: [4, 3, 3, 1] 880s 0.860 0.120 0.157 -0.151 880s -0.526 0.885 -0.282 -0.762 880s -0.107 0.998 0.409 0.063 880s 880s y: [1, 3, 3, 1] 880s -0.053 880s 0.496 880s 0.614 880s testing: mul_mat, [1, 4, 3, 4] = [1, 1, 3, 4] * [1, 4, 3, 4] 880s x0: [1, 1, 3, 4] 880s -0.209 880s 880s x1: [1, 4, 3, 4] 880s -0.691 880s 0.929 880s -0.505 880s 0.501 880s 880s y: [1, 4, 3, 4] 880s 0.145 880s -0.195 880s 0.106 880s -0.105 880s test-mul-mat0: iter:338/500 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.417 0.694 -0.344 880s -0.408 0.715 0.446 880s -0.297 -0.090 -0.786 880s 0.645 -0.950 0.618 880s 880s x1: [3, 3, 1, 1] 880s 0.001 0.547 -0.494 880s 0.659 -0.349 0.975 880s 0.522 0.941 0.488 880s 880s y: [4, 3, 1, 1] 880s 0.549 0.170 0.339 -0.824 880s -0.853 -0.083 -0.931 1.359 880s 0.268 0.678 -0.623 -0.256 880s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.334 0.891 -0.020 880s 0.179 0.114 -0.450 880s 0.351 -0.011 -0.906 880s 880s x1: [3, 4, 3, 1] 880s 0.982 -0.544 -0.472 880s 0.835 -0.644 0.512 880s 0.511 0.677 0.064 880s -0.488 0.223 0.570 880s 880s y: [3, 4, 3, 1] 880s -0.803 0.326 0.778 880s -0.863 -0.155 -0.163 880s 0.431 0.140 0.114 880s 0.350 -0.318 -0.689 880s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 2, 3, 1] 880s 0.785 -0.032 -0.400 880s 0.952 -0.333 0.725 880s 880s x1: [3, 3, 3, 1] 880s -0.833 0.698 0.125 880s -0.917 -0.635 0.016 880s 0.063 0.543 -0.870 880s 880s y: [2, 3, 3, 1] 880s -0.726 -0.935 880s -0.706 -0.650 880s 0.380 -0.752 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.024 0.562 0.982 880s 0.557 -0.001 0.251 880s 880s x1: [3, 2, 1, 1] 880s 0.251 -0.668 0.038 880s 0.940 -0.160 -0.645 880s 880s y: [2, 2, 1, 1] 880s -0.344 0.150 880s -0.746 0.362 880s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 1, 3, 1] 880s -0.570 -0.869 880s 880s x1: [2, 3, 3, 1] 880s -0.216 -0.049 880s 0.851 -0.264 880s 0.617 0.577 880s 880s y: [1, 3, 3, 1] 880s 0.166 880s -0.256 880s -0.852 880s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.024 880s 0.778 880s 880s x1: [1, 2, 3, 1] 880s -0.753 880s -0.956 880s 880s y: [2, 2, 3, 1] 880s 0.018 -0.586 880s 0.022 -0.744 880s test-mul-mat0: iter:339/500 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.018 0.255 0.135 880s 0.229 0.685 0.267 880s 0.415 -0.045 0.753 880s -0.472 0.202 0.797 880s 880s x1: [3, 1, 1, 1] 880s -0.138 0.985 0.156 880s 880s y: [4, 1, 1, 1] 880s 0.275 0.685 0.016 0.389 880s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.798 -0.849 -0.007 880s -0.674 0.353 -0.210 880s 880s x1: [3, 4, 2, 1] 880s 0.203 -0.553 0.823 880s 0.129 -0.523 -0.201 880s -0.093 0.243 0.393 880s -0.739 -0.220 -0.365 880s 880s y: [2, 4, 2, 1] 880s 0.626 -0.504 880s 0.549 -0.229 880s -0.284 0.066 880s -0.400 0.497 880s testing: mul_mat, [3, 2, 2, 4] = [3, 3, 2, 4] * [3, 2, 2, 4] 880s x0: [3, 3, 2, 4] 880s 0.577 -0.945 0.458 880s 0.176 -0.626 -0.537 880s 0.565 -0.777 0.268 880s 880s x1: [3, 2, 2, 4] 880s 0.565 0.521 -0.194 880s 0.826 -0.699 0.441 880s 880s y: [3, 2, 2, 4] 880s -0.255 -0.122 -0.138 880s 1.339 0.346 1.128 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.546 -0.085 0.875 880s 880s x1: [3, 3, 1, 1] 880s -0.130 0.513 0.485 880s -0.723 -0.827 -0.301 880s -0.618 -0.311 -0.225 880s 880s y: [1, 3, 1, 1] 880s 0.452 880s 0.202 880s 0.168 880s testing: mul_mat, [1, 3, 2, 1] = [1, 1, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.586 880s 880s x1: [1, 3, 2, 1] 880s 0.518 880s 0.116 880s 0.081 880s 880s y: [1, 3, 2, 1] 880s -0.303 880s -0.068 880s -0.047 880s testing: mul_mat, [4, 1, 2, 4] = [1, 4, 2, 4] * [1, 1, 2, 4] 880s x0: [1, 4, 2, 4] 880s 0.454 880s -0.457 880s 0.889 880s 0.730 880s 880s x1: [1, 1, 2, 4] 880s 0.932 880s 880s y: [4, 1, 2, 4] 880s 0.423 -0.426 0.829 0.681 880s test-mul-mat0: iter:340/500 880s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.495 0.857 0.759 0.352 880s 880s x1: [4, 3, 1, 1] 880s -0.842 -0.547 0.181 0.521 880s 0.408 0.039 0.799 -0.762 880s -0.585 0.194 -0.851 0.536 880s 880s y: [1, 3, 1, 1] 880s -0.565 880s 0.574 880s -0.580 880s testing: mul_mat, [2, 1, 3, 1] = [4, 2, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 2, 3, 1] 880s -0.133 0.383 0.243 -0.951 880s -0.096 -0.349 0.088 -0.297 880s 880s x1: [4, 1, 3, 1] 880s -0.890 -0.977 -0.302 -0.317 880s 880s y: [2, 1, 3, 1] 880s -0.028 0.494 880s testing: mul_mat, [4, 2, 3, 3] = [4, 4, 3, 3] * [4, 2, 3, 3] 880s x0: [4, 4, 3, 3] 880s -0.184 -0.515 0.021 0.601 880s 0.693 -0.475 -0.287 0.492 880s 0.154 -0.680 -0.587 0.802 880s -0.654 -0.657 0.476 -0.261 880s 880s x1: [4, 2, 3, 3] 880s 0.528 0.488 -0.786 0.977 880s -0.421 0.440 -0.232 0.447 880s 880s y: [4, 2, 3, 3] 880s 0.222 0.839 0.994 -1.294 880s 0.114 -0.214 0.131 -0.241 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.877 -0.891 -0.310 0.298 880s 0.099 -0.291 0.251 0.297 880s 880s x1: [4, 4, 1, 1] 880s 0.382 -0.394 0.721 0.165 880s -0.783 -0.428 -0.341 0.276 880s -0.718 -0.024 -0.479 -0.806 880s 0.704 0.995 0.932 -0.397 880s 880s y: [2, 4, 1, 1] 880s -0.159 0.382 880s 1.256 0.043 880s 0.559 -0.424 880s -1.911 -0.103 880s testing: mul_mat, [3, 4, 3, 1] = [2, 3, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 3, 3, 1] 880s -0.813 0.438 880s 0.729 0.027 880s 0.377 -0.326 880s 880s x1: [2, 4, 3, 1] 880s -0.096 0.901 880s 0.124 0.099 880s 0.893 -0.806 880s -0.519 -0.501 880s 880s y: [3, 4, 3, 1] 880s 0.473 -0.045 -0.330 880s -0.058 0.093 0.014 880s -1.079 0.629 0.599 880s 0.202 -0.392 -0.032 880s testing: mul_mat, [3, 2, 3, 3] = [3, 3, 3, 3] * [3, 2, 3, 3] 880s x0: [3, 3, 3, 3] 880s -0.874 -0.709 -0.148 880s -0.219 -0.193 -0.292 880s 0.656 0.565 0.500 880s 880s x1: [3, 2, 3, 3] 880s 0.875 -0.973 0.166 880s 0.156 -0.461 -0.870 880s 880s y: [3, 2, 3, 3] 880s -0.100 -0.052 0.108 880s 0.319 0.309 -0.593 880s test-mul-mat0: iter:341/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.744 -0.423 0.667 880s 0.619 -0.958 -0.992 880s 880s x1: [3, 2, 1, 1] 880s -0.590 0.247 -0.868 880s 0.663 -0.983 0.371 880s 880s y: [2, 2, 1, 1] 880s -0.244 0.259 880s 0.170 0.983 880s testing: mul_mat, [1, 2, 3, 1] = [3, 1, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 1, 3, 1] 880s -0.953 -0.053 -0.953 880s 880s x1: [3, 2, 3, 1] 880s 0.671 -0.381 0.551 880s 0.913 -0.114 -0.938 880s 880s y: [1, 2, 3, 1] 880s -1.145 880s 0.030 880s testing: mul_mat, [4, 1, 3, 3] = [3, 4, 3, 3] * [3, 1, 3, 3] 880s x0: [3, 4, 3, 3] 880s -0.295 0.475 -0.733 880s -0.356 -0.589 0.810 880s -0.745 0.967 -0.095 880s 0.778 -0.590 -0.938 880s 880s x1: [3, 1, 3, 3] 880s 0.939 -0.063 0.543 880s 880s y: [4, 1, 3, 3] 880s -0.705 0.144 -0.813 0.258 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.366 0.186 0.697 880s 0.252 -0.443 -0.450 880s -0.438 0.999 -0.520 880s 880s x1: [3, 4, 1, 1] 880s -0.077 0.199 0.604 880s -0.290 0.612 0.636 880s 0.799 0.355 -0.661 880s 0.740 0.465 0.941 880s 880s y: [3, 4, 1, 1] 880s 0.430 -0.380 -0.082 880s 0.451 -0.631 0.407 880s -0.103 0.342 0.348 880s 1.013 -0.443 -0.349 880s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.058 0.545 0.191 880s 0.296 -0.130 0.676 880s 880s x1: [3, 3, 3, 1] 880s 0.927 0.895 -0.870 880s -0.318 0.665 0.140 880s -0.613 0.063 0.418 880s 880s y: [2, 3, 3, 1] 880s 0.268 -0.430 880s 0.407 -0.086 880s 0.150 0.093 880s testing: mul_mat, [3, 3, 3, 3] = [2, 3, 3, 3] * [2, 3, 3, 3] 880s x0: [2, 3, 3, 3] 880s 0.699 -0.491 880s 0.224 -0.675 880s -0.938 -0.657 880s 880s x1: [2, 3, 3, 3] 880s 0.088 0.282 880s -0.089 -0.353 880s 0.976 0.849 880s 880s y: [3, 3, 3, 3] 880s -0.077 -0.171 -0.268 880s 0.111 0.218 0.315 880s 0.266 -0.354 -1.473 880s test-mul-mat0: iter:342/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.464 -0.656 0.632 880s 880s x1: [3, 1, 1, 1] 880s 0.860 -0.489 0.803 880s 880s y: [1, 1, 1, 1] 880s 0.429 880s testing: mul_mat, [2, 1, 4, 1] = [3, 2, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 2, 4, 1] 880s 0.009 -0.201 -0.644 880s 0.660 -0.753 0.965 880s 880s x1: [3, 1, 4, 1] 880s 0.070 0.210 0.891 880s 880s y: [2, 1, 4, 1] 880s -0.616 0.748 880s testing: mul_mat, [2, 2, 4, 2] = [3, 2, 4, 2] * [3, 2, 4, 2] 880s x0: [3, 2, 4, 2] 880s -0.954 0.576 0.377 880s 0.269 -0.013 0.993 880s 880s x1: [3, 2, 4, 2] 880s -0.722 -0.435 -0.372 880s -0.265 0.958 -0.075 880s 880s y: [2, 2, 4, 2] 880s 0.298 -0.557 880s 0.777 -0.158 880s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.503 -0.668 0.644 880s -0.028 0.581 -0.798 880s 0.095 -0.352 -0.586 880s -0.529 0.739 -0.306 880s 880s x1: [3, 2, 1, 1] 880s 0.296 -0.846 -0.313 880s 0.172 -0.348 0.816 880s 880s y: [4, 2, 1, 1] 880s 0.215 -0.251 0.510 -0.686 880s 0.672 -0.859 -0.339 -0.598 880s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.823 0.331 -0.258 -0.950 880s 0.547 -0.096 -0.656 -0.220 880s -0.194 0.331 -0.602 0.838 880s 0.963 0.764 -0.007 0.867 880s 880s x1: [4, 3, 4, 1] 880s 0.748 -0.406 -0.889 -0.445 880s 0.416 -0.917 0.131 -0.021 880s 0.076 -0.449 -0.989 -0.709 880s 880s y: [4, 3, 4, 1] 880s -0.098 1.130 -0.118 0.031 880s -0.660 0.234 -0.481 -0.319 880s 0.717 0.889 -0.162 -0.878 880s testing: mul_mat, [4, 4, 4, 2] = [4, 4, 4, 2] * [4, 4, 4, 2] 880s x0: [4, 4, 4, 2] 880s -0.693 0.517 -0.325 -0.188 880s 0.340 -0.410 -0.331 0.586 880s 0.327 -0.326 -0.032 0.205 880s 0.796 -0.996 -0.684 0.402 880s 880s x1: [4, 4, 4, 2] 880s -0.744 -0.791 0.753 0.260 880s -0.966 -0.366 0.006 -0.798 880s 0.912 0.744 -0.579 0.970 880s -0.678 -0.492 0.744 -0.002 880s 880s y: [4, 4, 4, 2] 880s -0.188 -0.026 0.043 -0.216 880s 0.628 -0.648 -0.360 -0.729 880s -0.241 0.765 0.273 0.770 880s -0.026 -0.277 -0.086 -0.560 880s test-mul-mat0: iter:343/500 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.543 -0.600 880s 880s x1: [2, 3, 1, 1] 880s -0.761 0.729 880s -0.473 -0.786 880s 0.254 -0.805 880s 880s y: [1, 3, 1, 1] 880s -0.025 880s 0.728 880s 0.345 880s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 4, 2, 1] 880s 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s .142 0.324 880s 0.104 0.895 880s 0.220 0.765 880s 0.369 0.346 880s 880s x1: [2, 1, 2, 1] 880s 0.523 -0.080 880s 880s y: [4, 1, 2, 1] 880s 0.048 -0.017 0.054 0.165 880s testing: mul_mat, [3, 4, 2, 4] = [2, 3, 2, 4] * [2, 4, 2, 4] 880s x0: [2, 3, 2, 4] 880s -0.904 -0.144 880s -0.876 -0.741 880s -0.589 0.976 880s 880s x1: [2, 4, 2, 4] 880s -0.069 0.872 880s -0.409 0.459 880s -0.914 0.845 880s 0.654 -0.760 880s 880s y: [3, 4, 2, 4] 880s -0.064 -0.586 0.891 880s 0.303 0.018 0.688 880s 0.704 0.174 1.363 880s -0.482 -0.010 -1.127 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.913 0.780 880s 880s x1: [2, 3, 1, 1] 880s -0.251 0.929 880s 0.947 -0.222 880s -0.871 0.594 880s 880s y: [1, 3, 1, 1] 880s 0.954 880s -1.038 880s 1.260 880s testing: mul_mat, [2, 2, 2, 1] = [1, 2, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.424 880s 0.578 880s 880s x1: [1, 2, 2, 1] 880s 0.054 880s -0.084 880s 880s y: [2, 2, 2, 1] 880s 0.023 0.031 880s -0.036 -0.049 880s testing: mul_mat, [3, 1, 2, 4] = [2, 3, 2, 4] * [2, 1, 2, 4] 880s x0: [2, 3, 2, 4] 880s -0.181 -0.095 880s 0.858 0.638 880s -0.937 0.117 880s 880s x1: [2, 1, 2, 4] 880s 0.679 0.352 880s 880s y: [3, 1, 2, 4] 880s -0.156 0.807 -0.595 880s test-mul-mat0: iter:344/500 880s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.947 -0.291 0.521 880s -0.935 -0.369 -0.742 880s 880s x1: [3, 2, 1, 1] 880s 0.629 -0.962 -0.156 880s -0.524 -0.353 -0.355 880s 880s y: [2, 2, 1, 1] 880s 0.794 -0.117 880s -0.579 0.884 880s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 4, 4, 1] 880s -0.466 0.141 -0.769 880s -0.400 0.772 -0.511 880s 0.688 0.964 -0.399 880s 0.621 -0.092 0.571 880s 880s x1: [3, 2, 4, 1] 880s 0.089 -0.808 -0.888 880s 0.933 -0.056 -0.030 880s 880s y: [4, 2, 4, 1] 880s 0.528 -0.206 -0.364 -0.378 880s -0.419 -0.401 0.600 0.567 880s testing: mul_mat, [3, 4, 4, 4] = [3, 3, 4, 4] * [3, 4, 4, 4] 880s x0: [3, 3, 4, 4] 880s -0.151 -0.698 -0.949 880s 0.283 0.145 -0.889 880s -0.960 0.722 -0.191 880s 880s x1: [3, 4, 4, 4] 880s 0.314 0.255 0.699 880s -0.012 0.643 -0.614 880s 0.606 0.418 0.013 880s -0.989 0.056 -0.772 880s 880s y: [3, 4, 4, 4] 880s -0.888 -0.495 -0.251 880s 0.136 0.636 0.593 880s -0.396 0.220 -0.283 880s 0.843 0.414 1.138 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.584 -0.893 -0.763 880s -0.292 0.303 0.405 880s -0.964 0.085 0.115 880s 880s x1: [3, 3, 1, 1] 880s -0.945 0.803 0.021 880s 0.846 -0.652 -0.160 880s -0.317 -0.431 -0.489 880s 880s y: [3, 3, 1, 1] 880s -0.181 0.528 0.982 880s 0.210 -0.510 -0.889 880s 0.943 -0.237 0.212 880s testing: mul_mat, [4, 3, 4, 1] = [3, 4, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 4, 4, 1] 880s -0.376 0.303 0.695 880s 0.140 -0.391 -0.524 880s -0.456 -0.390 0.582 880s -0.410 0.269 0.622 880s 880s x1: [3, 3, 4, 1] 880s -0.412 -0.408 -0.798 880s 0.861 0.731 -0.202 880s -0.466 -0.037 0.769 880s 880s y: [4, 3, 4, 1] 880s -0.524 0.519 -0.118 -0.437 880s -0.242 -0.059 -0.795 -0.282 880s 0.699 -0.453 0.675 0.659 880s testing: mul_mat, [4, 3, 4, 4] = [4, 4, 4, 4] * [4, 3, 4, 4] 880s x0: [4, 4, 4, 4] 880s 0.856 -0.694 -0.671 -0.250 880s -0.697 0.344 0.021 0.216 880s -0.379 0.899 0.795 0.458 880s -0.817 0.912 0.789 -0.473 880s 880s x1: [4, 3, 4, 4] 880s 0.064 0.166 -0.180 -0.283 880s 0.968 0.630 -0.777 -0.453 880s -0.330 -0.824 0.102 0.801 880s 880s y: [4, 3, 4, 4] 880s 0.132 -0.053 -0.148 0.091 880s 1.027 -0.572 -0.626 -0.615 880s 0.021 0.121 -0.168 -0.781 880s test-mul-mat0: iter:345/500 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.316 0.435 0.550 880s -0.547 0.817 0.259 880s 880s x1: [3, 3, 1, 1] 880s 0.376 0.053 0.155 880s 0.529 0.112 -0.319 880s -0.980 -0.862 0.804 880s 880s y: [2, 3, 1, 1] 880s -0.011 -0.123 880s -0.294 -0.280 880s 0.377 0.040 880s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.169 -0.413 0.014 880s -0.778 0.742 -0.456 880s 880s x1: [3, 2, 2, 1] 880s 0.934 0.810 0.717 880s 0.251 0.892 -0.994 880s 880s y: [2, 2, 2, 1] 880s -0.166 -0.453 880s -0.340 0.920 880s testing: mul_mat, [3, 2, 2, 1] = [3, 3, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 3, 2, 1] 880s 0.353 -0.163 0.061 880s 0.164 0.350 -0.301 880s -0.667 0.937 0.713 880s 880s x1: [3, 2, 2, 1] 880s -0.844 0.661 -0.623 880s -0.391 0.478 0.636 880s 880s y: [3, 2, 2, 1] 880s -0.444 0.281 0.738 880s -0.178 -0.088 1.162 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.934 0.287 -0.549 880s -0.628 0.209 -0.441 880s 0.098 -0.842 -0.143 880s 880s x1: [3, 3, 1, 1] 880s 0.518 0.988 -0.975 880s -0.872 0.466 0.661 880s -0.329 -0.246 0.013 880s 880s y: [3, 3, 1, 1] 880s 1.303 0.311 -0.641 880s -1.043 0.354 -0.572 880s -0.385 0.149 0.173 880s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 3, 2, 1] 880s -0.189 -0.256 -0.969 880s 0.992 -0.636 0.573 880s 0.374 -0.528 -0.370 880s 880s x1: [3, 3, 2, 1] 880s -0.216 -0.504 -0.430 880s 0.339 -0.825 -0.174 880s 0.229 -0.725 0.646 880s 880s y: [3, 3, 2, 1] 880s 0.587 -0.141 0.345 880s 0.315 0.762 0.627 880s -0.484 1.058 0.229 880s testing: mul_mat, [4, 3, 2, 1] = [3, 4, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 4, 2, 1] 880s 0.914 -0.819 0.700 880s -0.952 0.644 -0.732 880s -0.843 0.663 0.612 880s -0.603 -0.191 -0.417 880s 880s x1: [3, 3, 2, 1] 880s 0.835 0.079 -0.472 880s -0.574 0.681 0.822 880s -0.464 -0.728 0.140 880s 880s y: [4, 3, 2, 1] 880s 0.369 -0.399 -0.940 -0.322 880s -0.508 0.384 1.439 -0.126 880s 0.270 -0.129 -0.005 0.361 880s test-mul-mat0: iter:346/500 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s 0.910 880s -0.174 880s -0.742 880s 880s x1: [1, 2, 1, 1] 880s 0.996 880s -0.222 880s 880s y: [3, 2, 1, 1] 880s 0.906 -0.173 -0.739 880s -0.202 0.039 0.165 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.816 880s 880s x1: [1, 3, 1, 1] 880s -0.693 880s 0.007 880s 0.902 880s 880s y: [1, 3, 1, 1] 880s -0.566 880s 0.006 880s 0.736 880s testing: mul_mat, [2, 1, 1, 4] = [1, 2, 1, 4] * [1, 1, 1, 4] 880s x0: [1, 2, 1, 4] 880s -0.880 880s 0.296 880s 880s x1: [1, 1, 1, 4] 880s 0.602 880s 880s y: [2, 1, 1, 4] 880s -0.530 0.179 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.289 880s -0.778 880s -0.202 880s 0.863 880s 880s x1: [1, 2, 1, 1] 880s 0.771 880s -0.890 880s 880s y: [4, 2, 1, 1] 880s 0.223 -0.600 -0.156 0.666 880s -0.258 0.692 0.180 -0.768 880s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.778 -0.876 0.726 0.911 880s -0.565 -0.596 -0.358 -0.018 880s 0.409 0.225 -0.346 -0.227 880s 880s x1: [4, 1, 1, 1] 880s -0.782 0.576 0.964 -0.872 880s 880s y: [3, 1, 1, 1] 880s 0.009 -0.231 -0.327 880s testing: mul_mat, [4, 4, 1, 4] = [3, 4, 1, 4] * [3, 4, 1, 4] 880s x0: [3, 4, 1, 4] 880s 0.929 -0.119 -0.980 880s 0.862 0.459 -0.705 880s 0.539 0.995 -0.718 880s -0.650 0.626 0.564 880s 880s x1: [3, 4, 1, 4] 880s 0.207 0.852 0.266 880s 0.437 0.786 -0.597 880s 0.740 0.557 -0.487 880s -0.765 -0.154 -0.265 880s 880s y: [4, 4, 1, 4] 880s -0.169 0.382 0.769 0.548 880s 0.898 1.159 1.447 -0.130 880s 1.099 1.237 1.303 -0.408 880s -0.433 -0.543 -0.375 0.251 880s test-mul-mat0: iter:347/500 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.462 -0.776 0.609 880s 880s x1: [3, 4, 1, 1] 880s -0.218 -0.785 0.685 880s -0.528 -0.714 -0.557 880s 0.105 0.275 -0.832 880s 0.761 -0.807 0.755 880s 880s y: [1, 4, 1, 1] 880s 0.926 880s -0.028 880s -0.671 880s 1.437 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.305 -0.479 0.210 880s 0.216 -0.280 0.843 880s 0.183 0.060 -0.077 880s 0.636 -0.719 0.705 880s 880s x1: [3, 1, 1, 1] 880s -0.274 -0.681 0.283 880s 880s y: [4, 1, 1, 1] 880s 0.470 0.370 -0.113 0.515 880s testing: mul_mat, [4, 4, 1, 3] = [3, 4, 1, 3] * [3, 4, 1, 3] 880s x0: [3, 4, 1, 3] 880s 0.006 0.424 -0.739 880s -0.389 0.757 0.260 880s 0.678 -0.687 0.868 880s 0.471 0.188 0.758 880s 880s x1: [3, 4, 1, 3] 880s 0.851 0.966 -0.823 880s -0.863 -0.591 0.283 880s 0.412 -0.423 0.043 880s 0.605 -0.668 -0.001 880s 880s y: [4, 4, 1, 3] 880s 1.023 0.186 -0.800 -0.041 880s -0.465 -0.038 0.066 -0.303 880s -0.209 -0.469 0.608 0.147 880s -0.279 -0.741 0.868 0.159 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.282 0.299 0.074 880s 0.748 -0.132 0.445 880s 0.037 0.003 0.406 880s 880s x1: [3, 4, 1, 1] 880s -0.551 0.466 -0.255 880s -0.684 -0.063 0.933 880s -0.926 -0.516 -0.309 880s -0.756 0.318 -0.765 880s 880s y: [3, 4, 1, 1] 880s -0.035 -0.587 -0.123 880s -0.143 -0.088 0.354 880s -0.438 -0.762 -0.161 880s -0.175 -0.948 -0.338 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.953 -0.016 0.468 880s -0.489 -0.556 0.135 880s 0.665 0.739 0.983 880s 880s x1: [3, 3, 1, 1] 880s -0.217 0.616 -0.342 880s 0.711 -0.207 0.333 880s -0.100 -0.419 0.766 880s 880s y: [3, 3, 1, 1] 880s 0.036 -0.283 -0.025 880s -0.518 -0.187 0.647 880s 0.461 0.385 0.376 880s testing: mul_mat, [3, 3, 1, 3] = [3, 3, 1, 3] * [3, 3, 1, 3] 880s x0: [3, 3, 1, 3] 880s -0.081 0.706 0.775 880s -0.180 -0.810 0.308 880s -0.430 -0.903 -0.338 880s 880s x1: [3, 3, 1, 3] 880s -0.213 0.370 0.527 880s -0.931 0.118 -0.436 880s 0.368 0.986 0.567 880s 880s y: [3, 3, 1, 3] 880s 0.687 -0.099 -0.421 880s -0.179 -0.063 0.441 880s 1.106 -0.690 -1.240 880s test-mul-mat0: iter:348/500 880s testing: mul_mat, [3, 4, 1, 1] = [2, 3, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.493 0.068 880s 0.763 0.078 880s 0.793 -0.035 880s 880s x1: [2, 4, 1, 1] 880s -0.133 -0.725 880s -0.225 -0.427 880s -0.535 -0.127 880s -0.651 0.773 880s 880s y: [3, 4, 1, 1] 880s -0.115 -0.158 -0.080 880s -0.140 -0.205 -0.163 880s -0.272 -0.418 -0.420 880s -0.268 -0.436 -0.543 880s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 880s x0: [2, 1, 3, 1] 880s 0.091 -0.915 880s 880s x1: [2, 3, 3, 1] 880s 0.890 0.565 880s 0.798 0.637 880s 0.176 0.596 880s 880s y: [1, 3, 3, 1] 880s -0.435 880s -0.510 880s -0.529 880s testing: mul_mat, [4, 1, 3, 4] = [2, 4, 3, 4] * [2, 1, 3, 4] 880s x0: [2, 4, 3, 4] 880s 0.629 -0.815 880s -0.782 0.069 880s -0.956 0.214 880s -0.998 -0.029 880s 880s x1: [2, 1, 3, 4] 880s -0.067 -0.073 880s 880s y: [4, 1, 3, 4] 880s 0.018 0.047 0.048 0.069 880s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 1, 1, 1] 880s -0.691 0.718 880s 880s x1: [2, 4, 1, 1] 880s 0.766 0.620 880s 0.262 0.567 880s -0.191 -0.597 880s 0.109 -0.817 880s 880s y: [1, 4, 1, 1] 880s -0.085 880s 0.226 880s -0.296 880s -0.662 880s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.831 880s -0.321 880s 880s x1: [1, 2, 3, 1] 880s 0.957 880s -0.382 880s 880s y: [2, 2, 3, 1] 880s -0.795 -0.307 880s 0.317 0.123 880s testing: mul_mat, [3, 1, 3, 4] = [2, 3, 3, 4] * [2, 1, 3, 4] 880s x0: [2, 3, 3, 4] 880s 0.172 -0.133 880s -0.667 -0.327 880s -0.055 0.063 880s 880s x1: [2, 1, 3, 4] 880s -0.684 0.689 880s 880s y: [3, 1, 3, 4] 880s -0.209 0.231 0.081 880s test-mul-mat0: iter:349/500 880s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.677 -0.047 0.334 0.573 880s 0.012 0.147 0.005 -0.801 880s 880s x1: [4, 2, 1, 1] 880s -0.771 -0.865 -0.907 0.555 880s 0.051 -0.315 0.662 0.133 880s 880s y: [2, 2, 1, 1] 880s -0.467 -0.585 880s 0.347 -0.150 880s testing: mul_mat, [3, 2, 3, 1] = [4, 3, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.809 0.303 0.665 0.338 880s -0.693 0.865 -0.202 -0.387 880s 0.326 0.936 -0.238 0.186 880s 880s x1: [4, 2, 3, 1] 880s 0.460 -0.694 0.461 0.138 880s -0.851 0.860 0.367 -0.104 880s 880s y: [3, 2, 3, 1] 880s -0.229 -1.065 -0.584 880s 1.158 1.300 0.422 880s testing: mul_mat, [4, 3, 3, 2] = [4, 4, 3, 2] * [4, 3, 3, 2] 880s x0: [4, 4, 3, 2] 880s 0.534 0.707 -0.895 0.918 880s 0.894 -0.938 0.717 -0.188 880s -0.665 0.446 0.872 -0.002 880s -0.526 -0.789 -0.224 -0.637 880s 880s x1: [4, 3, 3, 2] 880s -0.744 0.697 0.362 0.581 880s 0.633 -0.876 -0.232 -0.064 880s 0.782 -0.714 0.847 0.732 880s 880s y: [4, 3, 3, 2] 880s 0.304 -1.168 1.121 -0.610 880s -0.131 1.233 -1.014 0.451 880s -0.173 1.839 -0.101 -0.504 880s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.841 -0.018 -0.642 0.178 880s -0.976 0.920 -0.217 -0.765 880s 0.374 -0.355 -0.409 -0.701 880s 880s x1: [4, 4, 1, 1] 880s -0.524 0.395 0.870 0.968 880s 0.233 -0.903 0.628 -0.753 880s 0.445 0.931 -0.353 -0.310 880s 0.240 -0.295 0.893 0.567 880s 880s y: [3, 4, 1, 1] 880s 0.047 -0.054 -1.371 880s -0.717 -0.618 0.679 880s -0.219 0.736 0.198 880s -0.670 -1.133 -0.568 880s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.187 -0.711 0.322 880s 0.909 0.728 -0.410 880s 880s x1: [3, 4, 3, 1] 880s -0.961 -0.999 -0.225 880s 0.435 0.871 -0.257 880s -0.332 0.967 -0.630 880s -0.085 0.412 -0.699 880s 880s y: [2, 4, 3, 1] 880s 0.817 -1.509 880s -0.783 1.135 880s -0.829 0.660 880s -0.502 0.509 880s testing: mul_mat, [2, 3, 3, 2] = [2, 2, 3, 2] * [2, 3, 3, 2] 880s x0: [2, 2, 3, 2] 880s -0.055 0.053 880s -0.636 -0.955 880s 880s x1: [2, 3, 3, 2] 880s 0.422 -0.797 880s 0.259 0.524 880s -0.125 0.329 880s 880s y: [2, 3, 3, 2] 880s -0.065 0.492 880s 0.013 -0.665 880s 0.024 -0.235 880s test-mul-mat0: iter:350/500 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.760 880s 0.294 880s 0.371 880s 0.293 880s 880s x1: [1, 1, 1, 1] 880s -0.651 880s 880s y: [4, 1, 1, 1] 880s 0.495 -0.191 -0.242 -0.191 880s testing: mul_mat, [1, 4, 3, 1] = [1, 1, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 1, 3, 1] 880s 0.086 880s 880s x1: [1, 4, 3, 1] 880s 0.339 880s 0.467 880s 0.689 880s -0.599 880s 880s y: [1, 4, 3, 1] 880s 0.029 880s 0.040 880s 0.059 880s -0.051 880s testing: mul_mat, [2, 1, 3, 4] = [1, 2, 3, 4] * [1, 1, 3, 4] 880s x0: [1, 2, 3, 4] 880s -0.025 880s -0.516 880s 880s x1: [1, 1, 3, 4] 880s -0.159 880s 880s y: [2, 1, 3, 4] 880s 0.004 0.082 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.995 880s -0.133 880s -0.986 880s 0.646 880s 880s x1: [1, 2, 1, 1] 880s 0.657 880s 0.263 880s 880s y: [4, 2, 1, 1] 880s -0.654 -0.088 -0.648 0.425 880s -0.262 -0.035 -0.259 0.170 880s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.908 -0.650 -0.351 0.316 880s 0.868 0.282 -0.208 -0.336 880s 0.608 -0.4ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 04 -0.745 0.912 880s 880s x1: [4, 1, 3, 1] 880s 0.842 -0.502 -0.374 -0.716 880s 880s y: [3, 1, 3, 1] 880s -0.534 0.908 0.341 880s testing: mul_mat, [1, 4, 3, 4] = [3, 1, 3, 4] * [3, 4, 3, 4] 880s x0: [3, 1, 3, 4] 880s -0.018 -0.514 0.701 880s 880s x1: [3, 4, 3, 4] 880s -0.530 0.188 0.645 880s -0.275 -0.496 -0.691 880s -0.363 0.082 0.796 880s -0.446 -0.472 0.296 880s 880s y: [1, 4, 3, 4] 880s 0.366 880s -0.225 880s 0.523 880s 0.459 880s test-mul-mat0: iter:351/500 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.618 0.077 0.632 880s -0.821 0.710 -0.274 880s 0.226 0.486 0.748 880s 0.255 0.749 -0.529 880s 880s x1: [3, 4, 1, 1] 880s 0.857 0.249 0.994 880s 0.891 -0.300 -0.147 880s 0.230 -0.925 -0.723 880s -0.474 -0.222 -0.169 880s 880s y: [4, 4, 1, 1] 880s 0.117 -0.799 1.059 -0.121 880s -0.667 -0.905 -0.054 0.080 880s -0.671 -0.647 -0.939 -0.251 880s 0.169 0.277 -0.342 -0.198 880s testing: mul_mat, [4, 4, 4, 1] = [3, 4, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 4, 4, 1] 880s 0.176 0.220 -0.535 880s 0.211 0.646 0.052 880s 0.973 0.298 0.126 880s -0.517 -0.796 -0.052 880s 880s x1: [3, 4, 4, 1] 880s -0.856 0.809 0.465 880s 0.526 0.063 0.844 880s 0.384 -0.687 0.839 880s 0.275 0.013 -0.308 880s 880s y: [4, 4, 4, 1] 880s -0.221 0.366 -0.534 -0.225 880s -0.345 0.196 0.637 -0.366 880s -0.533 -0.319 0.274 0.305 880s 0.216 0.050 0.232 -0.136 880s testing: mul_mat, [2, 4, 4, 2] = [3, 2, 4, 2] * [3, 4, 4, 2] 880s x0: [3, 2, 4, 2] 880s -0.023 0.129 -0.381 880s -0.689 0.132 -0.924 880s 880s x1: [3, 4, 4, 2] 880s 0.968 0.951 0.633 880s -0.749 -0.723 -0.338 880s -0.445 0.422 0.431 880s 0.967 -0.645 -0.029 880s 880s y: [2, 4, 4, 2] 880s -0.141 -1.127 880s 0.053 0.733 880s -0.100 -0.036 880s -0.094 -0.725 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.086 0.587 0.864 880s 0.977 -0.596 0.808 880s 0.901 -0.836 0.556 880s 880s x1: [3, 2, 1, 1] 880s 0.306 -0.287 0.035 880s 0.741 -0.837 -0.659 880s 880s y: [3, 2, 1, 1] 880s -0.112 0.498 0.535 880s -0.997 0.690 1.001 880s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 1, 4, 1] 880s -0.767 -0.510 0.034 880s 880s x1: [3, 3, 4, 1] 880s -0.240 0.523 -0.920 880s 0.732 -0.615 -0.206 880s 0.202 0.609 0.502 880s 880s y: [1, 3, 4, 1] 880s -0.114 880s -0.254 880s -0.449 880s testing: mul_mat, [1, 3, 4, 2] = [1, 1, 4, 2] * [1, 3, 4, 2] 880s x0: [1, 1, 4, 2] 880s 0.619 880s 880s x1: [1, 3, 4, 2] 880s -0.468 880s -0.887 880s -0.970 880s 880s y: [1, 3, 4, 2] 880s -0.290 880s -0.549 880s -0.600 880s test-mul-mat0: iter:352/500 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.374 0.250 0.582 880s 0.102 0.189 -0.446 880s -0.279 0.400 -0.670 880s 880s x1: [3, 4, 1, 1] 880s -0.427 0.513 0.799 880s -0.815 0.930 -0.273 880s -0.544 0.647 -0.848 880s 0.465 -0.606 -0.544 880s 880s y: [3, 4, 1, 1] 880s 0.433 -0.303 -0.211 880s -0.231 0.215 0.782 880s -0.535 0.446 0.979 880s -0.294 0.175 -0.007 880s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 2, 4, 1] 880s 0.562 -0.114 -0.612 880s 0.371 -0.811 -0.566 880s 880s x1: [3, 3, 4, 1] 880s -0.319 0.721 0.352 880s 0.174 -0.845 -0.596 880s 0.236 0.790 -0.753 880s 880s y: [2, 3, 4, 1] 880s -0.477 -0.902 880s 0.559 1.087 880s 0.503 -0.127 880s testing: mul_mat, [4, 2, 4, 3] = [3, 4, 4, 3] * [3, 2, 4, 3] 880s x0: [3, 4, 4, 3] 880s -0.950 -0.419 -0.268 880s 0.951 -0.569 0.112 880s 0.872 0.918 0.259 880s 0.391 -0.445 -0.176 880s 880s x1: [3, 2, 4, 3] 880s 0.596 0.599 -0.876 880s 0.333 0.925 0.219 880s 880s y: [4, 2, 4, 3] 880s -0.583 0.127 0.842 0.120 880s -0.763 -0.185 1.197 -0.320 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.966 0.556 -0.500 880s -0.241 -0.225 0.656 880s -0.006 -0.652 -0.570 880s 0.417 0.549 -0.630 880s 880s x1: [3, 4, 1, 1] 880s -0.694 0.435 -0.454 880s 0.024 0.936 0.577 880s -0.234 0.689 0.924 880s -0.213 0.218 -0.199 880s 880s y: [4, 4, 1, 1] 880s -0.202 -0.228 -0.021 0.235 880s 0.254 0.162 -0.939 0.160 880s -0.305 0.507 -0.975 -0.301 880s 0.015 -0.128 -0.027 0.156 880s testing: mul_mat, [3, 3, 4, 1] = [4, 3, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 3, 4, 1] 880s -0.414 0.252 -0.307 -0.366 880s -0.790 -0.170 0.798 -0.577 880s 0.253 -0.760 -0.223 0.743 880s 880s x1: [4, 3, 4, 1] 880s 0.319 0.102 -0.178 0.271 880s -0.041 0.825 0.577 -0.606 880s -0.629 -0.399 -0.670 0.948 880s 880s y: [3, 3, 4, 1] 880s -0.151 -0.568 0.244 880s 0.270 0.703 -1.216 880s 0.019 -0.517 0.998 880s testing: mul_mat, [2, 4, 4, 3] = [3, 2, 4, 3] * [3, 4, 4, 3] 880s x0: [3, 2, 4, 3] 880s 0.068 -0.610 0.428 880s 0.433 -0.280 -0.832 880s 880s x1: [3, 4, 4, 3] 880s 0.278 -0.335 -0.635 880s 0.538 0.032 -0.870 880s 0.549 -0.324 -0.347 880s 0.330 0.470 -ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s 0.910 880s 880s y: [2, 4, 4, 3] 880s -0.048 0.742 880s -0.355 0.948 880s 0.086 0.617 880s -0.653 0.769 880s test-mul-mat0: iter:353/500 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.718 0.003 -0.473 0.574 880s 880s x1: [4, 4, 1, 1] 880s -0.615 0.894 0.189 -0.069 880s 0.463 -0.003 -0.721 0.854 880s 0.131 -0.298 0.510 -0.609 880s -0.133 -0.214 -0.769 0.823 880s 880s y: [1, 4, 1, 1] 880s -0.568 880s 1.164 880s -0.498 880s 0.740 880s testing: mul_mat, [4, 1, 2, 1] = [4, 4, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 4, 2, 1] 880s 0.478 -0.992 0.395 -0.333 880s -0.061 -0.141 0.664 0.218 880s -0.287 -0.206 0.920 -0.777 880s 0.185 -0.213 0.009 0.416 880s 880s x1: [4, 1, 2, 1] 880s 0.834 0.550 -0.693 -0.031 880s 880s y: [4, 1, 2, 1] 880s -0.410 -0.595 -0.966 0.018 880s testing: mul_mat, [4, 4, 2, 3] = [4, 4, 2, 3] * [4, 4, 2, 3] 880s x0: [4, 4, 2, 3] 880s 0.644 0.163 -0.844 -0.748 880s -0.530 -0.590 -0.935 0.722 880s -0.181 0.284 -0.037 -0.012 880s -0.221 0.371 -0.862 -0.382 880s 880s x1: [4, 4, 2, 3] 880s -0.866 0.396 -0.015 0.073 880s -0.746 -0.351 -0.709 -0.033 880s 0.443 -0.789 0.190 -0.372 880s -0.002 -0.800 -0.956 0.608 880s 880s y: [4, 4, 2, 3] 880s -0.535 0.292 0.269 0.323 880s 0.085 1.241 0.062 0.658 880s 0.274 -0.216 -0.307 -0.412 880s 0.221 1.806 -0.199 0.296 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.879 0.629 0.198 0.811 880s 880s x1: [4, 4, 1, 1] 880s -0.267 0.486 -0.630 0.623 880s 0.751 0.062 0.320 0.197 880s -0.426 -0.395 0.193 0.045 880s -0.906 -0.429 0.408 0.405 880s 880s y: [1, 4, 1, 1] 880s 0.920 880s -0.399 880s 0.201 880s 0.936 880s testing: mul_mat, [1, 4, 2, 1] = [1, 1, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.147 880s 880s x1: [1, 4, 2, 1] 880s 0.256 880s 0.960 880s -0.833 880s 0.678 880s 880s y: [1, 4, 2, 1] 880s -0.038 880s -0.141 880s 0.122 880s -0.100 880s testing: mul_mat, [2, 1, 2, 3] = [1, 2, 2, 3] * [1, 1, 2, 3] 880s x0: [1, 2, 2, 3] 880s -0.141 880s 0.071 880s 880s x1: [1, 1, 2, 3] 880s -0.027 880s 880s y: [2, 1, 2, 3] 880s 0.004 -0.002 880s test-mul-mat0: iter:354/500 880s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.525 -0.373 880s -0.053 0.328 880s -0.850 0.920 880s -0.449 0.296 880s 880s x1: [2, 1, 1, 1] 880s 0.132 0.838 880s 880s y: [4, 1, 1, 1] 880s -0.382 0.268 0.659 0.188 880s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 4, 1, 1] 880s 0.366 -0.091 880s -0.433 -0.376 880s -0.659 -0.235 880s 0.080 0.584 880s 880s x1: [2, 4, 1, 1] 880s -0.355 -0.165 880s 0.762 -0.434 880s 0.694 -0.168 880s 0.680 0.598 880s 880s y: [4, 4, 1, 1] 880s -0.115 0.216 0.273 -0.125 880s 0.318 -0.167 -0.400 -0.192 880s 0.269 -0.237 -0.418 -0.042 880s 0.195 -0.519 -0.588 0.404 880s testing: mul_mat, [1, 4, 1, 2] = [2, 1, 1, 2] * [2, 4, 1, 2] 880s x0: [2, 1, 1, 2] 880s -0.469 -0.766 880s 880s x1: [2, 4, 1, 2] 880s -0.780 0.497 880s -0.006 0.352 880s 0.335 -0.547 880s 0.827 0.962 880s 880s y: [1, 4, 1, 2] 880s -0.015 880s -0.266 880s 0.262 880s -1.124 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.259 -0.798 880s -0.435 0.082 880s 880s x1: [2, 1, 1, 1] 880s 0.832 0.998 880s 880s y: [2, 1, 1, 1] 880s -1.013 -0.280 880s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.660 -0.887 880s -0.098 -0.887 880s 880s x1: [2, 2, 1, 1] 880s 0.331 0.282 880s -0.334 0.550 880s 880s y: [2, 2, 1, 1] 880s -0.032 -0.282 880s -0.708 -0.455 880s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 880s x0: [2, 1, 1, 2] 880s -0.872 -0.236 880s 880s x1: [2, 2, 1, 2] 880s -0.271 -0.925 880s 0.514 0.883 880s 880s y: [1, 2, 1, 2] 880s 0.455 880s -0.656 880s test-mul-mat0: iter:355/500 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.543 0.841 0.572 0.971 880s 0.724 0.759 -0.194 0.312 880s 880s x1: [4, 3, 1, 1] 880s 0.121 -0.065 0.329 0.323 880s -0.983 -0.340 -0.395 -0.316 880s -0.790 0.383 -0.656 0.112 880s 880s y: [2, 3, 1, 1] 880s 0.381 0.075 880s -0.286 -0.992 880s 0.485 -0.119 880s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.981 -0.595 -0.477 0.741 880s 0.210 0.835 0.095 -0.184 880s 0.275 0.213 0.754 0.077 880s -0.903 -0.764 0.074 0.745 880s 880s x1: [4, 2, 3, 1] 880s 0.354 0.606 0.440 -0.881 880s -0.062 0.802 -0.116 -0.519 880s 880s y: [4, 2, 3, 1] 880s -0.876 0.784 0.491 -1.408 880s -0.867 0.741 0.027 -0.952 880s testing: mul_mat, [4, 4, 3, 4] = [4, 4, 3, 4] * [4, 4, 3, 4] 880s x0: [4, 4, 3, 4] 880s -0.743 0.777 0.403 0.184 880s -0.787 -0.646 -0.465 0.679 880s 0.405 0.732 -0.292 0.132 880s 0.114 -0.141 0.696 -0.968 880s 880s x1: [4, 4, 3, 4] 880s 0.750 -0.792 -0.981 -0.296 880s -0.434 -0.689 0.291 -0.230 880s 0.516 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s .854 0.932 0.131 880s -0.149 0.893 -0.274 -0.748 880s 880s y: [4, 4, 3, 4] 880s -1.623 0.176 -0.028 -0.199 880s -0.137 0.495 -0.796 0.473 880s 0.679 -1.303 0.579 0.460 880s 0.556 -0.839 0.575 0.391 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s 0.113 -0.979 0.662 0.442 880s 880s x1: [4, 4, 1, 1] 880s 0.136 -0.856 0.470 0.334 880s -0.450 0.874 0.378 -0.725 880s 0.117 -0.257 -0.303 -0.833 880s 0.847 -0.845 -0.946 0.803 880s 880s y: [1, 4, 1, 1] 880s 1.311 880s -0.977 880s -0.304 880s 0.652 880s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 4, 3, 1] 880s -0.285 880s 0.556 880s 0.585 880s -0.907 880s 880s x1: [1, 4, 3, 1] 880s 0.935 880s -0.213 880s 0.043 880s 0.844 880s 880s y: [4, 4, 3, 1] 880s -0.267 0.520 0.547 -0.848 880s 0.061 -0.118 -0.125 0.193 880s -0.012 0.024 0.025 -0.039 880s -0.241 0.469 0.493 -0.765 880s testing: mul_mat, [3, 1, 3, 4] = [4, 3, 3, 4] * [4, 1, 3, 4] 880s x0: [4, 3, 3, 4] 880s -0.377 -0.277 0.525 0.197 880s 0.824 0.325 0.662 -0.318 880s -0.426 0.776 0.508 0.922 880s 880s x1: [4, 1, 3, 4] 880s -0.514 -0.060 -0.963 -0.493 880s 880s y: [3, 1, 3, 4] 880s -0.393 -0.924 -0.771 880s test-mul-mat0: iter:356/500 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.280 880s 880s x1: [1, 4, 1, 1] 880s 0.245 880s 0.852 880s -0.611 880s 0.630 880s 880s y: [1, 4, 1, 1] 880s 0.069 880s 0.238 880s -0.171 880s 0.176 880s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.664 880s 880s x1: [1, 1, 2, 1] 880s 0.030 880s 880s y: [1, 1, 2, 1] 880s -0.020 880s testing: mul_mat, [1, 1, 2, 3] = [1, 1, 2, 3] * [1, 1, 2, 3] 880s x0: [1, 1, 2, 3] 880s -0.728 880s 880s x1: [1, 1, 2, 3] 880s 0.679 880s 880s y: [1, 1, 2, 3] 880s -0.495 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.722 880s 880s x1: [1, 1, 1, 1] 880s 0.528 880s 880s y: [1, 1, 1, 1] 880s -0.381 880s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.734 880s 880s x1: [1, 1, 2, 1] 880s -0.394 880s 880s y: [1, 1, 2, 1] 880s 0.289 880s testing: mul_mat, [1, 1, 2, 3] = [1, 1, 2, 3] * [1, 1, 2, 3] 880s x0: [1, 1, 2, 3] 880s 0.232 880s 880s x1: [1, 1, 2, 3] 880s -0.221 880s 880s y: [1, 1, 2, 3] 880s -0.051 880s test-mul-mat0: iter:357/500 880s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.295 0.361 0.471 880s 880s x1: [3, 3, 1, 1] 880s 0.296 0.403 0.401 880s 0.071 -0.866 -0.844 880s -0.401 -0.230 -0.566 880s 880s y: [1, 3, 1, 1] 880s 0.422 880s -0.690 880s -0.468 880s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 880s x0: [3, 4, 2, 1] 880s 0.235 0.293 0.663 880s 0.237 -0.796 0.548 880s -0.577 -0.285 -0.427 880s 0.866 -0.920 0.869 880s 880s x1: [3, 1, 2, 1] 880s 0.065 -0.859 -0.874 880s 880s y: [4, 1, 2, 1] 880s -0.816 0.221 0.580 0.088 880s testing: mul_mat, [2, 4, 2, 3] = [3, 2, 2, 3] * [3, 4, 2, 3] 880s x0: [3, 2, 2, 3] 880s -0.387 -0.090 0.670 880s -0.455 0.933 -0.147 880s 880s x1: [3, 4, 2, 3] 880s -0.467 0.574 -0.659 880s -0.002 -0.815 -0.598 880s 0.894 0.421 0.695 880s 0.557 -0.342 0.899 880s 880s y: [2, 4, 2, 3] 880s -0.313 0.845 880s -0.327 -0.671 880s 0.082 -0.116 880s 0.417 -0.705 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.530 -0.389 0.505 880s 0.064 0.553 -0.128 880s 0.202 0.336 0.323 880s 880s x1: [3, 2, 1, 1] 880s 0.614 -0.025 -0.136 880s 0.865 -0.054 -0.576 880s 880s y: [3, 2, 1, 1] 880s 0.267 0.043 0.072 880s 0.188 0.099 -0.030 880s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 3, 2, 1] 880s 0.321 -0.148 0.463 880s 0.498 -0.438 -0.885 880s -0.395 0.807 0.143 880s 880s x1: [3, 3, 2, 1] 880s 0.104 -0.656 0.952 880s -0.516 0.693 0.905 880s -0.858 -0.662 0.422 880s 880s y: [3, 3, 2, 1] 880s 0.571 -0.503 -0.434 880s 0.150 -1.362 0.893 880s 0.017 -0.512 -0.134 880s testing: mul_mat, [2, 3, 2, 3] = [3, 2, 2, 3] * [3, 3, 2, 3] 880s x0: [3, 2, 2, 3] 880s 0.693 0.576 0.873 880s 0.905 0.140 0.973 880s 880s x1: [3, 3, 2, 3] 880s -0.302 0.363 0.744 880s 0.931 -0.040 0.594 880s 0.926 -0.640 0.804 880s 880s y: [2, 3, 2, 3] 880s 0.650 0.501 880s 1.140 1.415 880s 0.975 1.531 880s test-mul-mat0: iter:358/500 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.142 880s -0.637 880s 880s x1: [1, 4, 1, 1] 880s -0.611 880s 0.097 880s 0.611 880s 0.920 880s 880s y: [2, 4, 1, 1] 880s -0.087 0.389 880s 0.014 -0.062 880s 0.087 -0.389 880s 0.131 -0.586 880s testing: mul_mat, [3, 2, 3, 1] = [1, 3, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 3, 3, 1] 880s 0.704 880s -0.490 880s -0.721 880s 880s x1: [1, 2, 3, 1] 880s -0.527 880s -0.512 880s 880s y: [3, 2, 3, 1] 880s -0.371 0.258 0.380 880s -0.360 0.251 0.369 880s testing: mul_mat, [1, 3, 3, 3] = [1, 1, 3, 3] * [1, 3, 3, 3] 880s x0: [1, 1, 3, 3] 880s -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s .898 880s 880s x1: [1, 3, 3, 3] 880s -0.955 880s 0.515 880s 0.478 880s 880s y: [1, 3, 3, 3] 880s 0.858 880s -0.462 880s -0.429 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.687 880s 0.564 880s 880s x1: [1, 1, 1, 1] 880s 0.040 880s 880s y: [2, 1, 1, 1] 880s 0.027 0.023 880s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.728 0.984 880s 0.608 0.985 880s 880s x1: [2, 1, 3, 1] 880s -0.261 0.441 880s 880s y: [2, 1, 3, 1] 880s 0.624 0.276 880s testing: mul_mat, [2, 2, 3, 3] = [2, 2, 3, 3] * [2, 2, 3, 3] 880s x0: [2, 2, 3, 3] 880s -0.064 0.819 880s 0.377 0.951 880s 880s x1: [2, 2, 3, 3] 880s 0.081 -0.223 880s -0.410 0.064 880s 880s y: [2, 2, 3, 3] 880s -0.188 -0.182 880s 0.079 -0.094 880s test-mul-mat0: iter:359/500 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.234 0.701 880s -0.693 -0.635 880s 0.669 0.884 880s 880s x1: [2, 1, 1, 1] 880s 0.432 0.713 880s 880s y: [3, 1, 1, 1] 880s 0.398 -0.752 0.919 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.627 -0.889 880s 880s x1: [2, 3, 1, 1] 880s -0.034 0.826 880s 0.990 -0.093 880s -0.215 -0.527 880s 880s y: [1, 3, 1, 1] 880s -0.756 880s 0.703 880s 0.333 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.405 -0.859 880s 0.440 0.898 880s 880s x1: [2, 1, 1, 1] 880s -0.269 0.416 880s 880s y: [2, 1, 1, 1] 880s -0.466 0.255 880s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 4, 1, 1] 880s -0.589 0.396 880s -0.370 -0.971 880s -0.671 -0.359 880s -0.666 0.762 880s 880s x1: [2, 2, 1, 1] 880s -0.183 -0.106 880s 0.016 0.198 880s 880s y: [4, 2, 1, 1] 880s 0.066 0.170 0.160 0.041 880s 0.069 -0.198 -0.082 0.140 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.937 0.186 -0.699 0.510 880s 0.099 -0.597 -0.689 -0.864 880s -0.002 0.494 0.521 0.729 880s 0.353 -0.374 0.318 -0.860 880s 880s x1: [4, 2, 1, 1] 880s 0.699 -0.475 -0.273 0.525 880s -0.486 0.635 -0.690 -0.012 880s 880s y: [4, 2, 1, 1] 880s 1.024 0.087 0.005 -0.114 880s 0.139 0.059 -0.054 -0.617 880s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.075 -0.188 0.162 -0.324 880s 0.318 0.641 -0.053 -0.227 880s 0.015 0.629 -0.367 0.763 880s 0.328 0.123 0.298 -0.261 880s 880s x1: [4, 4, 1, 1] 880s -0.535 -0.937 0.536 -0.506 880s -0.296 0.298 -0.807 0.228 880s -0.975 0.718 0.743 0.660 880s -0.972 -0.270 0.927 0.965 880s 880s y: [4, 4, 1, 1] 880s 0.467 -0.684 -1.180 0.001 880s -0.238 0.088 0.654 -0.361 880s -0.156 -0.039 0.668 -0.183 880s -0.039 -0.750 0.212 -0.328 880s test-mul-mat0: iter:360/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s 0.987 0.472 -0.709 0.628 880s 0.101 0.414 -0.210 -0.952 880s -0.953 -0.912 -0.276 -0.180 880s 880s x1: [4, 3, 1, 1] 880s -0.549 -0.329 0.632 -0.831 880s -0.587 0.292 -0.803 0.144 880s 0.219 -0.838 -0.027 -0.856 880s 880s y: [3, 3, 1, 1] 880s -1.668 0.466 0.799 880s 0.218 0.093 0.488 880s -0.698 0.496 0.717 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.133 -0.121 0.087 -0.894 880s 0.023 -0.433 -0.907 -0.505 880s -0.141 0.722 0.597 -0.727 880s -0.488 0.645 -0.680 -0.401 880s 880s x1: [4, 3, 1, 1] 880s 0.851 0.463 0.129 -0.892 880s -0.095 -0.264 -0.441 0.576 880s -0.632 -0.272 0.990 0.660 880s 880s y: [4, 3, 1, 1] 880s 0.866 0.153 0.939 0.153 880s -0.534 0.221 -0.859 -0.055 880s -0.555 -1.127 0.003 -0.804 880s testing: mul_mat, [2, 4, 1, 4] = [4, 2, 1, 4] * [4, 4, 1, 4] 880s x0: [4, 2, 1, 4] 880s -0.420 -0.489 -0.074 -0.811 880s -0.199 0.519 -0.889 -0.537 880s 880s x1: [4, 4, 1, 4] 880s -0.631 0.140 -0.550 0.832 880s -0.731 -0.442 -0.263 0.004 880s 0.117 -0.687 0.372 0.845 880s -0.698 0.032 -0.230 0.436 880s 880s y: [2, 4, 1, 4] 880s -0.437 0.241 880s 0.539 0.148 880s -0.426 -1.164 880s -0.059 0.126 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.195 -0.928 0.090 -0.194 880s 0.842 0.056 -0.465 0.116 880s 0.859 -0.739 -0.711 0.612 880s -0.042 0.794 0.320 0.746 880s 880s x1: [4, 2, 1, 1] 880s -0.227 0.200 0.417 0.574 880s -0.281 0.528 -0.963 -0.324 880s 880s y: [4, 2, 1, 1] 880s -0.303 -0.308 -0.288 0.730 880s -0.569 0.203 -0.145 -0.118 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.710 -0.051 0.219 0.346 880s 880s x1: [4, 4, 1, 1] 880s -0.849 -0.564 -0.607 -0.587 880s -0.290 0.622 0.185 0.910 880s 0.039 -0.241 -0.371 -0.432 880s -0.204 0.306 -0.569 0.991 880s 880s y: [1, 4, 1, 1] 880s 0.295 880s 0.529 880s -0.246 880s 0.347 880s testing: mul_mat, [2, 4, 1, 4] = [1, 2, 1, 4] * [1, 4ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s , 1, 4] 880s x0: [1, 2, 1, 4] 880s -0.576 880s 0.352 880s 880s x1: [1, 4, 1, 4] 880s 0.210 880s 0.013 880s -0.564 880s 0.746 880s 880s y: [2, 4, 1, 4] 880s -0.121 0.074 880s -0.008 0.005 880s 0.325 -0.199 880s -0.429 0.262 880s test-mul-mat0: iter:361/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.873 880s 0.739 880s 880s x1: [1, 1, 1, 1] 880s -0.274 880s 880s y: [2, 1, 1, 1] 880s -0.239 -0.203 880s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 3, 4, 1] 880s 0.349 880s 0.160 880s 0.010 880s 880s x1: [1, 2, 4, 1] 880s -0.578 880s 0.619 880s 880s y: [3, 2, 4, 1] 880s -0.202 -0.092 -0.006 880s 0.216 0.099 0.006 880s testing: mul_mat, [2, 3, 4, 1] = [1, 2, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 2, 4, 1] 880s -0.681 880s 0.865 880s 880s x1: [1, 3, 4, 1] 880s 0.700 880s 0.094 880s 0.540 880s 880s y: [2, 3, 4, 1] 880s -0.476 0.605 880s -0.064 0.082 880s -0.368 0.467 880s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.614 880s 880s x1: [1, 2, 1, 1] 880s 0.209 880s 0.075 880s 880s y: [1, 2, 1, 1] 880s -0.128 880s -0.046 880s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 1, 4, 1] 880s -0.683 880s 880s x1: [1, 1, 4, 1] 880s 0.362 880s 880s y: [1, 1, 4, 1] 880s -0.247 880s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 2, 4, 1] 880s -0.864 880s 0.210 880s 880s x1: [1, 1, 4, 1] 880s 0.608 880s 880s y: [2, 1, 4, 1] 880s -0.526 0.128 880s test-mul-mat0: iter:362/500 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.884 0.059 -0.725 880s -0.626 0.762 -0.192 880s 880s x1: [3, 1, 1, 1] 880s 0.520 0.497 0.697 880s 880s y: [2, 1, 1, 1] 880s -0.016 -0.081 880s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 1, 4, 1] 880s -0.798 -0.565 -0.422 880s 880s x1: [3, 2, 4, 1] 880s -0.309 -0.836 -0.564 880s -0.452 0.772 0.798 880s 880s y: [1, 2, 4, 1] 880s 0.957 880s -0.413 880s testing: mul_mat, [3, 1, 4, 4] = [3, 3, 4, 4] * [3, 1, 4, 4] 880s x0: [3, 3, 4, 4] 880s -0.842 -0.609 -0.316 880s 0.166 0.252 0.686 880s 0.559 0.379 0.049 880s 880s x1: [3, 1, 4, 4] 880s -0.576 0.446 -0.499 880s 880s y: [3, 1, 4, 4] 880s 0.371 -0.326 -0.177 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.276 0.347 -0.157 880s 0.355 -0.771 0.689 880s 880s x1: [3, 3, 1, 1] 880s -0.816 -0.674 0.213 880s -0.730 -0.328 -0.732 880s 0.650 0.985 0.149 880s 880s y: [2, 3, 1, 1] 880s -0.493 0.376 880s -0.201 -0.511 880s 0.498 -0.426 880s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.969 -0.802 880s 0.228 -0.929 880s -0.879 0.809 880s 880s x1: [2, 3, 4, 1] 880s -0.113 0.581 880s -0.993 -0.301 880s 0.326 0.773 880s 880s y: [3, 3, 4, 1] 880s -0.576 -0.566 0.570 880s -0.720 0.053 0.629 880s -0.304 -0.644 0.339 880s testing: mul_mat, [1, 2, 4, 4] = [3, 1, 4, 4] * [3, 2, 4, 4] 880s x0: [3, 1, 4, 4] 880s 0.585 0.220 -0.946 880s 880s x1: [3, 2, 4, 4] 880s -0.542 -0.552 -0.919 880s 0.268 0.141 0.954 880s 880s y: [1, 2, 4, 4] 880s 0.430 880s -0.714 880s test-mul-mat0: iter:363/500 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.774 880s -0.203 880s -0.595 880s 0.195 880s 880s x1: [1, 1, 1, 1] 880s -0.838 880s 880s y: [4, 1, 1, 1] 880s 0.649 0.170 0.499 -0.163 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.225 880s -0.801 880s 880s x1: [1, 4, 1, 1] 880s -0.671 880s -0.981 880s -0.625 880s 0.237 880s 880s y: [2, 4, 1, 1] 880s 0.151 0.537 880s 0.220 0.785 880s 0.140 0.501 880s -0.053 -0.190 880s testing: mul_mat, [1, 2, 1, 2] = [1, 1, 1, 2] * [1, 2, 1, 2] 880s x0: [1, 1, 1, 2] 880s -0.211 880s 880s x1: [1, 2, 1, 2] 880s 0.581 880s -0.890 880s 880s y: [1, 2, 1, 2] 880s -0.122 880s 0.188 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.382 880s 880s x1: [1, 1, 1, 1] 880s 0.138 880s 880s y: [1, 1, 1, 1] 880s -0.053 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.813 880s -0.285 880s 880s x1: [1, 1, 1, 1] 880s -0.349 880s 880s y: [2, 1, 1, 1] 880s -0.284 0.099 880s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 880s x0: [2, 4, 1, 2] 880s 0.407 -0.269 880s 0.536 0.206 880s -0.123 0.831 880s 0.812 0.106 880s 880s x1: [2, 1, 1, 2] 880s 0.328 0.918 880s 880s y: [4, 1, 1, 2] 880s -0.114 0.365 0.723 0.364 880s test-mul-mat0: iter:364/500 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.659 -0.854 -0.327 880s 0.547 0.977 0.779 880s 880s x1: [3, 4, 1, 1] 880s -0.015 0.651 0.833 880s 0.768 0.464 -0.452 880s 0.096 0.383 -0.073 880s -0.594 0.463 -0.666 880s 880s y: [2, 4, 1, 1] 880s -0.819 1.276 880s -0.755 0.522 880s -0.367 0.370 880s 0.213 -0.390 880s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 4, 4, 1] 880s 0.846 -0.337 -0.907 880s 0.393 -0.360 0ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png.872 880s 0.383 0.667 0.531 880s -0.984 0.326 0.181 880s 880s x1: [3, 2, 4, 1] 880s 0.990 0.027 0.659 880s -0.367 0.659 0.650 880s 880s y: [4, 2, 4, 1] 880s 0.230 0.954 0.748 -0.846 880s -1.122 0.186 0.644 0.694 880s testing: mul_mat, [4, 4, 4, 1] = [3, 4, 4, 1] * [3, 4, 4, 1] 880s x0: [3, 4, 4, 1] 880s -0.899 -0.513 0.780 880s -0.013 -0.077 0.033 880s -0.416 -0.042 0.253 880s -0.272 0.467 0.210 880s 880s x1: [3, 4, 4, 1] 880s -0.775 -0.285 -0.500 880s -0.317 0.667 0.733 880s -0.603 0.691 0.008 880s -0.707 0.193 0.788 880s 880s y: [4, 4, 4, 1] 880s 0.453 0.015 0.208 -0.027 880s 0.515 -0.023 0.289 0.552 880s 0.194 -0.046 0.223 0.489 880s 1.151 0.020 0.485 0.448 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.367 -0.737 -0.504 880s 880s x1: [3, 4, 1, 1] 880s -0.088 0.186 -0.633 880s 0.162 -0.835 0.905 880s -0.365 -0.757 -0.742 880s 0.447 0.832 0.210 880s 880s y: [1, 4, 1, 1] 880s 0.150 880s 0.218 880s 0.798 880s -0.555 880s testing: mul_mat, [4, 3, 4, 1] = [1, 4, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 4, 4, 1] 880s 0.945 880s -0.797 880s 0.045 880s 0.131 880s 880s x1: [1, 3, 4, 1] 880s -0.319 880s -0.697 880s 0.187 880s 880s y: [4, 3, 4, 1] 880s -0.302 0.254 -0.014 -0.042 880s -0.659 0.555 -0.031 -0.092 880s 0.177 -0.149 0.008 0.025 880s testing: mul_mat, [4, 1, 4, 1] = [4, 4, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 4, 4, 1] 880s 0.697 -0.481 -0.261 -0.951 880s 0.222 -0.995 0.033 -0.831 880s 0.529 -0.255 0.430 -0.110 880s 0.904 -0.941 -0.317 0.561 880s 880s x1: [4, 1, 4, 1] 880s -0.793 0.512 0.419 -0.490 880s 880s y: [4, 1, 4, 1] 880s -0.443 -0.265 -0.316 -1.607 880s test-mul-mat0: iter:365/500 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s -0.943 -0.882 -0.119 880s 880s x1: [3, 4, 1, 1] 880s 0.353 -0.742 0.993 880s 0.019 -0.722 0.948 880s -0.266 -0.533 -0.208 880s -0.990 -0.756 0.036 880s 880s y: [1, 4, 1, 1] 880s 0.203 880s 0.506 880s 0.746 880s 1.596 880s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.985 -0.278 0.303 880s -0.727 -0.285 -0.678 880s -0.449 -0.337 0.056 880s 880s x1: [3, 1, 4, 1] 880s -0.015 0.753 0.499 880s 880s y: [3, 1, 4, 1] 880s -0.043 -0.542 -0.219 880s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.174 -0.733 -0.944 880s -0.386 -0.285 0.678 880s 0.895 0.468 0.664 880s 880s x1: [3, 3, 4, 1] 880s 0.053 -0.152 0.068 880s -0.891 0.866 -0.478 880s 0.175 0.129 0.014 880s 880s y: [3, 3, 4, 1] 880s 0.038 0.069 0.021 880s -0.029 -0.227 -0.709 880s -0.138 -0.095 0.226 880s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.779 0.387 -0.022 880s 0.963 0.120 0.811 880s 880s x1: [3, 3, 1, 1] 880s 0.135 -0.968 0.709 880s -0.202 -0.799 0.781 880s -0.103 0.795 -0.293 880s 880s y: [2, 3, 1, 1] 880s -0.286 0.588 880s -0.484 0.343 880s 0.234 -0.242 880s testing: mul_mat, [3, 3, 4, 1] = [2, 3, 4, 1] * [2, 3, 4, 1] 880s x0: [2, 3, 4, 1] 880s 0.690 -0.190 880s 0.964 -0.059 880s 0.871 0.682 880s 880s x1: [2, 3, 4, 1] 880s 0.508 -0.518 880s -0.341 0.348 880s 0.527 -0.221 880s 880s y: [3, 3, 4, 1] 880s 0.449 0.520 0.089 880s -0.301 -0.349 -0.060 880s 0.406 0.521 0.308 880s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 1, 4, 1] 880s -0.876 0.505 -0.471 880s 880s x1: [3, 2, 4, 1] 880s 0.240 -0.168 0.329 880s -0.366 -0.258 0.329 880s 880s y: [1, 2, 4, 1] 880s -0.450 880s 0.036 880s test-mul-mat0: iter:366/500 880s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.015 -0.341 -0.144 880s 880s x1: [3, 2, 1, 1] 880s 0.470 -0.250 0.517 880s 0.030 -0.593 -0.472 880s 880s y: [1, 2, 1, 1] 880s 0.018 880s 0.271 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.953 -0.941 0.003 880s -0.518 -0.075 0.225 880s 0.046 0.031 -0.721 880s 880s x1: [3, 1, 1, 1] 880s -0.204 0.946 0.935 880s 880s y: [3, 1, 1, 1] 880s -1.081 0.245 -0.655 880s testing: mul_mat, [1, 3, 1, 4] = [3, 1, 1, 4] * [3, 3, 1, 4] 880s x0: [3, 1, 1, 4] 880s -0.614 0.738 0.803 880s 880s x1: [3, 3, 1, 4] 880s -0.060 0.771 -0.925 880s -0.837 0.628 -0.697 880s -0.321 -0.526 0.773 880s 880s y: [1, 3, 1, 4] 880s -0.137 880s 0.418 880s 0.430 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.625 0.183 0.787 880s 0.343 -0.822 -0.957 880s 0.442 -0.183 -0.531 880s 880s x1: [3, 1, 1, 1] 880s -0.852 -0.452 0.433 880s 880s y: [3, 1, 1, 1] 880s -0.274 -0.335 -0.524 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.590 -0.698 0.538 880s -0.364 0.390 -0.370 880s -0.111 -0.918 0.318 880s -0.343 0.885 -0.168 880s 880s x1: [3, 3, 1, 1] 880s 0.332 0.680 -0.803 880s 0.039 -0.056 -0.773 880s 0.425 -0.318 -0.970 880s 880s y: [4, 3, 1, 1] 880s -1.102 0.442 -0.916 0.623 880s -0.400 0.250 -0.199 0.067 880s -0.551 0.080 -0.063 -0.263 880s testing: mul_mat, [2, 3, 1, 4] = [4, 2, 1, 4] * [4, 3, 1, 4] 880s x0: [4, 2, 1, 4] 880s -0.683 -0.762 -0.357 -0.995 880s 0.388 0.466 0.190 0.981 880s 880s x1: [4, 3, 1, 4] 880s -0.745 -0.339 -0.726 0.437 880s -0.162 0.091 0.225 -0.118 880s 0.560 -0.443 -0.439 0.757 880s 880s y: [2, 3, 1, 4] 880s 0.592 -0.156 880s 0.079 -0.094 880s -0.641 0.670 880s test-mul-mat0: iter:367/500 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.118 0.635 -0.329 880s -0.890 0.806 0.668 880s 880s x1: [3, 1, 1, 1] 880s -0.538 0.035 0.687 880s 880s y: [2, 1, 1, 1] 880s -0.267 0.966 880s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 880s x0: [3, 2, 2, 1] 880s -0.056 -0.655 0.371 880s -0.491 0.408 -0.357 880s 880s x1: [3, 2, 2, 1] 880s -0.715 0.636 -0.682 880s -0.266 -0.650 0.504 880s 880s y: [2, 2, 2, 1] 880s -0.629 0.853 880s 0.627 -0.314 880s testing: mul_mat, [4, 2, 2, 2] = [3, 4, 2, 2] * [3, 2, 2, 2] 880s x0: [3, 4, 2, 2] 880s 0.419 0.080 -0.815 880s 0.993 0.880 0.780 880s 0.686 -0.685 -0.954 880s 0.488 -0.444 0.979 880s 880s x1: [3, 2, 2, 2] 880s 0.693 -0.565 -0.734 880s 0.803 -0.759 0.934 880s 880s y: [4, 2, 2, 2] 880s 0.843 -0.382 1.562 -0.130 880s -0.486 0.857 0.179 1.644 880s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.125 -0.050 0.610 880s 0.132 -0.395 -0.821 880s 880s x1: [3, 4, 1, 1] 880s 0.754 0.865 -0.421 880s 0.284 -0.169 -0.605 880s 0.028 -0.138 0.581 880s 0.331 0.329 -0.353 880s 880s y: [2, 4, 1, 1] 880s -0.395 0.103 880s -0.396 0.601 880s 0.358 -0.419 880s -0.273 0.203 880s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.854 -0.479 880s -0.874 0.041 880s -0.988 0.839 880s -0.023 -0.898 880s 880s x1: [2, 3, 2, 1] 880s -0.885 0.773 880s 0.724 0.671 880s 0.425 0.825 880s 880s y: [4, 3, 2, 1] 880s 0.386 0.804 1.522 -0.674 880s -0.940 -0.605 -0.152 -0.620 880s -0.758 -0.338 0.273 -0.750 880s testing: mul_mat, [1, 2, 2, 2] = [4, 1, 2, 2] * [4, 2, 2, 2] 880s x0: [4, 1, 2, 2] 880s 0.205 -0.309 0.585 0.640 880s 880s x1: [4, 2, 2, 2] 880s 0.492 0.100 -0.237 0.265 880s -0.175 -0.565 -0.311 -0.351 880s 880s y: [1, 2, 2, 2] 880s 0.100 880s -0.268 880s test-mul-mat0: iter:368/500 880s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.502 -0.011 -0.034 880s -0.842 0.431 -0.607 880s -0.081 -0.364 0.084 880s 880s x1: [3, 1, 1, 1] 880s -0.021 0.759 0.194 880s 880s y: [3, 1, 1, 1] 880s -0.004 0.227 -0.258 880s testing: mul_mat, [4, 3, 3, 1] = [3, 4, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.614 -0.806 -0.625 880s -0.882 -0.530 -0.161 880s -0.143 -0.973 -0.975 880s -0.164 0.481 0.556 880s 880s x1: [3, 3, 3, 1] 880s -0.496 -0.724 -0.536 880s -0.261 0.557 0.186 880s 0.979 0.454 0.531 880s 880s y: [4, 3, 3, 1] 880s 0.614 0.907 1.298 -0.565 880s -0.726 -0.095 -0.686 0.414 880s -0.096 -1.189 -1.099 0.352 880s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 880s x0: [3, 4, 3, 1] 880s 0.134 0.581 0.689 880s -0.068 0.483 0.964 880s -0.570 -0.835 0.981 880s 0.573 -0.473 0.779 880s 880s x1: [3, 4, 3, 1] 880s 0.019 0.601 0.156 880s 0.044 -0.563 -0.362 880s -0.401 0.046 -0.188 880s 0.798 0.902 -0.725 880s 880s y: [4, 4, 3, 1] 880s 0.459 0.440 -0.359 -0.152 880s -0.571 -0.625 0.090 0.009 880s -0.157 -0.132 0.006 -0.398 880s 0.131 -0.318 -1.920 -0.535 880s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.294 -0.807 0.532 880s -0.562 -0.841 -0.568 880s -0.115 0.532 0.219 880s -0.129 0.439 0.440 880s 880s x1: [3, 4, 1, 1] 880s 0.432 -0.723 0.137 880s 0.413 0.849 0.664 880s 0.192 0.345 -0.834 880s 0.086 0.672 0.156 880s 880s y: [4, 4, 1, 1] 880s 0.783 0.288 -0.404 -0.313 880s -0.211 -1.323 0.549 0.611 880s -0.666 0.076 -0.022 -0.240 880s -0.434 -0.702 0.381 0.352 880s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.752 0.683 -0.208 -0.230 880s 880s x1: [4, 3, 3, 1] 880s -0.790 -0.866 -0.071 0.677 880s 0.980 0.901 0.109 -0.743 880s 0.039 -0.478 -0.894 -0.298 880s 880s y: [1, 3, 3, 1] 880s -0.139 880s 0.028 880s -0.101 880s testing: mul_mat, [2, 4, 3, 1] = [1, 2, 3, 1] * [1, 4, 3, 1] 880s x0: [1, 2, 3, 1] 880s 0.602 880s -0.643 880s 880s x1: [1, 4, 3, 1] 880s -0.808 880s -0.220 880s -0.152 880s 0.985 880s 880s y: [2, 4, 3, 1] 880s -0.487 0.520 880s -0.133 0.141 880s -0.092 0.098 880s 0.593 -0.633 880s test-mul-mat0: iter:369/500 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.831 0.704 0.707 880s 0.334 0.640 -0.405 880s -0.069 0.059 -0.265 880s 880s x1: [3, 3, 1, 1] 880s 0.256 -0.766 -0.756 880s -0.504 -0.403 0.528 880s -0.141 -0.017 0.719 880s 880s y: [3, 3, 1, 1] 880s -0.860 -0.098 0.137 880s -0.329 -0.640 -0.129 880s 0.379 -0.349 -0.182 880s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 1, 3, 1] 880s -0.807 0.724 -0.262 880s 880s x1: [3, 3, 3, 1] 880s 0.841 0.401 -0.662 880s -0.802 0.596 0.079 880s 0.982 0.544 -0.930 880s 880s y: [1, 3, 3, 1] 880s -0.215 880s 1.057 880s -0.155 880s testing: mul_mat, [2, 1, 3, 3] = [3, 2, 3, 3] * [3, 1, 3, 3] 880s x0: [3, 2, 3, 3] 880s 0.930 -0.689 -0.289 880s -0.855 -0.424 -0.064 880s 880s x1: [3, 1, 3, 3] 880s -0.786 -0.735 -0.775 880s 880s y: [2, 1, 3, 3] 880s 0.000 1.034 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.153 -0.906 0.970 880s -0.475 -0.704 0.876 880s -0.501 -0.837 -0.049 880s 880s x1: [3, 2, 1, 1] 880s 0.446 0.249 0.202 880s -0.367 -0.077 -0.102 880s 880s y: [3, 2, 1, 1] 880s 0.039 -0.210 -0.441 880s -0.086 0.139 0.253 880s testing: mul_mat, [3, 3, 3, 1] = [3, 3, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.924 -0.087 0.738 880s -0.837 0.797 -0.062 880s -0.765 -0.835 0.449 880s 880s x1: [3, 3, 3, 1] 880s -0.163 0.634 0.931 880s -0.175 0.142 0.284 880s 0.664 -0.970 0.291 880s 880s y: [3, 3, 3, 1] 880s 0.782 0.584 0.013 880s 0.359 0.242 0.144 880s -0.314 -1.347 0.432 880s testing: mul_mat, [4, 3, 3, 3] = [3, 4, 3, 3] * [3, 3, 3, 3] 880s x0: [3, 4, 3, 3] 880s 0.040 0.701 -0.467 880s 0.661 -0.849 -0.036 880s -0.575 0.962 -0.985 880s -0.315 -0.611 -0.962 880s 880s x1: [3, 3, 3, 3] 880s -0.534 -0.803 -0.422 880s 0.749 -0.728 -0.259 880s 0.984 0.185 -0.462 880s 880s y: [4, 3, 3, 3] 880s -0.388 0.344 -0.051 1.065 880s -0.360 1.122 -0.876 0.458 880s 0.385 0.510 0.068 0.021 880s test-mul-mat0: iter:370/500 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.421 -0.202 880s -0.673 0.877 880s 880s x1: [2, 4, 1, 1] 880s -0.174 -0.412 880s 0.854 0.935 880s -0.047 -0.590 880s 0.399 0.533 880s 880s y: [2, 4, 1, 1] 880s 0.157 -0.244 880s -0.549 0.245 880s 0.139 -0.485 880s -0.276 0.199 880s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.130 0.205 880s 880s x1: [2, 2, 1, 1] 880s -0.122 0.923 880s -0.623 0.607 880s 880s y: [1, 2, 1, 1] 880s 0.174 880s 0.044 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s -0.094 0.323 880s 0.112 0.182 880s 0.565 -0.230 880s 880s x1: [2, 1, 1, 1] 880s 0.179 -0.855 880s 880s y: [3, 1, 1, 1] 880s -0.293 -0.135 0.298 880s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 880s x0: [2, 1, 1, 1] 880s 0.532 -0.071 880s 880s x1: [2, 3, 1, 1] 880s 0.118 0.268 880s 0.596 0.706 880s 0.122 0.530 880s 880s y: [1, 3, 1, 1] 880s 0.044 880s 0.267 880s 0.027 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.011 880s 0.125 880s 0.385 880s 880s x1: [1, 2, 1, 1] 880s -0.808 880s -0.202 880s 880s y: [3, 2, 1, 1] 880s 0.009 -0.101 -0.311 880s 0.002 -0.025 -0.078 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.584 -0.108 -0.320 880s 880s x1: [3, 1, 1, 1] 880s 0.867 0.049 0.762 880s 880s y: [1, 1, 1, 1] 880s 0.257 880s test-mul-mat0: iter:371/500 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s 0.597 -0.667 0.846 880s -0.474 -0.475 -0.355 880s -0.966 0.514 0.770 880s 880s x1: [3, 3, 1, 1] 880s -0.640 0.701 -0.150 880s 0.925 -0.530 0.968 880s 0.193 -0.934 0.674 880s 880s y: [3, 3, 1, 1] 880s -0.975 0.024 0.863 880s 1.725 -0.531 -0.421 880s 1.308 0.113 -0.147 880s testing: mul_mat, [4, 3, 3, 1] = [3, 4, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 4, 3, 1] 880s -0.482 0.700 0.165 880s 0.937 0.081 0.983 880s 0.118 0.937 -0.614 880s 0.191 0.473 0.553 880s 880s x1: [3, 3, 3, 1] 880s 0.419 0.381 -0.181 880s 0.181 -0.144 -0.597 880s -0.927 0.536 0.166 880s 880s y: [4, 3, 3, 1] 880s 0.035 0.245 0.518 0.160 880s -0.286 -0.429 0.253 -0.364 880s 0.849 -0.661 0.291 0.169 880s testing: mul_mat, [4, 4, 3, 4] = [3, 4, 3, 4] * [3, 4, 3, 4] 880s x0: [3, 4, 3, 4] 880s 0.275 0.343 -0.753 880s 0.852 0.871 0.091 880s -0.994 0.524 -0.940 880s 0.887 -0.429 -0.551 880s 880s x1: [3, 4, 3, 4] 880s 0.952 -0.906 0.620 880s -0.661 0.285 0.093 880s 0.891 0.586 -0.310 880s -0.720 0.358 -0.887 880s 880s y: [4, 4, 3, 4] 880s -0.516 0.079 -2.003 0.892 880s -0.154 -0.306 0.720 -0.760 880s 0.679 1.242 -0.288 0.709 880s 0.593 -0.383 1.738 -0.304 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.102 0.916 -0.600 880s -0.182 -0.398 0.442 880s 0.843 -0.975 -0.161 880s 880s x1: [3, 4, 1, 1] 880s -0.523 -0.912 0.716 880s -0.785 -0.791 -0.928 880s 0.357 -0.035 -0.639 880s -0.385 -0.764 -0.354 880s 880s y: [3, 4, 1, 1] 880s -1.212 0.775 0.333 880s -0.089 0.048 0.259 880s 0.315 -0.334 0.439 880s -0.449 0.218 0.478 880s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 2, 3, 1] 880s -0.848 -0.759 0.355 880s -0.447 -0.790 -0.216 880s 880s x1: [3, 3, 3, 1] 880s -0.941 -0.198 -0.457 880s -0.380 0.968 0.693 880s 0.440 0.440 -0.103 880s 880s y: [2, 3, 3, 1] 880s 0.787 0.676 880s -0.167 -0.745 880s -0.743 -0.522 880s testing: mul_mat, [3, 3, 3, 4] = [2, 3, 3, 4] * [2, 3, 3, 4] 880s x0: [2, 3, 3, 4] 880s 0.834 0.257 880s 0.758 0.657 880s 0.699 0.192 880s 880s x1: [2, 3, 3, 4] 880s 0.808 -0.273 880s 0.719 0.913 880s 0.664 0.410 880s 880s y: [3, 3, 3, 4] 880s 0.604 0.433 0.513 880s 0.834 1.145 0.678 880s 0.659 0.773 0.543 880s test-mul-mat0: iter:372/500 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.609 880s 880s x1: [1, 1, 1, 1] 880s -0.720 880s 880s y: [1, 1, 1, 1] 880s -0.439 880s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 880s x0: [1, 1, 2, 1] 880s -0.876 880s 880s x1: [1, 1, 2, 1] 880s -0.735 880s 880s y: [1, 1, 2, 1] 880s 0.644 880s testing: mul_mat, [2, 1, 2, 2] = [1, 2, 2, 2] * [1, 1, 2, 2] 880s x0: [1, 2, 2, 2] 880s 0.280 880s 0.577 880s 880s x1: [1, 1, 2, 2] 880s 0.144 880s 880s y: [2, 1, 2, 2] 880s 0.040 0.083 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.741 880s -0.833 880s 0.640 880s 0.955 880s 880s x1: [1, 2, 1, 1] 880s -0.492 880s -0.628 880s 880s y: [4, 2, 1, 1] 880s 0.365 0.410 -0.315 -0.470 880s 0.465 0.523 -0.402 -0.600 880s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 3, 2, 1] 880s 0.213 0.799 -0.433 0.561 880s 0.889 0.154 -0.566 -0.477 880s 0.463 0.315 -0.108 0.046 880s 880s x1: [4, 1, 2, 1] 880s -0.455 0.883 0.251 -0.175 880s 880s y: [3, 1, 2, 1] 880s 0.402 -0.326 0.032 880s testing: mul_mat, [2, 4, 2, 2] = [3, 2, 2, 2] * [3, 4, 2, 2] 880s x0: [3, 2, 2, 2] 880s 0.651 0.776 -0.526 880s -0.248 0.990 0.139 880s 880s x1: [3, 4, 2, 2] 880s -0.444 0.820 0.506 880s 0.416 -0.091 0.169 880s 0.735 -0.878 0.058 880s 0.198 0.921 -0.788 880s 880s y: [2, 4, 2, 2] 880s 0.081 0.993 880s 0.111 -0.169 880s -0.234 -1.043 880s 1.259 0.753 880s test-mul-mat0: iter:373/500 880s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.716 880s 0.204 880s -0.415 880s 880s x1: [1, 4, 1, 1] 880s 0.563 880s 0.359 880s -0.492 880s 0.214 880s 880s y: [3, 4, 1, 1] 880s -0.403 0.115 -0.233 880s -0.257 0.073 -0.149 880s 0.352 -0.100 0.204 880s -0.153 0.044 -0.089 880s testing: mul_mat, [3, 3, 4, 1] = [1, 3, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 3, 4, 1] 880s -0.047 880s 0.402 880s -0.954 880s 880s x1: [1, 3, 4, 1] 880s 0.424 880s -0.322 880s -0.868 880s 880s y: [3, 3, 4, 1] 880s -0.020 0.171 -0.405 880s 0.015 -0.130 0.308 880s 0.041 -0.349 0.829 880s testing: mul_mat, [1, 3, 4, 1] = [1, 1, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 1, 4, 1] 880s -0.488 880s 880s x1: [1, 3, 4, 1] 880s -0.041 880s 0.112 880s 0.926 880s 880s y: [1, 3, 4, 1] 880s 0.020 880s -0.055 880s -0.452 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.685 880s 0.126 880s -0.785 880s -0.959 880s 880s x1: [1, 1, 1, 1] 880s 0.724 880s 880s y: [4, 1, 1, 1] 880s 0.496 0.091 -0.568 -0.694 880s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 880s x0: [4, 2, 4, 1] 880s 0.047 0.479 0.676 0.943 880s 0.165 -0.770 0.088 -0.592 880s 880s x1: [4, 1, 4, 1] 880s 0.097 0.430 -0.319 0.432 880s 880s y: [2, 1, 4, 1] 880s 0.402 -0.599 880s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 3, 4, 1] 880s -0.141 0.456 880s 0.996 0.995 880s 0.570 -0.162 880s 880s x1: [2, 4, 4, 1] 880s 0.914 0.368 880s -0.345 0.590 880s -0.544 -0.403 880s 0.999 0.268 880s 880s y: [3, 4, 4, 1] 880s 0.039 1.277 0.461 880s 0.318 0.243 -0.293 880s -0.107 -0.943 -0.245 880s -0.019 1.261 0.526 880s test-mul-mat0: iter:374/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.118 880s 0.240 880s -0.923 880s 880s x1: [1, 3, 1, 1] 880s 0.762 880s -0.115 880s 0.670 880s 880s y: [3, 3, 1, 1] 880s -0.090 0.183 -0.704 880s 0.014 -0.028 0.106 880s -0.079 0.161 -0.618 880s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 880s x0: [1, 2, 3, 1] 880s 0.396 880s 0.860 880s 880s x1: [1, 3, 3, 1] 880s -0.123 880s -0.922 880s -0.759 880s 880s y: [2, 3, 3, 1] 880s -0.049 -0.106 880s -0.365 -0.794 880s -0.301 -0.653 880s testing: mul_mat, [1, 2, 3, 1] = [1, 1, 3, 1] * [1, 2, 3, 1] 880s x0: [1, 1, 3, 1] 880s -0.615 880s 880s x1: [1, 2, 3, 1] 880s 0.136 880s 0.019 880s 880s y: [1, 2, 3, 1] 880s -0.084 880s -0.012 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.368 880s 0.295 880s 880s x1: [1, 1, 1, 1] 880s -0.945 880s 880s y: [2, 1, 1, 1] 880s -0.348 -0.279 880s testing: mul_mat, [4, 1, 3, 1] = [2, 4, 3, 1] * [2, 1, 3, 1] 880s x0: [2, 4, 3, 1] 880s -0.513 0.294 880s 0.898 0.128 880s 0.268 -0.899 880s 0.121 -0.813 880s 880s x1: [2, 1, 3, 1] 880s -0.847 -0.755 880s 880s y: [4, 1, 3, 1] 880s 0.213 -0.857 0.452 0.511 880s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 880s x0: [4, 4, 3, 1] 880s -0.866 -0.720 0.882 -0.529 880s -0.624 -0.269 -0.725 0.630 880s -0.047 0.915 0.786 0.067 880s 0.948 -0.572 -0.474 0.934 880s 880s x1: [4, 2, 3, 1] 880s 0.967 -0.851 0.151 -0.639 880s 0.358 -0.356 -0.805 0.844 880s 880s y: [4, 2, 3, 1] 880s 0.247 -0.886 -0.749 0.735 880s -1.209 0.988 -0.918 1.713 880s test-mul-mat0: iter:375/500 880s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.315 -0.341 -0.665 -0.541 880s -0.567 -0.102 -0.036 -0.877 880s 0.889 0.351 -0.913 -0.334 880s -0.737 -0.478 0.072 -0.758 880s 880s x1: [4, 2, 1, 1] 880s 0.205 0.845 -0.920 0.514 880s 0.141 -0.855 0.865 -0.167 880s 880s y: [4, 2, 1, 1] 880s -0.019 -0.620 1.147 -1.010 880s -0.237 0.122 -0.908 0.493 880s testing: mul_mat, [2, 4, 2, 1] = [4, 2, 2, 1] * [4, 4, 2, 1] 880s x0: [4, 2, 2, 1] 880s 0.456 0.055 -0.121 -0.612 880s -0.955 0.460 0.399 -0.451 880s 880s x1: [4, 4, 2, 1] 880s -0.109 -0.926 0.124 0.785 880s -0.264 -0.068 -0.010 -0.419 880s 0.011 -0.497 0.722 0.157 880s -0.632 -0.445 0.209 0.053 880s 880s y: [2, 4, 2, 1] 880s -0.597 -0.627 880s 0.134 0.406 880s -0.206 -0.022 880s -0.371 0.458 880s testing: mul_mat, [4, 2, 2, 3] = [4, 4, 2, 3] * [4, 2, 2, 3] 880s x0: [4, 4, 2, 3] 880s 0.431 0.078 -0.766 -0.120 880s -0.397 0.047 -0.115 0.044 880s 0.596 -0.346 0.455 0.096 880s 0.732 0.348 0.195 -0.440 880s 880s x1: [4, 2, 2, 3] 880s 0.482 -0.402 0.751 0.958 880s -0.632 -0.918 0.785 -0.775 880s 880s y: [4, 2, 2, 3] 880s -0.514 -0.255 0.860 -0.063 880s -0.853 0.084 0.224 -0.288 880s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.666 -0.258 -0.608 -0.911 880s -0.833 0.566 -0.349 -0.232 880s 880s x1: [4, 4, 1, 1] 880s -0.543 0.783 -0.623 -0.922 880s -0.741 0.453 -0.005 0.840 880s -0.863 -0.061 0.873 0.622 880s -0.530 0.559 0.570 0.059 880s 880s y: [2, 4, 1, 1] 880s 1.378 1.327 880s -0.386 0.681 880s -0.507 0.236 880s -0.192 0.546 880s testing: mul_mat, [3, 4, 2, 1] = [2, 3, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 3, 2, 1] 880s -0.674 0.896 880s 0.325 -0.616 880s 0.098 -0.168 880s 880s x1: [2, 4, 2, 1] 880s -0.093 -0.173 880s -0.073 -0.616 880s 0.784 -0.263 880s -0.158 0.567 880s 880s y: [3, 4, 2, 1] 880s -0.092 0.076 0.020 880s -0.502 0.355 0.096 880s -0.764 0.417 0.121 880s 0.615 -0.401 -0.111 880s testing: mul_mat, [1, 2, 2, 3] = [3, 1, 2, 3] * [3, 2, 2, 3] 880s x0: [3, 1, 2, 3] 880s 0.098 -0.027 0.829 880s 880s x1: [3, 2, 2, 3] 880s 0.884 0.710 0.132 880s -0.289 -0.363 0.516 880s 880s y: [1, 2, 2, 3] 880s 0.177 880s 0.409 880s test-mul-mat0: iter:376/500 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s 0.998 -0.714 0.059 880s 0.596 0.757 -0.843 880s 880s x1: [3, 1, 1, 1] 880s -0.436 0.036 -0.668 880s 880s y: [2, 1, 1, 1] 880s -0.500 0.331 880s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 880s x0: [3, 3, 3, 1] 880s -0.674 -0.020 -0.348 880s 0.362 0.312 0.107 880s 0.360 0.598 -0.834 880s 880s x1: [3, 2, 3, 1] 880s -0.432 0.586 0.318 880s -0.971 -0.307 -0.632 880s 880s y: [3, 2, 3, 1] 880s 0.168 0.060 -0.070 880s 0.881 -0.515 -0.005 880s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 880s x0: [3, 1, 3, 1] 880s 0.783 0.116 -0.539 880s 880s x1: [3, 3, 3, 1] 880s 0.641 0.569 0.677 880s 0.728 0.895 -0.344 880s -0.620 0.257 0.968 880s 880s y: [1, 3, 3, 1] 880s 0.203 880s 0.859 880s -0.978 880s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.546 0.247 -0.734 880s -0.673 -0.351 0.773 880s -0.328 0.123 -0.398 880s -0.974 0.108 0.891 880s 880s x1: [3, 1, 1, 1] 880s 0.532 0.837 -0.239 880s 880s y: [4, 1, 1, 1] 880s 0.091 -0.837 0.024 -0.641 880s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.102 -0.748 -0.464 -0.744 880s 0.642 -0.606 0.149 0.945 880s 0.045 0.061 -0.534 -0.330 880s -0.495 0.440 0.440 -0.964 880s 880s x1: [4, 3, 3, 1] 880s -0.477 -0.278 -0.503 -0.803 880s -0.191 -0.249 0.980 0.925 880s 0.212 0.404 -0.391 0.349 880s 880s y: [4, 3, 3, 1] 880s 0.990 -0.971 0.495 0.666 880s -0.975 1.048 -0.852 -0.475 880s -0.359 0.163 0.128 -0.435 880s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 1, 3, 1] 880s -0.258 0.925 0.888 0.167 880s 880s x1: [4, 4, 3, 1] 880s 0.892 0.100 -0.180 -0.376 880s -0.741 0.300 0.543 -0.997 880s -0.296 0.371 -0.171 0.230 880s -0.083 0.488 -0.142 -0.895 880s 880s y: [1, 4, 3, 1] 880s -0.360 880s 0.785 880s 0.306 880s 0.197 880s test-mul-mat0: iter:377/500 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.653 880s 880s 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.pnx1: [1, 1, 1, 1] 880s -0.506 880s 880s y: [1, 1, 1, 1] 880s 0.330 880s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.632 880s -0.170 880s 880s x1: [1, 1, 3, 1] 880s 0.298 880s 880s y: [2, 1, 3, 1] 880s -0.189 -0.051 880s testing: mul_mat, [1, 2, 3, 2] = [1, 1, 3, 2] * [1, 2, 3, 2] 880s x0: [1, 1, 3, 2] 880s -0.600 880s 880s x1: [1, 2, 3, 2] 880s -0.557 880s 0.247 880s 880s y: [1, 2, 3, 2] 880s 0.334 880s -0.148 880s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 1, 1, 1] 880s 0.695 880s 880s x1: [1, 1, 1, 1] 880s 0.859 880s 880s y: [1, 1, 1, 1] 880s 0.597 880s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 880s x0: [1, 2, 3, 1] 880s -0.016 880s 0.733 880s 880s x1: [1, 1, 3, 1] 880s -0.726 880s 880s y: [2, 1, 3, 1] 880s 0.012 -0.533 880s testing: mul_mat, [2, 1, 3, 2] = [2, 2, 3, 2] * [2, 1, 3, 2] 880s x0: [2, 2, 3, 2] 880s 0.299 -0.474 880s -0.878 -0.867 880s 880s x1: [2, 1, 3, 2] 880s 0.775 -0.999 880s 880s y: [2, 1, 3, 2] 880s 0.705 0.186 880s test-mul-mat0: iter:378/500 880s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 880s x0: [4, 1, 1, 1] 880s -0.741 0.105 0.118 -0.098 880s 880s x1: [4, 4, 1, 1] 880s 0.458 0.157 0.640 -0.243 880s 0.279 -0.834 -0.110 0.798 880s -0.449 -0.796 -0.988 0.209 880s 0.673 0.393 -0.491 -0.875 880s 880s y: [1, 4, 1, 1] 880s -0.223 880s -0.386 880s 0.112 880s -0.429 880s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 880s x0: [4, 3, 3, 1] 880s -0.202 0.563 0.140 0.688 880s 0.362 0.691 0.892 0.373 880s -0.100 0.565 -0.234 0.409 880s 880s x1: [4, 1, 3, 1] 880s 0.859 0.730 -0.194 0.524 880s 880s y: [3, 1, 3, 1] 880s 0.571 0.838 0.587 880s testing: mul_mat, [3, 3, 3, 2] = [4, 3, 3, 2] * [4, 3, 3, 2] 880s x0: [4, 3, 3, 2] 880s -0.340 0.659 0.335 0.391 880s 0.547 -0.109 -0.911 0.890 880s -0.703 0.475 0.381 -0.285 880s 880s x1: [4, 3, 3, 2] 880s -0.032 0.140 0.275 0.868 880s -0.295 -0.959 0.277 -0.605 880s 0.315 -0.055 0.190 0.707 880s 880s y: [3, 3, 3, 2] 880s 0.535 0.489 -0.053 880s -0.676 -0.848 0.029 880s 0.197 0.634 -0.376 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s 0.633 -0.007 -0.518 0.076 880s -0.138 -0.587 0.271 -0.936 880s -0.874 -0.188 -0.519 -0.774 880s -0.847 -0.579 0.723 -0.884 880s 880s x1: [4, 3, 1, 1] 880s 0.657 0.940 -0.709 0.269 880s -0.243 0.500 -0.845 0.847 880s -0.023 0.158 -0.409 -0.773 880s 880s y: [4, 3, 1, 1] 880s 0.798 -1.086 -0.590 -1.851 880s 0.345 -1.282 -0.098 -1.444 880s 0.138 0.523 0.801 0.316 880s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 3, 3, 1] 880s 0.564 -0.234 0.239 -0.213 880s 0.109 0.848 0.067 -0.650 880s -0.362 -0.026 0.033 0.319 880s 880s x1: [4, 4, 3, 1] 880s 0.468 0.467 0.773 0.408 880s 0.758 0.041 -0.835 0.258 880s 0.196 -0.988 -0.765 -0.646 880s -0.397 -0.538 0.315 -0.764 880s 880s y: [3, 4, 3, 1] 880s 0.252 0.234 -0.026 880s 0.164 -0.106 -0.221 880s 0.297 -0.448 -0.277 880s 0.139 0.018 -0.076 880s testing: mul_mat, [1, 4, 3, 2] = [3, 1, 3, 2] * [3, 4, 3, 2] 880s x0: [3, 1, 3, 2] 880s -0.576 0.214 0.773 880s 880s x1: [3, 4, 3, 2] 880s 0.809 0.602 -0.364 880s -0.158 -0.610 -0.014 880s -0.839 -0.159 -0.912 880s 0.232 -0.271 0.349 880s 880s y: [1, 4, 3, 2] 880s -0.618 880s -0.050 880s -0.256 880s 0.078 880s test-mul-mat0: iter:379/500 880s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.142 -0.315 -0.880 880s -0.057 -0.333 0.674 880s -0.434 0.975 -0.226 880s 880s x1: [3, 4, 1, 1] 880s -0.001 -0.961 -0.911 880s -0.837 -0.950 -0.770 880s -0.717 0.755 0.639 880s -0.293 -0.031 0.411 880s 880s y: [3, 4, 1, 1] 880s 1.104 -0.293 -0.730 880s 1.095 -0.154 -0.388 880s -0.698 0.219 0.903 880s -0.311 0.304 0.004 880s testing: mul_mat, [2, 3, 2, 1] = [3, 2, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 2, 2, 1] 880s 0.966 0.203 0.193 880s -0.176 0.888 0.313 880s 880s x1: [3, 3, 2, 1] 880s 0.683 0.424 -0.238 880s 0.969 -0.337 0.346 880s -0.328 0.153 0.834 880s 880s y: [2, 3, 2, 1] 880s 0.700 0.182 880s 0.935 -0.361 880s -0.126 0.454 880s testing: mul_mat, [2, 2, 2, 3] = [3, 2, 2, 3] * [3, 2, 2, 3] 880s x0: [3, 2, 2, 3] 880s -0.255 -0.918 -0.553 880s 0.073 0.042 -0.928 880s 880s x1: [3, 2, 2, 3] 880s -0.985 0.953 -0.478 880s 0.985 -0.384 0.868 880s 880s y: [2, 2, 2, 3] 880s -0.360 0.412 880s -0.379 -0.750 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.188 0.723 -0.758 880s -0.765 0.830 0.181 880s 0.071 0.923 0.990 880s 880s x1: [3, 2, 1, 1] 880s 0.609 -0.316 -0.813 880s 0.459 -0.804 -0.371 880s 880s y: [3, 2, 1, 1] 880s 0.273 -0.876 -1.054 880s -0.386 -1.086 -1.077 880s testing: mul_mat, [4, 3, 2, 1] = [3, 4, 2, 1] * [3, 3, 2, 1] 880s x0: [3, 4, 2, 1] 880s -0.498 0.023 -0.266 880s -0.212 -0.189 0.280 880s 0.740 0.037 -0.966 880s 0.314 -0.147 -0.276 880s 880s x1: [3, 3, 2, 1] 880s 0.356 -0.379 0.800 880s -0.176 -0.836 0.307 880s 0.168 0.796 0.127 880s 880s y: [4, 3, 2, 1] 880s -0.399 0.220 -0.524 -0.053 880s -0.013 0.281 -0.459 -0.017 880s -0.099 -0.150 0.031 -0.099 880s testing: mul_mat, [1, 3, 2, 3] = [4, 1, 2, 3] * [4, 3, 2, 3] 880s x0: [4, 1, 2, 3] 880s 0.808 0.147 -0.175 0.770 880s 880s x1: [4, 3, 2, 3] 880s -0.321 -0.590 0.961 -0.919 880s -0.284 -0.880 -0.058 0.218 880s -0.092 -0.317 -0.468 0.931 880s 880s y: [1, 3, 2, 3] 880s -1.221 880s -0.182 880s 0.678 880s test-mul-mat0: iter:380/500 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s 0.413 0.015 0.917 880s 0.637 -0.277 0.268 880s -0.835 0.423 -0.928 880s 0.265 0.575 -0.974 880s 880s x1: [3, 3, 1, 1] 880s -0.014 0.322 -0.914 880s -0.206 -0.531 -0.089 880s -0.437 -0.818 -0.274 880s 880s y: [4, 3, 1, 1] 880s -0.840 -0.343 0.997 1.072 880s -0.174 -0.008 0.030 -0.273 880s -0.444 -0.125 0.273 -0.318 880s testing: mul_mat, [2, 4, 2, 1] = [3, 2, 2, 1] * [3, 4, 2, 1] 880s x0: [3, 2, 2, 1] 880s -0.440 -0.196 0.234 880s 0.825 -0.622 0.260 880s 880s x1: [3, 4, 2, 1] 880s 0.192 -0.686 0.235 880s -0.836 -0.141 -0.480 880s -0.480 0.371 0.920 880s 0.506 -0.307 -0.994 880s 880s y: [2, 4, 2, 1] 880s 0.105 0.646 880s 0.283 -0.726 880s 0.353 -0.388 880s -0.395 0.350 880s testing: mul_mat, [4, 2, 2, 4] = [3, 4, 2, 4] * [3, 2, 2, 4] 880s x0: [3, 4, 2, 4] 880s -0.372 -0.612 -0.316 880s 0.264 -0.953 0.405 880s 0.159 -0.278 -0.182 880s 0.549 -0.422 0.782 880s 880s x1: [3, 2, 2, 4] 880s 0.700 -0.078 -0.065 880s 0.206 0.615 -0.059 880s 880s y: [4, 2, 2, 4] 880s -0.192 0.233 0.145 0.367 880s -0.435 -0.555 -0.127 -0.192 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.549 -0.117 0.887 880s 880s x1: [3, 4, 1, 1] 880s -0.460 0.252 -0.879 880s -0.650 0.383 -0.517 880s -0.229 -0.074 0.036 880s -0.243 -0.983 0.447 880s 880s y: [1, 4, 1, 1] 880s -1.062 880s -0.860 880s -0.085 880s 0.379 880s testing: mul_mat, [3, 3, 2, 1] = [1, 3, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.335 880s 0.904 880s -0.077 880s 880s x1: [1, 3, 2, 1] 880s -0.312 880s -0.212 880s 0.325 880s 880s y: [3, 3, 2, 1] 880s 0.104 -0.282 0.024 880s 0.071 -0.192 0.016 880s -0.109 0.294 -0.025 880s testing: mul_mat, [1, 1, 2, 4] = [3, 1, 2, 4] * [3, 1, 2, 4] 880s x0: [3, 1, 2, 4] 880s -0.694 -0.869 0.345 880s 880s x1: [3, 1, 2, 4] 880s 0.396 0.875 0.255 880s 880s y: [1, 1, 2, 4] 880s -0.948 880s test-mul-mat0: iter:381/500 880s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.903 880s 0.701 880s -0.160 880s 880s x1: [1, 3, 1, 1] 880s -0.213 880s 0.967 880s -0.644 880s 880s y: [3, 3, 1, 1] 880s 0.192 -0.149 0.034 880s -0.873 0.678 -0.155 880s 0.581 -0.451 0.103 880s testing: mul_mat, [2, 3, 2, 1] = [1, 2, 2, 1] * [1, 3, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.996 880s 0.592 880s 880s x1: [1, 3, 2, 1] 880s 0.936 880s 0.761 880s 0.981 880s 880s y: [2, 3, 2, 1] 880s 0.933 0.554 880s 0.758 0.450 880s 0.978 0.581 880s testing: mul_mat, [2, 2, 2, 3] = [1, 2, 2, 3] * [1, 2, 2, 3] 880s x0: [1, 2, 2, 3] 880s -0.634 880s -0.613 880s 880s x1: [1, 2, 2, 3] 880s -0.630 880s 0.252 880s 880s y: [2, 2, 2, 3] 880s 0.400 0.386 880s -0.160 -0.155 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.644 880s -0.021 880s -0.999 880s -0.987 880s 880s x1: [1, 2, 1, 1] 880s -0.948 880s 0.431 880s 880s y: [4, 2, 1, 1] 880s -0.610 0.020 0.948 0.935 880s 0.277 -0.009 -0.430 -0.425 880s testing: mul_mat, [3, 1, 2, 1] = [4, 3, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 3, 2, 1] 880s -0.839 -0.473 0.987 -0.714 880s 0.712 -0.900 -0.416 0.005 880s -0.766 -0.592 -0.717 0.918 880s 880s x1: [4, 1, 2, 1] 880s -0.769 0.723 -0.865 -0.849 880s 880s y: [3, 1, 2, 1] 880s 0.056 -0.843 0.001 880s testing: mul_mat, [4, 4, 2, 3] = [3, 4, 2, 3] * [3, 4, 2, 3] 880s x0: [3, 4, 2, 3] 880s -0.000 -0.184 -0.965 880s 0.092 -0.707 0.248 880s -0.470 0.414 0.499 880s 0.024 -0.258 -0.627 880s 880s x1: [3, 4, 2, 3] 880s -0.589 0.868 0.673 880s -0.057 -0.549 -0.035 880s 0.359 -0.388 -0.322 880s 0.593 0.139 -0.223 880s 880s y: [4, 4, 2, 3] 880s -0.809 -0.501 0.973 -0.660 880s 0.134 0.374 -0.218 0.162 880s 0.382 0.227 -0.490 0.311 880s 0.189 -0.099 -0.332 0.118 880s test-mul-mat0: iter:382/500 880s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 3, 1, 1] 880s -0.489 -0.380 -0.423 880s 0.766 -0.508 0.611 880s 0.047 0.260 0.751 880s 880s x1: [3, 3, 1, 1] 880s 0.650 -0.910 -0.079 880s -0.619 -0.415 -0.771 880s -0.106 0.102 -0.355 880s 880s y: [3, 3, 1, 1] 880s 0.062 0.913 -0.266 880s 0.786 -0.734 -0.715 880s 0.163 -0.350 -0.245 880s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 880s x0: [3, 4, 1, 1] 880s -0.202 -0.104 0.605 880s -0.112 0.817 0.986 880s 0.472 -0.953 -0.120 880s -0.426 -0.308 -0.564 880s 880s x1: [3, 3, 1, 1] 880s -0.118 0.948 0.812 880s 0.566 0.971 -0.464 880s 0.132 -0.423 0.422 880s 880s y: [4, 3, 1, 1] 880s 0.417 1.588 -1.057 -0.700 880s -0.496 0.272 -0.603 -0.278 880s 0.273 0.056 0.415 -0.164 880s testing: mul_mat, [1, 4, 1, 3] = [3, 1, 1, 3] * [3, 4, 1, 3] 880s x0: [3, 1, 1, 3] 880s 0.127 0.774 -0.649 880s 880s x1: [3, 4, 1, 3] 880s 0.086 0.312 0.013 880s -0.148 0.803 -0.376 880s 0.899 0.063 -0.626 880s -0.219 0.011 -0.814 880s 880s y: [1, 4, 1, 3] 880s 0.244 880s 0.847 880s 0.569 880s 0.509 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.682 -0.556 -0.795 880s 880s x1: [3, 1, 1, 1] 880s 0.733 0.325 0.403 880s 880s y: [1, 1, 1, 1] 880s -0.002 880s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.793 880s 0.134 880s -0.065 880s 0.803 880s 880s x1: [1, 3, 1, 1] 880s -0.073 880s 0.302 880s 0.229 880s 880s y: [4, 3, 1, 1] 880s 0.058 -0.010 0.005 -0.059 880s -0.240 0.040 -0.020 0.243 880s -0.182 0.031 -0.015 0.184 880s testing: mul_mat, [4, 1, 1, 3] = [4, 4, 1, 3] * [4, 1, 1, 3] 880s x0: [4, 4, 1, 3] 880s 0.829 0.072 -0.202 -0.838 880s 0.747 -0.646 -0.067 0.325 880s -0.049 -0.272 0.681 -0.693 880s 0.562 -0.246 -0.900 0.296 880s 880s x1: [4, 1, 1, 3] 880s 0.474 0.311 -0.289 -0.173 880s 880s y: [4, 1, 1, 3] 880s 0.619 0.117 -0.185 0.399 880s test-mul-mat0: iter:383/500 880s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 4, 1, 1] 880s -0.009 0.194 -0.808 -0.538 880s -0.813 0.545 -0.135 -0.140 880s -0.002 0.431 -0.142 -0.242 880s 0.293 0.116 -0.372 0.926 880s 880s x1: [4, 3, 1, 1] 880s -0.769 0.938 -0.066 -0.367 880s 0.715 -0.628 -0.811 0.067 880s 0.791 0.438 -0.752 -0.043 880s 880s y: [4, 3, 1, 1] 880s 0.441 1.197 0.504 -0.433 880s 0.490 -0.824 -0.174 0.501 880s 0.709 -0.297 0.304 0.522 880s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 880s x0: [4, 4, 3, 1] 880s 0.853 0.484 0.318 -0.918 880s 0.098 -0.892 0.743 0.570 880s 0.657 -0.302 0.159 0.834 880s 0.578 0.690 0.330 -0.385 880s 880s x1: [4, 4, 3, 1] 880s -0.056 0.319 -0.843 -0.118 880s -0.747 -0.211 -0.403 -0.375 880s -0.021 0.664 -0.584 -0.583 880s 0.912 0.373 -0.945 -0.097 880s 880s y: [4, 4, 3, 1] 880s -0.053 -0.984 -0.366 -0.045 880s -0.523 -0.399 -0.804 -0.566 880s 0.653 -1.360 -0.793 0.478 880s 0.747 -1.001 0.255 0.510 880s testing: mul_mat, [3, 4, 3, 2] = [4, 3, 3, 2] * [4, 4, 3, 2] 880s x0: [4, 3, 3, 2] 880s -0.342 -0.813 0.269 0.988 880s 0.724 0.106 0.148 0.944 880s -0.620 -0.253 0.940 0.548 880s 880s x1: [4, 4, 3, 2] 880s -0.403 0.873 -0.645 -0.950 880s 0.120 0.143 -0.706 0.590 880s 0.516 -0.300 0.337 -0.623 880s 0.837 -0.391 0.091 -0.542 880s 880s y: [3, 4, 3, 2] 880s -1.683 -1.191 -1.098 880s 0.235 0.555 -0.451 880s -0.457 -0.197 -0.269 880s -0.479 0.066 -0.631 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s 0.427 0.913 0.894 0.883 880s 0.147 0.163 0.882 -0.598 880s 880s x1: [4, 3, 1, 1] 880s -0.600 -0.747 0.672 -0.224 880s 0.105 0.258 -0.499 0.518 880s -0.847 0.079 0.718 0.130 880s 880s y: [2, 3, 1, 1] 880s -0.535 0.516 880s 0.292 -0.693 880s 0.467 0.443 880s testing: mul_mat, [1, 4, 3, 1] = [2, 1, 3, 1] * [2, 4, 3, 1] 880s x0: [2, 1, 3, 1] 880s 0.100 -0.014 880s 880s x1: [2, 4, 3, 1] 880s -0.184 0.773 880s 0.939 -0.295 880s -0.444 0.612 880s -0.305 -0.822 880s 880s y: [1, 4, 3, 1] 880s -0.030 880s 0.098 880s -0.053 880s -0.019 880s testing: mul_mat, [2, 2, 3, 2] = [1, 2, 3, 2] * [1, 2, 3, 2] 880s x0: [1, 2, 3, 2] 880s 0.307 880s 0.756 880s 880s x1: [1, 2, 3, 2] 880s 0.799 880s -0.359 880s 880s y: [2, 2, 3, 2] 880s 0.245 0.603 880s -0.110 -0.271 880s test-mul-mat0: iter:384/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.163 880s 0.145 880s 880s x1: [1, 1, 1, 1] 880s -0.496 880s 880s y: [2, 1, 1, 1] 880s -0.081 -0.072 880s testing: mul_mat, [1, 2, 4, 1] = [1, 1, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 1, 4, 1] 880s -0.358 880s 880s x1: [1, 2, 4, 1] 880s -0.237 880s 0.667 880s 880s y: [1, 2, 4, 1] 880s 0.085 880s -0.239 880s testing: mul_mat, [3, 1, 4, 4] = [1, 3, 4, 4] * [1, 1, 4, 4] 880s x0: [1, 3, 4, 4] 880s 0.596 880s -0.788 880s -0.906 880s 880s x1: [1, 1, 4, 4] 880s 0.934 880s 880s y: [3, 1, 4, 4] 880s 0.556 -0.735 -0.846 880s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.858 880s 880s x1: [1, 3, 1, 1] 880s -0.521 880s 0.945 880s 0.145 880s 880s y: [1, 3, 1, 1] 880s 0.447 880s -0.811 880s -0.124 880s testing: mul_mat, [4, 1, 4, 1] = [1, 4, 4, 1] * [1, 1, 4, 1] 880s x0: [1, 4, 4, 1] 880s 0.600 880s -0.707 880s -0.584 880s 0.388 880s 880s x1: [1, 1, 4, 1] 880s -0.253 880s 880s y: [4, 1, 4, 1] 880s -0.152 0.179 0.148 -0.098 880s testing: mul_mat, [3, 1, 4, 4] = [4, 3, 4, 4] * [4, 1, 4, 4] 880s x0: [4, 3, 4, 4] 880s -0.704 0.812 -0.376 -0.032 880s -0.171 0.093 -0.051 -0.307 880s -0.702 -0.676 0.717 0.571 880s 880s x1: [4, 1, 4, 4] 880s 0.943 0.849 -0.501 0.208 880s 880s y: [3, 1, 4, 4] 880s 0.208 -0.121 -1.475 880s test-mul-mat0: iter:385/500 880s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 2, 1, 1] 880s -0.738 0.460 0.109 880s -0.823 0.513 0.359 880s 880s x1: [3, 1, 1, 1] 880s -0.380 -0.304 0.457 880s 880s y: [2, 1, 1, 1] 880s 0.190 0.321 880s testing: mul_mat, [2, 2, 4, 1] = [3, 2, 4, 1] * [3, 2, 4, 1] 880s x0: [3, 2, 4, 1] 880s -0.979 0.987 0.050 880s -0.801 0.500 -0.591 880s 880s x1: [3, 2, 4, 1] 880s 0.111 0.084 -0.085 880s -0.045 0.474 -0.387 880s 880s y: [2, 2, 4, 1] 880s -0.029 0.004 880s 0.493 0.501 880s testing: mul_mat, [2, 2, 4, 4] = [3, 2, 4, 4] * [3, 2, 4, 4] 880s x0: [3, 2, 4, 4] 880s 0.479 -0.422 0.365 880s -0.747 0.161 -0.822 880s 880s x1: [3, 2, 4, 4] 880s 0.094 -0.189 0.857 880s -0.243 -0.430 -0.616 880s 880s y: [2, 2, 4, 4] 880s 0.437 -0.805 880s -0.159 0.618 880s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 880s x0: [3, 3, 1, 1] 880s -1.000 0.029 0.085 880s 0.499 -0.333 0.675 880s -0.281 0.946 -0.355 880s 880s x1: [3, 2, 1, 1] 880s 0.765 0.940 0.309 880s 0.361 0.774 -0.493 880s 880s y: [3, 2, 1, 1] 880s -0.712 0.277 0.565 880s -0.381 -0.411 0.806 880s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 880s x0: [3, 2, 4, 1] 880s 0.033 -0.699 -0.743 880s 0.912 -0.499 0.339 880s 880s x1: [3, 3, 4, 1] 880s 0.190 0.869 0.079 880s 0.062 -0.677 0.450 880s 0.703 -0.645 0.421 880s 880s y: [2, 3, 4, 1] 880s -0.660 -0.234 880s 0.141 0.547 880s 0.161 1.106 880s testing: mul_mat, [4, 3, 4, 4] = [2, 4, 4, 4] * [2, 3, 4, 4] 880s x0: [2, 4, 4, 4] 880s -0.566 0.267 880s -0.278 -0.244 880s -0.441 -0.064 880s 0.751 -0.787 880s 880s x1: [2, 3, 4, 4] 880s -0.183 0.968 880s -0.099 -0.753 880s 0.298 -0.543 880s 880s y: [4, 3, 4, 4] 880s 0.362 -0.185 0.018 -0.899 880s -0.145 0.211 0.092 0.518 880s -0.313 0.050 -0.096 0.650 880s test-mul-mat0: iter:386/500 880s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 2, 1, 1] 880s -0.431 880s 0.104 880s 880s x1: [1, 1, 1, 1] 880s 0.769 880s 880s y: [2, 1, 1, 1] 880s -0.331 0.080 880s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 880s x0: [1, 3, 4, 1] 880s 0.375 880s -0.430 880s -0.613 880s 880s x1: [1, 2, 4, 1] 880s 0.998 880s -0.215 880s 880s y: [3, 2, 4, 1] 880s 0.375 -0.430 -0.612 880s -0.081 0.092 0.132 880s testing: mul_mat, [2, 3, 4, 2] = [1, 2, 4, 2] * [1, 3, 4, 2] 880s x0: [1, 2, 4, 2] 880s 0.589 880s 0.404 880s 880s x1: [1, 3, 4, 2] 880s 0.373 880s -0.130 880s 0.170 880s 880s y: [2, 3, 4, 2] 880s 0.220 0.151 880s -0.076 -0.052 880s 0.100 0.069 880s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 3, 1, 1] 880s -0.315 880s 0.345 880s 0.205 880s 880s x1: [1, 2, 1, 1] 880s 0.875 880s -0.062 880s 880s y: [3, 2, 1, 1] 880s -0.276 0.302 0.180 880s 0.020 -0.021 -0.013 880s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 3, 4, 1] 880s -0.372 -0.017 0.612 880s 0.485 0.935 -0.426 880s -0.166 -0.526 -0.071 880s 880s x1: [3, 1, 4, 1] 880s -0.654 -0.292 0.357 880s 880s y: [3, 1, 4, 1] 880s 0.467 -0.742 0.237 880s testing: mul_mat, [2, 3, 4, 2] = [3, 2, 4, 2] * [3, 3, 4, 2] 880s x0: [3, 2, 4, 2] 880s 0.260 0.099 0.703 880s -0.246 0.914 -0.613 880s 880s x1: [3, 3, 4, 2] 880s 0.149 0.710 -0.960 880s -0.921 -0.425 -0.991 880s 0.383 0.315 -0.084 880s 880s y: [2, 3, 4, 2] 880s -0.566 1.200 880s -0.979 0.446 880s 0.072 0.245 880s test-mul-mat0: iter:387/500 880s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.966 880s 0.990 880s 0.784 880s -0.809 880s 880s x1: [1, 2, 1, 1] 880s -0.893 880s 0.869 880s 880s y: [4, 2, 1, 1] 880s -0.863 -0.884 -0.700 0.723 880s 0.839 0.860 0.681 -0.703 880s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 1, 1, 1] 880s -0.307 880s 880s x1: [1, 4, 1, 1] 880s -0.153 880s -0.914 880s 0.892 880s -0.035 880s 880s y: [1, 4, 1, 1] 880s 0.047 880s 0.281 880s -0.274 880s 0.011 880s testing: mul_mat, [2, 1, 1, 2] = [1, 2, 1, 2] * [1, 1, 1, 2] 880s x0: [1, 2, 1, 2] 880s 0.326 880s -0.180 880s 880s x1: [1, 1, 1, 2] 880s -0.004 880s 880s y: [2, 1, 1, 2] 880s -0.001 0.001 880s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.752 880s -0.763 880s 880s x1: [1, 2, 1, 1] 880s 0.705 880s 0.416 880s 880s y: [2, 2, 1, 1] 880s 0.530 -0.538 880s 0.313 -0.318 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s 0.833 -0.614 880s -0.787 0.940 880s 880s x1: [2, 1, 1, 1] 880s 0.590 0.084 880s 880s y: [2, 1, 1, 1] 880s 0.440 -0.385 880s testing: mul_mat, [3, 2, 1, 2] = [2, 3, 1, 2] * [2, 2, 1, 2] 880s x0: [2, 3, 1, 2] 880s 0.884 -0.120 880s -0.741 -0.840 880s -0.319 -0.439 880s 880s x1: [2, 2, 1, 2] 880s -0.918 0.248 880s 0.906 -0.928 880s 880s y: [3, 2, 1, 2] 880s -0.841 0.472 0.184 880s 0.912g 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 880s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 880s g 0.109 0.119 880s test-mul-mat0: iter:388/500 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.250 880s 0.816 880s 880s x1: [1, 4, 1, 1] 880s 0.521 880s -0.891 880s 0.310 880s 0.734 880s 880s y: [2, 4, 1, 1] 880s 0.130 0.426 880s -0.222 -0.727 880s 0.077 0.253 880s 0.183 0.600 880s testing: mul_mat, [2, 2, 2, 1] = [1, 2, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 2, 2, 1] 880s -0.193 880s 0.892 880s 880s x1: [1, 2, 2, 1] 880s 0.744 880s 0.156 880s 880s y: [2, 2, 2, 1] 880s -0.144 0.663 880s -0.030 0.139 880s testing: mul_mat, [1, 2, 2, 4] = [1, 1, 2, 4] * [1, 2, 2, 4] 880s x0: [1, 1, 2, 4] 880s -0.931 880s 880s x1: [1, 2, 2, 4] 880s -0.849 880s 0.922 880s 880s y: [1, 2, 2, 4] 880s 0.790 880s -0.858 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s -0.820 880s 0.223 880s -0.032 880s -0.129 880s 880s x1: [1, 1, 1, 1] 880s 0.332 880s 880s y: [4, 1, 1, 1] 880s -0.272 0.074 -0.011 -0.043 880s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 880s x0: [4, 1, 2, 1] 880s -0.048 -0.590 -0.103 0.365 880s 880s x1: [4, 1, 2, 1] 880s 0.374 -0.111 -0.558 0.685 880s 880s y: [1, 1, 2, 1] 880s 0.354 880s testing: mul_mat, [1, 4, 2, 4] = [1, 1, 2, 4] * [1, 4, 2, 4] 880s x0: [1, 1, 2, 4] 880s 0.599 880s 880s x1: [1, 4, 2, 4] 880s 0.726 880s -0.032 880s 0.750 880s 0.045 880s 880s y: [1, 4, 2, 4] 880s 0.435 880s -0.019 880s 0.449 880s 0.027 880s test-mul-mat0: iter:389/500 880s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 3, 1, 1] 880s -0.867 -0.582 0.029 0.983 880s 0.852 -0.492 0.581 -0.668 880s -0.559 0.487 -0.667 -0.801 880s 880s x1: [4, 3, 1, 1] 880s -0.359 -0.434 0.964 -0.438 880s 0.043 0.503 0.458 -0.074 880s -0.342 0.305 0.221 -0.291 880s 880s y: [3, 3, 1, 1] 880s 0.161 0.760 -0.303 880s -0.389 0.104 -0.025 880s -0.160 -0.120 0.425 880s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 880s x0: [4, 4, 4, 1] 880s -0.418 -0.435 0.357 0.532 880s 0.479 0.071 -0.011 0.974 880s 0.117 -0.113 0.956 0.495 880s -0.016 -0.032 0.453 0.047 880s 880s x1: [4, 3, 4, 1] 880s 0.030 0.580 0.965 0.436 880s 0.042 -0.391 -0.923 0.608 880s -0.428 -0.361 -0.349 -0.924 880s 880s y: [4, 3, 4, 1] 880s 0.312 0.470 1.076 0.439 880s 0.146 0.595 -0.532 -0.378 880s -0.281 -1.127 -0.800 -0.184 880s testing: mul_mat, [3, 4, 4, 4] = [4, 3, 4, 4] * [4, 4, 4, 4] 880s x0: [4, 3, 4, 4] 880s 0.410 0.694 -0.258 0.495 880s -0.162 0.405 -0.200 -0.326 880s -0.890 -0.379 -0.315 -0.018 880s 880s x1: [4, 4, 4, 4] 880s 0.362 -0.663 -0.692 0.621 880s -0.533 0.119 -0.004 0.484 880s 0.887 0.867 0.485 0.623 880s -0.810 -0.467 -0.226 -0.032 880s 880s y: [3, 4, 4, 4] 880s 0.175 -0.391 0.136 880s 0.105 -0.023 0.422 880s 1.148 -0.093 -1.282 880s -0.613 -0.002 0.969 880s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 880s x0: [4, 2, 1, 1] 880s -0.067 -0.944 0.300 0.723 880s -0.811 0.032 -0.497 -0.092 880s 880s x1: [4, 3, 1, 1] 880s 0.127 -0.892 0.843 0.267 880s -0.777 -0.225 0.465 0.568 880s 0.049 -0.553 0.746 0.097 880s 880s y: [2, 3, 1, 1] 880s 1.280 -0.575 880s 0.815 0.340 880s 0.813 -0.437 880s testing: mul_mat, [2, 4, 4, 1] = [2, 2, 4, 1] * [2, 4, 4, 1] 880s x0: [2, 2, 4, 1] 880s 0.285 -0.251 880s 0.788 0.992 880s 880s x1: [2, 4, 4, 1] 880s -0.060 0.195 880s -0.610 -0.075 880s -0.004 -0.293 880s 0.852 -0.531 880s 880s y: [2, 4, 4, 1] 880s -0.066 0.146 880s -0.155 -0.556 880s 0.072 -0.294 880s 0.376 0.144 880s testing: mul_mat, [4, 2, 4, 4] = [2, 4, 4, 4] * [2, 2, 4, 4] 880s x0: [2, 4, 4, 4] 880s 0.814 -0.377 880s -0.376 -0.703 880s -0.589 -0.961 880s -0.008 -0.889 880s 880s x1: [2, 2, 4, 4] 880s -0.301 -0.221 880s -0.900 0.196 880s 880s y: [4, 2, 4, 4] 880s -0.162 0.268 0.390 0.199 880s -0.806 0.201 0.342 -0.167 880s test-mul-mat0: iter:390/500 880s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.809 0.204 -0.024 880s 880s x1: [3, 1, 1, 1] 880s 0.417 0.205 -0.163 880s 880s y: [1, 1, 1, 1] 880s 0.383 880s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 880s x0: [3, 4, 4, 1] 880s -0.589 -0.104 -0.455 880s 0.285 0.205 -0.702 880s -0.869 -0.983 -0.736 880s -0.997 0.187 0.681 880s 880s x1: [3, 1, 4, 1] 880s -0.740 0.349 -0.035 880s 880s y: [4, 1, 4, 1] 880s 0.415 -0.115 0.326 0.779 880s testing: mul_mat, [4, 4, 4, 3] = [3, 4, 4, 3] * [3, 4, 4, 3] 880s x0: [3, 4, 4, 3] 880s 0.665 0.655 -0.936 880s -0.656 -0.970 -0.651 880s 0.612 0.768 -0.899 880s 0.362 -0.641 -0.054 880s 880s x1: [3, 4, 4, 3] 880s 0.474 -0.926 0.088 880s 0.003 0.253 -0.376 880s -0.587 -0.755 0.505 880s 0.623 -0.075 -0.753 880s 880s y: [4, 4, 4, 3] 880s -0.373 0.530 -0.500 0.761 880s 0.520 -0.002 0.534 -0.141 880s -1.358 0.789 -1.393 0.245 880s 1.070 0.154 1.000 0.314 880s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 880s x0: [3, 1, 1, 1] 880s 0.491 0.419 -0.646 880s 880s x1: [3, 4, 1, 1] 880s -0.158 0.019 0.886 880s 0.462 0.425 -0.658 880s -0.994 0.927 -0.867 880s 0.272 -0.011 0.702 880s 880s y: [1, 4, 1, 1] 880s -0.642 880s 0.830 880s 0.460 880s -0.325 880s testing: mul_mat, [1, 3, 4, 1] = [1, 1, 4, 1] * [1, 3, 4, 1] 880s x0: [1, 1, 4, 1] 880s 0.029 880s 880s x1: [1, 3, 4, 1] 880s 0.086 880s 0.192 880s -0.216 880s 880s y: [1, 3, 4, 1] 880s 0.002 880s 0.006 880s -0.006 880s testing: mul_mat, [3, 1, 4, 3] = [1, 3, 4, 3] * [1, 1, 4, 3] 880s x0: [1, 3, 4, 3] 880s -0.747 880s -0.034 880s 0.603 880s 880s x1: [1, 1, 4, 3] 880s -0.184 880s 880s y: [3, 1, 4, 3] 880s 0.137 0.006 -0.111 880s test-mul-mat0: iter:391/500 880s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 880s x0: [1, 4, 1, 1] 880s 0.333 880s -0.710 880s 0.908 880s 0.470 880s 880s x1: [1, 1, 1, 1] 880s 0.386 880s 880s y: [4, 1, 1, 1] 880s 0.129 -0.274 0.351 0.181 880s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 880s x0: [1, 2, 2, 1] 880s 0.680 880s -0.986 880s 880s x1: [1, 4, 2, 1] 880s -0.876 880s 0.616 880s 0.655 880s -0.226 880s 880s y: [2, 4, 2, 1] 880s -0.596 0.864 880s 0.419 -0.607 880s 0.445 -0.645 880s -0.154 0.223 880s testing: mul_mat, [4, 2, 2, 2] = [1, 4, 2, 2] * [1, 2, 2, 2] 880s x0: [1, 4, 2, 2] 880s -0.289 880s 0.244 880s -0.850 880s -0.903 880s 880s x1: [1, 2, 2, 2] 880s 0.898 880s 0.140 880s 880s y: [4, 2, 2, 2] 880s -0.259 0.219 -0.763 -0.811 880s -0.040 0.034 -0.119 -0.126 880s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 880s x0: [1, 2, 1, 1] 880s 0.316 880s 0.436 880s 880s x1: [1, 4, 1, 1] 880s 0.524 880s -0.712 880s 0.545 880s 0.538 880s 880s y: [2, 4, 1, 1] 880s 0.166 0.228 880s -0.225 -0.310 880s 0.172 0.237 880s 0.170 0.234 880s testing: mul_mat, [3, 1, 2, 1] = [2, 3, 2, 1] * [2, 1, 2, 1] 880s x0: [2, 3, 2, 1] 880s -0.323 0.388 880s 0.655 -0.100 880s 0.897 -0.953 880s 880s x1: [2, 1, 2, 1] 880s 0.241 0.591 880s 880s y: [3, 1, 2, 1] 880s 0.151 0.099 -0.347 880s testing: mul_mat, [1, 2, 2, 2] = [3, 1, 2, 2] * [3, 2, 2, 2] 880s x0: [3, 1, 2, 2] 880s -0.713 0.084 0.430 880s 880s x1: [3, 2, 2, 2] 880s -0.619 0.279 0.092 880s 0.665 -0.227 0.012 880s 880s y: [1, 2, 2, 2] 880s 0.504 880s -0.488 880s test-mul-mat0: iter:392/500 880s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 3, 1, 1] 880s 0.974 0.361 880s 0.180 -0.287 880s -0.325 0.134 880s 880s x1: [2, 1, 1, 1] 880s -0.295 -0.281 880s 880s y: [3, 1, 1, 1] 880s -0.389 0.028 0.058 880s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 880s x0: [2, 4, 2, 1] 880s -0.076 0.234 880s -0.879 0.499 880s 0.252 0.855 880s -0.631 -0.430 880s 880s x1: [2, 3, 2, 1] 880s -0.871 0.645 880s -0.015 -0.068 880s 0.514 -0.116 880s 880s y: [4, 3, 2, 1] 880s 0.217 1.087 0.331 0.272 880s -0.015 -0.021 -0.062 0.039 880s -0.066 -0.510 0.030 -0.274 880s testing: mul_mat, [1, 4, 2, 1] = [2, 1, 2, 1] * [2, 4, 2, 1] 880s x0: [2, 1, 2, 1] 880s 0.458 -0.374 880s 880s x1: [2, 4, 2, 1] 880s 0.324 -0.826 880s 0.891 -0.031 880s 0.159 -0.178 880s -0.517 -0.958 880s 880s y: [1, 4, 2, 1] 880s 0.458 880s 0.419 880s 0.139 880s 0.122 880s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.930 0.835 880s -0.071 0.118 880s 880s x1: [2, 1, 1, 1] 880s -0.005 0.640 880s 880s y: [2, 1, 1, 1] 880s 0.539 0.076 880s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 880s x0: [2, 1, 2, 1] 880s -0.645 -0.750 880s 880s x1: [2, 2, 2, 1] 880s 0.578 -0.125 880s -0.120 -0.097 880s 880s y: [1, 2, 2, 1] 880s -0.279 880s 0.151 880s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 880s x0: [1, 3, 2, 1] 880s -0.232 880s 0.006 880s 0.848 880s 880s x1: [1, 2, 2, 1] 880s -0.394 880s 0.054 880s 880s y: [3, 2, 2, 1] 880s 0.092 -0.002 -0.334 880s -0.013 0.000 0.046 880s test-mul-mat0: iter:393/500 880s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 880s x0: [2, 2, 1, 1] 880s -0.004 0.103 880s -0.180 0.352 880s 880s x1: [2, 4, 1, 1] 880s -0.356 -0.029 880s 0.577 -0.778 880s 0.847 -0.543 880s 0.125 -0.104 880s 880s y: [2, 4, 1, 1] 880s -0.002 0.054 880s -0.082 -0.377 880s -0.059 -0.344 880s -0.011 -0.059 880s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 2, 3, 1] 880s -0.092 -0.021 880s 0.946 -0.956 880s 880s x1: [2, 2, 3, 1] 880s 0.353 -0.546 880s -0.212 0.959 880s 880s y: [2, 2, 3, 1] 880s -0.021 0.856 880s -0.000 -1.117 880s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 880s x0: [2, 2, 3, 1] 881s 0.380 -0.302 881s 0.477 0.127 881s 881s x1: [2, 2, 3, 1] 881s 0.758 -0.302 881s -0.067 -0.139 881s 881s y: [2, 2, 3, 1] 881s 0.379 0.323 881s 0.017 -0.049 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.007 0.927 881s 0.714 -0.424 881s -0.110 -0.592 881s 881s x1: [2, 2, 1, 1] 881s 0.094 -0.280 881s 0.056 0.956 881s 881s y: [3, 2, 1, 1] 881s -0.260 0.186 0.155 881s 0.886 -0.366 -0.572 881s testing: mul_mat, [3, 2, 3, 1] = [3, 3, 3, 1] * [3, 2, 3, 1] 881s x0: [3, 3, 3, 1] 881s 0.370 -0.574 0.419 881s 0.577 0.534 0.248 881s 0.465 0.656 -0.454 881s 881s x1: [3, 2, 3, 1] 881s -0.788 -0.209 -0.619 881s 0.285 -0.035 0.862 881s 881s y: [3, 2, 3, 1] 881s -0.431 -0.720 -0.223 881s 0.487 0.359 -0.282 881s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 881s x0: [3, 1, 3, 1] 881s 0.820 -0.324 0.288 881s 881s x1: [3, 3, 3, 1] 881s -0.074 -0.769 0.696 881s 0.671 0.601 0.273 881s 0.136 -0.973 -0.194 881s 881s y: [1, 3, 3, 1] 881s 0.389 881s 0.434 881s 0.371 881s test-mul-mat0: iter:394/500 881s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 3, 1, 1] 881s 0.775 -0.111 0.746 0.063 881s 0.635 -0.347 -0.953 -0.948 881s 0.160 -0.697 0.604 0.693 881s 881s x1: [4, 4, 1, 1] 881s 0.980 0.117 0.683 -0.228 881s -0.005 0.977 0.122 0.488 881s 0.606 -0.686 0.560 -0.567 881s 0.135 0.499 0.699 0.069 881s 881s y: [3, 4, 1, 1] 881s 1.242 0.147 0.330 881s 0.010 -0.921 -0.270 881s 0.928 0.627 0.520 881s 0.575 -0.820 0.144 881s testing: mul_mat, [2, 3, 4, 1] = [4, 2, 4, 1] * [4, 3, 4, 1] 881s x0: [4, 2, 4, 1] 881s -0.528 -0.214 0.989 0.314 881s -0.029 -0.403 -0.191 0.203 881s 881s x1: [4, 3, 4, 1] 881s 0.366 0.949 0.673 -0.517 881s 0.632 -0.554 0.478 0.609 881s 0.567 -0.035 0.215 0.881 881s 881s y: [2, 3, 4, 1] 881s 0.107 -0.626 881s 0.448 0.237 881s 0.197 0.135 881s testing: mul_mat, [2, 2, 4, 2] = [4, 2, 4, 2] * [4, 2, 4, 2] 881s x0: [4, 2, 4, 2] 881s 0.746 -0.121 -0.587 0.670 881s 0.243 0.210 0.983 0.165 881s 881s x1: [4, 2, 4, 2] 881s -0.523 -0.821 0.223 0.448 881s -0.224 -0.968 -0.349 -0.965 881s 881s y: [2, 2, 4, 2] 881s -0.122 -0.006 881s -0.492 -0.759 881s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.224 -0.093 0.693 -0.398 881s 0.670 0.865 -0.190 0.703 881s 881s x1: [4, 2, 1, 1] 881s 0.567 -0.572 -0.973 0.580 881s -0.610 -0.427 -0.096 -0.325 881s 881s y: [2, 2, 1, 1] 881s -0.980 0.478 881s 0.239 -0.988 881s testing: mul_mat, [2, 4, 4, 1] = [2, 2, 4, 1] * [2, 4, 4, 1] 881s x0: [2, 2, 4, 1] 881s -0.585 -0.727 881s 0.370 -0.135 881s 881s x1: [2, 4, 4, 1] 881s -0.148 0.428 881s 0.357 -0.663 881s 0.966 -0.550 881s 0.912 0.213 881s 881s y: [2, 4, 4, 1] 881s -0.225 -0.112 881s 0.273 0.222 881s -0.165 0.432 881s -0.688 0.309 881s testing: mul_mat, [2, 2, 4, 2] = [2, 2, 4, 2] * [2, 2, 4, 2] 881s x0: [2, 2, 4, 2] 881s -0.236 0.484 881s -0.313 0.974 881s 881s x1: [2, 2, 4, 2] 881s -0.927 -0.075 881s -0.203 -0.935 881s 881s y: [2, 2, 4, 2] 881s 0.182 0.217 881s -0.404 -0.848 881s test-mul-mat0: iter:395/500 881s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.593 -0.521 -0.413 881s 0.859 -0.912 0.727 881s 0.745 -0.197 -0.172 881s 0.269 0.900 0.721 881s 881s x1: [3, 2, 1, 1] 881s 0.725 -0.927 -0.343 881s -0.087 0.903 -0.709 881s 881s y: [4, 2, 1, 1] 881s 0.194 1.219 0.781 -0.886 881s -0.126 -1.415 -0.121 0.278 881s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 881s x0: [3, 2, 3, 1] 881s 0.986 0.150 0.321 881s 0.367 -0.926 -0.541 881s 881s x1: [3, 4, 3, 1] 881s -0.998 0.805 0.162 881s 0.904 0.677 0.225 881s 0.900 -0.687 0.975 881s 0.381 -0.076 0.138 881s 881s y: [2, 4, 3, 1] 881s -0.812 -1.199 881s 1.066 -0.417 881s 1.098 0.439 881s 0.408 0.135 881s testing: mul_mat, [4, 2, 3, 4] = [3, 4, 3, 4] * [3, 2, 3, 4] 881s x0: [3, 4, 3, 4] 881s -0.295 -0.026 0.382 881s 0.675 0.400 0.098 881s -0.764 -0.056 -0.557 881s 0.459 0.862 0.907 881s 881s x1: [3, 2, 3, 4] 881s 0.676 0.356 0.013 881s -0.086 0.536 -0.631 881s 881s y: [4, 2, 3, 4] 881s -0.203 0.600 -0.544 0.629 881s -0.229 0.094 0.388 -0.150 881s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 2, 1, 1] 881s -0.696 -0.017 0.722 881s 0.592 -0.065 0.517 881s 881s x1: [3, 4, 1, 1] 881s 0.202 0.131 0.885 881s 0.847 -0.564 0.187 881s 0.774 0.481 0.505 881s -0.044 0.544 -0.453 881s 881s y: [2, 4, 1, 1] 881s 0.496 0.569 881s -0.445 0.635 881s -0.182 0.688 881s -0.306 -0.296 881s testing: mul_mat, [1, 3, 3, 1] = [2, 1, 3, 1] * [2, 3, 3, 1] 881s x0: [2, 1, 3, 1] 881s -0.571 -0.251 881s 881s x1: [2, 3, 3, 1] 881s -0.452 0.284 881s 0.492 -0.898 881s 0.206 -0.876 881s 881s y: [1, 3, 3, 1] 881s 0.187 881s -0.055 881s 0.102 881s testing: mul_mat, [4, 2, 3, 4] = [1, 4, 3, 4] * [1, 2, 3, 4] 881s x0: [1, 4, 3, 4] 881s -0.546 881s 0.879 881s -0.391 881s -0.573 881s 881s x1: [1, 2, 3, 4] 881s -0.722 881s -0.112 881s 881s y: [4, 2, 3, 4] 881s 0.395 -0.635 0.283 0.414 881s 0.061 -0.098 0.044 0.064 881s test-mul-mat0: iter:396/500 881s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 3, 1, 1] 881s -0.880 0.703 -0.010 881s -0.780 0.274 0.841 881s -0.762 0.520 -0.763 881s 881s x1: [3, 4, 1, 1] 881s -0.992 -0.251 -0.913 881s 0.810 0.240 0.410 881s -0.749 0.755 0.862 881s -0.162 -0.276 -0.805 881s 881s y: [3, 4, 1, 1] 881s 0.706 -0.063 1.322 881s -0.548 -0.221 -0.805 881s gml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s 1.181 1.516 0.305 881s -0.044 -0.627 0.594 881s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 881s x0: [3, 1, 3, 1] 881s 0.198 0.063 -0.710 881s 881s x1: [3, 3, 3, 1] 881s 0.933 -0.666 0.322 881s 0.168 -0.183 -0.638 881s -0.720 -0.848 0.012 881s 881s y: [1, 3, 3, 1] 881s -0.086 881s 0.475 881s -0.204 881s testing: mul_mat, [4, 1, 3, 1] = [3, 4, 3, 1] * [3, 1, 3, 1] 881s x0: [3, 4, 3, 1] 881s 0.303 -0.036 -0.499 881s 0.922 0.530 0.722 881s -0.338 0.799 0.934 881s 0.621 0.342 0.986 881s 881s x1: [3, 1, 3, 1] 881s -0.421 -0.634 -0.015 881s 881s y: [4, 1, 3, 1] 881s -0.097 -0.735 -0.378 -0.493 881s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.580 -0.861 0.761 881s 0.380 -0.592 -0.015 881s 881s x1: [3, 4, 1, 1] 881s -0.804 -0.059 0.817 881s -0.871 -0.437 0.159 881s -0.884 0.381 0.563 881s -0.608 0.331 0.365 881s 881s y: [2, 4, 1, 1] 881s 0.205 -0.283 881s -0.008 -0.074 881s -0.413 -0.570 881s -0.361 -0.433 881s testing: mul_mat, [4, 3, 3, 1] = [2, 4, 3, 1] * [2, 3, 3, 1] 881s x0: [2, 4, 3, 1] 881s 0.985 0.845 881s -0.519 -0.679 881s -0.958 -0.103 881s 0.316 0.225 881s 881s x1: [2, 3, 3, 1] 881s -0.897 0.408 881s 0.915 -0.237 881s -0.537 0.438 881s 881s y: [4, 3, 3, 1] 881s -0.538 0.188 0.816 -0.192 881s 0.701 -0.314 -0.851 0.236 881s -0.159 -0.018 0.469 -0.071 881s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 881s x0: [4, 2, 3, 1] 881s 0.414 0.857 0.584 0.055 881s -0.193 0.532 -0.478 -0.597 881s 881s x1: [4, 2, 3, 1] 881s 0.523 -0.181 -0.576 0.058 881s 0.423 0.341 -0.291 0.408 881s 881s y: [2, 2, 3, 1] 881s -0.272 0.044 881s 0.320 -0.004 881s test-mul-mat0: iter:397/500 881s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 1, 1, 1] 881s -0.374 881s 881s x1: [1, 1, 1, 1] 881s 0.349 881s 881s y: [1, 1, 1, 1] 881s -0.130 881s testing: mul_mat, [1, 1, 2, 1] = [1, 1, 2, 1] * [1, 1, 2, 1] 881s x0: [1, 1, 2, 1] 881s -0.705 881s 881s x1: [1, 1, 2, 1] 881s -0.237 881s 881s y: [1, 1, 2, 1] 881s 0.167 881s testing: mul_mat, [3, 1, 2, 2] = [1, 3, 2, 2] * [1, 1, 2, 2] 881s x0: [1, 3, 2, 2] 881s -0.431 881s -0.764 881s -0.717 881s 881s x1: [1, 1, 2, 2] 881s 0.005 881s 881s y: [3, 1, 2, 2] 881s -0.002 -0.004 -0.004 881s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.847 881s 0.322 881s -0.956 881s 881s x1: [1, 3, 1, 1] 881s 0.852 881s 0.334 881s 0.830 881s 881s y: [3, 3, 1, 1] 881s -0.721 0.274 -0.815 881s -0.283 0.108 -0.320 881s -0.703 0.267 -0.794 881s testing: mul_mat, [3, 1, 2, 1] = [3, 3, 2, 1] * [3, 1, 2, 1] 881s x0: [3, 3, 2, 1] 881s 0.437 0.387 0.623 881s -0.548 0.915 -0.802 881s 0.940 -0.491 -0.686 881s 881s x1: [3, 1, 2, 1] 881s -0.898 0.948 -0.193 881s 881s y: [3, 1, 2, 1] 881s -0.146 1.514 -1.177 881s testing: mul_mat, [2, 3, 2, 2] = [3, 2, 2, 2] * [3, 3, 2, 2] 881s x0: [3, 2, 2, 2] 881s 0.175 -0.149 -0.149 881s -0.910 -0.503 -0.229 881s 881s x1: [3, 3, 2, 2] 881s -0.790 0.059 0.968 881s -0.637 -0.619 -0.989 881s -0.535 -0.671 -0.182 881s 881s y: [2, 3, 2, 2] 881s -0.292 0.468 881s 0.128 1.117 881s 0.033 0.866 881s test-mul-mat0: iter:398/500 881s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.628 -0.830 -0.915 0.470 881s -0.914 0.811 -0.513 -0.459 881s 881s x1: [4, 3, 1, 1] 881s 0.476 0.005 0.974 -0.348 881s 0.095 -0.176 0.149 0.946 881s 0.596 -0.835 -0.005 -0.261 881s 881s y: [2, 3, 1, 1] 881s -0.760 -0.771 881s 0.513 -0.740 881s 0.949 -1.099 881s testing: mul_mat, [2, 2, 2, 1] = [4, 2, 2, 1] * [4, 2, 2, 1] 881s x0: [4, 2, 2, 1] 881s 0.243 0.017 0.392 -0.762 881s 0.756 0.033 0.866 0.927 881s 881s x1: [4, 2, 2, 1] 881s -0.104 -0.234 -0.914 -0.238 881s 0.184 0.952 -0.318 0.950 881s 881s y: [2, 2, 2, 1] 881s -0.206 -1.097 881s -0.788 0.775 881s testing: mul_mat, [2, 2, 2, 3] = [4, 2, 2, 3] * [4, 2, 2, 3] 881s x0: [4, 2, 2, 3] 881s -0.222 -0.487 0.426 0.413 881s 0.124 0.421 0.384 0.748 881s 881s x1: [4, 2, 2, 3] 881s -0.360 -0.412 0.773 -0.408 881s 0.270 0.722 0.789 0.893 881s 881s y: [2, 2, 2, 3] 881s 0.441 -0.227 881s 0.294 1.308 881s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.384 0.692 -0.610 -0.355 881s 881s x1: [4, 2, 1, 1] 881s 0.880 -0.360 -0.189 0.420 881s 0.255 -0.651 0.951 -0.878 881s 881s y: [1, 2, 1, 1] 881s 0.054 881s -0.621 881s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 881s x0: [1, 3, 2, 1] 881s 0.632 881s 0.787 881s 0.535 881s 881s x1: [1, 4, 2, 1] 881s 0.811 881s -0.807 881s -0.581 881s 0.355 881s 881s y: [3, 4, 2, 1] 881s 0.513 0.638 0.434 881s -0.510 -0.635 -0.432 881s -0.367 -0.457 -0.311 881s 0.225 0.280 0.190 881s testing: mul_mat, [1, 1, 2, 3] = [3, 1, 2, 3] * [3, 1, 2, 3] 881s x0: [3, 1, 2, 3] 881s 0.048 0.829 0.894 881s 881s x1: [3, 1, 2, 3] 881s -0.123 -0.827 0.311 881s 881s y: [1, 1, 2, 3] 881s -0.413 881s test-mul-mat0: iter:399/500 881s testiggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.pngng: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.207 -0.350 0.566 881s 0.626 0.227 -0.387 881s 881s x1: [3, 1, 1, 1] 881s 0.992 -0.019 -0.507 881s 881s y: [2, 1, 1, 1] 881s -0.075 0.813 881s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 881s x0: [3, 4, 4, 1] 881s -0.826 0.587 0.558 881s 0.800 -0.187 -0.829 881s 0.255 0.934 -0.846 881s -0.451 -0.086 0.137 881s 881s x1: [3, 2, 4, 1] 881s 0.455 0.121 0.982 881s 0.293 -0.021 -0.017 881s 881s y: [4, 2, 4, 1] 881s 0.243 -0.472 -0.602 -0.081 881s -0.264 0.253 0.070 -0.133 881s testing: mul_mat, [4, 4, 4, 4] = [3, 4, 4, 4] * [3, 4, 4, 4] 881s x0: [3, 4, 4, 4] 881s 0.482 -0.937 0.937 881s 0.523 -0.117 0.738 881s -0.098 -0.534 -0.616 881s -0.741 -0.531 -0.592 881s 881s x1: [3, 4, 4, 4] 881s 0.568 -0.032 0.057 881s 0.279 0.020 -0.036 881s -0.547 0.200 -0.144 881s -0.929 0.499 0.715 881s 881s y: [4, 4, 4, 4] 881s 0.357 0.343 -0.074 -0.438 881s 0.082 0.117 -0.016 -0.196 881s -0.586 -0.416 0.036 0.385 881s -0.246 -0.017 -0.616 0.000 881s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.276 0.701 0.647 881s 0.020 0.265 0.494 881s 881s x1: [3, 4, 1, 1] 881s -0.198 0.217 0.329 881s -0.546 0.522 0.885 881s 0.771 0.708 0.249 881s -0.947 0.551 -0.295 881s 881s y: [2, 4, 1, 1] 881s 0.310 0.216 881s 0.788 0.565 881s 0.870 0.326 881s -0.066 -0.019 881s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 881s x0: [2, 4, 4, 1] 881s -0.308 -0.521 881s -0.168 0.233 881s 0.739 -0.686 881s 0.957 -0.243 881s 881s x1: [2, 3, 4, 1] 881s -0.643 -0.721 881s -0.914 -0.306 881s 0.004 -0.248 881s 881s y: [4, 3, 4, 1] 881s 0.574 -0.060 0.020 -0.440 881s 0.441 0.083 -0.466 -0.801 881s 0.128 -0.058 0.173 0.064 881s testing: mul_mat, [3, 2, 4, 4] = [4, 3, 4, 4] * [4, 2, 4, 4] 881s x0: [4, 3, 4, 4] 881s 0.601 0.559 -0.083 -0.871 881s 0.980 -0.091 -0.342 -0.905 881s 0.225 0.602 -0.394 -0.889 881s 881s x1: [4, 2, 4, 4] 881s 0.050 0.549 0.012 0.528 881s -0.218 0.326 -0.715 -0.899 881s 881s y: [3, 2, 4, 4] 881s -0.124 -0.484 -0.132 881s 0.893 0.815 1.228 881s test-mul-mat0: iter:400/500 881s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 3, 1, 1] 881s 0.595 881s 0.317 881s -0.969 881s 881s x1: [1, 3, 1, 1] 881s -0.271 881s 0.259 881s -0.567 881s 881s y: [3, 3, 1, 1] 881s -0.161 -0.086 0.262 881s 0.154 0.082 -0.251 881s -0.337 -0.180 0.549 881s testing: mul_mat, [4, 3, 3, 1] = [1, 4, 3, 1] * [1, 3, 3, 1] 881s x0: [1, 4, 3, 1] 881s 0.697 881s 0.547 881s 0.354 881s 0.032 881s 881s x1: [1, 3, 3, 1] 881s 0.397 881s 0.128 881s 0.301 881s 881s y: [4, 3, 3, 1] 881s 0.277 0.217 0.140 0.013 881s 0.089 0.070 0.045 0.004 881s 0.210 0.165 0.106 0.010 881s testing: mul_mat, [4, 4, 3, 3] = [1, 4, 3, 3] * [1, 4, 3, 3] 881s x0: [1, 4, 3, 3] 881s 0.112 881s 0.966 881s 0.690 881s -0.483 881s 881s x1: [1, 4, 3, 3] 881s 0.155 881s 0.413 881s -0.384 881s -0.587 881s 881s y: [4, 4, 3, 3] 881s 0.017 0.149 0.107 -0.075 881s 0.046 0.399 0.285 -0.200 881s -0.043 -0.371 -0.265 0.186 881s -0.066 -0.567 -0.405 0.284 881s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 1, 1, 1] 881s 0.372 881s 881s x1: [1, 4, 1, 1] 881s 0.300 881s 0.059 881s 0.981 881s -0.095 881s 881s y: [1, 4, 1, 1] 881s 0.111 881s 0.022 881s 0.364 881s -0.035 881s testing: mul_mat, [2, 1, 3, 1] = [1, 2, 3, 1] * [1, 1, 3, 1] 881s x0: [1, 2, 3, 1] 881s 0.365 881s -0.975 881s 881s x1: [1, 1, 3, 1] 881s -0.529 881s 881s y: [2, 1, 3, 1] 881s -0.193 0.515 881s testing: mul_mat, [1, 1, 3, 3] = [2, 1, 3, 3] * [2, 1, 3, 3] 881s x0: [2, 1, 3, 3] 881s -0.521 0.680 881s 881s x1: [2, 1, 3, 3] 881s 0.242 0.414 881s 881s y: [1, 1, 3, 3] 881s 0.155 881s test-mul-mat0: iter:401/500 881s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.282 -0.080 0.071 0.525 881s -0.025 0.598 0.697 0.056 881s -0.574 0.852 0.088 -0.462 881s 0.250 -0.466 -0.542 0.676 881s 881s x1: [4, 2, 1, 1] 881s 0.028 -0.395 0.974 0.104 881s -0.068 0.579 -0.761 0.240 881s 881s y: [4, 2, 1, 1] 881s 0.148 0.447 -0.315 -0.266 881s 0.045 -0.169 0.354 0.288 881s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 881s x0: [4, 1, 3, 1] 881s -0.123 0.126 0.084 -0.430 881s 881s x1: [4, 4, 3, 1] 881s 0.547 0.669 -0.941 0.383 881s -0.548 0.727 -0.589 0.056 881s 0.701 0.514 0.989 0.279 881s 0.754 0.229 0.369 -0.529 881s 881s y: [1, 4, 3, 1] 881s -0.227 881s 0.085 881s -0.059 881s 0.194 881s testing: mul_mat, [4, 1, 3, 4] = [4, 4, 3, 4] * [4, 1, 3, 4] 881s x0: [4, 4, 3, 4] 881s 0.977 0.836 -0.982 -0.916 881s -0.179 -0.718 0.738 0.920 881s 0.119 0.941 0.930 -0.671 881s -0.847 0.322 0.260 0.501 881s 881s x1: [4, 1, 3, 4] 881s 0.383 0.721 0.137 0.914 881s 881s y: [4, 1, 3, 4] 881s 0.005 0.355 0.238 0.402 881s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s * [4, 4, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.649 0.114 0.127 -0.056 881s -0.956 -0.537 -0.093 -0.546 881s 0.556 0.900 -0.434 -0.830 881s -0.820 -0.900 0.770 0.280 881s 881s x1: [4, 4, 1, 1] 881s -0.601 -0.664 0.611 -0.554 881s 0.250 -0.622 0.802 0.928 881s 0.003 0.035 0.928 -0.319 881s -0.125 0.931 -0.075 0.806 881s 881s y: [4, 4, 1, 1] 881s 0.422 1.176 -0.737 1.406 881s -0.184 -0.485 -1.540 1.233 881s 0.138 0.066 -0.105 0.592 881s 0.133 -0.814 0.132 -0.568 881s testing: mul_mat, [3, 4, 3, 1] = [4, 3, 3, 1] * [4, 4, 3, 1] 881s x0: [4, 3, 3, 1] 881s -0.471 -0.988 -0.599 0.734 881s 0.989 0.381 0.339 0.751 881s 0.336 -0.033 -0.631 0.941 881s 881s x1: [4, 4, 3, 1] 881s -0.935 -0.384 -0.470 -0.557 881s -0.582 -0.543 0.446 0.453 881s -0.614 -0.873 -0.672 -0.683 881s 0.052 -0.866 -0.514 0.403 881s 881s y: [3, 4, 3, 1] 881s 0.693 -1.649 -0.529 881s 0.875 -0.292 -0.033 881s 1.053 -1.681 -0.397 881s 1.435 -0.150 0.749 881s testing: mul_mat, [4, 4, 3, 4] = [3, 4, 3, 4] * [3, 4, 3, 4] 881s x0: [3, 4, 3, 4] 881s 0.613 -0.013 -0.545 881s 0.155 0.279 -0.421 881s 0.868 0.564 -0.572 881s 0.506 0.190 0.909 881s 881s x1: [3, 4, 3, 4] 881s 0.300 0.440 0.102 881s 0.670 0.174 -0.147 881s 0.611 0.521 0.081 881s 0.739 0.011 -0.772 881s 881s y: [4, 4, 3, 4] 881s 0.123 0.126 0.450 0.328 881s 0.488 0.214 0.764 0.238 881s 0.324 0.206 0.778 0.481 881s 0.874 0.442 1.089 -0.325 881s test-mul-mat0: iter:402/500 881s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.809 0.613 881s -0.756 0.594 881s 881s x1: [2, 4, 1, 1] 881s 0.125 0.496 881s -0.211 -0.669 881s 0.174 -0.715 881s 0.404 0.289 881s 881s y: [2, 4, 1, 1] 881s 0.405 0.201 881s -0.581 -0.239 881s -0.298 -0.556 881s 0.504 -0.133 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.407 0.369 881s 0.091 0.450 881s -0.360 0.678 881s 881s x1: [2, 2, 1, 1] 881s 0.104 -0.869 881s 0.845 -0.515 881s 881s y: [3, 2, 1, 1] 881s -0.363 -0.381 -0.627 881s -0.533 -0.155 -0.654 881s testing: mul_mat, [1, 3, 1, 2] = [2, 1, 1, 2] * [2, 3, 1, 2] 881s x0: [2, 1, 1, 2] 881s -0.770 -0.273 881s 881s x1: [2, 3, 1, 2] 881s 0.223 0.746 881s -0.340 -0.216 881s 0.371 -0.232 881s 881s y: [1, 3, 1, 2] 881s -0.375 881s 0.321 881s -0.222 881s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.444 0.930 881s -0.707 0.246 881s 0.479 -0.927 881s 881s x1: [2, 1, 1, 1] 881s -0.660 -0.838 881s 881s y: [3, 1, 1, 1] 881s -1.072 0.260 0.461 881s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 2, 1, 1] 881s -0.325 0.055 -0.922 881s 0.530 0.443 0.551 881s 881s x1: [3, 2, 1, 1] 881s 0.298 -0.663 0.522 881s 0.938 -0.985 -0.254 881s 881s y: [2, 2, 1, 1] 881s -0.615 0.152 881s -0.125 -0.079 881s testing: mul_mat, [1, 3, 1, 2] = [2, 1, 1, 2] * [2, 3, 1, 2] 881s x0: [2, 1, 1, 2] 881s 0.412 0.760 881s 881s x1: [2, 3, 1, 2] 881s 0.707 0.416 881s -0.623 0.145 881s 0.039 -0.392 881s 881s y: [1, 3, 1, 2] 881s 0.608 881s -0.146 881s -0.282 881s test-mul-mat0: iter:403/500 881s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.981 -0.837 -0.901 881s 0.853 0.120 -0.255 881s -0.934 -0.761 0.026 881s 0.723 0.651 -0.214 881s 881s x1: [3, 4, 1, 1] 881s -0.177 0.011 -0.899 881s -0.492 0.686 0.631 881s 0.563 0.129 0.709 881s 0.114 -0.164 0.124 881s 881s y: [4, 4, 1, 1] 881s 0.628 0.079 0.133 0.071 881s -1.626 -0.499 -0.046 -0.044 881s -0.194 0.315 -0.605 0.340 881s 0.137 0.046 0.022 -0.051 881s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.130 0.414 -0.797 881s 0.966 -0.461 0.694 881s 881s x1: [3, 4, 1, 1] 881s 0.310 -0.691 -0.381 881s -0.804 0.305 0.974 881s 0.019 -0.684 -0.925 881s 0.527 -0.999 0.705 881s 881s y: [2, 4, 1, 1] 881s 0.058 0.354 881s -0.754 -0.241 881s 0.456 -0.308 881s -0.907 1.459 881s testing: mul_mat, [4, 2, 1, 3] = [3, 4, 1, 3] * [3, 2, 1, 3] 881s x0: [3, 4, 1, 3] 881s 0.641 0.019 0.642 881s 0.642 -0.276 -0.269 881s -0.228 -0.861 -0.066 881s -0.262 -0.323 -0.371 881s 881s x1: [3, 2, 1, 3] 881s 0.947 -0.298 0.793 881s 0.800 0.821 -0.462 881s 881s y: [4, 2, 1, 3] 881s 1.110 0.477 -0.011 -0.446 881s 0.232 0.412 -0.859 -0.303 881s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.369 -0.843 -0.387 881s 0.187 -0.232 0.353 881s 0.893 0.415 0.464 881s -0.427 0.148 0.402 881s 881s x1: [3, 4, 1, 1] 881s -0.575 -0.629 -0.329 881s 0.359 0.109 0.348 881s 0.988 0.794 -0.273 881s -0.591 -0.721 0.927 881s 881s y: [4, 4, 1, 1] 881s 0.869 -0.077 -0.927 0.020 881s -0.359 0.165 0.528 0.003 881s -0.928 -0.096 1.084 -0.414 881s 0.466 0.384 -0.396 0.518 881s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.706 0.280 -0.584 0.205 881s -0.511 -0.426 -0.351 0.603 881s 881s x1: [4, 3, 1, 1] 881s 0.405 0.421 0.210 0.099 881s 0.020 0.743 0.524 0.391 881s -0.586 -0.117 -0.500 0.762 881s 881s y: [2, 3, 1, 1] 881s -0.270 -0.400 881s -0.032 -0.275 881s 0.829 0.984 881s testing: mul_mat, [2, 4, 1, 3] = [2, 2, 1, 3] * [2, 4, 1, 3] 881s x0: [2, 2, 1, 3] 881s 0.609 0.357 881s 0.574 -0.743 881s 881s x1: [2, 4, 1, 3] 881s -0.459 0.778 881s 0.760 0.309 881s 0.193 -0.092 881s 0.921 -0.455 881s 881s y: [2, 4, 1, 3] 881s -0.001 -0.841 881s 0.573 0.207 881s 0.084 0.179 881s 0.398 0.867 881s test-mul-mat0: iter:404/500 881s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.392 881s -0.839 881s 881s x1: [1, 3, 1, 1] 881s -0.755 881s 0.794 881s 0.885 881s 881s y: [2, 3, 1, 1] 881s 0.296 0.634 881s -0.311 -0.666 881s -0.347 -0.743 881s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.325 881s 0.520 881s 881s x1: [1, 2, 1, 1] 881s 0.868 881s -0.186 881s 881s y: [2, 2, 1, 1] 881s 0.282 0.451 881s -0.060 -0.097 881s testing: mul_mat, [1, 2, 1, 2] = [1, 1, 1, 2] * [1, 2, 1, 2] 881s x0: [1, 1, 1, 2] 881s -0.906 881s 881s x1: [1, 2, 1, 2] 881s 0.302 881s 0.653 881s 881s y: [1, 2, 1, 2] 881s -0.273 881s -0.591 881s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 1, 1, 1] 881s -0.440 881s 881s x1: [1, 1, 1, 1] 881s 0.519 881s 881s y: [1, 1, 1, 1] 881s -0.228 881s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 1, 1, 1] 881s 0.280 881s 881s x1: [1, 1, 1, 1] 881s -0.682 881s 881s y: [1, 1, 1, 1] 881s -0.191 881s testing: mul_mat, [2, 1, 1, 2] = [1, 2, 1, 2] * [1, 1, 1, 2] 881s x0: [1, 2, 1, 2] 881s -0.545 881s 0.473 881s 881s x1: [1, 1, 1, 2] 881s -0.575 881s 881s y: [2, 1, 1, 2] 881s 0.313 -0.272 881s test-mul-mat0: iter:405/500 881s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.875 -0.297 881s -0.795 0.645 881s -0.995 0.857 881s -0.835 -0.752 881s 881s x1: [2, 1, 1, 1] 881s 0.311 0.853 881s 881s y: [4, 1, 1, 1] 881s -0.525 0.302 0.422 -0.901 881s testing: mul_mat, [2, 4, 3, 1] = [2, 2, 3, 1] * [2, 4, 3, 1] 881s x0: [2, 2, 3, 1] 881s 0.029 0.287 881s 0.896 -0.114 881s 881s x1: [2, 4, 3, 1] 881s 0.119 -0.741 881s 0.848 -0.362 881s 0.132 -0.592 881s -0.045 -0.439 881s 881s y: [2, 4, 3, 1] 881s -0.209 0.191 881s -0.080 0.801 881s -0.166 0.185 881s -0.127 0.010 881s testing: mul_mat, [3, 2, 3, 4] = [2, 3, 3, 4] * [2, 2, 3, 4] 881s x0: [2, 3, 3, 4] 881s -0.906 0.409 881s -0.052 -0.143 881s 0.381 0.239 881s 881s x1: [2, 2, 3, 4] 881s 0.598 0.403 881s -0.913 -0.022 881s 881s y: [3, 2, 3, 4] 881s -0.376 -0.089 0.324 881s 0.818 0.051 -0.353 881s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.489 0.185 881s -0.901 0.579 881s 0.530 -0.024 881s -0.504 -0.421 881s 881s x1: [2, 3, 1, 1] 881s 0.143 0.181 881s -0.340 -0.452 881s 0.988 -0.325 881s 881s y: [4, 3, 1, 1] 881s -0.036 -0.024 0.072 -0.148 881s 0.082 0.045 -0.170 0.362 881s -0.543 -1.078 0.532 -0.361 881s testing: mul_mat, [4, 2, 3, 1] = [4, 4, 3, 1] * [4, 2, 3, 1] 881s x0: [4, 4, 3, 1] 881s -0.439 0.370 -0.317 0.279 881s -0.606 0.847 0.113 0.501 881s 0.268 0.877 -0.866 0.957 881s -0.861 0.588 -0.253 -0.333 881s 881s x1: [4, 2, 3, 1] 881s 0.738 -0.782 -0.164 0.236 881s -0.455 -0.936 -0.833 0.367 881s 881s y: [4, 2, 3, 1] 881s -0.496 -1.010 -0.120 -1.132 881s 0.220 -0.428 0.131 -0.071 881s testing: mul_mat, [1, 4, 3, 4] = [4, 1, 3, 4] * [4, 4, 3, 4] 881s x0: [4, 1, 3, 4] 881s -0.999 -0.855 -0.493 -0.452 881s 881s x1: [4, 4, 3, 4] 881s 0.327 -0.611 -0.508 0.016 881s -0.635 0.197 -0.895 0.650 881s 0.592 -0.421 -0.082 0.018 881s -0.048 -0.731 0.004 0.719 881s 881s y: [1, 4, 3, 4] 881s 0.440 881s 0.613 881s -0.199 881s 0.346 881s test-mul-mat0: iter:406/500 881s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 1, 1, 1] 881s -0.624 881s 881s x1: [1, 2, 1, 1] 881s -0.737 881s 0.198 881s 881s y: [1, 2, 1, 1] 881s 0.460 881s -0.123 881s testing: mul_mat, [4, 1, 2, 1] = [1, 4, 2, 1] * [1, 1, 2, 1] 881s x0: [1, 4, 2, 1] 881s -0.344 881s -0.879 881s 0.863 881s -0.610 881s 881s x1: [1, 1, 2, 1] 881s -0.078 881s 881s y: [4, 1, 2, 1] 881s 0.027 0.068 -0.067 0.047 881s testing: mul_mat, [4, 4, 2, 3] = [1, 4, 2, 3] * [1, 4, 2, 3] 881s x0: [1, 4, 2, 3] 881s -0.162 881s 0.064 881s 0.232 881s 0.106 881s 881s x1: [1, 4, 2, 3] 881s 0.218 881s -0.894 881s 0.692 881s -0.881 881s 881s y: [4, 4, 2, 3] 881s -0.035 0.014 0.050 0.023 881s 0.144 -0.058 -0.207 -0.095 881s -0.112 0.045 0.160 0.073 881s 0.142 -0.057 -0.204 -0.093 881s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 4, 1, 1] 881s 0.702 881s -0.220 881s -0.925 881s -0.233 881s 881s x1: [1, 4, 1, 1] 881s -0.711 881s -0.740 881s 0.028 881s 0.474 881s 881s y: [4, 4, 1, 1] 881s -0.500 0.157 0.658 0.166 881s -0.519 0.163 0.684 0.173 881s 0.020 -0.006 -0.026 -0.007 881s 0.333 -0.104 -0.438 -0.111 881s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 881s x0: [4, 1, 2, 1] 881s -0.189 -0.473 -0.253 -0.682 881s 881s x1: [4, 1, 2, 1] 881s -0.988 0.180 -0.497 0.952 881s 881s y: [1, 1, 2, 1] 881s -0.423 881s testing: mul_mat, [4, 4, 2, 3] = [1, 4, 2, 3] * [1, 4, 2, 3] 881s x0: [1, 4, 2, 3] 881s -0.573 881s -0.782 881s 0.548 881s 0.417 881s 881s x1: [1, 4, 2, 3] 881s 0.605 881s -0.755 881s -0.002 881s -0.145 881s 881s y: [4, 4, 2, 3] 881s -0.346 -0.473 0.332 0.252 881s 0.432 0.590 -0.414 -0.315 881s 0.001 0.001 -0.001 -0.001 881s 0.083 0.114 -0.080 -0.061 881s test-mul-mat0: iter:407/500 881s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 4, 1, 1] 881s 0.128 881s -0.971 881s 0.834 881s -0.324 881s 881s x1: [1, 1, 1, 1] 881s 0.601 881s 881s y: [4, 1, 1, 1] 881s 0.077 -0.584 0.502 -0.195 881s testing: mul_mat, [2, 4, 4, 1] = [1, 2, 4, 1] * [1, 4, 4, 1] 881s x0: [1, 2, 4, 1] 881s 0.627 881s 0.825 881s 881s x1: [1, 4, 4, 1] 881s 0.446 881s -0.615 881s 0.659 881s -0.532 881s 881s y: [2, 4, 4, 1] 881s 0.279 0.368 881s -0.386 -0.507 881s 0.413 0.544 881s -0.334 -0.439 881s testing: mul_mat, [1, 2, 4, 3] = [1, 1, 4, 3] * [1, 2, 4, 3] 881s x0: [1, 1, 4, 3] 881s 0.858 881s 881s x1: [1, 2, 4, 3] 881s -0.213 881s 0.583 881s 881s y: [1, 2, 4, 3] 881s -0.183 881s 0.500 881s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 4, 1, 1] 881s 0.490 881s 0.635 881s -0.058 881s -0.551 881s 881s x1: [1, 1, 1, 1] 881s -0.094 881s 881s y: [4, 1, 1, 1] 881s -0.046 -0.059 0.005 0.052 881s testing: mul_mat, [1, 1, 4, 1] = [4, 1, 4, 1] * [4, 1, 4, 1] 881s x0: [4, 1, 4, 1] 881s 0.711 -0.684 0.903 -0.549 881s 881s x1: [4, 1, 4, 1] 881s -0.291 0.154 -0.684 0.119 881s 881s y: [1, 1, 4, 1] 881s -0.995 881s testing: mul_mat, [4, 4, 4, 3] = [1, 4, 4, 3] * [1, 4, 4, 3] 881s x0: [1, 4, 4, 3] 881s 0.003 881s 0.903 881s 0.372 881s -0.500 881s 881s x1: [1, 4, 4, 3] 881s 0.255 881s -0.083 881s 0.589 881s -0.625 881s 881s y: [4, 4, 4, 3] 881s 0.001 0.230 0.095 -0.128 881s -0.000 -0.075 -0.031 0.042 881s 0.002 0.532 0.219 -0.295 881s -0.002 -0.564 -0.232 0.313 881s test-mul-mat0: iter:408/500 881s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 4, 1, 1] 881s -0.123 881s 0.307 881s 0.110 881s 0.879 881s 881s x1: [1, 1, 1, 1] 881s -0.899 881s 881s y: [4, 1, 1, 1] 881s 0.111 -0.276 -0.099 -0.790 881s testing: mul_mat, [3, 4, 4, 1] = [1, 3, 4, 1] * [1, 4, 4, 1] 881s x0: [1, 3, 4, 1] 881s -0.446 881s -0.904 881s -0.212 881s 881s x1: [1, 4, 4, 1] 881s -0.351 881s -0.235 881s -0.130 881s -0.528 881s 881s y: [3, 4, 4, 1] 881s 0.156 0.317 0.074 881s 0.105 0.213 0.050 881s 0.058 0.118 0.028 881s 0.236 0.478 0.112 881s testing: mul_mat, [3, 3, 4, 2] = [1, 3, 4, 2] * [1, 3, 4, 2] 881s x0: [1, 3, 4, 2] 881s -0.202 881s -0.412 881s -0.019 881s 881s x1: [1, 3, 4, 2] 881s 0.731 881s -0.560 881s 0.438 881s 881s y: [3, 3, 4, 2] 881s -0.148 -0.301 -0.014 881s 0.113 0.231 0.011 881s -0.088 -0.180 -0.008 881s testing: mul_mat, [3, 3, 1, 1] = [1, 3, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.542 881s -0.551 881s -0.233 881s 881s x1: [1, 3, 1, 1] 881s 0.150 881s -0.288 881s 0.546 881s 881s y: [3, 3, 1, 1] 881s -0.081 -0.083 -0.035 881s 0.156 0.159 0.067 881s -0.295 -0.301 -0.127 881s testing: mul_mat, [2, 1, 4, 1] = [3, 2, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 2, 4, 1] 881s -0.354 0.107 0.383 881s -0.465 0.163 0.972 881s 881s x1: [3, 1, 4, 1] 881s 0.257 0.037 0.748 881s 881s y: [2, 1, 4, 1] 881s 0.199 0.614 881s testing: mul_mat, [3, 3, 4, 2] = [2, 3, 4, 2] * [2, 3, 4, 2] 881s x0: [2, 3, 4, 2] 881s 0.807 0.215 881s -0.195 -0.809 881s 0.759 -0.203 881s 881s x1: [2, 3, 4, 2] 881s -0.130 0.574 881s -0.033 -0.655 881s -0.752 -0.191 881s 881s y: [3, 3, 4, 2] 881s 0.019 -0.439 -0.215 881s -0.168 0.537 0.108 881s -0.648 0.301 -0.532 881s test-mul-mat0: iter:409/500 881s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.134 -0.378 0.868 -0.851 881s 881s x1: [4, 3, 1, 1] 881s 0.928 -0.998 0.344 0.110 881s -0.946 -0.183 0.249 -0.286 881s -0.182 -0.316 0.182 0.625 881s 881s y: [1, 3, 1, 1] 881s 0.458 881s 0.656 881s -0.230 881s testing: mul_mat, [2, 1, 4, 1] = [4, 2, 4, 1] * [4, 1, 4, 1] 881s x0: [4, 2, 4, 1] 881s 0.618 -0.823 -0.754 -0.132 881s -0.109 0.064 0.552 -0.927 881s 881s x1: [4, 1, 4, 1] 881s 0.562 0.305 -0.857 0.240 881s 881s y: [2, 1, 4, 1] 881s 0.711 -0.738 881s testing: mul_mat, [4, 2, 4, 2] = [4, 4, 4, 2] * [4, 2, 4, 2] 881s x0: [4, 4, 4, 2] 881s -0.023 -0.504 0.064 0.655 881s 0.633 -0.538 0.911 -0.978 881s 0.779 -0.168 -0.463 0.617 881s 0.649 0.293 0.215 0.792 881s 881s x1: [4, 2, 4, 2] 881s 0.674 -0.813 0.637 -0.435 881s 0.251 0.189 -0.361 0.941 881s 881s y: [4, 2, 4, 2] 881s 0.150 1.870 0.099 -0.008 881s 0.492 -1.192 0.911 0.886 881s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.886 -0.263 -0.139 0.489 881s 881s x1: [4, 4, 1, 1] 881s -0.566 -0.955 -0.207 -0.349 881s -0.554 0.621 -0.631 -0.175 881s 0.384 0.294 0.924 -0.953 881s -0.140 -0.819 -0.100 0.585 881s 881s y: [1, 4, 1, 1] 881s 0.611 881s 0.329 881s -1.013 881s 0.640 881s testing: mul_mat, [3, 4, 4, 1] = [1, 3, 4, 1] * [1, 4, 4, 1] 881s x0: [1, 3, 4, 1] 881s 0.232 881s -0.203 881s 0.093 881s 881s x1: [1, 4, 4, 1] 881s 0.794 881s 0.665 881s 0.149 881s 0.388 881s 881s y: [3, 4, 4, 1] 881s 0.184 -0.161 0.074 881s 0.154 -0.135 0.062 881s 0.035 -0.030 0.014 881s 0.090 -0.079 0.036 881s testing: mul_mat, [3, 1, 4, 2] = [3, 3, 4, 2] * [3, 1, 4, 2] 881s x0: [3, 3, 4, 2] 881s -0.668 0.169 -0.132 881s 0.323 0.181 -0.471 881s 0.345 0.621 0.690 881s 881s x1: [3, 1, 4, 2] 881s 0.698 0.347 -0.930 881s 881s y: [3, 1, 4, 2] 881s -0.285 0.727 -0.186 881s test-mul-mat0: iter:410/500 881s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.583 -0.659 881s 0.119 0.698 881s 881s x1: [2, 3, 1, 1] 881s -0.003 0.007 881s 0.632 0.589 881s 0.773 -0.209 881s 881s y: [2, 3, 1, 1] 881s -0.003 0.005 881s -0.756 0.486 881s -0.313 -0.054 881s testing: mul_mat, [2, 2, 4, 1] = [2, 2, 4, 1] * [2, 2, 4, 1] 881s x0: [2, 2, 4, 1] 881s -0.856 0.691 881s 0.767 0.140 881s 881s x1: [2, 2, 4, 1] 881s -0.881 -0.272 881s 0.468 0.121 881s 881s y: [2, 2, 4, 1] 881s 0.566 -0.713 881s -0.318 0.376 881s testing: mul_mat, [2, 2, 4, 4] = [2, 2, 4, 4] * [2, 2, 4, 4] 881s x0: [2, 2, 4, 4] 881s 0.640 -0.987 881s -0.549 0.226 881s 881s x1: [2, 2, 4, 4] 881s -0.945 0.771 881s 0.617 0.590 881s 881s y: [2, 2, 4, 4] 881s -1.365 0.693 881s -0.187 -0.206 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.027 0.702 881s 0.585 -0.075 881s 0.415 -0.818 881s 881s x1: [2, 2, 1, 1] 881s -0.655 -0.016 881s -0.086 -0.347 881s 881s y: [3, 2, 1, 1] 881s 0.007 -0.382 -0.259 881s -0.241 -0.024 0.248 881s testing: mul_mat, [4, 2, 4, 1] = [3, 4, 4, 1] * [3, 2, 4, 1] 881s x0: [3, 4, 4, 1] 881s -0.461 0.641 0.833 881s -0.285 -0.393 -0.914 881s -0.576 -0.376 0.900 881s -0.759 0.505 0.100 881s 881s x1: [3, 2, 4, 1] 881s -0.617 0.865 0.226 881s 0.461 -0.606 0.267 881s 881s y: [4, 2, 4, 1] 881s 1.027 -0.371 0.234 0.928 881s -0.378 -0.137 0.202 -0.630 881s testing: mul_mat, [3, 3, 4, 4] = [4, 3, 4, 4] * [4, 3, 4, 4] 881s x0: [4, 3, 4, 4] 881s 0.368 -0.426 -0.303 -0.655 881s -0.414 0.685 -0.908 -0.801 881s -0.301 -0.283 0.929 -0.451 881s 881s x1: [4, 3, 4, 4] 881s -0.511 -0.940 0.144 -0.047 881s -0.886 0.787 0.549 -0.187 881s 0.083 -0.378 0.141 0.945 881s 881s y: [3, 3, 4, 4] 881s 0.199 -0.525 0.575 881s -0.705 0.556 0.638 881s -0.471 -1.179 -0.213 881s test-mul-mat0: iter:411/500 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.168 -0.284 881s 0.089 -0.401 881s 0.312 0.190 881s 881s x1: [2, 2, 1, 1] 881s 0.404 0.187 881s 0.377 -0.520 881s 881s y: [3, 2, 1, 1] 881s 0.015 -0.039 0.162 881s 0.211 0.242 0.019 881s testing: mul_mat, [3, 3, 3, 1] = [2, 3, 3, 1] * [2, 3, 3, 1] 881s x0: [2, 3, 3, 1] 881s -0.246 -0.548 881s 0.251 -0.869 881s -0.067 -0.504 881s 881s x1: [2, 3, 3, 1] 881s 0.729 0.663 881s -0.547 0.489 881s -0.621 0.819 881s 881s y: [3, 3, 3, 1] 881s -0.543 -0.394 -0.384 881s -0.134 -0.562 -0.210 881s -0.296 -0.867 -0.371 881s testing: mul_mat, [2, 3, 3, 3] = [2, 2, 3, 3] * [2, 3, 3, 3] 881s x0: [2, 2, 3, 3] 881s 0.051 0.227 881s -0.380 -0.295 881s 881s x1: [2, 3, 3, 3] 881s 0.388 -0.955 881s -0.891 -0.204 881s -0.896 0.488 881s 881s y: [2, 3, 3, 3] 881s -0.197 0.134 881s -0.092 0.399 881s 0.065 0.196 881s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.160 0.084 881s -0.948 0.159 881s 881s x1: [2, 2, 1, 1] 881s -0.445 -0.650 881s 0.285 -0.573 881s 881s y: [2, 2, 1, 1] 881s 0.017 0.318 881s -0.094 -0.362 881s testing: mul_mat, [4, 2, 3, 1] = [2, 4, 3, 1] * [2, 2, 3, 1] 881s x0: [2, 4, 3, 1] 881s 0.778 -0.242 881s -0.482 -0.338 881s 0.975 0.821 881s 0.490 0.702 881s 881s x1: [2, 2, 3, 1] 881s -0.272 -0.165 881s -0.501 0.215 881s 881s y: [4, 2, 3, 1] 881s -0.172 0.187 -0.401 -0.249 881s -0.442 0.169 -0.312 -0.095 881s testing: mul_mat, [4, 2, 3, 3] = [4, 4, 3, 3] * [4, 2, 3, 3] 881s x0: [4, 4, 3, 3] 881s -0.634 -0.844 0.172 -0.630 881s -0.718 0.893 0.944 -0.508 881s 0.746 -0.911 0.394 0.278 881s 0.103 -0.569 -0.561 0.906 881s 881s x1: [4, 2, 3, 3] 881s 0.827 -0.222 0.889 0.641 881s -0.137 0.219 -0.844 -0.360 881s 881s y: [4, 2, 3, 3] 881s -0.588 -0.278 1.347 0.293 881s -0.016 -0.320 -0.734 0.009 881s test-mul-mat0: iter:412/500 881s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 1, 1, 1] 881s 0.112 881s 881s x1: [1, 1, 1, 1] 881s 0.148 881s 881s y: [1, 1, 1, 1] 881s 0.017 881s testing: mul_mat, [1, 1, 3, 1] = [1, 1, 3, 1] * [1, 1, 3, 1] 881s x0: [1, 1, 3, 1] 881s -0.855 881s 881s x1: [1, 1, 3, 1] 881s -0.167 881s 881s y: [1, 1, 3, 1] 881s 0.143 881s testing: mul_mat, [1, 1, 3, 2] = [1, 1, 3, 2] * [1, 1, 3, 2] 881s x0: [1, 1, 3, 2] 881s 0.562 881s 881s x1: [1, 1, 3, 2] 881s -0.249 881s 881s y: [1, 1, 3, 2] 881s -0.140 881s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.210 881s -0.026 881s 881s x1: [1, 1, 1, 1] 881s 0.351 881s 881s y: [2, 1, 1, 1] 881s -0.074 -0.009 881s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 881s x0: [2, 2, 3, 1] 881s -0.380 0.018 881s 0.542 -0.172 881s 881s x1: [2, 1, 3, 1] 881s 0.543 -0.361 881s 881s y: [2, 1, 3, 1] 881s -0.213 0.356 881s testing: mul_mat, [4, 2, 3, 2] = [2, 4, 3, 2] * [2, 2, 3, 2] 881s x0: [2, 4, 3, 2] 881s 0.316 -0.576 881s -0.306 -0.115 881s -0.804 0.397 881s -0.063 0.458 881s 881s x1: [2, 2, 3, 2] 881s -0.540 0.034 881s -0.748 -0.898 881s 881s y: [4, 2, 3, 2] 881s -0.190 0.162 0.447 0.049 881s 0.281 0.332 0.245 -0.365 881s test-mul-mat0: iter:413/500 881s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.702 0.020 881s -0.295 0.493 881s 881s x1: [2, 4, 1, 1] 881s -0.700 0.055 881s 0.099 -0.127 881s 0.451 0.232 881s 0.300 0.578 881s 881s y: [2, 4, 1, 1] 881s 0.493 0.233 881s -0.072 -0.092 881s -0.312 -0.019 881s -0.199 0.196 881s testing: mul_mat, [2, 2, 3, 1] = [2, 2, 3, 1] * [2, 2, 3, 1] 881s x0: [2, 2, 3, 1] 881s 0.914 0.207 881s -0.549 0.474 881s 881s x1: [2, 2, 3, 1] 881s 0.399 -0.787 881s 0.572 0.176 881s 881s y: [2, 2, 3, 1] 881s 0.203 -0.592 881s 0.560 -0.231 881s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 881s x0: [2, 1, 3, 1] 881s -0.268 -0.224 881s 881s x1: [2, 2, 3, 1] 881s 0.557 -0.307 881s 0.222 -0.423 881s 881s y: [1, 2, 3, 1] 881s -0.080 881s 0.035 881s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 4, 1, 1] 881s 0.689 -0.227 881s -0.065 0.532 881s 0.908 -0.803 881s -0.782 0.845 881s 881s x1: [2, 1, 1, 1] 881s -0.784 -0.805 881s 881s y: [4, 1, 1, 1] 881s -0.358 -0.377 -0.066 -0.067 881s testing: mul_mat, [1, 2, 3, 1] = [4, 1, 3, 1] * [4, 2, 3, 1] 881s x0: [4, 1, 3, 1] 881s 0.655 0.494 0.425 0.187 881s 881s x1: [4, 2, 3, 1] 881s -0.544 0.755 -0.462 0.678 881s -0.668 0.936 -0.607 -0.692 881s 881s y: [1, 2, 3, 1] 881s -0.054 881s -0.362 881s testing: mul_mat, [1, 4, 3, 1] = [1, 1, 3, 1] * [1, 4, 3, 1] 881s x0: [1, 1, 3, 1] 881s 0.086 881s 881s x1: [1, 4, 3, 1] 881s 0.714 881s -0.914 881s 0.290 881s -0.606 881s 881s y: [1, 4, 3, 1] 881s 0.061 881s -0.078 881s 0.025 881s -0.052 881s test-mul-mat0: iter:414/500 881s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 1, 1, 1] 881s 0.341 -0.181 881s 881s x1: [2, 4, 1, 1] 881s -0.574 -1.000 881s 0.674 -0.882 881s 0.270 0.316 881s 0.564 0.077 881s 881s y: [1, 4, 1, 1] 881s -0.015 881s 0.389 881s 0.035 881s 0.178 881s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 881s x0: [2, 4, 2, 1] 881s 0.737 -0.104 881s 0.587 -0.565 881s 0.582 -0.917 881s -0.496 0.789 881s 881s x1: [2, 1, 2, 1] 881s -0.650 0.427 881s 881s y: [4, 1, 2, 1] 881s -0.523 -0.622 -0.770 0.659 881s testing: mul_mat, [2, 4, 2, 2] = [2, 2, 2, 2] * [2, 4, 2, 2] 881s x0: [2, 2, 2, 2] 881s -0.353 0.325 881s 0.913 0.846 881s 881s x1: [2, 4, 2, 2] 881s -0.712 -0.837 881s 0.636 0.558 881s 0.479 0.200 881s -0.365 -0.884 881s 881s y: [2, 4, 2, 2] 881s -0.021 -1.358 881s -0.043 1.053 881s -0.104 0.607 881s -0.158 -1.082 881s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.782 -0.830 881s -0.346 0.999 881s 881s x1: [2, 2, 1, 1] 881s -0.986 0.502 881s -0.563 -0.746 881s 881s y: [2, 2, 1, 1] 881s 0.355 0.843 881s 1.059 -0.550 881s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 881s x0: [2, 2, 2, 1] 881s 0.650 -0.355 881s -0.299 -0.963 881s 881s x1: [2, 2, 2, 1] 881s -0.854 0.649 881s -0.731 0.157 881s 881s y: [2, 2, 2, 1] 881s -0.786 -0.370 881s -0.531 0.067 881s testing: mul_mat, [4, 2, 2, 2] = [2, 4, 2, 2] * [2, 2, 2, 2] 881s x0: [2, 4, 2, 2] 881s 0.597 0.119 881s -0.622 0.758 881s 0.142 0.051 881s 0.401 0.820 881s 881s x1: [2, 2, 2, 2] 881s -0.153 0.690 881s 0.371 -0.340 881s 881s y: [4, 2, 2, 2] 881s -0.009 0.618 0.013 0.505 881s 0.181 -0.488 0.036 -0.130 881s test-mul-mat0: iter:415/500 881s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.882 -0.732 881s -0.225 -0.090 881s 881s x1: [2, 2, 1, 1] 881s -0.678 -0.794 881s 0.946 -0.276 881s 881s y: [2, 2, 1, 1] 881s -0.017 0.224 881s 1.036 -0.189 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.094 -0.079 881s 0.272 -0.481 881s 0.276 -0.916 881s 881s x1: [2, 2, 1, 1] 881s -0.342 0.945 881s 0.309 0.708 881s 881s y: [3, 2, 1, 1] 881s -0.107 -0.548 -0.960 881s -0.027 -0.257 -0.563 881s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.421 0.050 881s 0.305 -0.295 881s 0.372 0.511 881s -0.349 -0.904 881s 881s x1: [2, 3, 1, 1] 881s -0.763 -0.056 881s 0.066 -0.605 881s 0.573 -0.513 881s 881s y: [4, 3, 1, 1] 881s 0.318 -0.216 -0.313 0.317 881s -0.058 0.198 -0.285 0.524 881s -0.267 0.326 -0.049 0.264 881s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.844 0.650 881s 0.847 0.429 881s 881s x1: [2, 4, 1, 1] 881s -0.195 -0.467 881s -0.636 0.579 881s 0.443 0.022 881s 0.524 -0.248 881s 881s y: [2, 4, 1, 1] 881s -0.139 -0.366 881s 0.913 -0.290 881s -0.360 0.384 881s -0.603 0.338 881s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.422 0.146 881s -0.004 -0.978 881s 0.443 -0.885 881s 0.065 -0.549 881s 881s x1: [2, 2, 1, 1] 881s -0.635 -0.074 881s 0.513 -0.398 881s 881s y: [4, 2, 1, 1] 881s 0.257 0.074 -0.216 -0.001 881s -0.274 0.388 0.579 0.252 881s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.355 0.698 0.455 -0.079 881s -0.031 -0.974 -0.681 -0.855 881s 0.870 0.954 -0.725 0.546 881s -0.893 0.349 -0.032 0.918 881s 881s x1: [4, 2, 1, 1] 881s -0.273 0.487 0.961 0.379 881s 0.583 -0.234 0.912 0.947 881s 881s y: [4, 2, 1, 1] 881s 0.651 -1.444 -0.263 0.730 881s 0.384 -1.221 0.140 0.238 881s test-mul-mat0: iter:416/500 881s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.476 0.715 0.797 881s -0.758 0.627 0.745 881s 881s x1: [3, 1, 1, 1] 881s -0.273 0.514 -0.663 881s 881s y: [2, 1, 1, 1] 881s -0.291 0.036 881s testing: mul_mat, [4, 2, 3, 1] = [3, 4, 3, 1] * [3, 2, 3, 1] 881s x0: [3, 4, 3, 1] 881s 0.265 0.377 0.789 881s -0.220 0.714 -0.997 881s -0.744 0.429 0.801 881s -0.503 0.057 0.545 881s 881s x1: [3, 2, 3, 1] 881s -0.413 -0.018 -0.287 881s -0.544 0.090 -0.588 881s 881s y: [4, 2, 3, 1] 881s -0.343 0.364 0.070 0.050 881s -0.575 0.770 -0.028 -0.043 881s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 881s x0: [3, 4, 3, 1] 881s 0.825 0.101 -0.337 881s 0.603 -0.454 0.822 881s -0.949 -0.987 0.152 881s -0.069 0.442 -0.983 881s 881s x1: [3, 4, 3, 1] 881s -0.827 -0.678 -0.065 881s 0.973 -0.181 0.992 881s 0.519 0.903 -0.969 881s 0.778 0.444 0.159 881s 881s y: [4, 4, 3, 1] 881s -0.730 -0.244 1.444 -0.179 881s 0.450 1.484 -0.594 -1.121 881s 0.847 -0.894 -1.531 1.316 881s 0.633 0.398 -1.152 -0.013 881s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.773 -0.947 -0.696 881s -0.445 -0.530 -0.544 881s 0.015 0.233 -0.980 881s -0.350 0.884 0.335 881s 881s x1: [3, 4, 1, 1] 881s -0.660 0.616 -0.308 881s 0.492 -0.453 -0.866 881s 0.509 -0.175 0.870 881s -0.789 0.095 0.217 881s 881s y: [4, 4, 1, 1] 881s -0.880 0.135 0.435 0.672 881s 1.412 0.492 0.750 -0.862 881s -0.046 -0.607 -0.885 -0.041 881s -0.851 0.183 -0.202 0.433 881s testing: mul_mat, [3, 3, 3, 1] = [4, 3, 3, 1] * [4, 3, 3, 1] 881s x0: [4, 3, 3, 1] 881s -0.945 -0.284 -0.114 0.268 881s 0.265 -0.025 0.511 0.141 881s 0.576 -0.451 0.392 0.452 881s 881s x1: [4, 3, 3, 1] 881s -0.688 0.982 0.468 -0.170 881s 0.547 -0.853 0.170 0.163 881s -0.161 -0.339 0.711 -0.027 881s 881s y: [3, 3, 3, 1] 881s 0.273 0.009 -0.733 881s -0.250 0.275 0.840 881s 0.161 0.325 0.326 881s testing: mul_mat, [1, 4, 3, 1] = [3, 1, 3, 1] * [3, 4, 3, 1] 881s x0: [3, 1, 3, 1] 881s -0.512 -0.235 0.997 881s 881s x1: [3, 4, 3, 1] 881s -0.874 -0.773 -0.735 881s 0.518 0.494 0.406 881s -0.030 -0.239 0.951 881s 0.037 0.041 -0.450 881s 881s y: [1, 4, 3, 1] 881s -0.104 881s 0.024 881s 1.020 881s -0.477 881s test-mul-mat0: iter:417/500 881s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 2, 1, 1] 881s -0.941 -0.938 -0.536 881s -0.657 -0.563 -0.536 881s 881s x1: [3, 2, 1, 1] 881s 0.999 -0.203 -0.402 881s -0.706 0.116 -0.559 881s 881s y: [2, 2, 1, 1] 881s -0.534 -0.327 881s 0.855 0.698 881s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.120 0.994 0.430 881s 0.666 0.290 -0.959 881s -0.352 0.643 0.010 881s 0.695 -0.087 0.009 881s 881s x1: [3, 2, 1, 1] 881s -0.147 0.077 0.982 881s -0.142 0.123 -0.530 881s 881s y: [4, 2, 1, 1] 881s 0.481 -1.017 0.111 -0.100 881s -0.123 0.449 0.124 -0.115 881s testing: mul_mat, [3, 4, 1, 2] = [3, 3, 1, 2] * [3, 4, 1, 2] 881s x0: [3, 3, 1, 2] 881s -0.916 -0.797 0.825 881s 0.779 0.116 -0.166 881s -0.729 -0.373 -0.862 881s 881s x1: [3, 4, 1, 2] 881s -0.508 0.511 0.304 881s 0.608 0.952 0.894 881s 0.667 -0.987 -0.641 881s -0.990 -0.550 -0.177 881s 881s y: [3, 4, 1, 2] 881s 0.309 -0.387 -0.082 881s -0.577 0.436 -1.569 881s -0.354 0.511 0.434 881s 1.199 -0.806 1.080 881s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.575 0.124 0.069 881s 881s x1: [3, 3, 1, 1] 881s -0.582 -0.430 -0.981 881s -0.862 0.220 0.295 881s 0.481 -0.011 -0.436 881s 881s y: [1, 3, 1, 1] 881s 0.214 881s 0.544 881s -0.308 881s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 1, 1, 1] 881s -0.556 881s 881s x1: [1, 3, 1, 1] 881s -0.492 881s 0.328 881s -0.107 881s 881s y: [1, 3, 1, 1] 881s 0.273 881s -0.182 881s 0.059 881s testing: mul_mat, [4, 1, 1, 2] = [1, 4, 1, 2] * [1, 1, 1, 2] 881s x0: [1, 4, 1, 2] 881s 0.865 881s -0.563 881s 0.651 881s 0.897 881s 881s x1: [1, 1, 1, 2] 881s 0.727 881s 881s y: [4, 1, 1, 2] 881s 0.629 -0.410 0.473 0.653 881s test-mul-mat0: iter:418/500 881s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 1, 1, 1] 881s 0.828 -0.510 -0.495 881s 881s x1: [3, 2, 1, 1] 881s 0.521 -0.908 0.076 881s -0.184 0.574 -0.935 881s 881s y: [1, 2, 1, 1] 881s 0.857 881s 0.018 881s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 881s x0: [3, 2, 3, 1] 881s -0.719 -0.001 0.281 881s -0.737 0.242 0.569 881s 881s x1: [3, 1, 3, 1] 881s -0.103 -0.002 0.833 881s 881s y: [2, 1, 3, 1] 881s 0.308 0.549 881s testing: mul_mat, [3, 2, 3, 2] = [3, 3, 3, 2] * [3, 2, 3, 2] 881s x0: [3, 3, 3, 2] 881s -0.163 -0.789 0.371 881s -0.708 -0.647 -0.878 881s 0.129 -0.224 0.872 881s 881s x1: [3, 2, 3, 2] 881s 0.249 -0.986 0.294 881s -0.854 0.012 0.126 881s 881s y: [3, 2, 3, 2] 881s 0.846 0.204 0.509 881s 0.176 0.486 -0.003 881s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 3, 1, 1] 881s 0.660 0.315 -0.157 881s 0.147 -0.159 0.783 881s 0.521 -0.786 -0.632 881s 881s x1: [3, 3, 1, 1] 881s 0.252 0.771 -0.126 881s -0.310 0.965 -0.690 881s -0.958 -0.873 -0.417 881s 881s y: [3, 3, 1, 1] 881s 0.429 -0.184 -0.395 881s 0.208 -0.739 -0.484 881s -0.841 -0.330 0.451 881s testing: mul_mat, [4, 3, 3, 1] = [3, 4, 3, 1] * [3, 3, 3, 1] 881s x0: [3, 4, 3, 1] 881s 0.944 -0.146 -0.294 881s 0.194 0.950 -0.735 881s 0.481 0.190 0.930 881s 0.179 -0.916 -0.978 881s 881s x1: [3, 3, 3, 1] 881s 0.235 0.660 -0.531 881s -0.989 0.230 -0.244 881s 0.075 0.845 0.316 881s 881s y: [4, 3, 3, 1] 881s 0.282 1.063 -0.255 -0.043 881s -0.896 0.206 -0.658 -0.150 881s -0.146 0.585 0.491 -1.070 881s testing: mul_mat, [3, 3, 3, 2] = [4, 3, 3, 2] * [4, 3, 3, 2] 881s x0: [4, 3, 3, 2] 881s 0.304 -0.618 -0.021 0.239 881s 0.683 0.018 0.903 0.376 881s -0.517 0.367 -0.378 0.124 881s 881s x1: [4, 3, 3, 2] 881s 0.419 0.557 -0.418 -0.875 881s 0.823 -0.488 -0.853 0.877 881s -0.295 0.825 -0.028 -0.065 881s 881s y: [3, 3, 3, 2] 881s -0.417 -0.411 0.037 881s 0.779 0.114 -0.173 881s -0.615 -0.237 0.458 881s test-mul-mat0: iter:419/500 881s testing: mul_mat, [1, 1, 1, 1] = [1, 1, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 1, 1, 1] 881s 0.214 881s 881s x1: [1, 1, 1, 1] 881s 0.563 881s 881s y: [1, 1, 1, 1] 881s 0.121 881s testing: mul_mat, [1, 1, 4, 1] = [1, 1, 4, 1] * [1, 1, 4, 1] 881s x0: [1, 1, 4, 1] 881s -0.736 881s 881s x1: [1, 1, 4, 1] 881s 0.338 881s 881s y: [1, 1, 4, 1] 881s -0.249 881s testing: mul_mat, [2, 1, 4, 1] = [1, 2, 4, 1] * [1, 1, 4, 1] 881s x0: [1, 2, 4, 1] 881s 0.745 881s 0.568 881s 881s x1: [1, 1, 4, 1] 881s 0.513 881s 881s y: [2, 1, 4, 1] 881s 0.383 0.291 881s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.839 881s -0.727 881s 881s x1: [1, 2, 1, 1] 881s 0.070 881s -0.303 881s 881s y: [2, 2, 1, 1] 881s -0.058 -0.051 881s 0.254 0.220 881s testing: mul_mat, [2, 1, 4, 1] = [2, 2, 4, 1] * [2, 1, 4, 1] 881s x0: [2, 2, 4, 1] 881s 0.702 -0.281 881s 0.345 0.295 881s 881s x1: [2, 1, 4, 1] 881s 0.282 0.724 881s 881s y: [2, 1, 4, 1] 881s -0.006 0.311 881s testing: mul_mat, [1, 2, 4, 1] = [2, 1, 4, 1] * [2, 2, 4, 1] 881s x0: [2, 1, 4, 1] 881s -0.205 -0.296 881s 881s x1: [2, 2, 4, 1] 881s -0.846 -0.751 881s 0.137 -0.509 881s 881s y: [1, 2, 4, 1] 881s 0.396 881s 0.123 881s test-mul-mat0: iter:420/500 881s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.722 -0.199 881s -0.098 -0.092 881s 0.070 -0.901 881s -0.536 -0.223 881s 881s x1: [2, 3, 1, 1] 881s -0.892 0.731 881s -0.462 -0.643 881s -0.132 0.029 881s 881s y: [4, 3, 1, 1] 881s 0.500 0.020 -0.721 0.316 881s 0.461 0.104 0.547 0.391 881s 0.090 0.010 -0.035 0.065 881s testing: mul_mat, [2, 4, 2, 1] = [2, 2, 2, 1] * [2, 4, 2, 1] 881s x0: [2, 2, 2, 1] 881s 0.905 -0.571 881s -0.602 -0.227 881s 881s x1: [2, 4, 2, 1] 881s 0.338 -0.461 881s -0.814 0.921 881s 0.334 -0.111 881s 0.362 0.670 881s 881s y: [2, 4, 2, 1] 881s 0.569 -0.098 881s -1.263 0.281 881s 0.366 -0.176 881s -0.055 -0.370 881s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 881s x0: [2, 2, 2, 1] 881s 0.425 -0.705 881s 0.182 -0.213 881s 881s x1: [2, 2, 2, 1] 881s -0.932 0.959 881s 0.330 -0.833 881s 881s y: [2, 2, 2, 1] 881s -1.072 -0.373 881s 0.727 0.237 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.054 0.345 881s 0.352 -0.875 881s 0.315 -0.227 881s 881s x1: [2, 2, 1, 1] 881s 0.272 -0.005 881s -0.095 0.447 881s 881s y: [3, 2, 1, 1] 881s -0.016 0.100 0.087 881s 0.159 -0.424 -0.132 881s testing: mul_mat, [2, 2, 2, 1] = [3, 2, 2, 1] * [3, 2, 2, 1] 881s x0: [3, 2, 2, 1] 881s -0.187 -0.537 -0.669 881s 0.942 -0.222 0.951 881s 881s x1: [3, 2, 2, 1] 881s -0.489 0.176 0.032 881s -0.421 0.134 -0.638 881s 881s y: [2, 2, 2, 1] 881s -0.024 -0.470 881s 0.434 -1.034 881s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 881s x0: [2, 1, 2, 1] 881s -0.606 0.781 881s 881s x1: [2, 3, 2, 1] 881s 0.600 -0.598 881s 0.147 -0.055 881s -0.473 0.919 881s 881s y: [1, 3, 2, 1] 881s -0.832 881s -0.132 881s 1.005 881s test-mul-mat0: iter:421/500 881s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 3, 1, 1] 881s -0.748 -0.582 -0.931 881s -0.094 0.467 0.901 881s -0.493 0.869 0.048 881s 881s x1: [3, 1, 1, 1] 881s 0.171 -0.497 -0.185 881s 881s y: [3, 1, 1, 1] 881s 0.334 -0.415 -0.525 881s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 881s x0: [3, 1, 4, 1] 881s 0.434 -0.963 -0.108 881s 881s x1: [3, 3, 4, 1] 881s 0.452 -0.604 -0.033 881s -0.093 -0.902 0.919 881s -0.058 -0.065 0.232 881s 881s y: [1, 3, 4, 1] 881s 0.781 881s 0.729 881s 0.012 881s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 4, 4, 1] 881s -0.898 0.977 0.834 881s -0.406 0.079 0.495 881s -0.971 0.116 -0.613 881s 0.186 -0.517 -0.708 881s 881s x1: [3, 1, 4, 1] 881s -0.992 -0.961 -0.646 881s 881s y: [4, 1, 4, 1] 881s -0.588 0.007 1.248 0.770 881s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.800 0.805 -0.141 881s -0.199 -0.121 0.430 881s 881s x1: [3, 4, 1, 1] 881s -0.743 0.322 -0.327 881s 0.863 0.567 0.132 881s 0.094 0.402 0.976 881s -0.310 -0.445 0.337 881s 881s y: [2, 4, 1, 1] 881s -0.289 -0.031 881s 1.128 -0.184 881s 0.261 0.352 881s -0.654 0.260 881s testing: mul_mat, [2, 3, 4, 1] = [2, 2, 4, 1] * [2, 3, 4, 1] 881s x0: [2, 2, 4, 1] 881s -0.922 0.639 881s 0.600 -0.043 881s 881s x1: [2, 3, 4, 1] 881s 0.453 -0.054 881s -0.829 -0.586 881s -0.781 0.394 881s 881s y: [2, 3, 4, 1] 881s -0.452 0.274 881s 0.390 -0.472 881s 0.972 -0.486 881s testing: mul_mat, [1, 2, 4, 1] = [2, 1, 4, 1] * [2, 2, 4, 1] 881s x0: [2, 1, 4, 1] 881s -0.177 0.753 881s 881s x1: [2, 2, 4, 1] 881s 0.644 0.022 881s -0.906 -0.603 881s 881s y: [1, 2, 4, 1] 881s -0.098 881s -0.294 881s test-mul-mat0: iter:422/500 881s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 4, 1, 1] 881s 0.624 0.185 881s -0.634 -0.603 881s 0.625 -0.310 881s 0.746 -0.656 881s 881s x1: [2, 3, 1, 1] 881s 0.106 -0.358 881s -0.931 -0.872 881s -0.263 -0.533 881s 881s y: [4, 3, 1, 1] 881s 0.000 0.148 0.178 0.314 881s -0.741 1.116 -0.311 -0.122 881s -0.263 0.489 0.001 0.153 881s testing: mul_mat, [1, 4, 4, 1] = [2, 1, 4, 1] * [2, 4, 4, 1] 881s x0: [2, 1, 4, 1] 881s 0.772 -0.317 881s 881s x1: [2, 4, 4, 1] 881s 0.812 -0.070 881s 0.140 -0.465 881s 0.753 -0.106 881s 0.718 0.645 881s 881s y: [1, 4, 4, 1] 881s 0.650 881s 0.256 881s 0.615 881s 0.350 881s testing: mul_mat, [1, 1, 4, 4] = [2, 1, 4, 4] * [2, 1, 4, 4] 881s x0: [2, 1, 4, 4] 881s -0.841 -0.502 881s 881s x1: [2, 1, 4, 4] 881s 0.772 0.766 881s 881s y: [1, 1, 4, 4] 881s -1.034 881s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.436 0.010 881s -0.082 0.636 881s 881s x1: [2, 1, 1, 1] 881s -0.025 -0.986 881s 881s y: [2, 1, 1, 1] 881s -0.021 -0.625 881s testing: mul_mat, [3, 2, 4, 1] = [2, 3, 4, 1] * [2, 2, 4, 1] 881s x0: [2, 3, 4, 1] 881s 0.596 -0.378 881s 0.287 0.764 881s -0.247 0.315 881s 881s x1: [2, 2, 4, 1] 881s -0.824 0.165 881s -0.146 -0.689 881s 881s y: [3, 2, 4, 1] 881s -0.554 -0.110 0.256 881s 0.173 -0.569 -0.181 881s testing: mul_mat, [4, 2, 4, 4] = [3, 4, 4, 4] * [3, 2, 4, 4] 881s x0: [3, 4, 4, 4] 881s -0.432 0.111 0.372 881s 0.853 0.147 -0.832 881s 0.923 -0.419 -0.080 881s -0.219 0.626 0.580 881s 881s x1: [3, 2, 4, 4] 881s -0.124 -0.334 -0.263 881s 0.231 -0.828 0.731 881s 881s y: [4, 2, 4, 4] 881s -0.081 0.064 0.047 -0.334 881s 0.080 -0.533 0.502 -0.145 881s test-mul-mat0: iter:423/500 881s testing: mul_mat, [3, 1, 1, 1] = [1, 3, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.430 881s 0.796 881s -0.779 881s 881s x1: [1, 1, 1, 1] 881s -0.130 881s 881s y: [3, 1, 1, 1] 881s 0.056 -0.103 0.101 881s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 881s x0: [1, 4, 2, 1] 881s -0.477 881s 0.997 881s 0.600 881s 0.440 881s 881s x1: [1, 3, 2, 1] 881s -0.897 881s -0.347 881s 0.662 881s 881s y: [4, 3, 2, 1] 881s 0.428 -0.895 -0.538 -0.395 881s 0.165 -0.346 -0.208 -0.153 881s -0.316 0.661 0.398 0.292 881s testing: mul_mat, [4, 4, 2, 2] = [1, 4, 2, 2] * [1, 4, 2, 2] 881s x0: [1, 4, 2, 2] 881s 0.933 881s -0.598 881s -0.470 881s 0.569 881s 881s x1: [1, 4, 2, 2] 881s -0.264 881s 0.470 881s -0.087 881s -0.564 881s 881s y: [4, 4, 2, 2] 881s -0.246 0.158 0.124 -0.150 881s 0.439 -0.281 -0.221 0.268 881s -0.081 0.052 0.041 -0.049 881s -0.526 0.337 0.265 -0.321 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.997 881s 0.798 881s 0.323 881s 881s x1: [1, 4, 1, 1] 881s 0.048 881s -0.869 881s -0.351 881s 0.185 881s 881s y: [3, 4, 1, 1] 881s -0.048 0.038 0.015 881s 0.866 -0.694 -0.281 881s 0.350 -0.280 -0.113 881s -0.185 0.148 0.060 881s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 881s x0: [3, 1, 2, 1] 881s -0.507 -0.018 0.549 881s 881s x1: [3, 1, 2, 1] 881s -0.406 0.281 0.841 881s 881s y: [1, 1, 2, 1] 881s 0.662 881s testing: mul_mat, [3, 3, 2, 2] = [1, 3, 2, 2] * [1, 3, 2, 2] 881s x0: [1, 3, 2, 2] 881s -0.153 881s -0.395 881s 0.187 881s 881s x1: [1, 3, 2, 2] 881s -0.845 881s 0.946 881s -0.547 881s 881s y: [3, 3, 2, 2] 881s 0.129 0.333 -0.158 881s -0.144 -0.373 0.177 881s 0.083 0.216 -0.102 881s test-mul-mat0: iter:424/500 881s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.514 -0.019 0.348 881s -0.880 -0.832 0.993 881s -0.952 0.950 0.920 881s 0.818 0.487 -0.592 881s 881s x1: [3, 4, 1, 1] 881s 0.797 0.306 0.198 881s 0.744 0.760 -0.114 881s -0.776 0.171 0.262 881s -0.162 -0.723 0.797 881s 881s y: [4, 4, 1, 1] 881s 0.473 -0.760 -0.286 0.685 881s 0.328 -1.400 -0.091 1.046 881s -0.311 0.801 1.142 -0.707 881s 0.208 1.535 0.201 -0.956 881s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 881s x0: [3, 2, 4, 1] 881s -0.856 -0.756 -0.856 881s -0.904 -0.459 -0.133 881s 881s x1: [3, 4, 4, 1] 881s -0.831 -0.957 0.389 881s 0.501 -0.852 -0.568 881s 0.298 0.454 0.629 881s 0.042 0.214 -0.485 881s 881s y: [2, 4, 4, 1] 881s 1.102 1.140 881s 0.702 0.014 881s -1.137 -0.561 881s 0.218 -0.072 881s testing: mul_mat, [3, 2, 4, 3] = [3, 3, 4, 3] * [3, 2, 4, 3] 881s x0: [3, 3, 4, 3] 881s -0.992 0.482 -0.317 881s -0.528 -0.681 0.675 881s -0.791 -0.087 -0.472 881s 881s x1: [3, 2, 4, 3] 881s 0.899 0.952 -0.513 881s 0.914 0.492 -0.446 881s 881s y: [3, 2, 4, 3] 881s -0.270 -1.469 -0.551 881s -0.528 -1.119 -0.556 881s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 3, 1, 1] 881s 0.875 0.766 0.622 881s 0.577 0.595 -0.848 881s 0.424 0.345 -0.698 881s 881s x1: [3, 3, 1, 1] 881s -0.741 -0.481 -0.813 881s -0.018 0.487 -0.040 881s -0.828 0.823 -0.822 881s 881s y: [3, 3, 1, 1] 881s -1.523 -0.025 0.086 881s 0.332 0.313 0.188 881s -0.606 0.709 0.507 881s testing: mul_mat, [1, 3, 4, 1] = [3, 1, 4, 1] * [3, 3, 4, 1] 881s x0: [3, 1, 4, 1] 881s 0.111 -0.953 0.137 881s 881s x1: [3, 3, 4, 1] 881s -0.954 0.487 -0.730 881s 0.845 -0.249 0.752 881s -0.572 -0.733 -0.665 881s 881s y: [1, 3, 4, 1] 881s -0.670 881s 0.434 881s 0.544 881s testing: mul_mat, [1, 3, 4, 3] = [1, 1, 4, 3] * [1, 3, 4, 3] 881s x0: [1, 1, 4, 3] 881s 0.935 881s 881s x1: [1, 3, 4, 3] 881s -0.750 881s 0.204 881s 0.363 881s 881s y: [1, 3, 4, 3] 881s -0.701 881s 0.191 881s 0.339 881s test-mul-mat0: iter:425/500 881s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 1, 1, 1] 881s -0.371 881s 881s x1: [1, 4, 1, 1] 881s 0.583 881s -0.578 881s -0.844 881s 0.440 881s 881s y: [1, 4, 1, 1] 881s -0.216 881s 0.214 881s 0.313 881s -0.163 881s testing: mul_mat, [3, 1, 4, 1] = [1, 3, 4, 1] * [1, 1, 4, 1] 881s x0: [1, 3, 4, 1] 881s 0.233 881s 0.808 881s 0.099 881s 881s x1: [1, 1, 4, 1] 881s -0.923 881s 881s y: [3, 1, 4, 1] 881s -0.215 -0.746 -0.092 881s testing: mul_mat, [4, 3, 4, 3] = [1, 4, 4, 3] * [1, 3, 4, 3] 881s x0: [1, 4, 4, 3] 881s -0.411 881s 0.328 881s -0.506 881s -0.750 881s 881s x1: [1, 3, 4, 3] 881s -0.277 881s -0.654 881s 0.435 881s 881s y: [4, 3, 4, 3] 881s 0.114 -0.091 0.140 0.208 881s 0.269 -0.215 0.331 0.490 881s -0.179 0.143 -0.220 -0.326 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s 0.090 881s -0.425 881s 0.277 881s 881s x1: [1, 4, 1, 1] 881s -0.786 881s -0.844 881s 0.547 881s 0.095 881s 881s y: [3, 4, 1, 1] 881s -0.071 0.334 -0.217 881s -0.076 0.359 -0.234 881s 0.049 -0.233 0.151 881s 0.009 -0.041 0.026 881s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 3, 4, 1] 881s 0.228 -0.705 0.277 881s 0.452 0.164 0.999 881s -0.625 -0.649 -0.201 881s 881s x1: [3, 1, 4, 1] 881s -0.527 -0.001 0.556 881s 881s y: [3, 1, 4, 1] 881s 0.034 0.318 0.218 881s testing: mul_mat, [2, 3, 4, 3] = [3, 2, 4, 3] * [3, 3, 4, 3] 881s x0: [3, 2, 4, 3] 881s 0.674 0.300 0.906 881s 0.802 0.217 -0.188 881s 881s x1: [3, 3, 4, 3] 881s -0.275 0.730 -0.379 881s 0.524 -0.871 -0.166 881s -0.522 -0.924 -0.857 881s 881s y: [2, 3, 4, 3] 881s -0.310 0.009 881s -0.058 0.263 881s -1.405 -0.458 881s test-mul-mat0: iter:426/500 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.879 881s 0.959 881s 0.122 881s 881s x1: [1, 4, 1, 1] 881s 0.887 881s 0.279 881s -0.659 881s -0.240 881s 881s y: [3, 4, 1, 1] 881s -0.779 0.851 0.108 881s -0.246 0.268 0.034 881s 0.579 -0.632 -0.081 881s 0.211 -0.230 -0.029 881s testing: mul_mat, [3, 3, 2, 1] = [1, 3, 2, 1] * [1, 3, 2, 1] 881s x0: [1, 3, 2, 1] 881s 0.795 881s 0.374 881s 0.789 881s 881s x1: [1, 3, 2, 1] 881s -0.509 881s -0.599 881s -0.718 881s 881s y: [3, 3, 2, 1] 881s -0.405 -0.190 -0.402 881s -0.477 -0.224 -0.473 881s -0.571 -0.269 -0.567 881s testing: mul_mat, [2, 3, 2, 3] = [1, 2, 2, 3] * [1, 3, 2, 3] 881s x0: [1, 2, 2, 3]ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s 881s -0.002 881s -0.519 881s 881s x1: [1, 3, 2, 3] 881s 0.661 881s 0.256 881s 0.080 881s 881s y: [2, 3, 2, 3] 881s -0.001 -0.343 881s -0.000 -0.133 881s -0.000 -0.042 881s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.191 881s -0.759 881s -0.826 881s 881s x1: [1, 2, 1, 1] 881s 0.637 881s 0.678 881s 881s y: [3, 2, 1, 1] 881s -0.122 -0.483 -0.526 881s -0.130 -0.515 -0.560 881s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 881s x0: [3, 4, 2, 1] 881s -0.252 0.348 0.666 881s 0.149 -0.904 -0.455 881s -0.653 -0.656 0.879 881s 0.345 0.829 -0.736 881s 881s x1: [3, 1, 2, 1] 881s 0.154 -0.768 -0.517 881s 881s y: [4, 1, 2, 1] 881s -0.650 0.952 -0.050 -0.203 881s testing: mul_mat, [1, 3, 2, 3] = [4, 1, 2, 3] * [4, 3, 2, 3] 881s x0: [4, 1, 2, 3] 881s 0.123 0.773 -0.581 0.422 881s 881s x1: [4, 3, 2, 3] 881s -0.882 0.876 0.806 -0.932 881s 0.986 0.878 0.156 -0.266 881s 0.027 0.502 -0.920 -0.625 881s 881s y: [1, 3, 2, 3] 881s -0.293 881s 0.597 881s 0.663 881s test-mul-mat0: iter:427/500 881s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 1, 1, 1] 881s -0.423 0.023 881s 881s x1: [2, 4, 1, 1] 881s -0.772 0.609 881s -0.976 0.351 881s 0.382 -0.557 881s -0.227 -0.990 881s 881s y: [1, 4, 1, 1] 881s 0.340 881s 0.421 881s -0.174 881s 0.073 881s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.591 -0.409 881s 0.280 0.672 881s -0.581 -0.247 881s 881s x1: [2, 1, 1, 1] 881s 0.456 -0.853 881s 881s y: [3, 1, 1, 1] 881s 0.619 -0.446 -0.054 881s testing: mul_mat, [2, 3, 1, 2] = [2, 2, 1, 2] * [2, 3, 1, 2] 881s x0: [2, 2, 1, 2] 881s -0.871 -0.306 881s -0.954 -0.428 881s 881s x1: [2, 3, 1, 2] 881s -0.151 0.787 881s -0.974 -0.611 881s -0.276 0.916 881s 881s y: [2, 3, 1, 2] 881s -0.109 -0.193 881s 1.035 1.191 881s -0.040 -0.129 881s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.218 -0.626 881s -0.042 0.201 881s 881s x1: [2, 2, 1, 1] 881s 0.079 0.112 881s -0.809 0.678 881s 881s y: [2, 2, 1, 1] 881s -0.087 0.019 881s -0.248 0.170 881s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 4, 1, 1] 881s 0.712 -0.232 881s 0.653 -0.704 881s 0.241 0.064 881s -0.041 -0.889 881s 881s x1: [2, 2, 1, 1] 881s 0.711 0.224 881s -0.012 0.737 881s 881s y: [4, 2, 1, 1] 881s 0.454 0.307 0.185 -0.228 881s -0.179 -0.527 0.044 -0.655 881s testing: mul_mat, [2, 2, 1, 2] = [4, 2, 1, 2] * [4, 2, 1, 2] 881s x0: [4, 2, 1, 2] 881s -0.509 -0.169 -0.776 0.191 881s -0.513 0.479 -0.556 0.877 881s 881s x1: [4, 2, 1, 2] 881s -0.040 0.192 -0.195 0.005 881s 0.764 -0.727 0.061 0.420 881s 881s y: [2, 2, 1, 2] 881s 0.140 0.226 881s -0.234 -0.406 881s test-mul-mat0: iter:428/500 881s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 4, 1, 1] 881s -0.342 881s -0.036 881s 0.355 881s 0.948 881s 881s x1: [1, 4, 1, 1] 881s 0.007 881s 0.590 881s -0.850 881s 0.146 881s 881s y: [4, 4, 1, 1] 881s -0.002 -0.000 0.002 0.006 881s -0.202 -0.021 0.209 0.560 881s 0.290 0.030 -0.301 -0.806 881s -0.050 -0.005 0.052 0.139 881s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 881s x0: [1, 2, 2, 1] 881s -0.671 881s 0.923 881s 881s x1: [1, 4, 2, 1] 881s 0.741 881s -0.038 881s -0.561 881s -0.772 881s 881s y: [2, 4, 2, 1] 881s -0.497 0.684 881s 0.025 -0.035 881s 0.376 -0.518 881s 0.518 -0.713 881s testing: mul_mat, [2, 2, 2, 2] = [1, 2, 2, 2] * [1, 2, 2, 2] 881s x0: [1, 2, 2, 2] 881s 0.796 881s -0.359 881s 881s x1: [1, 2, 2, 2] 881s 0.904 881s -0.526 881s 881s y: [2, 2, 2, 2] 881s 0.720 -0.325 881s -0.419 0.189 881s testing: mul_mat, [3, 2, 1, 1] = [1, 3, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.681 881s 0.876 881s -0.127 881s 881s x1: [1, 2, 1, 1] 881s 0.880 881s 0.648 881s 881s y: [3, 2, 1, 1] 881s -0.599 0.771 -0.112 881s -0.441 0.568 -0.082 881s testing: mul_mat, [1, 1, 2, 1] = [3, 1, 2, 1] * [3, 1, 2, 1] 881s x0: [3, 1, 2, 1] 881s 0.631 -0.062 0.080 881s 881s x1: [3, 1, 2, 1] 881s -0.763 -0.672 0.111 881s 881s y: [1, 1, 2, 1] 881s -0.431 881s testing: mul_mat, [1, 3, 2, 2] = [1, 1, 2, 2] * [1, 3, 2, 2] 881s x0: [1, 1, 2, 2] 881s -0.134 881s 881s x1: [1, 3, 2, 2] 881s -0.349 881s -0.428 881s 0.010 881s 881s y: [1, 3, 2, 2] 881s 0.047 881s 0.058 881s -0.001 881s test-mul-mat0: iter:429/500 881s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.600 0.485 0.171 -0.182 881s -0.984 -0.968 0.683 0.626 881s 881s x1: [4, 4, 1, 1] 881s -0.236 0.727 -0.371 0.370 881s 0.358 0.567 -0.550 -0.282 881s 0.349 -0.590 0.369 0.921 881s 0.420 0.143 -0.711 0.072 881s 881s y: [2, 4, 1, 1] 881s 0.080 -0.493 881s 0.447 -1.453 881s -0.182 1.057 881s 0.187 -0.992 881s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.237 -0.421 0.900 0.213 881s 881s x1: [4, 2, 1, 1] 881s 0.212 -0.575 -0.890 -0.371 881s 0.495 0.704 0.393 0.221 881s 881s y: [1, 2, 1, 1] 881s -0.689 881s -0.012 881s testing: mul_mat, [3, 1, 1, 2] = [4ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s , 3, 1, 2] * [4, 1, 1, 2] 881s x0: [4, 3, 1, 2] 881s 1.000 0.060 0.601 0.171 881s 0.877 0.617 0.203 0.561 881s 0.243 -0.585 0.985 0.353 881s 881s x1: [4, 1, 1, 2] 881s 0.297 0.250 0.623 -0.334 881s 881s y: [3, 1, 1, 2] 881s 0.629 0.354 0.422 881s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.426 0.669 0.657 0.841 881s 881s x1: [4, 3, 1, 1] 881s -0.963 0.244 0.712 0.080 881s 0.053 0.172 0.196 0.052 881s -0.769 -0.204 -0.777 -0.891 881s 881s y: [1, 3, 1, 1] 881s 0.288 881s 0.310 881s -1.723 881s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.322 881s 0.836 881s 881s x1: [1, 4, 1, 1] 881s 0.655 881s 0.010 881s -0.115 881s 0.135 881s 881s y: [2, 4, 1, 1] 881s 0.211 0.547 881s 0.003 0.008 881s -0.037 -0.097 881s 0.043 0.113 881s testing: mul_mat, [4, 1, 1, 2] = [2, 4, 1, 2] * [2, 1, 1, 2] 881s x0: [2, 4, 1, 2] 881s 0.695 0.002 881s 0.291 -0.947 881s -0.028 0.991 881s -0.061 -0.826 881s 881s x1: [2, 1, 1, 2] 881s 0.459 -0.478 881s 881s y: [4, 1, 1, 2] 881s 0.317 0.586 -0.487 0.367 881s test-mul-mat0: iter:430/500 881s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 1, 1, 1] 881s 0.261 0.926 0.401 881s 881s x1: [3, 2, 1, 1] 881s 0.125 0.418 -0.027 881s 0.193 -0.261 -0.191 881s 881s y: [1, 2, 1, 1] 881s 0.409 881s -0.268 881s testing: mul_mat, [3, 1, 4, 1] = [3, 3, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 3, 4, 1] 881s -0.140 -0.211 -0.028 881s 0.553 -0.028 0.087 881s -0.914 -0.189 -0.788 881s 881s x1: [3, 1, 4, 1] 881s -0.804 0.621 -0.308 881s 881s y: [3, 1, 4, 1] 881s -0.010 -0.488 0.859 881s testing: mul_mat, [4, 3, 4, 3] = [3, 4, 4, 3] * [3, 3, 4, 3] 881s x0: [3, 4, 4, 3] 881s -0.932 -0.447 -0.680 881s 0.294 -0.641 0.063 881s -0.742 0.763 0.624 881s 0.844 -0.143 -0.264 881s 881s x1: [3, 3, 4, 3] 881s 0.132 0.343 -0.441 881s 0.344 -0.154 -0.657 881s -0.251 0.089 -0.064 881s 881s y: [4, 3, 4, 3] 881s 0.023 -0.209 -0.111 0.179 881s 0.195 0.158 -0.782 0.486 881s 0.238 -0.135 0.214 -0.208 881s testing: mul_mat, [4, 4, 1, 1] = [3, 4, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.995 -0.480 -0.574 881s -0.758 -0.355 0.825 881s -0.233 0.533 -0.121 881s 0.988 0.001 0.229 881s 881s x1: [3, 4, 1, 1] 881s 0.386 -0.298 0.630 881s -0.337 0.816 0.308 881s 0.979 0.097 -0.675 881s -0.937 0.569 0.329 881s 881s y: [4, 4, 1, 1] 881s -0.602 0.333 -0.325 0.525 881s -0.232 0.220 0.476 -0.263 881s -0.633 -1.333 -0.095 0.813 881s 0.470 0.780 0.482 -0.850 881s testing: mul_mat, [3, 3, 4, 1] = [4, 3, 4, 1] * [4, 3, 4, 1] 881s x0: [4, 3, 4, 1] 881s 0.891 -0.055 -0.356 -0.230 881s -0.082 0.699 -0.988 -0.850 881s 0.856 -0.379 -0.614 0.030 881s 881s x1: [4, 3, 4, 1] 881s -0.897 0.448 -0.218 -0.238 881s 0.603 0.410 -0.852 -0.695 881s 0.041 -0.189 -0.879 -0.652 881s 881s y: [3, 3, 4, 1] 881s -0.692 0.804 -0.811 881s 0.977 1.670 0.863 881s 0.509 1.287 0.627 881s testing: mul_mat, [2, 4, 4, 3] = [3, 2, 4, 3] * [3, 4, 4, 3] 881s x0: [3, 2, 4, 3] 881s 0.420 0.304 0.151 881s -0.322 -0.888 -0.352 881s 881s x1: [3, 4, 4, 3] 881s -0.615 0.311 0.515 881s 0.201 0.708 -0.976 881s -0.862 0.040 0.285 881s -0.586 0.982 -0.850 881s 881s y: [2, 4, 4, 3] 881s -0.086 -0.260 881s 0.152 -0.351 881s -0.307 0.141 881s -0.076 -0.385 881s test-mul-mat0: iter:431/500 881s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 1, 1, 1] 881s 0.042 -0.146 -0.560 881s 881s x1: [3, 1, 1, 1] 881s 0.879 0.136 0.131 881s 881s y: [1, 1, 1, 1] 881s -0.056 881s testing: mul_mat, [2, 1, 4, 1] = [3, 2, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 2, 4, 1] 881s -0.677 0.768 -0.248 881s 0.316 -0.079 0.154 881s 881s x1: [3, 1, 4, 1] 881s 0.028 0.244 0.067 881s 881s y: [2, 1, 4, 1] 881s 0.151 0.000 881s testing: mul_mat, [3, 2, 4, 2] = [3, 3, 4, 2] * [3, 2, 4, 2] 881s x0: [3, 3, 4, 2] 881s -0.659 -0.420 0.859 881s 0.459 0.407 0.865 881s 0.487 0.445 0.958 881s 881s x1: [3, 2, 4, 2] 881s -0.076 0.670 0.469 881s -0.793 0.392 0.302 881s 881s y: [3, 2, 4, 2] 881s 0.172 0.643 0.710 881s 0.617 0.057 0.077 881s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 3, 1, 1] 881s -0.089 0.099 -0.354 881s -0.986 -0.386 0.057 881s 0.146 0.084 -0.070 881s 881s x1: [3, 3, 1, 1] 881s 0.992 0.175 0.257 881s 0.788 -0.628 -0.447 881s -0.663 0.649 0.732 881s 881s y: [3, 3, 1, 1] 881s -0.162 -1.031 0.142 881s 0.026 -0.560 0.094 881s -0.136 0.445 -0.094 881s testing: mul_mat, [3, 3, 4, 1] = [3, 3, 4, 1] * [3, 3, 4, 1] 881s x0: [3, 3, 4, 1] 881s -0.301 0.650 -0.406 881s 0.270 -0.801 -0.697 881s 0.338 -0.141 0.637 881s 881s x1: [3, 3, 4, 1] 881s 0.490 -0.961 0.026 881s -0.789 0.420 -0.170 881s -0.783 0.064 0.831 881s 881s y: [3, 3, 4, 1] 881s -0.783 0.884 0.318 881s 0.579 -0.431 -0.434 881s -0.060 -0.841 0.255 881s testing: mul_mat, [3, 3, 4, 2] = [3, 3, 4, 2] * [3ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s , 3, 4, 2] 881s x0: [3, 3, 4, 2] 881s 0.795 0.175 0.411 881s 0.980 0.723 -0.168 881s 0.768 -0.938 -0.820 881s 881s x1: [3, 3, 4, 2] 881s 0.280 -0.452 -0.690 881s -0.083 0.874 0.089 881s 0.885 -0.731 0.992 881s 881s y: [3, 3, 4, 2] 881s -0.140 0.063 1.205 881s 0.123 0.535 -0.956 881s 0.983 0.173 0.551 881s test-mul-mat0: iter:432/500 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.621 881s -0.093 881s 0.225 881s 881s x1: [1, 4, 1, 1] 881s 0.574 881s -0.396 881s 0.721 881s 0.407 881s 881s y: [3, 4, 1, 1] 881s -0.356 -0.053 0.129 881s 0.246 0.037 -0.089 881s -0.447 -0.067 0.162 881s -0.252 -0.038 0.091 881s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.396 881s 0.321 881s 881s x1: [1, 3, 1, 1] 881s 0.494 881s -0.747 881s 0.549 881s 881s y: [2, 3, 1, 1] 881s -0.196 0.159 881s 0.296 -0.240 881s -0.218 0.176 881s testing: mul_mat, [4, 2, 1, 2] = [1, 4, 1, 2] * [1, 2, 1, 2] 881s x0: [1, 4, 1, 2] 881s -0.820 881s 0.311 881s 0.706 881s 0.096 881s 881s x1: [1, 2, 1, 2] 881s -0.572 881s -0.981 881s 881s y: [4, 2, 1, 2] 881s 0.469 -0.178 -0.404 -0.055 881s 0.805 -0.305 -0.692 -0.094 881s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 4, 1, 1] 881s -0.862 881s -0.678 881s 0.180 881s 0.774 881s 881s x1: [1, 4, 1, 1] 881s -0.399 881s -0.800 881s 0.534 881s -0.398 881s 881s y: [4, 4, 1, 1] 881s 0.344 0.271 -0.072 -0.309 881s 0.689 0.542 -0.144 -0.620 881s -0.460 -0.362 0.096 0.413 881s 0.343 0.270 -0.072 -0.309 881s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.452 -0.338 0.442 -0.913 881s -0.766 -0.037 -0.009 -0.177 881s -0.056 -0.188 -0.566 0.680 881s 0.870 0.123 0.147 0.958 881s 881s x1: [4, 1, 1, 1] 881s -0.298 -0.912 -0.001 -0.804 881s 881s y: [4, 1, 1, 1] 881s 1.177 0.405 -0.357 -1.142 881s testing: mul_mat, [3, 4, 1, 2] = [4, 3, 1, 2] * [4, 4, 1, 2] 881s x0: [4, 3, 1, 2] 881s 0.015 -0.053 -0.941 0.259 881s 0.200 0.016 -0.393 0.197 881s -0.122 -0.918 0.988 -0.910 881s 881s x1: [4, 4, 1, 2] 881s -0.856 -0.720 -0.842 0.678 881s -0.118 0.932 0.816 0.204 881s 0.112 0.591 0.906 0.200 881s -0.410 -0.898 0.540 0.138 881s 881s y: [3, 4, 1, 2] 881s 0.993 0.282 -0.684 881s -0.766 -0.290 -0.220 881s -0.830 -0.285 0.158 881s -0.431 -0.281 1.282 881s test-mul-mat0: iter:433/500 881s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.015 -0.594 0.806 881s 0.350 -0.168 0.089 881s 881s x1: [3, 4, 1, 1] 881s 0.726 -0.004 0.367 881s -0.259 -0.804 -0.755 881s 0.688 0.211 -0.673 881s 0.747 0.818 -0.685 881s 881s y: [2, 4, 1, 1] 881s 0.309 0.288 881s -0.134 -0.023 881s -0.658 0.145 881s -1.027 0.063 881s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.896 0.094 -0.304 881s 0.371 -0.536 -0.577 881s 881s x1: [3, 2, 1, 1] 881s -0.308 -0.745 0.820 881s 0.854 0.799 0.439 881s 881s y: [2, 2, 1, 1] 881s -0.595 -0.188 881s 0.707 -0.364 881s testing: mul_mat, [2, 2, 1, 2] = [3, 2, 1, 2] * [3, 2, 1, 2] 881s x0: [3, 2, 1, 2] 881s 0.125 0.412 0.958 881s 0.956 -0.499 -0.350 881s 881s x1: [3, 2, 1, 2] 881s -0.633 0.831 0.164 881s -0.437 -0.924 -0.148 881s 881s y: [2, 2, 1, 2] 881s 0.421 -1.077 881s -0.577 0.095 881s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 1, 1, 1] 881s 0.149 -0.387 -0.792 881s 881s x1: [3, 2, 1, 1] 881s 0.906 -0.147 0.026 881s 0.278 0.318 0.449 881s 881s y: [1, 2, 1, 1] 881s 0.171 881s -0.437 881s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.935 881s 0.575 881s 881s x1: [1, 3, 1, 1] 881s 0.225 881s 0.465 881s 0.982 881s 881s y: [2, 3, 1, 1] 881s -0.211 0.129 881s -0.435 0.267 881s -0.919 0.564 881s testing: mul_mat, [3, 1, 1, 2] = [2, 3, 1, 2] * [2, 1, 1, 2] 881s x0: [2, 3, 1, 2] 881s -0.860 -0.211 881s 0.428 0.639 881s 0.584 0.905 881s 881s x1: [2, 1, 1, 2] 881s -0.653 0.671 881s 881s y: [3, 1, 1, 2] 881s 0.420 0.149 0.226 881s test-mul-mat0: iter:434/500 881s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.758 -0.815 0.912 -0.576 881s 881s x1: [4, 2, 1, 1] 881s -0.522 -0.016 -0.289 -0.315 881s -0.791 -0.824 -0.333 0.838 881s 881s y: [1, 2, 1, 1] 881s 0.327 881s 0.485 881s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.353 -0.276 -0.518 -0.354 881s -0.627 -0.854 -0.731 0.928 881s 881s x1: [4, 1, 1, 1] 881s 0.657 -0.006 -0.436 0.085 881s 881s y: [2, 1, 1, 1] 881s 0.429 -0.010 881s testing: mul_mat, [2, 2, 1, 4] = [4, 2, 1, 4] * [4, 2, 1, 4] 881s x0: [4, 2, 1, 4] 881s -0.760 0.421 -0.974 0.479 881s -0.097 0.754 0.403 -0.405 881s 881s x1: [4, 2, 1, 4] 881s -0.336 -0.081 -0.239 -0.501 881s 0.017 0.239 0.483 0.729 881s 881s y: [2, 2, 1, 4] 881s 0.214 0.078 881s -0.034 0.078 881s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s [4, 2, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.121 0.106 0.807 -0.567 881s 0.243 0.901 0.590 0.222 881s 0.817 0.926 -0.435 0.786 881s -0.282 -0.979 -0.114 -0.173 881s 881s x1: [4, 2, 1, 1] 881s 0.257 -0.976 0.593 -0.840 881s 0.778 -0.004 -0.245 0.438 881s 881s y: [4, 2, 1, 1] 881s 0.820 -0.653 -1.612 0.960 881s -0.540 0.138 1.083 -0.264 881s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.789 -0.636 0.668 0.919 881s 0.742 -0.888 -0.615 0.258 881s 881s x1: [4, 4, 1, 1] 881s -0.864 -0.225 -0.299 0.972 881s -0.226 -0.185 0.229 -0.202 881s -0.592 0.389 -0.424 0.404 881s -0.855 -0.986 -0.606 0.024 881s 881s y: [2, 4, 1, 1] 881s 1.518 -0.007 881s 0.263 -0.197 881s 0.308 -0.420 881s 0.919 0.619 881s testing: mul_mat, [3, 4, 1, 4] = [2, 3, 1, 4] * [2, 4, 1, 4] 881s x0: [2, 3, 1, 4] 881s -0.573 -0.054 881s -0.341 -0.032 881s 0.878 0.245 881s 881s x1: [2, 4, 1, 4] 881s -0.049 -0.195 881s 0.691 -0.828 881s -0.409 -0.481 881s -0.692 0.366 881s 881s y: [3, 4, 1, 4] 881s 0.039 0.023 -0.091 881s -0.351 -0.209 0.405 881s 0.261 0.155 -0.477 881s 0.376 0.224 -0.518 881s test-mul-mat0: iter:435/500 881s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 2, 1, 1] 881s -0.799 -0.978 -0.471 881s -0.042 -0.655 0.172 881s 881s x1: [3, 3, 1, 1] 881s -0.209 0.326 0.514 881s 0.218 0.985 0.392 881s -0.836 -0.047 -0.363 881s 881s y: [2, 3, 1, 1] 881s -0.394 -0.116 881s -1.322 -0.587 881s 0.886 0.004 881s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 881s x0: [3, 2, 3, 1] 881s 0.154 -0.552 -0.109 881s -0.893 0.085 0.220 881s 881s x1: [3, 2, 3, 1] 881s -0.385 -0.817 0.661 881s 0.316 0.549 -0.610 881s 881s y: [2, 2, 3, 1] 881s 0.320 0.420 881s -0.188 -0.370 881s testing: mul_mat, [1, 2, 3, 2] = [3, 1, 3, 2] * [3, 2, 3, 2] 881s x0: [3, 1, 3, 2] 881s -0.298 0.869 0.032 881s 881s x1: [3, 2, 3, 2] 881s 0.282 -0.116 -0.457 881s 0.944 0.889 0.858 881s 881s y: [1, 2, 3, 2] 881s -0.199 881s 0.519 881s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.633 0.809 0.894 881s 0.561 -0.739 -0.526 881s -0.513 0.772 0.814 881s 0.730 0.945 -0.074 881s 881s x1: [3, 1, 1, 1] 881s -0.003 -0.183 -0.802 881s 881s y: [4, 1, 1, 1] 881s -0.867 0.555 -0.793 -0.115 881s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 881s x0: [4, 1, 3, 1] 881s -0.377 -0.604 -0.227 0.859 881s 881s x1: [4, 3, 3, 1] 881s 0.346 -0.415 -0.587 0.115 881s 0.412 -0.197 -0.328 -0.597 881s 0.180 0.236 0.865 -0.349 881s 881s y: [1, 3, 3, 1] 881s 0.352 881s -0.474 881s -0.706 881s testing: mul_mat, [1, 4, 3, 2] = [1, 1, 3, 2] * [1, 4, 3, 2] 881s x0: [1, 1, 3, 2] 881s -0.847 881s 881s x1: [1, 4, 3, 2] 881s -0.224 881s -0.281 881s 0.387 881s -0.927 881s 881s y: [1, 4, 3, 2] 881s 0.190 881s 0.238 881s -0.328 881s 0.786 881s test-mul-mat0: iter:436/500 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.681 0.692 0.711 0.140 881s 881s x1: [4, 1, 1, 1] 881s -0.757 -0.895 -0.401 0.223 881s 881s y: [1, 1, 1, 1] 881s -1.389 881s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 881s x0: [4, 2, 2, 1] 881s 0.872 -0.731 0.888 -0.975 881s -0.375 -0.978 -0.855 -0.814 881s 881s x1: [4, 1, 2, 1] 881s -0.002 -0.908 0.389 -0.746 881s 881s y: [2, 1, 2, 1] 881s 1.734 1.163 881s testing: mul_mat, [1, 2, 2, 3] = [4, 1, 2, 3] * [4, 2, 2, 3] 881s x0: [4, 1, 2, 3] 881s 0.225 -0.024 -0.209 -0.723 881s 881s x1: [4, 2, 2, 3] 881s 0.932 0.612 -0.711 0.624 881s 0.322 0.429 -0.379 0.415 881s 881s y: [1, 2, 2, 3] 881s -0.107 881s -0.158 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.016 0.329 -0.594 0.993 881s 881s x1: [4, 1, 1, 1] 881s 0.152 -0.137 -0.946 -0.653 881s 881s y: [1, 1, 1, 1] 881s -0.129 881s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 881s x0: [1, 3, 2, 1] 881s -0.464 881s -0.292 881s -0.253 881s 881s x1: [1, 4, 2, 1] 881s -0.880 881s -0.316 881s 0.169 881s 0.836 881s 881s y: [3, 4, 2, 1] 881s 0.409 0.257 0.223 881s 0.147 0.093 0.080 881s -0.079 -0.049 -0.043 881s -0.388 -0.244 -0.212 881s testing: mul_mat, [2, 1, 2, 3] = [3, 2, 2, 3] * [3, 1, 2, 3] 881s x0: [3, 2, 2, 3] 881s -0.861 -0.963 -0.664 881s -0.189 -0.787 -0.737 881s 881s x1: [3, 1, 2, 3] 881s -0.713 0.498 0.197 881s 881s y: [2, 1, 2, 3] 881s 0.003 -0.403 881s test-mul-mat0: iter:437/500 881s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 3, 1, 1] 881s 0.568 0.489 0.360 881s -0.647 -0.484 -0.665 881s 0.644 0.029 -0.841 881s 881s x1: [3, 1, 1, 1] 881s 0.311 -0.864 0.086 881s 881s y: [3, 1, 1, 1] 881s -0.215 0.160 0.103 881s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 881s x0: [3, 3, 2, 1] 881s -0.200 -0.593 -0.283 881s -0.633 0.896 -0.923 881s -0.280 -0.588 -0.588 881s 881s x1: [3, 3, 2, 1] 881s 0.789 -0.790 0.846 881s -0.884 -0.535 0.391 881s -0.254 -0.522 0.963 881s 881s y: [3, 3, 2, 1] 881s 0.070 -1.988 -0.254 881s 0.383 -0.282 0.331 881s 0.088 -1.197 -0.188 881s testing: mul_mat, [2, 3, 2, 4] = [3, 2, 2, 4] * [3, 3, 2, 4] 881s x0: [3, 2, 2, 4] 881s 0.709 -0.749 -0.622 881s 0.015 -0.924 0.278 881s 881s x1: [3, 3, 2, 4] 881s 0.016 -0.405 0.771 881s -0.469 0.895 -0.658 881s 0.028 -0.645 0.783 881s 881s y: [2, 3, 2, 4] 881s -0.165 0.589 881s -0.593 -1.017 881s 0.015 0.814 881s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.783 0.612 0.353 881s -0.697 0.327 0.864 881s 881s x1: [3, 2, 1, 1] 881s 0.630 0.120 0.259 881s 0.390 0.006 0.168 881s 881s y: [2, 2, 1, 1] 881s 0.658 -0.176 881s 0.369 -0.125 881s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 881s x0: [2, 4, 2, 1] 881s -0.014 0.478 881s 0.809 0.444 881s 0.815 -0.290 881s 0.589 -0.264 881s 881s x1: [2, 3, 2, 1] 881s -0.466 -0.889 881s -0.634 -0.880 881s 0.885 -0.054 881s 881s y: [4, 3, 2, 1] 881s -0.418 -0.772 -0.121 -0.040 881s -0.411 -0.903 -0.261 -0.142 881s -0.039 0.691 0.736 0.536 881s testing: mul_mat, [2, 2, 2, 4] = [4, 2, 2, 4] * [4, 2, 2, 4] 881s x0: [4, 2, 2, 4] 881s -0.712 0.179 -0.020 -0.121 881s -0.915 0.112 0.906 0.804 881s 881s x1: [4, 2, 2, 4] 881s 0.560 0.605 0.360 0.671 881s 0.971 0.480 0.555 -0.083 881s 881s y: [2, 2, 2, 4] 881s -0.379 0.421 881s -0.607 -0.398 881s test-mul-mat0: iter:438/500 881s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 3, 1, 1] 881s 0.212 0.180 -0.951 881s -0.566 -0.532 -0.936 881s -0.747 0.663 -0.595 881s 881s x1: [3, 1, 1, 1] 881s -0.650 -0.040 0.294 881s 881s y: [3, 1, 1, 1] 881s -0.424 0.115 0.285 881s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 881s x0: [3, 2, 4, 1] 881s -0.925 0.476 -0.795 881s -0.282 0.980 0.841 881s 881s x1: [3, 3, 4, 1] 881s -0.776 0.804 0.178 881s 0.731 -0.411 0.090 881s -0.802 -0.879 -0.729 881s 881s y: [2, 3, 4, 1] 881s 0.959 1.156 881s -0.943 -0.533 881s 0.903 -1.249 881s testing: mul_mat, [4, 2, 4, 4] = [3, 4, 4, 4] * [3, 2, 4, 4] 881s x0: [3, 4, 4, 4] 881s 0.376 -0.719 -0.660 881s -0.289 -0.106 0.581 881s -0.880 0.876 0.711 881s -0.786 0.913 -0.650 881s 881s x1: [3, 2, 4, 4] 881s -0.409 0.868 -0.928 881s 0.436 0.746 0.427 881s 881s y: [4, 2, 4, 4] 881s -0.165 -0.513 0.460 1.717 881s -0.653 0.043 0.572 0.061 881s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 3, 1, 1] 881s -0.385 0.018 -0.087 881s -0.658 0.485 -0.216 881s -0.057 -0.027 0.595 881s 881s x1: [3, 4, 1, 1] 881s 0.581 0.299 0.776 881s 0.516 -0.671 -0.482 881s 0.979 -0.079 0.831 881s 0.417 -0.866 0.615 881s 881s y: [3, 4, 1, 1] 881s -0.286 -0.405 0.421 881s -0.169 -0.560 -0.298 881s -0.451 -0.862 0.441 881s -0.230 -0.827 0.366 881s testing: mul_mat, [2, 3, 4, 1] = [3, 2, 4, 1] * [3, 3, 4, 1] 881s x0: [3, 2, 4, 1] 881s -0.104 0.294 0.848 881s 0.414 -0.876 0.249 881s 881s x1: [3, 3, 4, 1] 881s 0.730 0.542 -0.204 881s 0.155 -0.794 -0.930 881s 0.646 0.136 -0.820 881s 881s y: [2, 3, 4, 1] 881s -0.089 -0.224 881s -1.039 0.527 881s -0.723 -0.056 881s testing: mul_mat, [3, 3, 4, 4] = [2, 3, 4, 4] * [2, 3, 4, 4] 881s x0: [2, 3, 4, 4] 881s -0.464 -0.889 881s 0.756 -0.787 881s -0.659 0.469 881s 881s x1: [2, 3, 4, 4] 881s 0.045 -0.537 881s -0.446 -0.910 881s 0.713 -0.168 881s 881s y: [3, 3, 4, 4] 881s 0.457 0.457 -0.282 881s 1.016 0.379 -0.133 881s -0.181 0.671 -0.549 881s test-mul-mat0: iter:439/500 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.662 0.068 -0.623 0.193 881s 881s x1: [4, 1, 1, 1] 881s -0.613 -0.858 0.200 0.018 881s 881s y: [1, 1, 1, 1] 881s 0.226 881s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 881s x0: [4, 1, 2, 1] 881s -0.933 -0.768 0.420 -0.890 881s 881s x1: [4, 1, 2, 1] 881s 0.034 0.329 -0.606 -0.340 881s 881s y: [1, 1, 2, 1] 881s -0.236 881s testing: mul_mat, [1, 1, 2, 4] = [4, 1, 2, 4] * [4, 1, 2, 4] 881s x0: [4, 1, 2, 4] 881s -0.137 -0.203 0.775 0.930 881s 881s x1: [4, 1, 2, 4] 881s 0.215 0.257 -0.319 -0.872 881s 881s y: [1, 1, 2, 4] 881s -1.141 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.193 -0.569 -0.718 0.978 881s 881s x1: [4, 1, 1, 1] 881s 0.457 0.944 -0.209 -0.476 881s 881s y: [1, 1, 1, 1] 881s -0.940 881s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 881s x0: [1, 3, 2, 1] 881s -0.622 881s 0.403 881s 0.349 881s 881s x1: [1, 4, 2, 1] 881s 0.555 881s -0.624 881s 0.336 881s 0.225 881s 881s y: [3, 4, 2, 1] 881s -0.345 0.224 0.194 881s 0.388 -0.252 -0.218 881s -0.209 0.136 0.118 881s -0.140 0.091 0.079 881s testing: mul_mat, [4, 1, 2, 4] = [3, 4, 2, 4] * [3, 1, 2, 4] 881s x0: [3, 4, 2, 4] 881s 0.333 0.074 -0.322 881s 0.134 -0.791 -0.801 881s 0.084 0.306 0.597 881s -0.589 0.005 0.316 881s 881s x1: [3, 1, 2, 4] 881s -0.105 0.931 -0.531 881s 881s y: [4, 1, 2, 4]ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s 881s 0.205 -0.325 -0.041 -0.101 881s test-mul-mat0: iter:440/500 881s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.934 -0.863 0.299 -0.884 881s -0.180 -0.360 0.209 -0.336 881s 881s x1: [4, 4, 1, 1] 881s 0.728 -0.661 0.008 0.346 881s -0.978 0.478 -0.016 -0.327 881s -0.840 -0.138 -0.360 -0.098 881s -0.803 0.001 -0.120 -0.232 881s 881s y: [2, 4, 1, 1] 881s 0.947 -0.008 881s -1.042 0.111 881s -0.686 0.159 881s -0.583 0.197 881s testing: mul_mat, [1, 2, 4, 1] = [4, 1, 4, 1] * [4, 2, 4, 1] 881s x0: [4, 1, 4, 1] 881s -0.409 -0.812 0.760 -0.223 881s 881s x1: [4, 2, 4, 1] 881s -0.332 -0.920 0.930 0.155 881s 0.808 -0.815 -0.117 -0.852 881s 881s y: [1, 2, 4, 1] 881s 1.555 881s 0.433 881s testing: mul_mat, [4, 1, 4, 4] = [4, 4, 4, 4] * [4, 1, 4, 4] 881s x0: [4, 4, 4, 4] 881s 0.732 0.453 0.330 0.667 881s -0.386 -0.067 -0.322 0.177 881s 0.435 0.423 -0.823 -0.210 881s -0.504 0.799 -0.244 -0.328 881s 881s x1: [4, 1, 4, 4] 881s -0.140 -0.112 0.818 0.900 881s 881s y: [4, 1, 4, 4] 881s 0.718 -0.042 -0.971 -0.514 881s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.519 0.834 -0.642 0.414 881s 0.935 0.834 -0.859 0.896 881s 881s x1: [4, 4, 1, 1] 881s -0.688 -0.333 -0.150 -0.741 881s -0.566 0.692 0.645 0.585 881s 0.114 0.724 0.612 -0.074 881s 0.297 0.584 0.021 0.599 881s 881s y: [2, 4, 1, 1] 881s -0.846 -1.457 881s 0.111 0.018 881s 0.239 0.118 881s 0.876 1.284 881s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 881s x0: [2, 3, 4, 1] 881s -0.938 -0.608 881s -0.690 0.981 881s 0.290 -0.812 881s 881s x1: [2, 4, 4, 1] 881s -0.095 -0.640 881s 0.680 0.359 881s -0.769 -0.670 881s 0.671 -0.102 881s 881s y: [3, 4, 4, 1] 881s 0.478 -0.562 0.492 881s -0.856 -0.117 -0.095 881s 1.129 -0.127 0.322 881s -0.567 -0.563 0.277 881s testing: mul_mat, [3, 2, 4, 4] = [3, 3, 4, 4] * [3, 2, 4, 4] 881s x0: [3, 3, 4, 4] 881s 0.053 0.185 -0.107 881s 0.701 -0.469 -0.356 881s 0.048 0.364 0.263 881s 881s x1: [3, 2, 4, 4] 881s -0.992 -0.813 0.098 881s -0.293 -0.430 -0.951 881s 881s y: [3, 2, 4, 4] 881s -0.214 -0.349 -0.318 881s 0.007 0.335 -0.420 881s test-mul-mat0: iter:441/500 881s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.206 0.292 881s -0.755 0.201 881s -0.947 0.805 881s 881s x1: [2, 1, 1, 1] 881s -0.806 -0.308 881s 881s y: [3, 1, 1, 1] 881s 0.076 0.547 0.516 881s testing: mul_mat, [1, 3, 4, 1] = [2, 1, 4, 1] * [2, 3, 4, 1] 881s x0: [2, 1, 4, 1] 881s 0.375 -0.738 881s 881s x1: [2, 3, 4, 1] 881s -0.528 -0.251 881s 0.659 -0.565 881s 0.371 -0.113 881s 881s y: [1, 3, 4, 1] 881s -0.012 881s 0.664 881s 0.223 881s testing: mul_mat, [2, 1, 4, 1] = [2, 2, 4, 1] * [2, 1, 4, 1] 881s x0: [2, 2, 4, 1] 881s -0.944 0.749 881s 0.025 -0.771 881s 881s x1: [2, 1, 4, 1] 881s -0.043 0.483 881s 881s y: [2, 1, 4, 1] 881s 0.402 -0.374 881s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 1, 1, 1] 881s 0.590 0.819 881s 881s x1: [2, 2, 1, 1] 881s 0.563 -0.410 881s -0.473 -0.138 881s 881s y: [1, 2, 1, 1] 881s -0.003 881s -0.392 881s testing: mul_mat, [3, 2, 4, 1] = [1, 3, 4, 1] * [1, 2, 4, 1] 881s x0: [1, 3, 4, 1] 881s 0.096 881s -0.105 881s -0.196 881s 881s x1: [1, 2, 4, 1] 881s 0.121 881s -0.975 881s 881s y: [3, 2, 4, 1] 881s 0.012 -0.013 -0.024 881s -0.094 0.103 0.191 881s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 4, 4, 1] 881s -0.827 -0.973 0.596 881s -0.842 -0.208 0.901 881s -0.783 -0.384 0.488 881s 0.520 -0.078 0.105 881s 881s x1: [3, 1, 4, 1] 881s 0.790 0.279 -0.346 881s 881s y: [4, 1, 4, 1] 881s -1.131 -1.035 -0.894 0.353 881s test-mul-mat0: iter:442/500 881s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.380 -0.759 -0.581 0.417 881s 0.564 -0.912 0.568 -0.518 881s -0.721 0.096 0.437 -0.218 881s 0.280 0.083 -0.050 0.989 881s 881s x1: [4, 4, 1, 1] 881s -0.832 -0.291 0.519 -0.953 881s -0.260 -0.837 0.961 0.399 881s 0.605 0.573 -0.769 0.204 881s 0.402 -0.417 -0.169 -0.976 881s 881s y: [4, 4, 1, 1] 881s -0.795 0.584 1.007 -1.225 881s 0.144 0.955 0.440 0.204 881s 0.327 -0.724 -0.762 0.458 881s 0.160 1.017 -0.191 -0.879 881s testing: mul_mat, [4, 4, 3, 1] = [4, 4, 3, 1] * [4, 4, 3, 1] 881s x0: [4, 4, 3, 1] 881s 0.474 0.350 0.698 -0.755 881s 0.595 -0.023 -0.888 0.277 881s 0.038 0.259 -0.958 -0.951 881s -0.606 0.410 -0.944 -0.741 881s 881s x1: [4, 4, 3, 1] 881s -0.398 -0.003 -0.271 -0.234 881s -0.042 -0.872 -0.629 -0.469 881s -0.641 0.575 0.934 -0.058 881s -0.594 0.957 -0.507 0.786 881s 881s y: [4, 4, 3, 1] 881s -0.202 -0.060 0.466 0.669 881s -0.410 0.424 0.821 0.609 881s 0.593 -1.239 -0.715 -0.214 881s -0.894 0.293 -0.037 0.649 881s testing: mul_mat, [4, 4, 3, 4] = [4, 4, 3, 4] * [4, 4, 3, 4] 881s x0: [4, 4, 3, 4] 881s 0.649 0.507 0.782 0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s 914 881s -0.463 0.968 -0.388 0.608 881s 0.256 -0.186 0.626 -0.261 881s 0.221 -0.004 -0.897 0.613 881s 881s x1: [4, 4, 3, 4] 881s 0.223 -0.124 -0.812 0.742 881s 0.479 -0.685 0.691 0.376 881s -0.073 0.028 0.059 0.809 881s 0.951 0.441 -0.836 -0.776 881s 881s y: [4, 4, 3, 4] 881s 0.125 0.542 -0.622 1.233 881s 0.847 -0.923 0.585 -0.281 881s 0.752 0.529 -0.198 0.426 881s -0.522 -0.161 -0.159 0.482 881s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 3, 1, 1] 881s 0.357 -0.255 -0.352 0.193 881s 0.222 -0.984 0.573 -0.041 881s -0.969 0.659 -0.460 0.491 881s 881s x1: [4, 4, 1, 1] 881s 0.973 -0.715 -0.769 -0.317 881s -0.775 -0.066 -0.192 0.760 881s 0.359 0.066 -0.761 -0.234 881s -0.721 0.680 0.434 0.928 881s 881s y: [3, 4, 1, 1] 881s 0.740 0.492 -1.215 881s -0.045 -0.248 1.168 881s 0.334 -0.413 -0.069 881s -0.405 -0.619 1.402 881s testing: mul_mat, [3, 4, 3, 1] = [3, 3, 3, 1] * [3, 4, 3, 1] 881s x0: [3, 3, 3, 1] 881s -0.559 -0.305 0.188 881s -0.525 -0.975 -0.659 881s 0.998 -0.980 -0.678 881s 881s x1: [3, 4, 3, 1] 881s 0.984 0.026 0.464 881s -0.731 0.257 -0.853 881s -0.505 -0.809 -0.044 881s -0.745 0.550 -0.978 881s 881s y: [3, 4, 3, 1] 881s -0.471 -0.848 0.642 881s 0.170 0.695 -0.403 881s 0.521 1.083 0.319 881s 0.065 0.499 -0.620 881s testing: mul_mat, [1, 3, 3, 4] = [3, 1, 3, 4] * [3, 3, 3, 4] 881s x0: [3, 1, 3, 4] 881s -0.328 -0.791 0.388 881s 881s x1: [3, 3, 3, 4] 881s -0.515 -0.137 0.827 881s 0.528 0.303 -0.698 881s 0.526 0.998 -0.673 881s 881s y: [1, 3, 3, 4] 881s 0.598 881s -0.684 881s -1.224 881s test-mul-mat0: iter:443/500 881s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.554 -0.132 -0.233 881s 0.534 -0.865 0.051 881s 881s x1: [3, 1, 1, 1] 881s -0.710 -0.213 -0.652 881s 881s y: [2, 1, 1, 1] 881s -0.213 -0.228 881s testing: mul_mat, [2, 2, 3, 1] = [3, 2, 3, 1] * [3, 2, 3, 1] 881s x0: [3, 2, 3, 1] 881s 0.457 0.012 0.701 881s -0.756 0.360 -0.167 881s 881s x1: [3, 2, 3, 1] 881s 0.579 -0.055 0.775 881s 0.424 0.015 -0.780 881s 881s y: [2, 2, 3, 1] 881s 0.808 -0.587 881s -0.353 -0.184 881s testing: mul_mat, [3, 2, 3, 3] = [3, 3, 3, 3] * [3, 2, 3, 3] 881s x0: [3, 3, 3, 3] 881s -0.072 -0.242 0.660 881s -0.851 0.148 -0.281 881s -0.985 -0.258 0.755 881s 881s x1: [3, 2, 3, 3] 881s 0.204 0.977 0.669 881s 0.533 -0.495 -0.730 881s 881s y: [3, 2, 3, 3] 881s 0.191 -0.217 0.052 881s -0.400 -0.322 -0.948 881s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 3, 1, 1] 881s -0.552 -0.601 -0.078 881s 0.521 -0.160 -0.918 881s -0.014 -0.850 -0.579 881s 881s x1: [3, 3, 1, 1] 881s 0.430 -0.840 -0.106 881s -0.335 -0.940 -0.160 881s 0.781 0.921 0.408 881s 881s y: [3, 3, 1, 1] 881s 0.276 0.457 0.769 881s 0.763 0.123 0.897 881s -1.017 -0.115 -1.030 881s testing: mul_mat, [4, 3, 3, 1] = [3, 4, 3, 1] * [3, 3, 3, 1] 881s x0: [3, 4, 3, 1] 881s 0.291 -0.095 0.050 881s 0.902 -0.241 -0.198 881s 0.410 0.827 -0.479 881s -0.951 0.803 0.294 881s 881s x1: [3, 3, 3, 1] 881s -0.242 0.003 0.350 881s 0.778 -0.102 -0.709 881s -0.025 -0.282 -0.756 881s 881s y: [4, 3, 3, 1] 881s -0.053 -0.288 -0.264 0.335 881s 0.200 0.867 0.574 -1.031 881s -0.018 0.195 0.119 -0.425 881s testing: mul_mat, [1, 3, 3, 3] = [4, 1, 3, 3] * [4, 3, 3, 3] 881s x0: [4, 1, 3, 3] 881s -0.109 0.644 -0.743 0.649 881s 881s x1: [4, 3, 3, 3] 881s -0.693 -0.928 0.744 0.357 881s -0.126 -0.735 -0.350 -0.260 881s -0.833 -0.738 0.100 0.611 881s 881s y: [1, 3, 3, 3] 881s -0.844 881s -0.369 881s -0.063 881s test-mul-mat0: iter:444/500 881s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 1, 1, 1] 881s 0.290 0.264 0.439 881s 881s x1: [3, 3, 1, 1] 881s 0.286 0.224 -0.879 881s -0.419 -0.543 0.485 881s 0.721 -0.853 -0.192 881s 881s y: [1, 3, 1, 1] 881s -0.244 881s -0.052 881s -0.101 881s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 881s x0: [3, 2, 3, 1] 881s -0.073 0.578 0.671 881s -0.558 -0.467 -0.443 881s 881s x1: [3, 1, 3, 1] 881s 0.675 0.924 0.573 881s 881s y: [2, 1, 3, 1] 881s 0.869 -1.061 881s testing: mul_mat, [3, 2, 3, 3] = [3, 3, 3, 3] * [3, 2, 3, 3] 881s x0: [3, 3, 3, 3] 881s 0.209 0.252 0.737 881s 0.680 -0.786 -0.408 881s 0.845 -0.765 -0.070 881s 881s x1: [3, 2, 3, 3] 881s 0.059 0.482 0.133 881s -0.265 0.406 -0.294 881s 881s y: [3, 2, 3, 3] 881s 0.232 -0.393 -0.328 881s -0.170 -0.380 -0.514 881s testing: mul_mat, [1, 3, 1, 1] = [3, 1, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.153 0.231 0.700 881s 881s x1: [3, 3, 1, 1] 881s -0.027 0.580 0.304 881s 0.115 -0.008 -0.908 881s 0.641 0.395 -0.632 881s 881s y: [1, 3, 1, 1] 881s 0.351 881s -0.654 881s -0.449 881s testing: mul_mat, [3, 3, 3, 1] = [1, 3, 3, 1] * [1, 3, 3, 1] 881s x0: [1, 3, 3, 1] 881s -0.061 881s 0.965 881s -0.375 881s 881s x1: [1, 3, 3, 1] 881s -0.398 881s -0.582 881s -0.877 881s 881s y: [3, 3, 3, 1] 881s 0.024 -0.384 0.149 881s 0.036 -0.561 0.218 881s 0.054 -0.846 0.329 881s testing: mul_mat, [1, 1, 3, 3] = [3, 1, 3, 3] * [3, 1, 3, 3] 881s x0: [3, 1, 3, 3] 881s -0.660 -0.464 0.932 881s 881s x1: [3, 1, 3, 3] 881s 0.362 0.405 -0.054 881s 881s y: [1, 1, 3, 3] 881s -0.477 881s test-mul-mat0: iter:445/500 881s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.497 881s 0.310 881s 881s x1: [1, 2, 1, 1] 881s -0.199 881s 0.369 881s 881s y: [2, 2, 1, 1] 881s 0.099 -0.062 881s -0.184 0.115 881s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.868 881s -0.635 881s 881s x1: [1, 2, 1, 1] 881s -0.597 881s -0.440 881s 881s y: [2, 2, 1, 1] 881s -0.518 0.379 881s -0.383 0.280 881s testing: mul_mat, [3, 2, 1, 2] = [1, 3, 1, 2] * [1, 2, 1, 2] 881s x0: [1, 3, 1, 2] 881s -0.634 881s -0.631 881s 0.600 881s 881s x1: [1, 2, 1, 2] 881s -0.947 881s 0.353 881s 881s y: [3, 2, 1, 2] 881s 0.601 0.598 -0.569 881s -0.224 -0.223 0.212 881s testing: mul_mat, [4, 3, 1, 1] = [1, 4, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 4, 1, 1] 881s -0.318 881s 0.365 881s 0.406 881s 0.143 881s 881s x1: [1, 3, 1, 1] 881s 0.666 881s -0.222 881s -0.034 881s 881s y: [4, 3, 1, 1] 881s -0.212 0.243 0.271 0.095 881s 0.071 -0.081 -0.090 -0.032 881s 0.011 -0.012 -0.014 -0.005 881s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.553 -0.071 0.627 0.372 881s -0.122 0.007 -0.203 0.680 881s 881s x1: [4, 1, 1, 1] 881s -0.802 -0.000 0.944 0.568 881s 881s y: [2, 1, 1, 1] 881s 1.246 0.292 881s testing: mul_mat, [4, 4, 1, 2] = [2, 4, 1, 2] * [2, 4, 1, 2] 881s x0: [2, 4, 1, 2] 881s 0.425 0.437 881s 0.869 0.050 881s -0.475 0.817 881s 0.369 -0.685 881s 881s x1: [2, 4, 1, 2] 881s -0.850 -0.862 881s -0.867 -0.294 881s -0.496 -0.498 881s -0.694 -0.461 881s 881s y: [4, 4, 1, 2] 881s -0.738 -0.782 -0.300 0.277 881s -0.497 -0.768 0.171 -0.118 881s -0.428 -0.456 -0.171 0.158 881s -0.497 -0.626 -0.047 0.060 881s test-mul-mat0: iter:446/500 881s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.276 0.580 881s -0.319 0.582 881s 0.698 -0.036 881s 881s x1: [2, 1, 1, 1] 881s 0.937 0.376 881s 881s y: [3, 1, 1, 1] 881s 0.476 -0.080 0.641 881s testing: mul_mat, [4, 3, 2, 1] = [2, 4, 2, 1] * [2, 3, 2, 1] 881s x0: [2, 4, 2, 1] 881s 0.553 -0.407 881s -0.564 -0.522 881s -0.074 -0.809 881s -0.891 0.763 881s 881s x1: [2, 3, 2, 1] 881s 0.252 0.224 881s -0.611 0.121 881s 0.748 0.759 881s 881s y: [4, 3, 2, 1] 881s 0.048 -0.259 -0.200 -0.054 881s -0.387 0.281 -0.053 0.636 881s 0.105 -0.818 -0.669 -0.088 881s testing: mul_mat, [1, 4, 2, 2] = [2, 1, 2, 2] * [2, 4, 2, 2] 881s x0: [2, 1, 2, 2] 881s 0.531 -0.509 881s 881s x1: [2, 4, 2, 2] 881s -0.583 0.174 881s -0.844 0.640 881s 0.564 0.277 881s 0.389 0.322 881s 881s y: [1, 4, 2, 2] 881s -0.399 881s -0.774 881s 0.158 881s 0.042 881s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.410 -0.170 881s -0.128 -0.458 881s 0.547 0.295 881s -0.520 -0.022 881s 881s x1: [2, 1, 1, 1] 881s 0.294 0.350 881s 881s y: [4, 1, 1, 1] 881s -0.180 -0.198 0.264 -0.160 881s testing: mul_mat, [1, 2, 2, 1] = [4, 1, 2, 1] * [4, 2, 2, 1] 881s x0: [4, 1, 2, 1] 881s 0.015 0.626 0.313 -0.692 881s 881s x1: [4, 2, 2, 1] 881s 0.646 0.873 0.575 -0.204 881s 0.590 0.701 0.268 -0.976 881s 881s y: [1, 2, 2, 1] 881s 0.877 881s 1.205 881s testing: mul_mat, [3, 4, 2, 2] = [1, 3, 2, 2] * [1, 4, 2, 2] 881s x0: [1, 3, 2, 2] 881s -0.486 881s -0.040 881s -0.598 881s 881s x1: [1, 4, 2, 2] 881s 0.891 881s 0.378 881s -0.322 881s -0.567 881s 881s y: [3, 4, 2, 2] 881s -0.433 -0.036 -0.533 881s -0.184 -0.015 -0.226 881s 0.157 0.013 0.193 881s 0.276 0.023 0.339 881s test-mul-mat0: iter:447/500 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.952 0.957 -0.419 -0.912 881s 881s x1: [4, 1, 1, 1] 881s 0.070 0.628 0.406 -0.274 881s 881s y: [1, 1, 1, 1] 881s 0.747 881s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.284 -0.630 -0.532 0.686 881s 0.503 -0.590 0.334 0.650 881s 881s x1: [4, 1, 1, 1] 881s -0.112 -0.277 -0.752 0.856 881s 881s y: [2, 1, 1, 1] 881s 1.129 0.413 881s testing: mul_mat, [2, 2, 1, 3] = [4, 2, 1, 3] * [4, 2, 1, 3] 881s x0: [4, 2, 1, 3] 881s 0.409 -0.142 0.282 0.819 881s -0.809 -0.067 0.797 -0.514 881s 881s x1: [4, 2, 1, 3] 881s 0.978 -0.706 -0.894 -0.479 881s 0.598 0.461 -0.087 -0.207 881s 881s y: [2, 2, 1, 3] 881s -0.144 -1.209 881s -0.015 -0.478 881s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 3, 1, 1] 881s -0.388 0.545 -0.416 0.084 881s 0.735 0.532 0.851 0.082 881s 0.596 -0.918 -0.834 0.044 881s 881s x1: [4, 2, 1, 1] 881s -0.860 0.566 0.375 0.550 881s -0.493 -0.453 0.330 0.916 881s 881s y: [3, 2, 1, 1] 881s 0.533 0.033 -1.321 881s -0.116 -0.248 -0.113 881s testing: mul_mat, [3, 4, 1, 1] = [3, 3, 1, 1] * [3, 4, 1, 1] 881s x0:ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s [3, 3, 1, 1] 881s -0.505 0.825 -0.563 881s 0.976 0.892 0.627 881s 0.325 -0.270 -0.674 881s 881s x1: [3, 4, 1, 1] 881s 0.778 -0.704 -0.658 881s -0.928 -0.782 0.768 881s 0.918 -0.195 0.903 881s 0.404 -0.055 0.469 881s 881s y: [3, 4, 1, 1] 881s -0.603 -0.281 0.886 881s -0.609 -1.121 -0.609 881s -1.133 1.289 -0.257 881s -0.513 0.640 -0.170 881s testing: mul_mat, [3, 3, 1, 3] = [3, 3, 1, 3] * [3, 3, 1, 3] 881s x0: [3, 3, 1, 3] 881s 0.114 0.095 0.661 881s -0.403 0.757 -0.233 881s 0.983 -0.609 -0.034 881s 881s x1: [3, 3, 1, 3] 881s 0.982 0.160 -0.592 881s -0.434 0.010 -0.425 881s 0.650 -0.908 0.618 881s 881s y: [3, 3, 1, 3] 881s -0.264 -0.137 0.888 881s -0.330 0.282 -0.419 881s 0.396 -1.093 1.170 881s test-mul-mat0: iter:448/500 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.041 0.021 881s -0.824 -0.993 881s -0.667 0.035 881s 881s x1: [2, 2, 1, 1] 881s -0.735 0.652 881s -0.351 0.360 881s 881s y: [3, 2, 1, 1] 881s -0.016 -0.041 0.514 881s -0.007 -0.069 0.247 881s testing: mul_mat, [3, 3, 2, 1] = [2, 3, 2, 1] * [2, 3, 2, 1] 881s x0: [2, 3, 2, 1] 881s 0.535 0.500 881s -0.591 -0.456 881s -0.349 0.464 881s 881s x1: [2, 3, 2, 1] 881s -0.409 0.072 881s -0.717 0.136 881s 0.312 -0.862 881s 881s y: [3, 3, 2, 1] 881s -0.183 0.209 0.176 881s -0.315 0.361 0.313 881s -0.264 0.209 -0.509 881s testing: mul_mat, [2, 3, 2, 4] = [2, 2, 2, 4] * [2, 3, 2, 4] 881s x0: [2, 2, 2, 4] 881s -0.378 -0.035 881s -0.962 -0.282 881s 881s x1: [2, 3, 2, 4] 881s -0.099 0.297 881s -0.300 -0.093 881s 0.630 0.735 881s 881s y: [2, 3, 2, 4] 881s 0.027 0.012 881s 0.117 0.315 881s -0.264 -0.813 881s testing: mul_mat, [1, 2, 1, 1] = [2, 1, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 1, 1, 1] 881s 0.226 -0.234 881s 881s x1: [2, 2, 1, 1] 881s -0.924 -0.494 881s -0.052 0.557 881s 881s y: [1, 2, 1, 1] 881s -0.093 881s -0.142 881s testing: mul_mat, [1, 2, 2, 1] = [1, 1, 2, 1] * [1, 2, 2, 1] 881s x0: [1, 1, 2, 1] 881s -0.155 881s 881s x1: [1, 2, 2, 1] 881s 0.707 881s -0.821 881s 881s y: [1, 2, 2, 1] 881s -0.110 881s 0.127 881s testing: mul_mat, [3, 1, 2, 4] = [1, 3, 2, 4] * [1, 1, 2, 4] 881s x0: [1, 3, 2, 4] 881s -0.443 881s -0.134 881s 0.743 881s 881s x1: [1, 1, 2, 4] 881s -0.615 881s 881s y: [3, 1, 2, 4] 881s 0.272 0.083 -0.457 881s test-mul-mat0: iter:449/500 881s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.759 -0.115 0.809 0.095 881s -0.575 -0.488 -0.976 -0.530 881s -0.837 -0.490 0.795 0.040 881s 0.763 -0.173 -0.903 -0.423 881s 881s x1: [4, 1, 1, 1] 881s 0.256 -0.530 -0.502 0.813 881s 881s y: [4, 1, 1, 1] 881s -0.462 0.171 -0.321 0.397 881s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 881s x0: [4, 1, 3, 1] 881s 0.003 -0.946 0.187 0.191 881s 881s x1: [4, 4, 3, 1] 881s 0.920 0.486 -0.763 -0.347 881s 0.260 0.419 0.459 0.106 881s 0.696 -0.669 -0.639 -0.835 881s -0.171 -0.826 0.501 0.070 881s 881s y: [1, 4, 3, 1] 881s -0.667 881s -0.290 881s 0.356 881s 0.888 881s testing: mul_mat, [4, 1, 3, 3] = [4, 4, 3, 3] * [4, 1, 3, 3] 881s x0: [4, 4, 3, 3] 881s 0.982 0.721 -0.903 0.735 881s 0.244 -0.798 -0.956 -0.466 881s 0.817 -0.590 -0.337 -0.998 881s -0.857 0.793 -0.760 0.062 881s 881s x1: [4, 1, 3, 3] 881s 0.391 -0.172 0.927 -0.333 881s 881s y: [4, 1, 3, 3] 881s -0.822 -0.498 0.441 -1.196 881s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.095 -0.138 0.984 0.453 881s -0.392 0.969 -0.272 0.993 881s 881s x1: [4, 4, 1, 1] 881s -0.433 0.113 -0.586 -0.217 881s -0.828 -0.304 -0.395 0.033 881s 0.128 0.198 -0.331 0.873 881s -0.435 -0.942 -0.725 -0.201 881s 881s y: [2, 4, 1, 1] 881s -0.649 0.223 881s -0.253 0.170 881s 0.031 1.098 881s -0.633 -0.745 881s testing: mul_mat, [1, 4, 3, 1] = [2, 1, 3, 1] * [2, 4, 3, 1] 881s x0: [2, 1, 3, 1] 881s 0.265 0.719 881s 881s x1: [2, 4, 3, 1] 881s 0.487 -0.820 881s -0.107 0.400 881s -0.027 -0.541 881s 0.968 -0.915 881s 881s y: [1, 4, 3, 1] 881s -0.460 881s 0.260 881s -0.396 881s -0.401 881s testing: mul_mat, [3, 2, 3, 3] = [1, 3, 3, 3] * [1, 2, 3, 3] 881s x0: [1, 3, 3, 3] 881s 0.932 881s -0.595 881s -0.453 881s 881s x1: [1, 2, 3, 3] 881s 0.480 881s 0.352 881s 881s y: [3, 2, 3, 3] 881s 0.447 -0.285 -0.217 881s 0.328 -0.209 -0.159 881s test-mul-mat0: iter:450/500 881s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.593 -0.045 -0.361 0.655 881s 0.356 -0.564 -0.988 -0.289 881s -0.811 0.361 0.577 0.857 881s -0.729 0.071 0.457 0.586 881s 881s x1: [4, 4, 1, 1] 881s 0.600 0.315 -0.959 0.935 881s 0.470 0.114 0.536 0.344 881s -0.304 0.129 -0.611 -0.220 881s -0.798 -0.214 -0.741 -0.846 881s 881s y: [4, 4, 1, 1] 881s 1.301 0.713 -0.124 -0.306 881s 0.305 -0.526 0.264 0.112 881s -0.110 0.486 -0.248 -0.177 881s -0.751 0.813 -0.581 -0.267 881s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 3, 1, 1] 881s 0.722 0.208 -0.276 -0.842 881s 0.220 0.436 -0.652 -0.419 881s 0.012 -0.795 -0.149 -0.917 881s 881s x1: [4, 4, 1, 1] 881s 0.112 0.393 0.056 -0.775 881s -0.071 -0.601 -0.079 -0.943 881s -0.211 0.701 -0.741 0.575 881s 0.960 -0.587 0.083 0.554 881s 881s y: [3, 4, 1, 1] 881s 0.799 0.484 0.391 881s 0.638 0.169 1.353 881s -0.286 0.501 -0.977 881s 0.082 -0.331 -0.042 881s testing: mul_mat, [1, 3, 1, 3] = [4, 1, 1, 3] * [4, 3, 1, 3] 881s x0: [4, 1, 1, 3] 881s -0.527 0.238 -0.707 -0.486 881s 881s x1: [4, 3, 1, 3] 881s 0.662 -0.563 0.195 0.055 881s 0.493 0.420 0.984 0.892 881s -0.659 -0.959 -0.319 -0.958 881s 881s y: [1, 3, 1, 3] 881s -0.647 881s -1.289 881s 0.810 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.356 -0.343 0.361 0.210 881s 881s x1: [4, 1, 1, 1] 881s 0.609 -0.426 -0.341 -0.584 881s 881s y: [1, 1, 1, 1] 881s -0.317 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.520 881s 0.779 881s 0.057 881s 881s x1: [1, 4, 1, 1] 881s 0.911 881s 0.133 881s -0.374 881s 0.248 881s 881s y: [3, 4, 1, 1] 881s -0.474 0.710 0.052 881s -0.069 0.104 0.008 881s 0.194 -0.291 -0.021 881s -0.129 0.193 0.014 881s testing: mul_mat, [1, 1, 1, 3] = [3, 1, 1, 3] * [3, 1, 1, 3] 881s x0: [3, 1, 1, 3] 881s -0.303 -0.615 -0.330 881s 881s x1: [3, 1, 1, 3] 881s 0.826 0.731 0.530 881s 881s y: [1, 1, 1, 3] 881s -0.875 881s test-mul-mat0: iter:451/500 881s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.269 -0.918 -0.880 881s 881s x1: [3, 4, 1, 1] 881s -0.022 0.533 0.730 881s -0.774 -0.899 -0.790 881s -0.994 0.158 -0.964 881s 0.737 -0.313 0.100 881s 881s y: [1, 4, 1, 1] 881s -1.126 881s 1.729 881s 0.972 881s 0.001 881s testing: mul_mat, [4, 1, 2, 1] = [3, 4, 2, 1] * [3, 1, 2, 1] 881s x0: [3, 4, 2, 1] 881s 0.154 -0.342 -0.624 881s -0.807 -0.448 0.443 881s -0.407 0.656 -0.578 881s -0.874 0.386 -0.352 881s 881s x1: [3, 1, 2, 1] 881s 0.588 0.239 -0.183 881s 881s y: [4, 1, 2, 1] 881s 0.123 -0.663 0.023 -0.357 881s testing: mul_mat, [1, 4, 2, 4] = [3, 1, 2, 4] * [3, 4, 2, 4] 881s x0: [3, 1, 2, 4] 881s 0.302 0.413 0.640 881s 881s x1: [3, 4, 2, 4] 881s -0.609 -0.947 0.088 881s -0.636 0.963 -0.504 881s -0.669 0.117 0.154 881s -0.293 0.309 0.706 881s 881s y: [1, 4, 2, 4] 881s -0.519 881s -0.118 881s -0.056 881s 0.491 881s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.837 -0.269 0.047 881s 0.633 0.139 -0.629 881s 881s x1: [3, 1, 1, 1] 881s 0.219 0.444 0.750 881s 881s y: [2, 1, 1, 1] 881s 0.098 -0.272 881s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] * [2, 3, 2, 1] 881s x0: [2, 1, 2, 1] 881s 0.944 0.624 881s 881s x1: [2, 3, 2, 1] 881s -0.993 0.412 881s -0.893 -0.597 881s 0.860 -0.603 881s 881s y: [1, 3, 2, 1] 881s -0.681 881s -1.216 881s 0.436 881s testing: mul_mat, [3, 2, 2, 4] = [1, 3, 2, 4] * [1, 2, 2, 4] 881s x0: [1, 3, 2, 4] 881s -0.616 881s -0.557 881s 0.731 881s 881s x1: [1, 2, 2, 4] 881s 0.614 881s -0.972 881s 881s y: [3, 2, 2, 4] 881s -0.378 -0.342 0.449 881s 0.599 0.542 -0.711 881s test-mul-mat0: iter:452/500 881s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.919 0.762 -0.718 881s 881s x1: [3, 2, 1, 1] 881s 0.458 -0.685 -0.969 881s 0.843 -0.242 0.763 881s 881s y: [1, 2, 1, 1] 881s -0.247 881s -1.506 881s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 1, 1, 1] 881s 0.324 0.153 -0.787 881s 881s x1: [3, 1, 1, 1] 881s 0.118 0.731 0.942 881s 881s y: [1, 1, 1, 1] 881s -0.591 881s testing: mul_mat, [1, 1, 1, 1] = [3, 1, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.123 -0.450 -0.084 881s 881s x1: [3, 1, 1, 1] 881s -0.469 -0.608 0.883 881s 881s y: [1, 1, 1, 1] 881s 0.257 881s testing: mul_mat, [2, 1, 1, 1] = [3, 2, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 2, 1, 1] 881s 0.530 0.656 0.526 881s 0.211 -0.011 0.687 881s 881s x1: [3, 1, 1, 1] 881s 0.490 -0.947 0.478 881s 881s y: [2, 1, 1, 1] 881s -0.110 0.443 881s testing: mul_mat, [3, 3, 1, 1] = [2, 3, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.483 0.581 881s -0.057 -0.733 881s 0.477 -0.370 881s 881s x1: [2, 3, 1, 1] 881s -0.169 -0.716 881s 0.450 0.254 881s -0.635 0.212 881s 881s y: [3, 3, 1, 1] 881s -0.498 0.534 0.184 881s 0.365 -0.212 0.121 881s -0.184 -0.119 -0.381 881s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.807 0.389 -0.719 881s 881s x1: [3, 2, 1, 1] 881s 0.795 -0.202 0.022 881s 0.678 0.277 0.900 881s 881s y: [1, 2, 1, 1] 881s -0.736 881s -1.086 881s test-mul-mat0: iter:453/500 881s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.691 881s 0.631 881s 881s x1: [1, 2, 1, 1] 881s -0.996 881s -0.200 881s 881s y: [2, 2, 1, 1] 881s -0.688 -0.629 881s -0.138 -0.126 881s testing: mul_mat, [1, 2, 3, 1] = [1, 1, 3, 1] * [1, 2, 3, 1] 881s x0: [1, 1, 3, 1] 881s 0.295 881s 881s x1: [1, 2, 3, 1] 881s -0.308 881s 0.141 881s 881s y: [1, 2, 3, 1] 881s -0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s .091 881s 0.042 881s testing: mul_mat, [4, 1, 3, 1] = [1, 4, 3, 1] * [1, 1, 3, 1] 881s x0: [1, 4, 3, 1] 881s 0.550 881s 0.594 881s 0.637 881s 0.449 881s 881s x1: [1, 1, 3, 1] 881s 0.562 881s 881s y: [4, 1, 3, 1] 881s 0.309 0.334 0.358 0.253 881s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.482 881s -0.978 881s 881s x1: [1, 4, 1, 1] 881s 0.190 881s -0.286 881s -0.210 881s -0.119 881s 881s y: [2, 4, 1, 1] 881s 0.092 -0.186 881s -0.138 0.280 881s -0.101 0.206 881s -0.057 0.116 881s testing: mul_mat, [2, 1, 3, 1] = [2, 2, 3, 1] * [2, 1, 3, 1] 881s x0: [2, 2, 3, 1] 881s 0.201 -0.925 881s 0.231 0.117 881s 881s x1: [2, 1, 3, 1] 881s 0.230 -0.461 881s 881s y: [2, 1, 3, 1] 881s 0.473 -0.001 881s testing: mul_mat, [4, 2, 3, 1] = [2, 4, 3, 1] * [2, 2, 3, 1] 881s x0: [2, 4, 3, 1] 881s 0.751 0.236 881s 0.567 -0.963 881s -0.510 -0.969 881s -0.170 -0.532 881s 881s x1: [2, 2, 3, 1] 881s -0.363 -0.441 881s 0.100 -0.449 881s 881s y: [4, 2, 3, 1] 881s -0.377 0.219 0.613 0.296 881s -0.031 0.489 0.384 0.222 881s test-mul-mat0: iter:454/500 881s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.459 -0.953 881s 0.786 0.372 881s 0.284 0.823 881s 881s x1: [2, 1, 1, 1] 881s -0.210 -0.520 881s 881s y: [3, 1, 1, 1] 881s 0.591 -0.358 -0.487 881s testing: mul_mat, [4, 3, 4, 1] = [2, 4, 4, 1] * [2, 3, 4, 1] 881s x0: [2, 4, 4, 1] 881s 0.108 -0.031 881s 0.805 -0.520 881s -0.747 0.628 881s 0.883 -0.996 881s 881s x1: [2, 3, 4, 1] 881s 0.403 0.751 881s 0.936 -0.448 881s 0.811 0.674 881s 881s y: [4, 3, 4, 1] 881s 0.020 -0.066 0.171 -0.393 881s 0.115 0.987 -0.981 1.273 881s 0.067 0.302 -0.183 0.045 881s testing: mul_mat, [1, 4, 4, 4] = [2, 1, 4, 4] * [2, 4, 4, 4] 881s x0: [2, 1, 4, 4] 881s -0.403 -0.690 881s 881s x1: [2, 4, 4, 4] 881s 0.463 0.721 881s -0.158 0.715 881s 0.350 -0.276 881s 0.719 -0.086 881s 881s y: [1, 4, 4, 4] 881s -0.684 881s -0.429 881s 0.049 881s -0.231 881s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.832 -0.181 881s -0.680 -0.048 881s 0.860 0.931 881s 881s x1: [2, 1, 1, 1] 881s -0.391 0.547 881s 881s y: [3, 1, 1, 1] 881s -0.424 0.239 0.173 881s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 881s x0: [3, 1, 4, 1] 881s 0.024 0.198 -0.918 881s 881s x1: [3, 2, 4, 1] 881s -0.901 0.209 0.961 881s 0.401 0.522 -0.663 881s 881s y: [1, 2, 4, 1] 881s -0.862 881s 0.721 881s testing: mul_mat, [4, 3, 4, 4] = [1, 4, 4, 4] * [1, 3, 4, 4] 881s x0: [1, 4, 4, 4] 881s -0.564 881s -0.711 881s -0.572 881s -0.159 881s 881s x1: [1, 3, 4, 4] 881s 0.426 881s 0.790 881s 0.319 881s 881s y: [4, 3, 4, 4] 881s -0.240 -0.303 -0.244 -0.068 881s -0.445 -0.562 -0.452 -0.126 881s -0.180 -0.227 -0.183 -0.051 881s test-mul-mat0: iter:455/500 881s testing: mul_mat, [2, 2, 1, 1] = [3, 2, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 2, 1, 1] 881s -0.174 0.907 -0.687 881s -0.113 0.547 0.282 881s 881s x1: [3, 2, 1, 1] 881s 0.509 0.891 0.478 881s 0.413 -0.858 -0.494 881s 881s y: [2, 2, 1, 1] 881s 0.392 0.565 881s -0.511 -0.656 881s testing: mul_mat, [4, 2, 2, 1] = [3, 4, 2, 1] * [3, 2, 2, 1] 881s x0: [3, 4, 2, 1] 881s -0.792 0.941 -0.090 881s 0.020 0.153 -0.580 881s -0.090 -0.369 0.833 881s 0.052 0.137 0.506 881s 881s x1: [3, 2, 2, 1] 881s -0.908 -0.079 0.966 881s -0.116 0.433 0.773 881s 881s y: [4, 2, 2, 1] 881s 0.559 -0.590 0.915 0.431 881s 0.430 -0.384 0.494 0.445 881s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 881s x0: [3, 2, 2, 4] 881s 0.451 -0.546 -0.514 881s -0.345 -0.105 -0.452 881s 881s x1: [3, 4, 2, 4] 881s 0.021 0.926 -0.692 881s -0.589 0.068 -0.643 881s -0.935 0.276 -0.703 881s -0.025 -0.705 0.451 881s 881s y: [2, 4, 2, 4] 881s -0.141 0.208 881s 0.028 0.487 881s -0.212 0.611 881s 0.142 -0.121 881s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.729 -0.258 -0.992 881s 0.454 0.788 -0.315 881s -0.978 -0.910 -0.124 881s 0.983 0.482 0.122 881s 881s x1: [3, 2, 1, 1] 881s 0.057 -0.401 0.055 881s -0.588 0.863 0.996 881s 881s y: [4, 2, 1, 1] 881s 0.007 -0.307 0.302 -0.130 881s -0.782 0.099 -0.333 -0.041 881s testing: mul_mat, [3, 3, 2, 1] = [4, 3, 2, 1] * [4, 3, 2, 1] 881s x0: [4, 3, 2, 1] 881s 0.128 0.813 0.640 -0.413 881s -0.222 0.654 0.661 -0.298 881s -0.381 0.741 0.288 0.042 881s 881s x1: [4, 3, 2, 1] 881s 0.827 -0.993 0.547 -0.053 881s 0.040 0.754 -0.933 0.483 881s 0.492 -0.461 -0.678 -0.952 881s 881s y: [3, 3, 2, 1] 881s -0.329 -0.456 -0.896 881s -0.179 -0.277 0.295 881s -0.352 -0.576 -0.764 881s testing: mul_mat, [2, 4, 2, 4] = [3, 2, 2, 4] * [3, 4, 2, 4] 881s x0: [3, 2, 2, 4] 881s -0.847 -0.779 0.809 881s -0.197 -0.705 -0.960 881s 881s x1: [3, 4, 2, 4] 881s 0.599 0.266 0.341 881s -0.577 -0.696 -0.362 881s 0.034 0.432 0.416 881s 0.653 0.244 0.070 881s 881s y: [2, 4, 2, 4] 881s -0.439 -0.633 881s 0.738 0.953 881s -0.029 -0.711 881s -0.687 -0.369 881s test-mul-mat0: iter:456/500 881s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 4, 1, 1] 881s 0.097 881s 0.748 881s 0.942 881s 0.477 881s 881s x1: [1, 2, 1, 1] 881s -0.514 881s -0.137 881s 881s y: [4, 2, 1, 1] 881s -0.050 -0.384 -0.484 -0.245 881s -0.013 -0.102 -0.129 -0.065 881s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 1, 1, 1] 881s -0.512 881s 881s x1: [1, 4, 1, 1] 881s 0.411 881s -0.895 881s 0.872 881s 0.227 881s 881s y: [1, 4, 1, 1] 881s -0.211 881s 0.458 881s -0.447 881s -0.116 881s testing: mul_mat, [1, 1, 1, 4] = [1, 1, 1, 4] * [1, 1, 1, 4] 881s x0: [1, 1, 1, 4] 881s -0.336 881s 881s x1: [1, 1, 1, 4] 881s 0.338 881s 881s y: [1, 1, 1, 4] 881s -0.113 881s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.772 881s -0.291 881s 881s x1: [1, 1, 1, 1] 881s -0.080 881s 881s y: [2, 1, 1, 1] 881s -0.061 0.023 881s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 1, 1, 1] 881s 0.868 0.006 881s 881s x1: [2, 1, 1, 1] 881s 0.150 -0.118 881s 881s y: [1, 1, 1, 1] 881s 0.130 881s testing: mul_mat, [1, 2, 1, 4] = [1, 1, 1, 4] * [1, 2, 1, 4] 881s x0: [1, 1, 1, 4] 881s 0.964 881s 881s x1: [1, 2, 1, 4] 881s 0.517 881s -0.268 881s 881s y: [1, 2, 1, 4] 881s 0.499 881s -0.259 881s test-mul-mat0: iter:457/500 881s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 3, 1, 1] 881s -0.521 -0.035 0.799 881s 0.485 -0.518 -0.469 881s -0.226 0.097 0.010 881s 881s x1: [3, 3, 1, 1] 881s 0.467 -0.740 -0.695 881s 0.934 0.825 0.225 881s -0.219 0.596 0.934 881s 881s y: [3, 3, 1, 1] 881s -0.773 0.936 -0.184 881s -0.336 -0.080 -0.129 881s 0.839 -0.854 0.117 881s testing: mul_mat, [4, 3, 4, 1] = [3, 4, 4, 1] * [3, 3, 4, 1] 881s x0: [3, 4, 4, 1] 881s -0.286 -0.220 -0.148 881s -0.284 0.979 0.681 881s -0.815 0.439 -0.186 881s 0.588 -0.278 0.173 881s 881s x1: [3, 3, 4, 1] 881s -0.994 0.188 -0.098 881s 0.342 0.152 -0.980 881s 0.749 0.604 -0.524 881s 881s y: [4, 3, 4, 1] 881s 0.257 0.399 0.911 -0.654 881s 0.014 -0.615 -0.030 -0.010 881s -0.270 0.023 -0.248 0.182 881s testing: mul_mat, [3, 4, 4, 2] = [3, 3, 4, 2] * [3, 4, 4, 2] 881s x0: [3, 3, 4, 2] 881s 0.719 0.626 -0.584 881s 0.461 -0.189 -0.826 881s 0.195 0.822 -0.045 881s 881s x1: [3, 4, 4, 2] 881s 0.136 -0.328 0.782 881s 0.596 -0.084 0.204 881s -0.109 -0.386 -0.751 881s -0.790 -0.332 -0.741 881s 881s y: [3, 4, 4, 2] 881s -0.564 -0.521 -0.278 881s 0.257 0.122 0.038 881s 0.119 0.644 -0.305 881s -0.342 0.311 -0.393 881s testing: mul_mat, [4, 3, 1, 1] = [3, 4, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.988 0.155 -0.380 881s 0.554 -0.989 -0.396 881s 0.875 -0.433 0.396 881s -0.628 0.443 0.420 881s 881s x1: [3, 3, 1, 1] 881s 0.628 0.442 0.344 881s -0.934 -0.856 0.939 881s 0.535 0.777 -0.006 881s 881s y: [4, 3, 1, 1] 881s -0.683 -0.225 0.494 -0.054 881s 0.433 -0.042 -0.075 0.602 881s -0.406 -0.470 0.130 0.006 881s testing: mul_mat, [4, 3, 4, 1] = [4, 4, 4, 1] * [4, 3, 4, 1] 881s x0: [4, 4, 4, 1] 881s 0.709 0.088 -0.566 0.576 881s -0.870 -0.337 0.732 -0.019 881s -0.664 0.307 -0.257 -0.804 881s 0.503 -0.387 0.601 0.295 881s 881s x1: [4, 3, 4, 1] 881s -0.891 0.429 0.874 0.893 881s 0.715 0.175 0.777 0.211 881s 0.341 0.405 -0.347 -0.315 881s 881s y: [4, 3, 4, 1] 881s -0.574 1.254 -0.219 0.174 881s 0.204 -0.117 -0.791 0.821 881s 0.292 -0.682 0.240 -0.287 881s testing: mul_mat, [1, 4, 4, 2] = [4, 1, 4, 2] * [4, 4, 4, 2] 881s x0: [4, 1, 4, 2] 881s -0.416 0.617 -0.610 0.259 881s 881s x1: [4, 4, 4, 2] 881s 0.307 0.493 -0.310 -0.643 881s -0.082 0.238 0.840 -0.926 881s -0.471 -0.076 -0.193 -0.200 881s -0.796 -0.165 -0.815 0.991 881s 881s y: [1, 4, 4, 2] 881s 0.199 881s -0.571 881s 0.215 881s 0.984 881s test-mul-mat0: iter:458/500 881s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 1, 1, 1] 881s 0.171 -0.474 881s 881s x1: [2, 4, 1, 1] 881s 0.169 -0.888 881s 0.273 -0.242 881s 0.021 -0.333 881s 0.243 0.738 881s 881s y: [1, 4, 1, 1] 881s 0.450 881s 0.161 881s 0.162 881s -0.309 881s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.893 0.383 881s -0.600 0.221 881s 0.160 -0.749 881s 881s x1: [2, 1, 1, 1] 881s -0.734 -0.053 881s 881s y: [3, 1, 1, 1] 881s 0.636 0.429 -0.078 881s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.648 0.663 881s -0.951 0.585 881s 881s x1: [2, 3, 1, 1] 881s 0.798 0.604 881s 0.826 0.740 881s 0.817 0.550 881s 881s y: [2, 3, 1, 1] 881s 0.918 -0.405 881s 1.026 -0.352 881s 0.894 -0.455 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.525 -0.627 881s -0.167 -0.996 881s -0.080 0.446 881s 881s x1: [2, 2, 1, 1] 881s -0.168 -0.839 881s -0.142 0.590 881s 881s y: [3, 2, 1, 1] 881s 0.438 0.864 -0.361 881s -0.445 -0.564 0.275 881s testing: mul_mat, [3, 2, 1, 1] = [3, 3, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 3, 1, 1] 881s 0.494 0.098ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s -0.085 881s 0.962 0.788 0.338 881s 0.764 0.504 0.769 881s 881s x1: [3, 2, 1, 1] 881s -0.361 -0.049 0.311 881s -0.255 -0.666 0.711 881s 881s y: [3, 2, 1, 1] 881s -0.210 -0.281 -0.062 881s -0.251 -0.529 0.017 881s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 3, 1, 1] 881s -0.743 -0.739 0.212 881s -0.918 0.527 -0.161 881s -0.096 0.543 -0.712 881s 881s x1: [3, 3, 1, 1] 881s -0.437 0.001 0.818 881s -0.852 0.834 0.979 881s 0.006 0.424 0.161 881s 881s y: [3, 3, 1, 1] 881s 0.497 0.270 -0.540 881s 0.225 1.064 -0.163 881s -0.283 0.192 0.115 881s test-mul-mat0: iter:459/500 881s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 3, 1, 1] 881s 0.088 -0.229 0.897 -0.751 881s 0.302 -0.846 -0.670 -0.794 881s -0.584 0.858 0.748 0.628 881s 881s x1: [4, 3, 1, 1] 881s -0.488 -0.982 -0.861 -0.700 881s 0.522 0.054 0.638 0.291 881s 0.617 -0.361 0.109 0.765 881s 881s y: [3, 3, 1, 1] 881s -0.064 1.816 -1.641 881s 0.387 -0.546 0.401 881s -0.341 -0.189 -0.109 881s testing: mul_mat, [1, 3, 2, 1] = [4, 1, 2, 1] * [4, 3, 2, 1] 881s x0: [4, 1, 2, 1] 881s 0.897 0.069 0.273 0.754 881s 881s x1: [4, 3, 2, 1] 881s -0.304 -0.964 0.175 0.246 881s -0.709 -0.785 0.758 -0.691 881s -0.646 -0.942 0.831 0.408 881s 881s y: [1, 3, 2, 1] 881s -0.105 881s -1.005 881s -0.109 881s testing: mul_mat, [3, 1, 2, 3] = [4, 3, 2, 3] * [4, 1, 2, 3] 881s x0: [4, 3, 2, 3] 881s 0.185 0.170 -0.819 -0.914 881s 0.621 0.035 -0.838 0.318 881s -0.820 -0.547 -0.227 0.634 881s 881s x1: [4, 1, 2, 3] 881s -0.924 0.697 0.145 -0.709 881s 881s y: [3, 1, 2, 3] 881s 0.477 -0.895 -0.106 881s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.716 0.583 0.292 -0.954 881s 881s x1: [4, 3, 1, 1] 881s -0.059 0.937 0.217 -0.047 881s 0.074 0.703 -0.541 0.534 881s -0.580 0.000 0.953 0.236 881s 881s y: [1, 3, 1, 1] 881s 0.696 881s -0.310 881s 0.468 881s testing: mul_mat, [4, 4, 2, 1] = [1, 4, 2, 1] * [1, 4, 2, 1] 881s x0: [1, 4, 2, 1] 881s -0.023 881s 0.769 881s -0.248 881s -0.305 881s 881s x1: [1, 4, 2, 1] 881s 0.755 881s -0.995 881s -0.346 881s 0.115 881s 881s y: [4, 4, 2, 1] 881s -0.017 0.581 -0.187 -0.230 881s 0.023 -0.765 0.247 0.303 881s 0.008 -0.266 0.086 0.106 881s -0.003 0.088 -0.028 -0.035 881s testing: mul_mat, [1, 1, 2, 3] = [4, 1, 2, 3] * [4, 1, 2, 3] 881s x0: [4, 1, 2, 3] 881s 0.745 -0.802 -0.874 0.439 881s 881s x1: [4, 1, 2, 3] 881s -0.353 0.706 -0.427 0.106 881s 881s y: [1, 1, 2, 3] 881s -0.411 881s test-mul-mat0: iter:460/500 881s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 4, 1, 1] 881s -0.655 881s 0.078 881s -0.130 881s 0.784 881s 881s x1: [1, 2, 1, 1] 881s -0.400 881s -0.119 881s 881s y: [4, 2, 1, 1] 881s 0.262 -0.031 0.052 -0.313 881s 0.078 -0.009 0.015 -0.093 881s testing: mul_mat, [1, 4, 1, 1] = [1, 1, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 1, 1, 1] 881s 0.061 881s 881s x1: [1, 4, 1, 1] 881s 0.909 881s 0.866 881s 0.870 881s -0.555 881s 881s y: [1, 4, 1, 1] 881s 0.055 881s 0.053 881s 0.053 881s -0.034 881s testing: mul_mat, [3, 1, 1, 4] = [1, 3, 1, 4] * [1, 1, 1, 4] 881s x0: [1, 3, 1, 4] 881s -0.311 881s 0.641 881s 0.394 881s 881s x1: [1, 1, 1, 4] 881s -0.778 881s 881s y: [3, 1, 1, 4] 881s 0.242 -0.499 -0.307 881s testing: mul_mat, [2, 3, 1, 1] = [1, 2, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.681 881s 0.757 881s 881s x1: [1, 3, 1, 1] 881s -0.830 881s 0.974 881s -0.321 881s 881s y: [2, 3, 1, 1] 881s 0.565 -0.628 881s -0.663 0.737 881s 0.219 -0.243 881s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 4, 1, 1] 881s 0.651 0.034 881s 0.973 -0.668 881s -0.889 0.086 881s 0.087 0.392 881s 881s x1: [2, 1, 1, 1] 881s 0.785 -0.897 881s 881s y: [4, 1, 1, 1] 881s 0.481 1.363 -0.775 -0.283 881s testing: mul_mat, [2, 2, 1, 4] = [4, 2, 1, 4] * [4, 2, 1, 4] 881s x0: [4, 2, 1, 4] 881s -0.201 0.495 0.070 0.979 881s -0.711 -0.776 -0.668 0.617 881s 881s x1: [4, 2, 1, 4] 881s 0.525 0.876 -0.919 0.166 881s 0.271 0.425 -0.219 -0.163 881s 881s y: [2, 2, 1, 4] 881s 0.427 -0.337 881s -0.019 -0.477 881s test-mul-mat0: iter:461/500 881s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.520 -0.219 -0.610 -0.045 881s 0.156 -0.416 -0.397 0.019 881s 0.300 -0.430 -0.962 -0.809 881s -0.171 -0.676 -0.657 -0.588 881s 881s x1: [4, 1, 1, 1] 881s -0.448 0.303 0.982 -0.860 881s 881s y: [4, 1, 1, 1] 881s -0.859 -0.602 -0.512 -0.267 881s testing: mul_mat, [1, 4, 3, 1] = [4, 1, 3, 1] * [4, 4, 3, 1] 881s x0: [4, 1, 3, 1] 881s 0.151 -0.120 0.325 0.082 881s 881s x1: [4, 4, 3, 1] 881s -0.793 -0.156 -0.287 0.213 881s 0.569 0.812 -0.493 0.788 881s 0.136 -0.129 -0.660 -0.562 881s -0.147 -0.520 0.373 -0.627 881s 881s y: [1, 4, 3, 1] 881s -0.177 881s -0.108 881s -0.225 881s 0.110 881s testing: mul_mat, [1, 1, 3, 2] = [4, 1, 3, 2] * [4, 1, 3, 2] 881s x0: [4, 1, 3, 2] 881s -0.779 -0.639 0.925 0.135 881s 881s x1: [4, 1, 3,ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s 2] 881s -0.344 0.594 -0.986 0.112 881s 881s y: [1, 1, 3, 2] 881s -1.010 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.337 0.613 0.429 0.698 881s 881s x1: [4, 1, 1, 1] 881s -0.648 -0.430 -0.232 -0.503 881s 881s y: [1, 1, 1, 1] 881s -0.933 881s testing: mul_mat, [3, 4, 3, 1] = [1, 3, 3, 1] * [1, 4, 3, 1] 881s x0: [1, 3, 3, 1] 881s -0.392 881s -0.530 881s 0.667 881s 881s x1: [1, 4, 3, 1] 881s 0.538 881s -0.436 881s 0.648 881s -0.547 881s 881s y: [3, 4, 3, 1] 881s -0.211 -0.285 0.359 881s 0.171 0.231 -0.291 881s -0.254 -0.343 0.432 881s 0.214 0.290 -0.365 881s testing: mul_mat, [1, 1, 3, 2] = [3, 1, 3, 2] * [3, 1, 3, 2] 881s x0: [3, 1, 3, 2] 881s 0.023 0.959 -0.982 881s 881s x1: [3, 1, 3, 2] 881s -0.814 -0.335 -0.910 881s 881s y: [1, 1, 3, 2] 881s 0.554 881s test-mul-mat0: iter:462/500 881s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.284 0.294 -0.963 881s -0.325 0.312 0.668 881s 0.105 -0.003 0.599 881s -0.385 0.490 0.364 881s 881s x1: [3, 2, 1, 1] 881s 0.311 0.233 0.046 881s 0.533 0.200 -0.287 881s 881s y: [4, 2, 1, 1] 881s -0.065 0.003 0.060 0.011 881s 0.184 -0.303 -0.117 -0.212 881s testing: mul_mat, [4, 4, 2, 1] = [3, 4, 2, 1] * [3, 4, 2, 1] 881s x0: [3, 4, 2, 1] 881s 0.933 -0.002 0.239 881s -0.452 -0.513 -0.397 881s -0.233 0.792 0.918 881s 0.256 0.003 0.865 881s 881s x1: [3, 4, 2, 1] 881s -0.781 0.305 0.315 881s -0.511 0.211 0.947 881s -0.368 -0.456 -0.125 881s -0.923 0.003 0.636 881s 881s y: [4, 4, 2, 1] 881s -0.654 0.072 0.712 0.073 881s -0.251 -0.253 1.155 0.689 881s -0.372 0.450 -0.390 -0.204 881s -0.710 0.163 0.801 0.314 881s testing: mul_mat, [2, 4, 2, 3] = [3, 2, 2, 3] * [3, 4, 2, 3] 881s x0: [3, 2, 2, 3] 881s -0.245 0.269 -0.100 881s -0.698 -0.042 0.997 881s 881s x1: [3, 4, 2, 3] 881s -0.792 -0.623 0.143 881s -0.820 0.205 0.144 881s 0.819 0.138 -0.859 881s 0.058 0.685 -0.372 881s 881s y: [2, 4, 2, 3] 881s 0.012 0.721 881s 0.242 0.707 881s -0.078 -1.434 881s 0.208 -0.440 881s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.921 -0.622 0.996 881s 881s x1: [3, 2, 1, 1] 881s -0.739 -0.808 0.695 881s -0.050 0.405 0.100 881s 881s y: [1, 2, 1, 1] 881s 1.876 881s -0.106 881s testing: mul_mat, [4, 3, 2, 1] = [1, 4, 2, 1] * [1, 3, 2, 1] 881s x0: [1, 4, 2, 1] 881s -0.419 881s -0.843 881s -0.499 881s -0.183 881s 881s x1: [1, 3, 2, 1] 881s -0.478 881s 0.147 881s -0.551 881s 881s y: [4, 3, 2, 1] 881s 0.200 0.403 0.239 0.088 881s -0.061 -0.124 -0.073 -0.027 881s 0.231 0.464 0.275 0.101 881s testing: mul_mat, [2, 1, 2, 3] = [4, 2, 2, 3] * [4, 1, 2, 3] 881s x0: [4, 2, 2, 3] 881s 0.408 -0.293 0.885 0.479 881s 0.204 0.762 -0.537 0.184 881s 881s x1: [4, 1, 2, 3] 881s 0.717 0.300 -0.597 0.981 881s 881s y: [2, 1, 2, 3] 881s 0.147 0.876 881s test-mul-mat0: iter:463/500 881s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.058 0.873 0.376 0.825 881s 0.871 0.729 -0.590 -0.550 881s -0.024 -0.370 -0.587 -0.802 881s -0.621 -0.056 -0.524 -0.922 881s 881s x1: [4, 4, 1, 1] 881s -0.723 0.698 0.428 -0.140 881s 0.332 -0.869 0.615 0.874 881s 0.221 -0.299 0.824 0.515 881s -0.506 0.535 0.694 -0.250 881s 881s y: [4, 4, 1, 1] 881s 0.613 -0.295 -0.379 0.315 881s 0.213 -1.188 -0.748 -1.286 881s 0.487 -0.795 -0.791 -1.028 881s 0.493 -0.322 -0.393 0.151 881s testing: mul_mat, [1, 4, 1, 1] = [4, 1, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.255 0.816 -0.360 -0.526 881s 881s x1: [4, 4, 1, 1] 881s -0.628 0.336 0.410 -0.497 881s -0.049 0.284 0.723 0.652 881s 0.108 0.239 -0.854 -0.357 881s -0.067 -0.104 0.369 0.990 881s 881s y: [1, 4, 1, 1] 881s 0.548 881s -0.359 881s 0.662 881s -0.721 881s testing: mul_mat, [1, 1, 1, 2] = [4, 1, 1, 2] * [4, 1, 1, 2] 881s x0: [4, 1, 1, 2] 881s 0.313 -0.180 0.460 -0.277 881s 881s x1: [4, 1, 1, 2] 881s -0.774 0.090 0.450 -0.144 881s 881s y: [1, 1, 1, 2] 881s -0.012 881s testing: mul_mat, [3, 1, 1, 1] = [4, 3, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 3, 1, 1] 881s 0.217 0.208 0.024 0.250 881s 0.103 0.872 -0.487 0.603 881s 0.127 0.872 -0.654 0.796 881s 881s x1: [4, 1, 1, 1] 881s -0.938 0.115 -0.715 -0.792 881s 881s y: [3, 1, 1, 1] 881s -0.396 -0.126 -0.182 881s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.258 0.500 -0.500 881s 881s x1: [3, 4, 1, 1] 881s -0.894 0.106 0.444 881s 0.342 0.554 0.568 881s -0.345 -0.626 0.028 881s 0.378 0.696 0.438 881s 881s y: [1, 4, 1, 1] 881s 0.061 881s -0.096 881s -0.238 881s 0.031 881s testing: mul_mat, [1, 3, 1, 2] = [1, 1, 1, 2] * [1, 3, 1, 2] 881s x0: [1, 1, 1, 2] 881s -0.316 881s 881s x1: [1, 3, 1, 2] 881s 0.026 881s 0.708 881s -0.742 881s 881s y: [1, 3, 1, 2] 881s -0.008 881s -0.224 881s 0.235 881s test-mul-mat0: iter:464/500 881s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.174 0.459 -0.588 0.829 881s 0.832 0.440 0.207 0.528 881s -0.122 -0.408 -0.730 -0.622 881s 0.092 0.297 -0.914 0.350 881s 881s x1: [4, 1, 1, 1] 881s -0.201 0.399 0.832 0.905 881s 881s y: [4, 1, 1, 1] 881s 0.409 0.658 -1.308 -0.345 881s testing: mul_mat, [3, 4, 4, 1] = [4, 3, 4, 1] * [4, 4, 4, 1] 881s x0: [4, 3, 4, 1] 881s 0.590 0.321 -0.422 0.457 881s 0.605 0.166 -0.482 0.465 881s -0.254 -0.394 -0.196 -0.839 881s 881s x1: [4, 4, 4, 1] 881s 0.541 -0.103 -0.266 -0.008 881s 0.581 0.990 0.468 -0.223 881s 0.593 0.194 0.575 -0.008 881s 0.027 0.480 0.835 -0.799 881s 881s y: [3, 4, 4, 1] 881s 0.395 0.435 -0.038 881s 0.361 0.186 -0.443 881s 0.165 0.110 -0.333 881s -0.548 -0.678 0.311 881s testing: mul_mat, [4, 3, 4, 3] = [4, 4, 4, 3] * [4, 3, 4, 3] 881s x0: [4, 4, 4, 3] 881s 0.499 -0.984 -0.091 0.844 881s -0.704 0.607 0.296 -0.205 881s -0.969 0.373 0.659 -0.029 881s -0.689 0.400 -0.357 0.993 881s 881s x1: [4, 3, 4, 3] 881s -0.731 0.418 0.536 0.356 881s 0.573 -0.066 0.301 -0.966 881s -0.885 0.134 0.073 -0.512 881s 881s y: [4, 3, 4, 3] 881s -0.525 0.854 1.208 0.834 881s -0.492 -0.156 -0.353 -1.489 881s -1.012 0.831 0.971 0.129 881s testing: mul_mat, [3, 4, 1, 1] = [4, 3, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 3, 1, 1] 881s 0.202 -0.977 0.556 0.112 881s -0.457 0.235 0.313 -0.505 881s -0.043 0.941 0.355 0.017 881s 881s x1: [4, 4, 1, 1] 881s 0.062 0.563 0.876 -0.882 881s -0.232 -0.788 0.088 0.130 881s 0.598 -0.446 -0.623 -0.676 881s -0.932 0.139 -0.965 0.237 881s 881s y: [3, 4, 1, 1] 881s -0.150 0.824 0.823 881s 0.786 -0.117 -0.698 881s 0.134 -0.231 -0.678 881s -0.833 0.037 -0.168 881s testing: mul_mat, [2, 4, 4, 1] = [3, 2, 4, 1] * [3, 4, 4, 1] 881s x0: [3, 2, 4, 1] 881s 0.207 0.589 0.229 881s -0.826 -0.657 -0.489 881s 881s x1: [3, 4, 4, 1] 881s -0.811 0.784 -0.921 881s 0.752 0.660 -0.803 881s -0.480 0.873 0.285 881s 0.650 0.470 0.839 881s 881s y: [2, 4, 4, 1] 881s 0.083 0.605 881s 0.360 -0.663 881s 0.480 -0.317 881s 0.603 -1.257 881s testing: mul_mat, [3, 3, 4, 3] = [2, 3, 4, 3] * [2, 3, 4, 3] 881s x0: [2, 3, 4, 3] 881s 0.952 0.990 881s 0.896 0.442 881s 0.796 0.700 881s 881s x1: [2, 3, 4, 3] 881s -0.584 0.022 881s 0.035 0.173 881s -0.950 -0.823 881s 881s y: [3, 3, 4, 3] 881s -0.533 -0.513 -0.449 881s 0.205 0.108 0.149 881s -1.719 -1.215 -1.332 881s test-mul-mat0: iter:465/500 881s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.301 881s -0.189 881s 881s x1: [1, 4, 1, 1] 881s 0.075 881s -0.255 881s 0.534 881s -0.344 881s 881s y: [2, 4, 1, 1] 881s -0.023 -0.014 881s 0.077 0.048 881s -0.161 -0.101 881s 0.104 0.065 881s testing: mul_mat, [2, 2, 4, 1] = [1, 2, 4, 1] * [1, 2, 4, 1] 881s x0: [1, 2, 4, 1] 881s 0.705 881s -0.986 881s 881s x1: [1, 2, 4, 1] 881s 0.022 881s -0.242 881s 881s y: [2, 2, 4, 1] 881s 0.016 -0.022 881s -0.171 0.239 881s testing: mul_mat, [4, 2, 4, 2] = [1, 4, 4, 2] * [1, 2, 4, 2] 881s x0: [1, 4, 4, 2] 881s 0.671 881s -0.044 881s 0.044 881s -0.948 881s 881s x1: [1, 2, 4, 2] 881s -0.665 881s 0.441 881s 881s y: [4, 2, 4, 2] 881s -0.446 0.029 -0.029 0.631 881s 0.296 -0.019 0.019 -0.418 881s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.831 881s 0.797 881s 881s x1: [1, 4, 1, 1] 881s -0.674 881s 0.713 881s -0.926 881s 0.055 881s 881s y: [2, 4, 1, 1] 881s -0.560 -0.537 881s 0.593 0.568 881s -0.770 -0.738 881s 0.046 0.044 881s testing: mul_mat, [3, 1, 4, 1] = [2, 3, 4, 1] * [2, 1, 4, 1] 881s x0: [2, 3, 4, 1] 881s 0.355 -0.916 881s -0.523 -0.838 881s 0.077 -0.408 881s 881s x1: [2, 1, 4, 1] 881s 0.997 0.099 881s 881s y: [3, 1, 4, 1] 881s 0.262 -0.604 0.036 881s testing: mul_mat, [3, 2, 4, 2] = [3, 3, 4, 2] * [3, 2, 4, 2] 881s x0: [3, 3, 4, 2] 881s -0.789 -0.210 -0.170 881s -0.042 -0.472 0.577 881s 0.386 0.599 0.677 881s 881s x1: [3, 2, 4, 2] 881s -0.466 0.987 -0.744 881s -0.853 -0.426 -0.675 881s 881s y: [3, 2, 4, 2] 881s 0.287 -0.876 -0.093 881s 0.878 -0.152 -1.041 881s test-mul-mat0: iter:466/500 881s testing: mul_mat, [2, 3, 1, 1] = [3, 2, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 2, 1, 1] 881s -0.938 0.814 0.353 881s 0.268 0.022 0.658 881s 881s x1: [3, 3, 1, 1] 881s 0.484 -0.099 -0.021 881s 0.212 0.321 0.896 881s 0.418 0.018 0.111 881s 881s y: [2, 3, 1, 1] 881s -0.542 0.113 881s 0.378 0.653 881s -0.339 0.185 881s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.856 -0.546 0.450 881s -0.590 0.214 0.252 881s 0.060 0.198 -0.264 881s 0.961 -0.823 0.948 881s 881s x1: [3, 2, 1, 1] 881s -0.670 -0.054 -0.584 881s -0.364 -0.623 -0.844 881s 881s y: [4, 2, 1, 1] 881s 0.340 0.236 0.103 -1.153 881s 0.271 -0.132 0.077 -0.637 881s testing: mul_mat, [2, 4, 1, 2] = [3, 2, 1, 2] * [3, 4, 1, 2] 881s x0: [3, 2, 1, 2] 881s -0.044 -0.640 0.819 881s -0.083 -0.463 0.768 881s 881s x1: [3, 4, 1, 2] 881s 0.282 -0.927 0.366 881s -0.700ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s 0.184 0.910 881s -0.638 -0.003 0.263 881s 0.630 -0.981 -0.079 881s 881s y: [2, 4, 1, 2] 881s 0.881 0.687 881s 0.659 0.672 881s 0.246 0.256 881s 0.536 0.342 881s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.774 -0.939 0.260 881s -0.433 0.429 -0.503 881s 0.049 -0.398 0.171 881s 0.116 -0.991 -0.657 881s 881s x1: [3, 2, 1, 1] 881s -0.139 -0.210 -0.692 881s -0.509 -0.191 0.229 881s 881s y: [4, 2, 1, 1] 881s -0.091 0.318 -0.042 0.646 881s -0.155 0.023 0.090 -0.020 881s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.262 0.646 0.923 0.563 881s -0.583 -0.125 -0.038 -0.718 881s -0.577 0.846 -0.727 -0.387 881s -0.929 -0.788 0.353 -0.939 881s 881s x1: [4, 3, 1, 1] 881s 0.454 0.531 -0.837 -0.629 881s -0.932 0.931 -0.429 -0.322 881s 0.065 0.070 -0.528 -0.891 881s 881s y: [4, 3, 1, 1] 881s -0.666 0.153 1.040 -0.544 881s -0.221 0.675 1.761 0.283 881s -0.928 0.614 0.751 0.535 881s testing: mul_mat, [1, 4, 1, 2] = [4, 1, 1, 2] * [4, 4, 1, 2] 881s x0: [4, 1, 1, 2] 881s 0.481 0.488 -0.280 0.367 881s 881s x1: [4, 4, 1, 2] 881s 0.452 -0.044 0.515 -0.017 881s 0.119 0.886 0.052 0.050 881s -0.544 0.730 -0.886 0.526 881s -0.799 -0.776 0.458 0.464 881s 881s y: [1, 4, 1, 2] 881s 0.046 881s 0.493 881s 0.536 881s -0.722 881s test-mul-mat0: iter:467/500 881s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 3, 1, 1] 881s 0.192 -0.698 -0.845 881s 0.098 0.354 -0.201 881s 0.065 0.859 -0.601 881s 881s x1: [3, 3, 1, 1] 881s -0.193 0.935 -0.253 881s -0.312 0.470 0.033 881s 0.444 -0.150 0.531 881s 881s y: [3, 3, 1, 1] 881s -0.476 0.363 0.943 881s -0.416 0.129 0.364 881s -0.259 -0.116 -0.419 881s testing: mul_mat, [2, 3, 3, 1] = [3, 2, 3, 1] * [3, 3, 3, 1] 881s x0: [3, 2, 3, 1] 881s -0.761 0.707 0.499 881s -0.216 0.054 -0.381 881s 881s x1: [3, 3, 3, 1] 881s -0.454 0.347 0.119 881s 0.912 0.887 -0.112 881s 0.735 -0.843 -0.228 881s 881s y: [2, 3, 3, 1] 881s 0.651 0.072 881s -0.123 -0.106 881s -1.269 -0.117 881s testing: mul_mat, [1, 2, 3, 2] = [3, 1, 3, 2] * [3, 2, 3, 2] 881s x0: [3, 1, 3, 2] 881s 0.683 0.614 -0.386 881s 881s x1: [3, 2, 3, 2] 881s 0.986 -0.676 0.445 881s -0.990 0.208 0.636 881s 881s y: [1, 2, 3, 2] 881s 0.087 881s -0.794 881s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.311 0.896 -0.573 881s -0.225 0.942 0.417 881s 0.093 0.734 0.840 881s -0.793 -0.774 0.042 881s 881s x1: [3, 1, 1, 1] 881s 0.070 0.158 0.977 881s 881s y: [4, 1, 1, 1] 881s -0.440 0.541 0.943 -0.137 881s testing: mul_mat, [3, 3, 3, 1] = [4, 3, 3, 1] * [4, 3, 3, 1] 881s x0: [4, 3, 3, 1] 881s -0.591 -0.156 -0.112 -0.998 881s 0.314 -0.333 0.054 -0.401 881s 0.767 0.379 -0.538 0.522 881s 881s x1: [4, 3, 3, 1] 881s 0.557 0.911 0.633 0.254 881s -0.321 0.907 -0.993 -0.073 881s -0.080 -0.566 0.353 0.612 881s 881s y: [3, 3, 3, 1] 881s -0.795 -0.195 0.564 881s 0.232 -0.428 0.594 881s -0.515 -0.063 -0.146 881s testing: mul_mat, [2, 4, 3, 2] = [3, 2, 3, 2] * [3, 4, 3, 2] 881s x0: [3, 2, 3, 2] 881s -0.283 0.671 -0.769 881s 0.966 0.509 -0.888 881s 881s x1: [3, 4, 3, 2] 881s 0.043 -0.265 0.974 881s 0.505 -0.263 -0.427 881s 0.028 0.542 0.407 881s -0.615 -0.757 -0.318 881s 881s y: [2, 4, 3, 2] 881s -0.939 -0.957 881s 0.009 0.733 881s 0.043 -0.059 881s -0.090 -0.697 881s test-mul-mat0: iter:468/500 881s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.588 881s 0.451 881s 881s x1: [1, 4, 1, 1] 881s -0.036 881s 0.402 881s 0.738 881s 0.075 881s 881s y: [2, 4, 1, 1] 881s 0.021 -0.016 881s -0.236 0.181 881s -0.434 0.333 881s -0.044 0.034 881s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 881s x0: [1, 4, 2, 1] 881s -0.009 881s 0.583 881s -0.568 881s 0.992 881s 881s x1: [1, 2, 2, 1] 881s -0.526 881s -0.068 881s 881s y: [4, 2, 2, 1] 881s 0.005 -0.307 0.299 -0.522 881s 0.001 -0.040 0.039 -0.068 881s testing: mul_mat, [4, 4, 2, 2] = [1, 4, 2, 2] * [1, 4, 2, 2] 881s x0: [1, 4, 2, 2] 881s 0.912 881s 0.235 881s -0.434 881s -0.182 881s 881s x1: [1, 4, 2, 2] 881s -0.893 881s -0.698 881s 0.593 881s -0.979 881s 881s y: [4, 4, 2, 2] 881s -0.814 -0.210 0.388 0.163 881s -0.637 -0.164 0.303 0.127 881s 0.541 0.140 -0.258 -0.108 881s -0.893 -0.230 0.425 0.178 881s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.262 881s -0.223 881s 881s x1: [1, 4, 1, 1] 881s -0.914 881s 0.936 881s -0.622 881s -0.644 881s 881s y: [2, 4, 1, 1] 881s -0.240 0.204 881s 0.246 -0.209 881s -0.163 0.139 881s -0.169 0.144 881s testing: mul_mat, [4, 1, 2, 1] = [2, 4, 2, 1] * [2, 1, 2, 1] 881s x0: [2, 4, 2, 1] 881s -0.376 0.839 881s -0.396 -0.834 881s 0.601 0.354 881s -0.464 -0.654 881s 881s x1: [2, 1, 2, 1] 881s -0.729 -0.254 881s 881s y: [4, 1, 2, 1] 881s 0.060 0.501 -0.528 0.504 881s testing: mul_mat, [2, 2, 2, 2] = [4, 2, 2, 2] * [4, ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s 2, 2, 2] 881s x0: [4, 2, 2, 2] 881s 0.865 0.172 -0.844 -0.432 881s -0.282 -0.415 0.967 -0.628 881s 881s x1: [4, 2, 2, 2] 881s -0.003 -0.243 -0.657 0.083 881s -0.307 -0.279 0.439 -0.204 881s 881s y: [2, 2, 2, 2] 881s 0.474 -0.586 881s -0.595 0.755 881s test-mul-mat0: iter:469/500 881s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.868 -0.855 -0.441 881s 881s x1: [3, 4, 1, 1] 881s -0.185 0.803 -0.841 881s 0.850 0.100 -0.090 881s 0.793 0.092 -0.336 881s 0.248 0.381 0.958 881s 881s y: [1, 4, 1, 1] 881s -0.155 881s -0.783 881s -0.619 881s -0.963 881s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 1, 4, 1] 881s -0.678 -0.196 -0.254 881s 881s x1: [3, 1, 4, 1] 881s -0.270 -0.611 0.198 881s 881s y: [1, 1, 4, 1] 881s 0.253 881s testing: mul_mat, [1, 1, 4, 1] = [3, 1, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 1, 4, 1] 881s 0.117 0.311 0.257 881s 881s x1: [3, 1, 4, 1] 881s 0.698 -0.506 0.692 881s 881s y: [1, 1, 4, 1] 881s 0.102 881s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.499 -0.754 -0.718 881s -0.445 0.722 -0.337 881s 0.892 0.635 0.945 881s -0.005 -0.503 0.050 881s 881s x1: [3, 1, 1, 1] 881s 0.876 0.965 -0.082 881s 881s y: [4, 1, 1, 1] 881s -1.106 0.335 1.317 -0.494 881s testing: mul_mat, [1, 3, 4, 1] = [4, 1, 4, 1] * [4, 3, 4, 1] 881s x0: [4, 1, 4, 1] 881s 0.622 -0.886 -0.382 0.425 881s 881s x1: [4, 3, 4, 1] 881s 0.344 0.869 0.224 -0.539 881s 0.179 -0.520 -0.335 0.507 881s 0.668 0.190 -0.362 0.602 881s 881s y: [1, 3, 4, 1] 881s -0.871 881s 0.915 881s 0.640 881s testing: mul_mat, [2, 4, 4, 1] = [1, 2, 4, 1] * [1, 4, 4, 1] 881s x0: [1, 2, 4, 1] 881s 0.920 881s -0.376 881s 881s x1: [1, 4, 4, 1] 881s -0.239 881s -0.385 881s -0.739 881s 0.722 881s 881s y: [2, 4, 4, 1] 881s -0.220 0.090 881s -0.354 0.145 881s -0.679 0.278 881s 0.664 -0.272 881s test-mul-mat0: iter:470/500 881s testing: mul_mat, [3, 1, 1, 1] = [3, 3, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 3, 1, 1] 881s 0.910 -0.784 0.672 881s -0.599 -0.910 -0.377 881s -0.275 0.957 0.792 881s 881s x1: [3, 1, 1, 1] 881s 0.593 0.648 -0.506 881s 881s y: [3, 1, 1, 1] 881s -0.309 -0.754 0.056 881s testing: mul_mat, [3, 3, 3, 1] = [3, 3, 3, 1] * [3, 3, 3, 1] 881s x0: [3, 3, 3, 1] 881s 0.898 -0.021 -0.962 881s -0.089 -0.487 0.915 881s 0.226 0.246 0.989 881s 881s x1: [3, 3, 3, 1] 881s 0.013 0.534 0.876 881s -0.685 -0.267 -0.926 881s -0.765 0.357 -0.421 881s 881s y: [3, 3, 3, 1] 881s -0.842 0.540 1.001 881s 0.281 -0.656 -1.136 881s -0.290 -0.490 -0.501 881s testing: mul_mat, [1, 3, 3, 2] = [3, 1, 3, 2] * [3, 3, 3, 2] 881s x0: [3, 1, 3, 2] 881s -0.234 -0.065 0.944 881s 881s x1: [3, 3, 3, 2] 881s 0.384 -0.186 0.171 881s 0.631 -0.288 -0.850 881s 0.669 0.623 -0.337 881s 881s y: [1, 3, 3, 2] 881s 0.084 881s -0.931 881s -0.515 881s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.721 -0.619 -0.997 881s -0.994 0.785 -0.600 881s -0.580 0.633 -0.935 881s 0.574 0.131 0.947 881s 881s x1: [3, 1, 1, 1] 881s 0.421 0.796 0.940 881s 881s y: [4, 1, 1, 1] 881s -1.126 -0.358 -0.619 1.236 881s testing: mul_mat, [4, 3, 3, 1] = [4, 4, 3, 1] * [4, 3, 3, 1] 881s x0: [4, 4, 3, 1] 881s -0.942 -0.622 0.678 -0.804 881s -0.745 0.372 -0.836 -0.906 881s 0.333 -0.970 -0.557 -0.823 881s 0.316 -0.487 0.924 -0.352 881s 881s x1: [4, 3, 3, 1] 881s 0.090 -0.055 0.258 0.852 881s 0.155 -0.249 0.858 -0.675 881s 0.137 0.258 -0.877 -0.961 881s 881s y: [4, 3, 3, 1] 881s -0.561 -1.075 -0.763 -0.007 881s 1.134 -0.314 0.371 1.202 881s -0.111 1.597 1.075 -0.554 881s testing: mul_mat, [3, 4, 3, 2] = [4, 3, 3, 2] * [4, 4, 3, 2] 881s x0: [4, 3, 3, 2] 881s 0.593 0.244 -0.290 0.411 881s 0.090 -0.507 0.337 0.145 881s -0.303 -0.145 -0.412 -0.238 881s 881s x1: [4, 4, 3, 2] 881s -0.756 0.278 -0.715 -0.182 881s -0.598 0.719 0.221 0.713 881s 0.502 -0.602 -0.531 0.484 881s 0.505 0.420 0.524 -0.133 881s 881s y: [3, 4, 3, 2] 881s -0.248 -0.476 0.527 881s 0.050 -0.241 -0.183 881s 0.504 0.242 0.039 881s 0.196 -0.011 -0.399 881s test-mul-mat0: iter:471/500 881s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.072 -0.808 -0.172 881s 0.148 0.693 0.463 881s 0.256 0.236 -0.147 881s 0.584 -0.890 -0.359 881s 881s x1: [3, 1, 1, 1] 881s 0.467 -0.191 -0.882 881s 881s y: [4, 1, 1, 1] 881s 0.340 -0.472 0.204 0.760 881s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.677 0.073 0.721 881s 881s x1: [3, 4, 1, 1] 881s -0.260 0.907 0.916 881s 0.419 0.872 0.171 881s -0.523 -0.284 0.743 881s -0.499 0.848 -0.222 881s 881s y: [1, 4, 1, 1] 881s 0.903 881s -0.096 881s 0.870 881s 0.240 881s testing: mul_mat, [4, 1, 1, 1] = [3, 4, 1, 1] * [3, 1, 1, 1] 881s x0: [3, 4, 1, 1] 881s 0.383 -0.157 0.353 881s 0.076 -0.693 -0.391 881s -0.687 0.160 -0.807 881s -0.577 0.801 -0.067 881s 881s x1ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s : [3, 1, 1, 1] 881s 0.132 0.191 -0.984 881s 881s y: [4, 1, 1, 1] 881s -0.327 0.262 0.734 0.143 881s testing: mul_mat, [2, 4, 1, 1] = [3, 2, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 2, 1, 1] 881s -0.518 -0.779 0.322 881s 0.306 0.674 -0.574 881s 881s x1: [3, 4, 1, 1] 881s -0.670 0.716 -0.648 881s -0.798 -0.113 -0.171 881s -0.082 -0.369 0.330 881s -0.234 0.409 0.585 881s 881s y: [2, 4, 1, 1] 881s -0.419 0.649 881s 0.446 -0.222 881s 0.436 -0.464 881s -0.009 -0.132 881s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.632 -0.132 881s -0.419 0.433 881s 881s x1: [2, 3, 1, 1] 881s 0.057 -0.834 881s 0.779 -0.867 881s -0.528 -0.612 881s 881s y: [2, 3, 1, 1] 881s 0.146 -0.385 881s 0.606 -0.701 881s -0.253 -0.044 881s testing: mul_mat, [4, 2, 1, 1] = [2, 4, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.964 -0.974 881s -0.305 0.085 881s -0.682 -0.924 881s -0.333 0.611 881s 881s x1: [2, 2, 1, 1] 881s 0.514 0.198 881s 0.149 0.866 881s 881s y: [4, 2, 1, 1] 881s -0.689 -0.140 -0.534 -0.050 881s -0.988 0.028 -0.902 0.479 881s test-mul-mat0: iter:472/500 881s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.123 -0.012 0.867 881s -0.490 0.568 -0.264 881s 0.942 0.083 0.934 881s 0.092 -0.051 0.334 881s 881s x1: [3, 2, 1, 1] 881s -0.769 -0.761 -0.179 881s 0.288 -0.595 -0.400 881s 881s y: [4, 2, 1, 1] 881s -0.051 -0.008 -0.954 -0.091 881s -0.375 -0.374 -0.151 -0.077 881s testing: mul_mat, [4, 4, 3, 1] = [3, 4, 3, 1] * [3, 4, 3, 1] 881s x0: [3, 4, 3, 1] 881s -0.944 0.910 -0.457 881s -0.515 -0.892 0.851 881s 0.145 -0.455 0.979 881s 0.093 0.387 -0.292 881s 881s x1: [3, 4, 3, 1] 881s 0.128 0.644 0.653 881s 0.795 0.670 -0.263 881s 0.871 0.280 -0.088 881s -0.572 -0.802 0.307 881s 881s y: [4, 4, 3, 1] 881s 0.167 -0.085 0.365 0.070 881s -0.021 -1.230 -0.447 0.409 881s -0.526 -0.774 -0.088 0.215 881s -0.331 1.272 0.584 -0.453 881s testing: mul_mat, [2, 4, 3, 1] = [3, 2, 3, 1] * [3, 4, 3, 1] 881s x0: [3, 2, 3, 1] 881s 0.216 0.183 0.772 881s -0.930 0.470 0.144 881s 881s x1: [3, 4, 3, 1] 881s 0.742 0.031 0.545 881s 0.722 -0.876 -0.069 881s -0.570 -0.413 -0.963 881s -0.146 -0.713 0.372 881s 881s y: [2, 4, 3, 1] 881s 0.586 -0.598 881s -0.058 -1.093 881s -0.941 0.197 881s 0.125 -0.146 881s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 1, 1, 1] 881s 0.234 -0.899 0.376 881s 881s x1: [3, 2, 1, 1] 881s 0.056 -0.152 -0.813 881s 0.314 -0.830 0.629 881s 881s y: [1, 2, 1, 1] 881s -0.156 881s 1.056 881s testing: mul_mat, [2, 3, 3, 1] = [1, 2, 3, 1] * [1, 3, 3, 1] 881s x0: [1, 2, 3, 1] 881s -0.218 881s -0.951 881s 881s x1: [1, 3, 3, 1] 881s -0.981 881s -0.303 881s 0.110 881s 881s y: [2, 3, 3, 1] 881s 0.214 0.933 881s 0.066 0.288 881s -0.024 -0.105 881s testing: mul_mat, [3, 1, 3, 1] = [2, 3, 3, 1] * [2, 1, 3, 1] 881s x0: [2, 3, 3, 1] 881s 0.017 -0.355 881s -0.425 0.148 881s 0.530 -0.237 881s 881s x1: [2, 1, 3, 1] 881s 0.093 -0.802 881s 881s y: [3, 1, 3, 1] 881s 0.286 -0.158 0.239 881s test-mul-mat0: iter:473/500 881s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.647 0.028 881s 0.707 -0.002 881s 881s x1: [2, 4, 1, 1] 881s -0.691 -0.013 881s -0.695 -0.493 881s 0.192 -0.183 881s 0.336 -0.547 881s 881s y: [2, 4, 1, 1] 881s 0.447 -0.488 881s 0.436 -0.490 881s -0.129 0.136 881s -0.233 0.239 881s testing: mul_mat, [4, 2, 4, 1] = [2, 4, 4, 1] * [2, 2, 4, 1] 881s x0: [2, 4, 4, 1] 881s 0.858 -0.714 881s 0.309 -0.836 881s -0.208 -0.499 881s -0.019 -0.872 881s 881s x1: [2, 2, 4, 1] 881s -0.825 -0.530 881s 0.745 -0.302 881s 881s y: [4, 2, 4, 1] 881s -0.330 0.188 0.436 0.478 881s 0.855 0.482 -0.004 0.249 881s testing: mul_mat, [2, 4, 4, 2] = [2, 2, 4, 2] * [2, 4, 4, 2] 881s x0: [2, 2, 4, 2] 881s 0.172 0.725 881s -0.601 -0.665 881s 881s x1: [2, 4, 4, 2] 881s 0.681 0.932 881s -0.216 -0.527 881s -0.568 0.765 881s -0.398 0.385 881s 881s y: [2, 4, 4, 2] 881s 0.792 -1.029 881s -0.419 0.480 881s 0.457 -0.167 881s 0.211 -0.017 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.948 0.340 881s 0.892 0.933 881s 0.387 -0.216 881s 881s x1: [2, 2, 1, 1] 881s 0.015 -0.500 881s 0.367 0.427 881s 881s y: [3, 2, 1, 1] 881s -0.156 -0.453 0.114 881s 0.493 0.725 0.050 881s testing: mul_mat, [1, 2, 4, 1] = [3, 1, 4, 1] * [3, 2, 4, 1] 881s x0: [3, 1, 4, 1] 881s -0.782 0.805 0.659 881s 881s x1: [3, 2, 4, 1] 881s 0.328 0.925 -0.101 881s -0.884 0.429 0.299 881s 881s y: [1, 2, 4, 1] 881s 0.421 881s 1.234 881s testing: mul_mat, [4, 3, 4, 2] = [1, 4, 4, 2] * [1, 3, 4, 2] 881s x0: [1, 4, 4, 2] 881s 0.606 881s 0.213 881s -0.219 881s -0.392 881s 881s x1: [1, 3, 4, 2] 881s 0.834 881s 0.129 881s -0.360 881s 881s y: [4, 3, 4, 2] 881s 0.506 0.178 -0.183 -0.327 881s 0.078 0.028 -0.028 -0.051 881s -0.218 -0.077 0.079 0.141 881s test-mul-mat0: iter:474/500 881s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ] 881s x0: [4, 4, 1, 1] 881s 0.433 0.874 0.734 0.194 881s -0.968 0.413 0.048 0.256 881s -0.962 -0.091 0.817 0.827 881s -0.417 0.375 -0.664 0.808 881s 881s x1: [4, 2, 1, 1] 881s 0.200 0.165 -0.823 0.263 881s -0.837 0.502 0.281 0.508 881s 881s y: [4, 2, 1, 1] 881s -0.322 -0.098 -0.663 0.738 881s 0.380 1.161 1.409 0.762 881s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 881s x0: [4, 1, 4, 1] 881s 0.172 -0.666 0.787 0.407 881s 881s x1: [4, 4, 4, 1] 881s -0.629 -0.549 -0.753 -0.851 881s 0.174 -0.010 0.350 -0.661 881s 0.166 -0.387 -0.498 -0.332 881s 0.894 -0.990 0.399 0.327 881s 881s y: [1, 4, 4, 1] 881s -0.682 881s 0.043 881s -0.241 881s 1.261 881s testing: mul_mat, [3, 1, 4, 3] = [4, 3, 4, 3] * [4, 1, 4, 3] 881s x0: [4, 3, 4, 3] 881s 0.098 -0.434 -0.683 -0.233 881s -0.621 -0.444 0.432 0.672 881s 0.135 0.163 0.555 -0.474 881s 881s x1: [4, 1, 4, 3] 881s 0.612 0.832 0.813 0.809 881s 881s y: [3, 1, 4, 3] 881s -1.045 0.146 0.286 881s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.914 -0.768 -0.877 0.531 881s -0.764 -0.858 0.439 0.967 881s 0.949 0.529 0.105 -0.732 881s 0.227 0.128 -0.385 -0.256 881s 881s x1: [4, 3, 1, 1] 881s -0.844 0.346 0.760 -0.357 881s 0.080 -0.951 0.743 -0.706 881s 0.044 -0.119 0.179 -0.167 881s 881s y: [4, 3, 1, 1] 881s -1.893 0.336 -0.277 -0.348 881s -0.223 0.399 0.166 -0.209 881s -0.114 -0.014 0.120 -0.031 881s testing: mul_mat, [1, 4, 4, 1] = [4, 1, 4, 1] * [4, 4, 4, 1] 881s x0: [4, 1, 4, 1] 881s 0.363 -0.639 0.333 0.444 881s 881s x1: [4, 4, 4, 1] 881s -0.454 -0.684 -0.100 0.892 881s -0.924 0.543 -0.028 -0.875 881s 0.286 0.267 0.169 -0.833 881s -0.554 -0.997 -0.431 -0.640 881s 881s y: [1, 4, 4, 1] 881s 0.635 881s -1.080 881s -0.380 881s 0.008 881s testing: mul_mat, [3, 4, 4, 3] = [1, 3, 4, 3] * [1, 4, 4, 3] 881s x0: [1, 3, 4, 3] 881s -0.089 881s 0.771 881s 0.046 881s 881s x1: [1, 4, 4, 3] 881s -0.089 881s -0.802 881s 0.338 881s -0.425 881s 881s y: [3, 4, 4, 3] 881s 0.008 -0.068 -0.004 881s 0.071 -0.618 -0.037 881s -0.030 0.260 0.016 881s 0.038 -0.328 -0.020 881s test-mul-mat0: iter:475/500 881s testing: mul_mat, [1, 2, 1, 1] = [3, 1, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.913 -0.332 -0.679 881s 881s x1: [3, 2, 1, 1] 881s -0.222 -0.581 -0.862 881s -0.266 -0.307 -0.485 881s 881s y: [1, 2, 1, 1] 881s 0.981 881s 0.675 881s testing: mul_mat, [2, 1, 3, 1] = [3, 2, 3, 1] * [3, 1, 3, 1] 881s x0: [3, 2, 3, 1] 881s 0.961 -0.878 0.011 881s 0.254 -0.257 -0.691 881s 881s x1: [3, 1, 3, 1] 881s 0.544 -0.296 -0.044 881s 881s y: [2, 1, 3, 1] 881s 0.782 0.245 881s testing: mul_mat, [2, 2, 3, 3] = [3, 2, 3, 3] * [3, 2, 3, 3] 881s x0: [3, 2, 3, 3] 881s 0.279 -0.843 0.176 881s -0.205 -0.118 -0.415 881s 881s x1: [3, 2, 3, 3] 881s 0.743 -0.700 -0.735 881s 0.287 0.004 0.221 881s 881s y: [2, 2, 3, 3] 881s 0.668 0.234 881s 0.116 -0.151 881s testing: mul_mat, [4, 2, 1, 1] = [3, 4, 1, 1] * [3, 2, 1, 1] 881s x0: [3, 4, 1, 1] 881s -0.910 -0.128 0.446 881s 0.722 -0.186 0.941 881s 0.532 -0.067 0.659 881s -0.781 -0.724 0.183 881s 881s x1: [3, 2, 1, 1] 881s 0.739 0.585 -0.710 881s -0.108 0.138 0.366 881s 881s y: [4, 2, 1, 1] 881s -1.064 -0.245 -0.114 -1.131 881s 0.244 0.241 0.174 0.052 881s testing: mul_mat, [1, 3, 3, 1] = [4, 1, 3, 1] * [4, 3, 3, 1] 881s x0: [4, 1, 3, 1] 881s 0.482 0.484 -0.530 0.619 881s 881s x1: [4, 3, 3, 1] 881s 0.956 0.965 -0.890 0.745 881s 0.511 0.451 -0.433 -0.467 881s -0.967 -0.863 0.306 0.291 881s 881s y: [1, 3, 3, 1] 881s 1.861 881s 0.405 881s -0.866 881s testing: mul_mat, [3, 4, 3, 3] = [1, 3, 3, 3] * [1, 4, 3, 3] 881s x0: [1, 3, 3, 3] 881s 0.193 881s 0.283 881s 0.450 881s 881s x1: [1, 4, 3, 3] 881s -0.837 881s 0.830 881s -0.045 881s 0.596 881s 881s y: [3, 4, 3, 3] 881s -0.162 -0.237 -0.377 881s 0.161 0.235 0.373 881s -0.009 -0.013 -0.020 881s 0.115 0.168 0.268 881s test-mul-mat0: iter:476/500 881s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.830 0.712 0.383 -0.467 881s -0.576 0.681 0.197 -0.027 881s 881s x1: [4, 2, 1, 1] 881s 0.774 0.525 0.051 -0.498 881s 0.460 -0.714 0.638 0.218 881s 881s y: [2, 2, 1, 1] 881s -0.017 -0.065 881s -0.747 -0.631 881s testing: mul_mat, [4, 2, 1, 1] = [4, 4, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.565 -0.668 -0.863 -0.324 881s 0.933 0.912 -0.799 -0.016 881s -0.587 0.661 0.270 -0.949 881s -0.121 -0.984 -0.779 -0.409 881s 881s x1: [4, 2, 1, 1] 881s -0.571 0.923 -0.147 0.310 881s -0.948 -0.673 -0.965 0.870 881s 881s y: [4, 2, 1, 1] 881s -0.268 0.421 0.611 -0.852 881s 1.537 -0.742 -0.974 1.172 881s testing: mul_mat, [4, 4, 1, 3] = [4, 4, 1, 3] * [4, 4, 1, 3] 881s x0: [4, 4, 1, 3] 881s 0.252 0.947 -0.333 0.420 881s -0.009 0.853 0.248 0.305 881s 0.090 0.956 0.635 -0.451 881s 0.254 0.364 -0.239 0.567 881s 881s x1: [4, 4, 1, 3] 881s 0.400 -0.246 ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s 0.015 0.081 881s 0.951 0.987 0.510 0.874 881s -0.159 -0.180 0.926 0.168 881s -0.144 0.796 -0.276 0.290 881s 881s y: [4, 4, 1, 3] 881s -0.103 -0.185 -0.226 0.054 881s 1.372 1.226 0.959 0.975 881s -0.448 0.129 0.326 -0.232 881s 0.931 0.700 0.441 0.484 881s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.580 -0.938 -0.678 0.712 881s 0.250 0.921 -0.902 -0.694 881s -0.932 -0.950 -0.680 0.570 881s 0.257 -0.491 0.484 0.193 881s 881s x1: [4, 4, 1, 1] 881s -0.914 0.481 -0.520 0.048 881s -0.512 -0.885 0.973 0.805 881s 0.745 -0.344 -0.847 -0.338 881s 0.226 -0.604 -0.169 0.743 881s 881s y: [4, 4, 1, 1] 881s 0.466 0.650 0.776 -0.714 881s 1.040 -2.379 1.115 0.929 881s 0.223 0.868 0.015 -0.114 881s 1.078 -0.863 0.901 0.416 881s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.409 0.529 0.867 0.652 881s 0.816 0.330 -0.962 0.965 881s 881s x1: [4, 4, 1, 1] 881s -0.214 -0.382 0.681 -0.099 881s -0.409 0.486 -0.354 0.247 881s 0.639 0.308 -0.527 -0.965 881s -0.861 -0.784 0.341 -0.441 881s 881s y: [2, 4, 1, 1] 881s 0.236 -1.052 881s -0.056 0.405 881s -0.661 0.200 881s -0.759 -1.715 881s testing: mul_mat, [1, 4, 1, 3] = [2, 1, 1, 3] * [2, 4, 1, 3] 881s x0: [2, 1, 1, 3] 881s -0.153 0.791 881s 881s x1: [2, 4, 1, 3] 881s -0.642 0.136 881s 0.676 -0.337 881s -0.293 -0.131 881s 0.449 0.325 881s 881s y: [1, 4, 1, 3] 881s 0.206 881s -0.370 881s -0.059 881s 0.189 881s test-mul-mat0: iter:477/500 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s 0.705 -0.472 881s 0.782 -0.299 881s 0.691 0.457 881s 881s x1: [2, 2, 1, 1] 881s 0.287 0.196 881s 0.355 -0.388 881s 881s y: [3, 2, 1, 1] 881s 0.110 0.166 0.288 881s 0.434 0.394 0.068 881s testing: mul_mat, [1, 3, 1, 1] = [2, 1, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 1, 1, 1] 881s -0.502 -0.319 881s 881s x1: [2, 3, 1, 1] 881s 0.006 0.327 881s 0.168 0.449 881s 0.179 -0.781 881s 881s y: [1, 3, 1, 1] 881s -0.107 881s -0.228 881s 0.159 881s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 1, 1, 1] 881s 0.956 -0.770 881s 881s x1: [2, 1, 1, 1] 881s 0.270 0.345 881s 881s y: [1, 1, 1, 1] 881s -0.007 881s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 1, 1, 1] 881s -0.504 -0.068 881s 881s x1: [2, 1, 1, 1] 881s 0.916 -0.394 881s 881s y: [1, 1, 1, 1] 881s -0.435 881s testing: mul_mat, [2, 2, 1, 1] = [1, 2, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 2, 1, 1] 881s -0.266 881s -0.605 881s 881s x1: [1, 2, 1, 1] 881s 0.379 881s 0.783 881s 881s y: [2, 2, 1, 1] 881s -0.101 -0.229 881s -0.208 -0.474 881s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 1, 1, 1] 881s 0.656 0.141 881s 881s x1: [2, 1, 1, 1] 881s 0.874 -0.561 881s 881s y: [1, 1, 1, 1] 881s 0.495 881s test-mul-mat0: iter:478/500 881s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.653 0.273 0.193 0.569 881s 881s x1: [4, 2, 1, 1] 881s -0.719 0.595 0.121 -0.499 881s -0.953 0.619 0.181 0.317 881s 881s y: [1, 2, 1, 1] 881s -0.568 881s -0.238 881s testing: mul_mat, [3, 1, 3, 1] = [4, 3, 3, 1] * [4, 1, 3, 1] 881s x0: [4, 3, 3, 1] 881s -0.641 0.781 0.583 -0.495 881s 0.722 0.864 -0.899 -0.157 881s -0.635 -0.852 -0.539 0.546 881s 881s x1: [4, 1, 3, 1] 881s 0.879 -0.334 -0.935 -0.189 881s 881s y: [3, 1, 3, 1] 881s -1.276 1.216 0.127 881s testing: mul_mat, [3, 3, 3, 2] = [4, 3, 3, 2] * [4, 3, 3, 2] 881s x0: [4, 3, 3, 2] 881s 0.367 0.730 0.267 0.161 881s 0.416 0.338 -0.491 0.594 881s -0.538 0.519 0.188 0.820 881s 881s x1: [4, 3, 3, 2] 881s 0.098 -0.710 -0.564 0.463 881s -0.563 -0.102 0.009 0.902 881s -0.678 -0.792 -0.360 -0.059 881s 881s y: [3, 3, 3, 2] 881s -0.558 0.353 -0.147 881s -0.133 0.262 0.991 881s -0.932 -0.408 -0.162 881s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.890 0.211 -0.200 0.279 881s 881s x1: [4, 3, 1, 1] 881s -0.869 -0.376 0.489 0.431 881s -0.150 0.789 0.501 -0.496 881s 0.795 0.793 -0.830 -0.819 881s 881s y: [1, 3, 1, 1] 881s 0.717 881s 0.062 881s -0.602 881s testing: mul_mat, [4, 4, 3, 1] = [1, 4, 3, 1] * [1, 4, 3, 1] 881s x0: [1, 4, 3, 1] 881s 0.382 881s 0.145 881s 0.783 881s -0.129 881s 881s x1: [1, 4, 3, 1] 881s -0.364 881s -0.521 881s 0.532 881s -0.665 881s 881s y: [4, 4, 3, 1] 881s -0.139 -0.053 -0.285 0.047 881s -0.199 -0.076 -0.408 0.067 881s 0.203 0.077 0.417 -0.069 881s -0.254 -0.097 -0.521 0.086 881s testing: mul_mat, [1, 1, 3, 2] = [4, 1, 3, 2] * [4, 1, 3, 2] 881s x0: [4, 1, 3, 2] 881s -0.928 0.536 -0.196 -0.473 881s 881s x1: [4, 1, 3, 2] 881s -0.364 0.041 0.417 0.847 881s 881s y: [1, 1, 3, 2] 881s -0.123 881s test-mul-mat0: iter:479/500 881s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.477 0.469 881s 0.193 0.534 881s 0.756 0.416 881s 0.027 -0.636 881s 881s x1: [2, ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s 4, 1, 1] 881s -0.238 -0.920 881s -0.250 -0.166 881s 0.616 0.554 881s 0.360 0.022 881s 881s y: [4, 4, 1, 1] 881s -0.318 -0.537 -0.563 0.579 881s 0.041 -0.137 -0.258 0.099 881s -0.033 0.414 0.696 -0.336 881s -0.162 0.081 0.282 -0.004 881s testing: mul_mat, [3, 4, 4, 1] = [2, 3, 4, 1] * [2, 4, 4, 1] 881s x0: [2, 3, 4, 1] 881s -0.318 0.569 881s -0.901 0.199 881s -0.283 -0.058 881s 881s x1: [2, 4, 4, 1] 881s 0.868 0.338 881s -0.133 0.147 881s -0.955 -0.483 881s 0.149 -0.157 881s 881s y: [3, 4, 4, 1] 881s -0.083 -0.715 -0.265 881s 0.126 0.149 0.029 881s 0.029 0.765 0.298 881s -0.137 -0.165 -0.033 881s testing: mul_mat, [1, 3, 4, 3] = [2, 1, 4, 3] * [2, 3, 4, 3] 881s x0: [2, 1, 4, 3] 881s -0.242 0.546 881s 881s x1: [2, 3, 4, 3] 881s -0.614 0.277 881s -0.545 0.102 881s 0.092 -0.123 881s 881s y: [1, 3, 4, 3] 881s 0.300 881s 0.188 881s -0.089 881s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 4, 1, 1] 881s 0.167 0.425 881s -0.333 0.308 881s 0.762 -0.232 881s 0.437 0.749 881s 881s x1: [2, 1, 1, 1] 881s -0.541 -0.293 881s 881s y: [4, 1, 1, 1] 881s -0.215 0.090 -0.344 -0.455 881s testing: mul_mat, [2, 2, 4, 1] = [4, 2, 4, 1] * [4, 2, 4, 1] 881s x0: [4, 2, 4, 1] 881s 0.048 -0.794 -0.653 -0.993 881s -0.146 -0.632 0.778 0.285 881s 881s x1: [4, 2, 4, 1] 881s -0.538 0.859 0.520 0.066 881s -0.956 0.320 0.494 0.571 881s 881s y: [2, 2, 4, 1] 881s -1.112 -0.041 881s -1.189 0.484 881s testing: mul_mat, [2, 4, 4, 3] = [2, 2, 4, 3] * [2, 4, 4, 3] 881s x0: [2, 2, 4, 3] 881s 0.872 -0.968 881s -0.231 0.634 881s 881s x1: [2, 4, 4, 3] 881s 0.454 -0.060 881s -0.020 0.801 881s -0.282 -0.013 881s 0.087 0.723 881s 881s y: [2, 4, 4, 3] 881s 0.453 -0.143 881s -0.793 0.513 881s -0.233 0.057 881s -0.624 0.439 881s test-mul-mat0: iter:480/500 881s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.296 881s 0.143 881s 881s x1: [1, 4, 1, 1] 881s 0.884 881s 0.439 881s 0.941 881s 0.364 881s 881s y: [2, 4, 1, 1] 881s 0.262 0.126 881s 0.130 0.063 881s 0.279 0.134 881s 0.108 0.052 881s testing: mul_mat, [1, 2, 3, 1] = [1, 1, 3, 1] * [1, 2, 3, 1] 881s x0: [1, 1, 3, 1] 881s 0.335 881s 881s x1: [1, 2, 3, 1] 881s -0.910 881s 0.297 881s 881s y: [1, 2, 3, 1] 881s -0.305 881s 0.100 881s testing: mul_mat, [2, 1, 3, 4] = [1, 2, 3, 4] * [1, 1, 3, 4] 881s x0: [1, 2, 3, 4] 881s -0.633 881s -0.636 881s 881s x1: [1, 1, 3, 4] 881s 0.762 881s 881s y: [2, 1, 3, 4] 881s -0.482 -0.485 881s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 1, 1, 1] 881s -0.292 881s 881s x1: [1, 2, 1, 1] 881s -0.837 881s -0.930 881s 881s y: [1, 2, 1, 1] 881s 0.245 881s 0.272 881s testing: mul_mat, [4, 1, 3, 1] = [1, 4, 3, 1] * [1, 1, 3, 1] 881s x0: [1, 4, 3, 1] 881s 0.984 881s -0.723 881s -0.513 881s 0.711 881s 881s x1: [1, 1, 3, 1] 881s 0.812 881s 881s y: [4, 1, 3, 1] 881s 0.798 -0.587 -0.416 0.577 881s testing: mul_mat, [1, 1, 3, 4] = [4, 1, 3, 4] * [4, 1, 3, 4] 881s x0: [4, 1, 3, 4] 881s -0.133 0.859 0.268 -0.842 881s 881s x1: [4, 1, 3, 4] 881s 0.756 -0.683 -0.737 -0.406 881s 881s y: [1, 1, 3, 4] 881s -0.543 881s test-mul-mat0: iter:481/500 881s testing: mul_mat, [2, 3, 1, 1] = [4, 2, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.546 0.631 0.721 -0.663 881s -0.351 -0.933 0.043 -0.028 881s 881s x1: [4, 3, 1, 1] 881s -0.374 -0.186 -0.493 0.462 881s -0.024 0.332 0.305 0.201 881s 0.495 -0.853 -0.705 0.517 881s 881s y: [2, 3, 1, 1] 881s -0.983 0.270 881s 0.283 -0.293 881s -1.119 0.576 881s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.034 -0.994 0.658 0.541 881s 0.468 -0.366 -0.127 -0.226 881s 881s x1: [4, 2, 1, 1] 881s 0.920 0.999 -0.244 -0.784 881s 0.803 -0.248 -0.447 0.380 881s 881s y: [2, 2, 1, 1] 881s -1.546 0.273 881s 0.186 0.438 881s testing: mul_mat, [2, 2, 1, 2] = [4, 2, 1, 2] * [4, 2, 1, 2] 881s x0: [4, 2, 1, 2] 881s 0.103 -0.141 -0.355 0.657 881s -0.761 0.489 -0.309 -0.755 881s 881s x1: [4, 2, 1, 2] 881s 0.835 -0.632 -0.079 -0.870 881s 0.885 0.179 0.676 0.516 881s 881s y: [2, 2, 1, 2] 881s -0.368 -0.264 881s 0.165 -1.185 881s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.546 -0.823 0.548 -0.557 881s -0.508 0.340 -0.929 -0.593 881s 881s x1: [4, 2, 1, 1] 881s 0.409 0.747 0.359 -0.413 881s 0.423 -0.125 0.487 0.436 881s 881s y: [2, 2, 1, 1] 881s -0.412 -0.042 881s -0.104 -0.968 881s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.134 0.092 881s -0.872 -0.248 881s 881s x1: [2, 4, 1, 1] 881s -0.284 -0.900 881s -0.354 -0.795 881s -0.209 -0.109 881s 0.353 0.023 881s 881s y: [2, 4, 1, 1] 881s -0.121 0.471 881s -0.121 0.506 881s -0.038 0.209 881s 0.050 -0.313 881s testing: mul_mat, [4, 2, 1, 2] = [2, 4, 1, 2] * [2, 2, 1, 2] 881s x0: [2, 4, 1, 2] 881s -0.125 -0.632 881s 0.028 0.398 881s 0.850 0.123 881s 0.052 -0.188 881s 881s x1: [2, 2, 1, 2] 881s 0.601 -0.500 881s -0.501 -0.040 881s 881s y: [4, 2, 1, 2] 881s 0.240 -0.ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s 182 0.450 0.125 881s 0.088 -0.030 -0.431 -0.019 881s test-mul-mat0: iter:482/500 881s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 3, 1, 1] 881s -0.090 0.176 -0.403 -0.693 881s -0.701 0.409 -0.888 -0.862 881s -0.679 -0.437 -0.818 -0.977 881s 881s x1: [4, 3, 1, 1] 881s 0.038 0.283 -0.545 0.538 881s 0.781 0.415 -0.374 0.703 881s 0.250 0.201 0.060 -0.876 881s 881s y: [3, 3, 1, 1] 881s -0.107 0.108 -0.229 881s -0.333 -0.652 -1.092 881s 0.595 0.609 0.549 881s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.124 -0.278 -0.746 0.936 881s 881s x1: [4, 3, 1, 1] 881s 0.905 0.578 -0.251 0.851 881s -0.891 0.603 -0.111 0.392 881s -0.942 -0.572 0.173 0.473 881s 881s y: [1, 3, 1, 1] 881s 0.711 881s 0.393 881s 0.589 881s testing: mul_mat, [3, 1, 1, 4] = [4, 3, 1, 4] * [4, 1, 1, 4] 881s x0: [4, 3, 1, 4] 881s 0.584 -0.671 0.184 -0.527 881s 0.721 0.242 -0.099 -0.106 881s -0.285 0.955 0.771 0.438 881s 881s x1: [4, 1, 1, 4] 881s -0.153 -0.517 -0.154 -0.977 881s 881s y: [3, 1, 1, 4] 881s 0.744 -0.118 -0.997 881s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.168 0.049 -0.665 0.499 881s 881s x1: [4, 3, 1, 1] 881s -0.345 -0.882 -0.370 0.285 881s -0.622 0.601 -0.626 0.447 881s -0.600 -0.895 0.349 -0.068 881s 881s y: [1, 3, 1, 1] 881s 0.403 881s 0.773 881s -0.208 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s -0.965 881s -0.425 881s 0.896 881s 881s x1: [1, 4, 1, 1] 881s 0.639 881s -0.665 881s 0.410 881s 0.129 881s 881s y: [3, 4, 1, 1] 881s -0.616 -0.272 0.572 881s 0.641 0.283 -0.596 881s -0.395 -0.174 0.367 881s -0.125 -0.055 0.116 881s testing: mul_mat, [3, 1, 1, 4] = [3, 3, 1, 4] * [3, 1, 1, 4] 881s x0: [3, 3, 1, 4] 881s -0.483 0.866 -0.302 881s 0.823 -0.245 0.804 881s -0.134 -0.078 0.256 881s 881s x1: [3, 1, 1, 4] 881s -0.901 -0.771 -0.732 881s 881s y: [3, 1, 1, 4] 881s -0.011 -1.142 -0.007 881s test-mul-mat0: iter:483/500 881s testing: mul_mat, [2, 3, 1, 1] = [2, 2, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.030 -0.385 881s 0.983 -0.654 881s 881s x1: [2, 3, 1, 1] 881s -0.007 0.251 881s 0.074 -0.400 881s 0.823 0.897 881s 881s y: [2, 3, 1, 1] 881s -0.096 -0.171 881s 0.152 0.335 881s -0.370 0.223 881s testing: mul_mat, [1, 2, 3, 1] = [2, 1, 3, 1] * [2, 2, 3, 1] 881s x0: [2, 1, 3, 1] 881s -0.120 -0.013 881s 881s x1: [2, 2, 3, 1] 881s 0.601 -0.829 881s -0.702 0.780 881s 881s y: [1, 2, 3, 1] 881s -0.061 881s 0.074 881s testing: mul_mat, [3, 1, 3, 4] = [2, 3, 3, 4] * [2, 1, 3, 4] 881s x0: [2, 3, 3, 4] 881s -0.173 -0.571 881s 0.457 0.934 881s -0.788 0.723 881s 881s x1: [2, 1, 3, 4] 881s -0.411 -0.224 881s 881s y: [3, 1, 3, 4] 881s 0.199 -0.397 0.162 881s testing: mul_mat, [4, 3, 1, 1] = [2, 4, 1, 1] * [2, 3, 1, 1] 881s x0: [2, 4, 1, 1] 881s 0.022 -0.342 881s 0.664 0.587 881s 0.685 0.543 881s 0.085 -0.758 881s 881s x1: [2, 3, 1, 1] 881s -0.946 0.438 881s 0.878 -0.578 881s 0.457 0.923 881s 881s y: [4, 3, 1, 1] 881s -0.171 -0.371 -0.410 -0.413 881s 0.217 0.244 0.287 0.513 881s -0.306 0.845 0.815 -0.661 881s testing: mul_mat, [2, 2, 3, 1] = [4, 2, 3, 1] * [4, 2, 3, 1] 881s x0: [4, 2, 3, 1] 881s 0.219 -0.836 -0.137 0.256 881s 0.319 -0.195 -0.594 0.813 881s 881s x1: [4, 2, 3, 1] 881s -0.549 -0.050 -0.040 -0.776 881s -0.253 0.812 0.735 0.364 881s 881s y: [2, 2, 3, 1] 881s -0.272 -0.773 881s -0.741 -0.379 881s testing: mul_mat, [2, 4, 3, 4] = [2, 2, 3, 4] * [2, 4, 3, 4] 881s x0: [2, 2, 3, 4] 881s -0.616 0.902 881s 0.691 0.114 881s 881s x1: [2, 4, 3, 4] 881s -0.617 0.516 881s -0.273 -0.774 881s 0.050 -0.039 881s 0.732 -0.732 881s 881s y: [2, 4, 3, 4] 881s 0.846 -0.368 881s -0.530 -0.277 881s -0.066 0.030 881s -1.111 0.423 881s test-mul-mat0: iter:484/500 881s testing: mul_mat, [1, 4, 1, 1] = [2, 1, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 1, 1, 1] 881s -0.751 -0.854 881s 881s x1: [2, 4, 1, 1] 881s 0.151 -0.508 881s -0.137 -0.223 881s -0.646 -0.363 881s 0.387 0.058 881s 881s y: [1, 4, 1, 1] 881s 0.320 881s 0.293 881s 0.795 881s -0.340 881s testing: mul_mat, [1, 1, 2, 1] = [2, 1, 2, 1] * [2, 1, 2, 1] 881s x0: [2, 1, 2, 1] 881s -0.343 -0.406 881s 881s x1: [2, 1, 2, 1] 881s 0.329 0.014 881s 881s y: [1, 1, 2, 1] 881s -0.118 881s testing: mul_mat, [2, 1, 2, 3] = [2, 2, 2, 3] * [2, 1, 2, 3] 881s x0: [2, 2, 2, 3] 881s -0.052 -0.055 881s 0.770 0.585 881s 881s x1: [2, 1, 2, 3] 881s 0.788 -0.463 881s 881s y: [2, 1, 2, 3] 881s -0.015 0.336 881s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.301 0.326 881s -0.630 -0.754 881s 881s x1: [2, 2, 1, 1] 881s -0.970 -0.707 881s 0.133 -0.046 881s 881s y: [2, 2, 1, 1] 881s -0.522 1.145 881s 0.025 -0.049 881s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 881s x0: [2, 2, 2, 1] 881s 0.267 0.952 881s 0.876 0.248 881s 881s x1: [2, 2, 2, 1] 881s -0.860 -0.089 881s -0.423 -0.031 881s 881s y: [2, 2, 2, 1] 881s -0.315 -0.775 881s -0.143 -0.ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s 378 881s testing: mul_mat, [1, 2, 2, 3] = [2, 1, 2, 3] * [2, 2, 2, 3] 881s x0: [2, 1, 2, 3] 881s -0.891 -0.887 881s 881s x1: [2, 2, 2, 3] 881s 0.949 -0.125 881s 0.192 0.689 881s 881s y: [1, 2, 2, 3] 881s -0.735 881s -0.782 881s test-mul-mat0: iter:485/500 881s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.946 0.332 -0.840 0.284 881s -0.263 0.758 -0.800 -0.107 881s 881s x1: [4, 1, 1, 1] 881s 0.415 0.820 0.458 0.085 881s 881s y: [2, 1, 1, 1] 881s -0.481 0.137 881s testing: mul_mat, [3, 2, 4, 1] = [4, 3, 4, 1] * [4, 2, 4, 1] 881s x0: [4, 3, 4, 1] 881s 0.939 -0.014 0.581 -0.267 881s 0.725 -0.288 0.267 0.340 881s 0.720 0.231 -0.177 0.829 881s 881s x1: [4, 2, 4, 1] 881s -0.377 -0.008 0.002 -0.264 881s -0.555 -0.206 -0.261 0.131 881s 881s y: [3, 2, 4, 1] 881s -0.282 -0.360 -0.492 881s -0.706 -0.368 -0.293 881s testing: mul_mat, [4, 3, 4, 3] = [4, 4, 4, 3] * [4, 3, 4, 3] 881s x0: [4, 4, 4, 3] 881s -0.556 -0.406 0.874 -0.326 881s 0.073 -0.432 -0.360 -0.579 881s -0.053 0.482 0.751 -0.360 881s -0.565 -0.562 -0.490 -0.311 881s 881s x1: [4, 3, 4, 3] 881s 0.717 0.342 0.390 0.842 881s -0.636 0.857 0.836 0.331 881s -0.632 0.457 -0.558 0.815 881s 881s y: [4, 3, 4, 3] 881s -0.471 -0.723 0.117 -1.050 881s 0.628 -0.909 0.955 -0.635 881s -0.588 -0.514 -0.459 0.120 881s testing: mul_mat, [2, 4, 1, 1] = [4, 2, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.528 0.890 0.437 0.555 881s -0.749 0.054 -0.572 0.657 881s 881s x1: [4, 4, 1, 1] 881s -0.961 -0.091 -0.463 -0.748 881s 0.417 -0.364 -0.918 0.083 881s 0.007 -0.382 -0.629 0.793 881s 0.525 -0.961 -0.500 -0.615 881s 881s y: [2, 4, 1, 1] 881s -1.206 0.488 881s -0.459 0.247 881s -0.171 0.854 881s -1.137 -0.564 881s testing: mul_mat, [4, 4, 4, 1] = [2, 4, 4, 1] * [2, 4, 4, 1] 881s x0: [2, 4, 4, 1] 881s -0.775 0.744 881s -0.902 0.489 881s 0.847 -0.357 881s -0.834 0.640 881s 881s x1: [2, 4, 4, 1] 881s -0.256 -0.438 881s -0.627 0.606 881s -0.267 0.941 881s 0.645 0.642 881s 881s y: [4, 4, 4, 1] 881s -0.127 0.017 -0.061 -0.067 881s 0.936 0.862 -0.747 0.911 881s 0.906 0.701 -0.562 0.825 881s -0.023 -0.268 0.318 -0.127 881s testing: mul_mat, [1, 2, 4, 3] = [4, 1, 4, 3] * [4, 2, 4, 3] 881s x0: [4, 1, 4, 3] 881s -0.872 -0.073 -0.515 0.123 881s 881s x1: [4, 2, 4, 3] 881s -0.159 -0.341 -0.955 -0.416 881s -0.852 -0.312 -0.775 -0.305 881s 881s y: [1, 2, 4, 3] 881s 0.605 881s 1.128 881s test-mul-mat0: iter:486/500 881s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.611 -0.212 -0.118 0.239 881s -0.773 0.578 0.016 -0.849 881s 0.397 -0.537 -0.182 -0.652 881s -0.989 0.060 -0.962 0.896 881s 881s x1: [4, 1, 1, 1] 881s 0.046 -0.495 0.088 -0.913 881s 881s y: [4, 1, 1, 1] 881s -0.152 0.455 0.863 -0.978 881s testing: mul_mat, [1, 4, 2, 1] = [4, 1, 2, 1] * [4, 4, 2, 1] 881s x0: [4, 1, 2, 1] 881s -0.290 -0.130 -0.943 0.407 881s 881s x1: [4, 4, 2, 1] 881s -0.316 -0.991 -0.526 0.715 881s 0.266 -0.303 0.255 -0.706 881s 0.721 -0.865 0.340 -0.773 881s 0.223 0.428 -0.460 0.612 881s 881s y: [1, 4, 2, 1] 881s 1.007 881s -0.566 881s -0.733 881s 0.563 881s testing: mul_mat, [2, 1, 2, 1] = [4, 2, 2, 1] * [4, 1, 2, 1] 881s x0: [4, 2, 2, 1] 881s 0.778 -0.683 0.813 -0.629 881s -0.287 -0.858 0.783 0.458 881s 881s x1: [4, 1, 2, 1] 881s 0.692 0.356 0.483 0.119 881s 881s y: [2, 1, 2, 1] 881s 0.613 -0.070 881s testing: mul_mat, [3, 2, 1, 1] = [4, 3, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 3, 1, 1] 881s -0.750 0.586 0.364 0.735 881s 0.217 -0.466 -0.149 0.564 881s 0.244 0.190 0.003 0.145 881s 881s x1: [4, 2, 1, 1] 881s 0.088 -0.163 -0.595 0.439 881s 0.966 -0.335 0.131 0.321 881s 881s y: [3, 2, 1, 1] 881s -0.055 0.431 0.052 881s -0.636 0.527 0.219 881s testing: mul_mat, [3, 4, 2, 1] = [3, 3, 2, 1] * [3, 4, 2, 1] 881s x0: [3, 3, 2, 1] 881s 0.150 -0.115 -0.597 881s -0.499 -0.934 0.088 881s -0.883 0.262 -0.111 881s 881s x1: [3, 4, 2, 1] 881s 0.518 0.022 0.627 881s 0.772 0.262 -0.654 881s 0.181 -0.707 0.775 881s 0.461 0.381 -0.388 881s 881s y: [3, 4, 2, 1] 881s -0.300 -0.224 -0.521 881s 0.476 -0.687 -0.541 881s -0.355 0.638 -0.431 881s 0.257 -0.620 -0.265 881s testing: mul_mat, [3, 3, 2, 1] = [3, 3, 2, 1] * [3, 3, 2, 1] 881s x0: [3, 3, 2, 1] 881s 0.217 0.391 -0.253 881s -0.481 -0.131 0.511 881s 0.203 -0.562 0.448 881s 881s x1: [3, 3, 2, 1] 881s -0.316 -0.158 -0.800 881s 0.216 -0.207 -0.901 881s 0.896 -0.007 0.215 881s 881s y: [3, 3, 2, 1] 881s 0.072 -0.236 -0.334 881s 0.193 -0.537 -0.244 881s 0.137 -0.320 0.282 881s test-mul-mat0: iter:487/500 881s testing: mul_mat, [3, 1, 1, 1] = [2, 3, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.130 -0.184 881s 0.168 0.684 881s -0.752 0.482 881s 881s x1: [2, 1, 1, 1] 881s -0.059 0.450 881s 881s y: [3, 1, 1, 1] 881s -0.075 0.298 0.261 881s testing: mul_mat, [1, 3, 2, 1] = [2, 1, 2, 1] ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s * [2, 3, 2, 1] 881s x0: [2, 1, 2, 1] 881s 0.056 -0.087 881s 881s x1: [2, 3, 2, 1] 881s -0.099 -0.233 881s -0.315 -0.708 881s 0.636 0.123 881s 881s y: [1, 3, 2, 1] 881s 0.015 881s 0.044 881s 0.025 881s testing: mul_mat, [1, 1, 2, 3] = [2, 1, 2, 3] * [2, 1, 2, 3] 881s x0: [2, 1, 2, 3] 881s 0.626 -0.218 881s 881s x1: [2, 1, 2, 3] 881s -0.717 0.678 881s 881s y: [1, 1, 2, 3] 881s -0.597 881s testing: mul_mat, [4, 1, 1, 1] = [2, 4, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.441 0.566 881s 0.888 0.277 881s -0.109 0.710 881s -0.158 -0.575 881s 881s x1: [2, 1, 1, 1] 881s 0.646 0.571 881s 881s y: [4, 1, 1, 1] 881s 0.038 0.732 0.335 -0.431 881s testing: mul_mat, [3, 2, 2, 1] = [4, 3, 2, 1] * [4, 2, 2, 1] 881s x0: [4, 3, 2, 1] 881s -0.082 -0.436 0.123 0.965 881s 0.368 -0.061 -0.172 -0.606 881s 0.019 0.685 -0.424 0.853 881s 881s x1: [4, 2, 2, 1] 881s -0.397 -0.972 0.400 -0.200 881s 0.162 0.544 -0.319 -0.456 881s 881s y: [3, 2, 2, 1] 881s 0.313 -0.035 -1.014 881s -0.730 0.358 0.122 881s testing: mul_mat, [1, 4, 2, 3] = [3, 1, 2, 3] * [3, 4, 2, 3] 881s x0: [3, 1, 2, 3] 881s -0.424 -0.812 0.078 881s 881s x1: [3, 4, 2, 3] 881s -0.899 -0.587 0.935 881s -0.422 0.326 -0.415 881s 0.986 -0.756 0.953 881s 0.004 -0.193 -0.108 881s 881s y: [1, 4, 2, 3] 881s 0.931 881s -0.118 881s 0.270 881s 0.146 881s test-mul-mat0: iter:488/500 881s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.049 -0.683 881s 0.473 -0.126 881s 881s x1: [2, 4, 1, 1] 881s 0.017 -0.221 881s -0.993 0.426 881s 0.846 0.883 881s -0.527 -0.871 881s 881s y: [2, 4, 1, 1] 881s 0.152 0.036 881s -0.340 -0.523 881s -0.562 0.289 881s 0.570 -0.139 881s testing: mul_mat, [2, 2, 2, 1] = [2, 2, 2, 1] * [2, 2, 2, 1] 881s x0: [2, 2, 2, 1] 881s 0.041 0.752 881s 0.763 -0.292 881s 881s x1: [2, 2, 2, 1] 881s 0.675 0.010 881s -0.938 0.323 881s 881s y: [2, 2, 2, 1] 881s 0.036 0.512 881s 0.205 -0.810 881s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 881s x0: [2, 1, 2, 1] 881s 0.501 0.965 881s 881s x1: [2, 2, 2, 1] 881s 0.216 0.017 881s 0.744 0.223 881s 881s y: [1, 2, 2, 1] 881s 0.124 881s 0.588 881s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.099 -0.438 881s 0.034 -0.554 881s 881s x1: [2, 1, 1, 1] 881s -0.161 0.710 881s 881s y: [2, 1, 1, 1] 881s -0.295 -0.399 881s testing: mul_mat, [1, 2, 2, 1] = [2, 1, 2, 1] * [2, 2, 2, 1] 881s x0: [2, 1, 2, 1] 881s 0.618 0.768 881s 881s x1: [2, 2, 2, 1] 881s 0.869 -0.993 881s -0.080 -0.090 881s 881s y: [1, 2, 2, 1] 881s -0.225 881s -0.118 881s testing: mul_mat, [3, 2, 2, 1] = [1, 3, 2, 1] * [1, 2, 2, 1] 881s x0: [1, 3, 2, 1] 881s 0.993 881s -0.181 881s -0.013 881s 881s x1: [1, 2, 2, 1] 881s 0.511 881s -0.620 881s 881s y: [3, 2, 2, 1] 881s 0.508 -0.092 -0.006 881s -0.616 0.112 0.008 881s test-mul-mat0: iter:489/500 881s testing: mul_mat, [2, 4, 1, 1] = [1, 2, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.065 881s -0.623 881s 881s x1: [1, 4, 1, 1] 881s 0.598 881s -0.492 881s -0.856 881s 0.467 881s 881s y: [2, 4, 1, 1] 881s 0.039 -0.372 881s -0.032 0.307 881s -0.055 0.533 881s 0.030 -0.291 881s testing: mul_mat, [2, 2, 3, 1] = [1, 2, 3, 1] * [1, 2, 3, 1] 881s x0: [1, 2, 3, 1] 881s -0.153 881s -0.753 881s 881s x1: [1, 2, 3, 1] 881s -0.725 881s 0.748 881s 881s y: [2, 2, 3, 1] 881s 0.111 0.546 881s -0.114 -0.563 881s testing: mul_mat, [1, 2, 3, 4] = [1, 1, 3, 4] * [1, 2, 3, 4] 881s x0: [1, 1, 3, 4] 881s 0.265 881s 881s x1: [1, 2, 3, 4] 881s -0.286 881s 0.318 881s 881s y: [1, 2, 3, 4] 881s -0.076 881s 0.084 881s testing: mul_mat, [2, 1, 1, 1] = [1, 2, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 2, 1, 1] 881s 0.740 881s -0.823 881s 881s x1: [1, 1, 1, 1] 881s -0.431 881s 881s y: [2, 1, 1, 1] 881s -0.319 0.354 881s testing: mul_mat, [1, 1, 3, 1] = [2, 1, 3, 1] * [2, 1, 3, 1] 881s x0: [2, 1, 3, 1] 881s -0.073 0.829 881s 881s x1: [2, 1, 3, 1] 881s -0.245 0.032 881s 881s y: [1, 1, 3, 1] 881s 0.045 881s testing: mul_mat, [3, 2, 3, 4] = [1, 3, 3, 4] * [1, 2, 3, 4] 881s x0: [1, 3, 3, 4] 881s -0.086 881s -0.542 881s 0.091 881s 881s x1: [1, 2, 3, 4] 881s -0.198 881s -0.362 881s 881s y: [3, 2, 3, 4] 881s 0.017 0.107 -0.018 881s 0.031 0.196 -0.033 881s test-mul-mat0: iter:490/500 881s testing: mul_mat, [2, 1, 1, 1] = [2, 2, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.758 0.463 881s 0.574 0.666 881s 881s x1: [2, 1, 1, 1] 881s -0.436 -0.726 881s 881s y: [2, 1, 1, 1] 881s -0.006 -0.734 881s testing: mul_mat, [2, 2, 1, 1] = [2, 2, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 2, 1, 1] 881s 0.461 0.172 881s -0.757 0.570 881s 881s x1: [2, 2, 1, 1] 881s -0.970 -0.498 881s 0.744 -0.981 881s 881s y: [2, 2, 1, 1] 881s -0.533 0.451 881s 0.174 -1.123 881s testing: mul_mat, [1, 2, 1, 2] = [2, 1, 1, 2] * [2, 2, 1, 2] 881s x0: [2, 1, 1, 2] 881s 0.327 -0.553 881s 881s x1: [2, 2, 1, 2] 881s -0.479 0.525 881s -0.257 0.857 881s 881s y: [1, 2, 1, 2] 881s -0.447 881s -0.558 881s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 1, 1, 1] 881s -0.732 -0.406 881s 881s x1: [2, 1, 1, 1] 881s 0.993 0.281 881s 881s y: [1, 1, 1, 1] 881s -0.841 881s teggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s sting: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 1, 1, 1] 881s -0.205 881s 881s x1: [1, 2, 1, 1] 881s 0.129 881s 0.731 881s 881s y: [1, 2, 1, 1] 881s -0.026 881s -0.150 881s testing: mul_mat, [1, 1, 1, 2] = [1, 1, 1, 2] * [1, 1, 1, 2] 881s x0: [1, 1, 1, 2] 881s 0.780 881s 881s x1: [1, 1, 1, 2] 881s 0.761 881s 881s y: [1, 1, 1, 2] 881s 0.593 881s test-mul-mat0: iter:491/500 881s testing: mul_mat, [4, 4, 1, 1] = [2, 4, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 4, 1, 1] 881s -0.237 -0.204 881s 0.016 0.085 881s -0.211 -0.703 881s 0.972 0.057 881s 881s x1: [2, 4, 1, 1] 881s 0.473 0.104 881s -0.687 0.329 881s -0.458 -0.059 881s 0.435 -0.651 881s 881s y: [4, 4, 1, 1] 881s -0.133 0.017 -0.173 0.465 881s 0.096 0.017 -0.087 -0.649 881s 0.121 -0.012 0.138 -0.449 881s 0.030 -0.049 0.366 0.386 881s testing: mul_mat, [2, 4, 2, 1] = [2, 2, 2, 1] * [2, 4, 2, 1] 881s x0: [2, 2, 2, 1] 881s -0.918 -0.708 881s 0.257 0.518 881s 881s x1: [2, 4, 2, 1] 881s -0.109 0.101 881s -0.212 0.059 881s 0.895 -0.451 881s 0.730 -0.566 881s 881s y: [2, 4, 2, 1] 881s 0.029 0.024 881s 0.152 -0.024 881s -0.503 -0.004 881s -0.269 -0.106 881s testing: mul_mat, [1, 2, 2, 4] = [2, 1, 2, 4] * [2, 2, 2, 4] 881s x0: [2, 1, 2, 4] 881s -0.143 -0.485 881s 881s x1: [2, 2, 2, 4] 881s -0.180 0.366 881s -0.774 0.116 881s 881s y: [1, 2, 2, 4] 881s -0.152 881s 0.054 881s testing: mul_mat, [1, 1, 1, 1] = [2, 1, 1, 1] * [2, 1, 1, 1] 881s x0: [2, 1, 1, 1] 881s 0.019 0.031 881s 881s x1: [2, 1, 1, 1] 881s -0.241 0.512 881s 881s y: [1, 1, 1, 1] 881s 0.011 881s testing: mul_mat, [4, 2, 2, 1] = [1, 4, 2, 1] * [1, 2, 2, 1] 881s x0: [1, 4, 2, 1] 881s -0.615 881s 0.509 881s 0.395 881s 0.784 881s 881s x1: [1, 2, 2, 1] 881s -0.937 881s 0.571 881s 881s y: [4, 2, 2, 1] 881s 0.576 -0.477 -0.370 -0.734 881s -0.351 0.291 0.226 0.448 881s testing: mul_mat, [2, 1, 2, 4] = [4, 2, 2, 4] * [4, 1, 2, 4] 881s x0: [4, 2, 2, 4] 881s -0.192 0.114 0.242 0.216 881s 0.803 -0.541 0.909 -0.535 881s 881s x1: [4, 1, 2, 4] 881s -0.607 -0.179 0.311 -0.394 881s 881s y: [2, 1, 2, 4] 881s 0.086 0.103 881s test-mul-mat0: iter:492/500 881s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 4, 1, 1] 881s 0.601 881s 0.369 881s 0.551 881s -0.455 881s 881s x1: [1, 2, 1, 1] 881s 0.517 881s -0.520 881s 881s y: [4, 2, 1, 1] 881s 0.311 0.191 0.285 -0.235 881s -0.312 -0.192 -0.286 0.236 881s testing: mul_mat, [2, 4, 2, 1] = [1, 2, 2, 1] * [1, 4, 2, 1] 881s x0: [1, 2, 2, 1] 881s 0.353 881s -0.961 881s 881s x1: [1, 4, 2, 1] 881s -0.806 881s 0.716 881s 0.960 881s 0.253 881s 881s y: [2, 4, 2, 1] 881s -0.285 0.774 881s 0.253 -0.688 881s 0.339 -0.922 881s 0.089 -0.243 881s testing: mul_mat, [4, 2, 2, 2] = [1, 4, 2, 2] * [1, 2, 2, 2] 881s x0: [1, 4, 2, 2] 881s -0.592 881s -0.435 881s -0.593 881s -0.112 881s 881s x1: [1, 2, 2, 2] 881s 0.453 881s 0.919 881s 881s y: [4, 2, 2, 2] 881s -0.268 -0.197 -0.269 -0.051 881s -0.544 -0.400 -0.545 -0.103 881s testing: mul_mat, [4, 4, 1, 1] = [1, 4, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 4, 1, 1] 881s 0.585 881s 0.239 881s -0.556 881s 0.628 881s 881s x1: [1, 4, 1, 1] 881s 0.747 881s -0.980 881s -0.455 881s 0.786 881s 881s y: [4, 4, 1, 1] 881s 0.437 0.179 -0.415 0.469 881s -0.573 -0.235 0.545 -0.616 881s -0.266 -0.109 0.253 -0.286 881s 0.460 0.188 -0.437 0.494 881s testing: mul_mat, [1, 1, 2, 1] = [4, 1, 2, 1] * [4, 1, 2, 1] 881s x0: [4, 1, 2, 1] 881s -0.805 0.164 -0.894 -0.548 881s 881s x1: [4, 1, 2, 1] 881s 0.783 0.636 -0.198 -0.102 881s 881s y: [1, 1, 2, 1] 881s -0.295 881s testing: mul_mat, [4, 4, 2, 2] = [1, 4, 2, 2] * [1, 4, 2, 2] 881s x0: [1, 4, 2, 2] 881s 0.129 881s -0.358 881s -0.949 881s -0.565 881s 881s x1: [1, 4, 2, 2] 881s 0.172 881s -0.394 881s 0.674 881s -0.055 881s 881s y: [4, 4, 2, 2] 881s 0.022 -0.061 -0.163 -0.097 881s -0.051 0.141 0.374 0.222 881s 0.087 -0.241 -0.640 -0.381 881s -0.007 0.020 0.052 0.031 881s test-mul-mat0: iter:493/500 881s testing: mul_mat, [2, 2, 1, 1] = [4, 2, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 2, 1, 1] 881s -0.011 -0.424 -0.176 0.632 881s -0.373 0.259 0.911 -0.224 881s 881s x1: [4, 2, 1, 1] 881s 0.996 -0.268 0.019 0.486 881s -0.339 0.465 0.420 -0.783 881s 881s y: [2, 2, 1, 1] 881s 0.407 -0.533 881s -0.762 0.805 881s testing: mul_mat, [4, 2, 4, 1] = [4, 4, 4, 1] * [4, 2, 4, 1] 881s x0: [4, 4, 4, 1] 881s -0.446 0.077 -0.107 -0.498 881s -0.969 -0.352 -0.464 0.575 881s -0.208 0.602 0.508 -0.252 881s -0.335 -0.827 0.723 0.213 881s 881s x1: [4, 2, 4, 1] 881s 0.889 0.354 0.032 -0.030 881s 0.038 0.947 -0.046 0.754 881s 881s y: [4, 2, 4, 1] 881s -0.357 -1.018 0.053 -0.574 881s -0.314 0.085 0.349 -0.668 881s testing: mul_mat, [3, 4, 4, 1] = [4, 3, 4, 1] * [4, 4, 4, 1] 881s x0: [4, 3, 4, 1] 881s 0.987 0.465 0.360 -0.199 881s 0.598 0.774 -0.323 -0.885 881s -0.798 0.505 0.517 -0.908 881s 881s x1: [4, 4, 4, 1] 881s -0.151 0.452 -0.979 0.476 881s -0.527 -0.712 -0.105 0.096 881s -0.936 0.062 -0.616 -0.987 881s -0.048 -0.115 0.462 0.004 881s 881s y: [3, 4, 4, 1] 881s -0.386 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s .155 -0.590 881s -0.908 -0.917 -0.080 881s -0.920 0.560 1.355 881s 0.064 -0.271 0.216 881s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.433 0.895 0.279 -0.482 881s 881s x1: [4, 3, 1, 1] 881s -0.536 -0.637 -0.802 -0.553 881s 0.225 0.374 0.178 -0.007 881s -0.826 0.772 0.709 0.041 881s 881s y: [1, 3, 1, 1] 881s -0.759 881s 0.486 881s 0.511 881s testing: mul_mat, [3, 4, 4, 1] = [1, 3, 4, 1] * [1, 4, 4, 1] 881s x0: [1, 3, 4, 1] 881s 0.587 881s -0.828 881s 0.105 881s 881s x1: [1, 4, 4, 1] 881s 0.549 881s 0.364 881s 0.376 881s 0.881 881s 881s y: [3, 4, 4, 1] 881s 0.323 -0.455 0.058 881s 0.214 -0.301 0.038 881s 0.221 -0.311 0.040 881s 0.517 -0.729 0.093 881s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 4, 4, 1] 881s -0.096 0.782 0.671 881s -0.165 -0.434 -0.602 881s -0.165 0.632 0.120 881s -0.955 0.954 -0.050 881s 881s x1: [3, 1, 4, 1] 881s -0.389 -0.175 0.764 881s 881s y: [4, 1, 4, 1] 881s 0.413 -0.320 0.045 0.167 881s test-mul-mat0: iter:494/500 881s testing: mul_mat, [4, 2, 1, 1] = [1, 4, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 4, 1, 1] 881s -0.504 881s -0.088 881s 0.691 881s 0.705 881s 881s x1: [1, 2, 1, 1] 881s -0.895 881s -0.054 881s 881s y: [4, 2, 1, 1] 881s 0.451 0.079 -0.618 -0.631 881s 0.027 0.005 -0.037 -0.038 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s 0.564 881s 0.345 881s -0.336 881s 881s x1: [1, 4, 1, 1] 881s 0.105 881s -0.802 881s -0.783 881s -0.798 881s 881s y: [3, 4, 1, 1] 881s 0.059 0.036 -0.035 881s -0.452 -0.276 0.270 881s -0.442 -0.270 0.263 881s -0.450 -0.275 0.268 881s testing: mul_mat, [1, 3, 1, 1] = [1, 1, 1, 1] * [1, 3, 1, 1] 881s x0: [1, 1, 1, 1] 881s 0.291 881s 881s x1: [1, 3, 1, 1] 881s 0.477 881s -0.881 881s 0.742 881s 881s y: [1, 3, 1, 1] 881s 0.139 881s -0.256 881s 0.216 881s testing: mul_mat, [4, 1, 1, 1] = [1, 4, 1, 1] * [1, 1, 1, 1] 881s x0: [1, 4, 1, 1] 881s 0.398 881s -0.869 881s 0.402 881s 0.773 881s 881s x1: [1, 1, 1, 1] 881s 0.939 881s 881s y: [4, 1, 1, 1] 881s 0.374 -0.816 0.378 0.726 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.131 0.189 -0.331 0.627 881s 881s x1: [4, 1, 1, 1] 881s -0.377 -0.307 0.026 -0.758 881s 881s y: [1, 1, 1, 1] 881s -0.591 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s 0.549 881s 0.408 881s -0.902 881s 881s x1: [1, 4, 1, 1] 881s -0.900 881s -0.640 881s 0.332 881s 0.205 881s 881s y: [3, 4, 1, 1] 881s -0.494 -0.367 0.812 881s -0.351 -0.261 0.578 881s 0.182 0.135 -0.300 881s 0.113 0.084 -0.185 881s test-mul-mat0: iter:495/500 881s testing: mul_mat, [4, 1, 1, 1] = [4, 4, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 4, 1, 1] 881s 0.248 0.560 -0.427 -0.350 881s 0.333 0.197 0.343 -0.641 881s 0.439 -0.423 0.490 -0.372 881s 0.246 0.117 -0.272 0.606 881s 881s x1: [4, 1, 1, 1] 881s -0.129 -0.497 0.651 -0.838 881s 881s y: [4, 1, 1, 1] 881s -0.295 0.620 0.784 -0.775 881s testing: mul_mat, [2, 4, 4, 1] = [4, 2, 4, 1] * [4, 4, 4, 1] 881s x0: [4, 2, 4, 1] 881s -0.189 0.055 -0.713 0.357 881s -0.040 0.760 0.603 0.829 881s 881s x1: [4, 4, 4, 1] 881s -0.551 0.934 -0.836 0.998 881s 0.341 -0.738 0.111 -0.906 881s 0.024 0.701 -0.035 0.527 881s 0.352 0.127 0.970 -0.400 881s 881s y: [2, 4, 4, 1] 881s 1.108 1.055 881s -0.508 -1.259 881s 0.247 0.947 881s -0.894 0.335 881s testing: mul_mat, [3, 2, 4, 2] = [4, 3, 4, 2] * [4, 2, 4, 2] 881s x0: [4, 3, 4, 2] 881s 0.402 0.806 0.619 -0.421 881s 0.662 0.456 -0.390 0.268 881s -0.774 -0.998 0.866 -0.224 881s 881s x1: [4, 2, 4, 2] 881s -0.759 0.500 -0.269 0.200 881s 0.260 -0.666 0.029 -0.980 881s 881s y: [3, 2, 4, 2] 881s -0.153 -0.116 -0.190 881s -0.002 -0.405 0.709 881s testing: mul_mat, [4, 3, 1, 1] = [4, 4, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.956 -0.632 -0.161 -0.415 881s 0.367 0.880 0.972 0.683 881s 0.790 0.262 -0.603 -0.549 881s -0.235 0.330 -0.890 -0.806 881s 881s x1: [4, 3, 1, 1] 881s -0.722 0.149 0.068 0.229 881s 0.722 -0.593 -0.538 0.012 881s -0.419 0.945 -0.030 0.579 881s 881s y: [4, 3, 1, 1] 881s 0.490 0.088 -0.697 -0.026 881s -0.234 -0.772 0.733 0.103 881s -0.432 1.044 -0.383 -0.030 881s testing: mul_mat, [3, 4, 4, 1] = [4, 3, 4, 1] * [4, 4, 4, 1] 881s x0: [4, 3, 4, 1] 881s 0.729 -0.907 0.449 -0.233 881s 0.464 -0.625 0.581 -0.340 881s 0.719 -0.161 -0.934 0.875 881s 881s x1: [4, 4, 4, 1] 881s 0.109 -0.761 -0.528 -0.742 881s 0.307 0.701 0.980 0.713 881s -0.837 -0.008 -0.706 -0.892 881s 0.962 0.874 -0.419 -0.994 881s 881s y: [3, 4, 4, 1] 881s 0.705 0.472 0.043 881s -0.138 0.031 -0.183 881s -0.712 -0.490 -0.723 881s -0.048 -0.005 0.071 881s testing: mul_mat, [1, 4, 4, 2] = [3, 1, 4, 2] * [3, 4, 4, 2] 881s x0: [3, 1, 4, 2] 881s -0.286 0.212 0.611 881s 881s x1: [3, 4, 4, 2] 881s -0.523 0.161 0.108 881s -0.515 0.607 0.079 881s 0.215 -0.225 -0.731 881s -0.681 0ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s .449 -0.197 881s 881s y: [1, 4, 4, 2] 881s 0.250 881s 0.324 881s -0.556 881s 0.169 881s test-mul-mat0: iter:496/500 881s testing: mul_mat, [3, 3, 1, 1] = [4, 3, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 3, 1, 1] 881s -0.336 -0.698 -0.726 -0.180 881s -0.261 0.918 -0.675 0.124 881s -0.423 0.693 -0.584 0.629 881s 881s x1: [4, 3, 1, 1] 881s 0.209 -0.734 0.065 0.923 881s 0.478 -0.323 -0.674 -0.230 881s 0.864 -0.279 0.143 -0.169 881s 881s y: [3, 3, 1, 1] 881s 0.228 -0.658 -0.055 881s 0.595 0.005 -0.177 881s -0.169 -0.599 -0.749 881s testing: mul_mat, [1, 3, 1, 1] = [4, 1, 1, 1] * [4, 3, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.428 -0.408 0.407 -0.429 881s 881s x1: [4, 3, 1, 1] 881s 0.899 0.978 -0.532 0.228 881s -0.085 -0.014 -0.563 0.181 881s -0.948 -0.640 -0.342 -0.272 881s 881s y: [1, 3, 1, 1] 881s -0.329 881s -0.338 881s -0.168 881s testing: mul_mat, [1, 1, 1, 1] = [4, 1, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 1, 1, 1] 881s -0.945 -0.536 0.895 -0.620 881s 881s x1: [4, 1, 1, 1] 881s 0.423 0.702 0.235 0.725 881s 881s y: [1, 1, 1, 1] 881s -1.015 881s testing: mul_mat, [2, 1, 1, 1] = [4, 2, 1, 1] * [4, 1, 1, 1] 881s x0: [4, 2, 1, 1] 881s 0.971 -0.431 -0.103 0.636 881s 0.982 0.199 0.555 -0.922 881s 881s x1: [4, 1, 1, 1] 881s 0.588 -0.528 -0.928 -0.996 881s 881s y: [2, 1, 1, 1] 881s 0.260 0.876 881s testing: mul_mat, [2, 4, 1, 1] = [2, 2, 1, 1] * [2, 4, 1, 1] 881s x0: [2, 2, 1, 1] 881s -0.650 0.971 881s 0.792 -0.925 881s 881s x1: [2, 4, 1, 1] 881s 0.607 0.996 881s -0.264 -0.665 881s -0.318 -0.836 881s -0.073 -0.910 881s 881s y: [2, 4, 1, 1] 881s 0.572 -0.441 881s -0.474 0.406 881s -0.605 0.522 881s -0.836 0.784 881s testing: mul_mat, [3, 2, 1, 1] = [2, 3, 1, 1] * [2, 2, 1, 1] 881s x0: [2, 3, 1, 1] 881s -0.873 -0.869 881s -0.863 -0.762 881s -0.522 -0.551 881s 881s x1: [2, 2, 1, 1] 881s 0.769 -0.974 881s -0.461 0.665 881s 881s y: [3, 2, 1, 1] 881s 0.175 0.079 0.135 881s -0.175 -0.108 -0.125 881s test-mul-mat0: iter:497/500 881s testing: mul_mat, [4, 4, 1, 1] = [4, 4, 1, 1] * [4, 4, 1, 1] 881s x0: [4, 4, 1, 1] 881s -0.280 -0.806 0.279 0.125 881s -0.680 0.416 0.603 -0.549 881s 0.654 -0.949 -0.436 -0.539 881s 0.699 0.573 -0.177 -0.017 881s 881s x1: [4, 4, 1, 1] 881s -0.638 0.284 0.087 0.969 881s 0.280 0.823 -0.696 0.963 881s 0.988 0.231 -0.948 -0.277 881s 0.580 0.845 -0.306 0.655 881s 881s y: [4, 4, 1, 1] 881s 0.095 0.072 -1.247 -0.315 881s -0.816 -0.796 -0.814 0.775 881s -0.761 -0.994 0.989 0.995 881s -0.847 -0.587 -0.642 0.933 881s testing: mul_mat, [4, 4, 2, 1] = [4, 4, 2, 1] * [4, 4, 2, 1] 881s x0: [4, 4, 2, 1] 881s -0.666 0.809 0.256 -0.440 881s -0.719 0.582 -0.161 0.476 881s 0.489 -0.038 0.366 0.050 881s -0.301 0.975 -0.195 0.715 881s 881s x1: [4, 4, 2, 1] 881s -0.339 -0.208 -0.737 -0.516 881s 0.096 -0.774 -0.528 -0.674 881s -0.722 0.195 0.907 -0.877 881s 0.889 0.562 0.737 -0.391 881s 881s y: [4, 4, 2, 1] 881s 0.096 -0.004 -0.453 -0.326 881s -0.529 -0.755 -0.151 -1.162 881s 1.256 0.069 -0.073 -0.396 881s 0.223 -0.618 0.663 -0.143 881s testing: mul_mat, [2, 4, 2, 2] = [4, 2, 2, 2] * [4, 4, 2, 2] 881s x0: [4, 2, 2, 2] 881s 0.635 -0.614 0.462 0.491 881s -0.966 0.180 -0.234 0.365 881s 881s x1: [4, 4, 2, 2] 881s -0.703 0.562 -0.514 -0.422 881s 0.144 0.325 -0.946 -0.367 881s -0.714 0.420 0.683 -0.014 881s 0.394 -0.512 -0.299 0.103 881s 881s y: [2, 4, 2, 2] 881s -1.236 0.746 881s -0.725 0.007 881s -0.402 0.599 881s 0.477 -0.365 881s testing: mul_mat, [1, 2, 1, 1] = [4, 1, 1, 1] * [4, 2, 1, 1] 881s x0: [4, 1, 1, 1] 881s 0.092 -0.687 -0.757 0.465 881s 881s x1: [4, 2, 1, 1] 881s -0.492 0.960 -0.167 -0.458 881s 0.140 0.599 0.907 0.024 881s 881s y: [1, 2, 1, 1] 881s -0.792 881s -1.075 881s testing: mul_mat, [3, 4, 2, 1] = [1, 3, 2, 1] * [1, 4, 2, 1] 881s x0: [1, 3, 2, 1] 881s -0.967 881s 0.793 881s -0.474 881s 881s x1: [1, 4, 2, 1] 881s 0.853 881s 0.373 881s 0.804 881s -0.276 881s 881s y: [3, 4, 2, 1] 881s -0.824 0.676 -0.404 881s -0.361 0.296 -0.177 881s -0.777 0.637 -0.381 881s 0.267 -0.219 0.131 881s testing: mul_mat, [2, 1, 2, 2] = [3, 2, 2, 2] * [3, 1, 2, 2] 881s x0: [3, 2, 2, 2] 881s 0.195 -0.582 -0.340 881s 0.287 0.048 -0.778 881s 881s x1: [3, 1, 2, 2] 881s -0.095 -0.242 0.576 881s 881s y: [2, 1, 2, 2] 881s -0.074 -0.487 881s test-mul-mat0: iter:498/500 881s testing: mul_mat, [1, 4, 1, 1] = [3, 1, 1, 1] * [3, 4, 1, 1] 881s x0: [3, 1, 1, 1] 881s -0.414 -0.033 -0.655 881s 881s x1: [3, 4, 1, 1] 881s 0.186 -0.942 -0.542 881s -0.619 0.345 -0.124 881s 0.429 -0.995 0.098 881s 0.201 0.671 -0.293 881s 881s y: [1, 4, 1, 1] 881s 0.309 881s 0.326 881s -0.209 881s 0.087 881s testing: mul_mat, [4, 1, 3, 1] = [3, 4, 3, 1] * [3, 1, 3, 1] 881s x0: [3, 4, 3, 1] 881s -0.623 0.326 -0.265 881s 0.391 -0.169 0.920 881s 0.450 0.289 -0.699 881s -0.205 -0.836 0.730 881s 881s x1: [3, 1, 3, 1] 881s 0.294 0.754 0.717 881s 881s y: [4, 1, 3, 1] 881s -0.128 0.647 -0.152 -0.167 881s testing: mulggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-forward.dot -o test-grad0-forward.dot.png && open test-grad0-forward.dot.png 881s ggml_graph_dump_dot: dot -Tpng test-grad0-backward.dot -o test-grad0-backward.dot.png && open test-grad0-backward.dot.png 881s _mat, [3, 4, 3, 3] = [3, 3, 3, 3] * [3, 4, 3, 3] 881s x0: [3, 3, 3, 3] 881s -0.168 0.395 -0.083 881s -0.996 -0.326 0.380 881s -0.553 0.716 -0.592 881s 881s x1: [3, 4, 3, 3] 881s 0.687 -0.947 0.872 881s 0.988 -0.152 -0.964 881s 0.718 0.648 0.299 881s -0.351 -0.881 0.269 881s 881s y: [3, 4, 3, 3] 881s -0.562 -0.045 -1.574 881s -0.147 -1.301 -0.085 881s 0.110 -0.813 -0.110 881s -0.311 0.738 -0.596 881s testing: mul_mat, [3, 3, 1, 1] = [3, 3, 1, 1] * [3, 3, 1, 1] 881s x0: [3, 3, 1, 1] 881s -0.494 -0.352 0.064 881s -0.305 0.945 0.394 881s -0.511 0.853 0.905 881s 881s x1: [3, 3, 1, 1] 881s -0.262 -0.188 -0.620 881s 0.001 0.626 0.736 881s -0.519 0.963 -0.735 881s 881s y: [3, 3, 1, 1] 881s 0.156 -0.343 -0.588 881s -0.174 0.882 1.199 881s -0.130 0.779 0.422 881s testing: mul_mat, [1, 3, 3, 1] = [3, 1, 3, 1] * [3, 3, 3, 1] 881s x0: [3, 1, 3, 1] 881s -0.603 0.406 0.957 881s 881s x1: [3, 3, 3, 1] 881s 0.469 0.451 -0.420 881s -0.207 -0.842 0.891 881s -0.644 -0.571 0.747 881s 881s y: [1, 3, 3, 1] 881s -0.502 881s 0.636 881s 0.872 881s testing: mul_mat, [1, 3, 3, 3] = [1, 1, 3, 3] * [1, 3, 3, 3] 881s x0: [1, 1, 3, 3] 881s 0.154 881s 881s x1: [1, 3, 3, 3] 881s 0.015 881s 0.445 881s -0.825 881s 881s y: [1, 3, 3, 3] 881s 0.002 881s 0.069 881s -0.127 881s test-mul-mat0: iter:499/500 881s testing: mul_mat, [1, 2, 1, 1] = [1, 1, 1, 1] * [1, 2, 1, 1] 881s x0: [1, 1, 1, 1] 881s 0.097 881s 881s x1: [1, 2, 1, 1] 881s -0.115 881s -0.391 881s 881s y: [1, 2, 1, 1] 881s -0.011 881s -0.038 881s testing: mul_mat, [4, 1, 4, 1] = [1, 4, 4, 1] * [1, 1, 4, 1] 881s x0: [1, 4, 4, 1] 881s -0.776 881s 0.796 881s 0.573 881s -0.148 881s 881s x1: [1, 1, 4, 1] 881s 0.623 881s 881s y: [4, 1, 4, 1] 881s -0.483 0.496 0.357 -0.092 881s testing: mul_mat, [4, 4, 4, 2] = [1, 4, 4, 2] * [1, 4, 4, 2] 881s x0: [1, 4, 4, 2] 881s -0.434 881s 0.259 881s 0.574 881s 0.023 881s 881s x1: [1, 4, 4, 2] 881s -0.122 881s 0.144 881s 0.287 881s 0.732 881s 881s y: [4, 4, 4, 2] 881s 0.053 -0.032 -0.070 -0.003 881s -0.062 0.037 0.082 0.003 881s -0.124 0.074 0.165 0.007 881s -0.318 0.190 0.420 0.017 881s testing: mul_mat, [3, 4, 1, 1] = [1, 3, 1, 1] * [1, 4, 1, 1] 881s x0: [1, 3, 1, 1] 881s 0.799 881s -0.840 881s -0.539 881s 881s x1: [1, 4, 1, 1] 881s -0.308 881s 0.534 881s -0.057 881s -0.431 881s 881s y: [3, 4, 1, 1] 881s -0.246 0.258 0.166 881s 0.426 -0.449 -0.288 881s -0.045 0.048 0.031 881s -0.344 0.362 0.232 881s testing: mul_mat, [4, 1, 4, 1] = [3, 4, 4, 1] * [3, 1, 4, 1] 881s x0: [3, 4, 4, 1] 881s 0.616 0.929 -0.493 881s -0.978 -0.453 -0.035 881s -0.949 0.044 -0.946 881s 0.691 0.362 -0.959 881s 881s x1: [3, 1, 4, 1] 881s 0.778 0.458 -0.860 881s 881s y: [4, 1, 4, 1] 881s 1.328 -0.937 0.095 1.527 881s testing: mul_mat, [4, 3, 4, 2] = [4, 4, 4, 2] * [4, 3, 4, 2] 881s x0: [4, 4, 4, 2] 881s -0.098 0.579 -0.110 0.879 881s -0.433 0.923 -0.015 0.200 881s 0.544 -0.770 0.666 0.494 881s -0.418 0.165 0.131 -0.611 881s 881s x1: [4, 3, 4, 2] 881s 0.187 -0.115 0.229 0.655 881s -0.346 -0.844 -0.215 0.320 881s 0.784 -0.766 0.291 -0.746 881s 881s y: [4, 3, 4, 2] 881s 0.466 -0.060 0.667 -0.468 881s -0.150 -0.562 0.476 -0.218 881s -1.208 -1.200 0.841 0.040 900s compression: 0.000000 900s convert time: 0.000000 ms / method = 0 900s -0.040188 0.785085 900s 0.405617 0.183192 900s 0.016901 0.564737 900s 0.001560 0.452003 900s -0.111647 -0.158779 900s 0.602449 0.452541 900s 0.464777 0.596264 900s 0.031770 0.389379 900s 0.522225 -0.094984 900s 0.246030 0.173205 900s 0.322603 -0.063522 900s 0.171129 0.240454 900s 0.435216 0.577889 900s 0.286599 -0.102425 900s -0.152230 0.570174 900s -0.116195 0.552043 900s main: elapsed ticks: 18549564 900s main: elapsed us: 18621791 / 18621.791000 ms 900s 135.421090 900s Testing 2 devices 900s 900s Backend 1/2: BLAS 900s Device description: BLAS 900s Device memory: 0 MB (0 MB free) 900s 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=1): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=2): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=3): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=4): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=5): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=6): OK 900s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=2): OK 900s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=4): OK 900s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=6): OK 900s test_dataset(shuffle=no, ndata_shard=3, ndata_batch=3): OK 900s test_dataset(shuffle=no, ndata_shard=3, ndata_batch=6): OK 900s test_dataset(shuffle=no, ndata_shard=4, ndata_batch=4): OK 900s test_dataset(shuffle=no, ndata_shard=5, ndata_batch=5): OK 900s test_dataset(shuffle=no, ndata_shard=6, ndata_batch=6): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=1): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=2): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=3): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=4): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=5): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=6): OK 900s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=2): OK 900s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=4): OK 900s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=6): OK 900s test_dataset(shuffle=yes, ndata_shard=3, ndata_batch=3): OK 900s test_dataset(shuffle=yes, ndata_shard=3, ndata_batch=6): OK 900s test_dataset(shuffle=yes, ndata_shard=4, ndata_batch=4): OK 900s test_dataset(shuffle=yes, ndata_shard=5, ndata_batch=5): OK 900s test_dataset(shuffle=yes, ndata_shard=6, ndata_batch=6): OK 900s test_grad(): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=results_initial): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=weights_after_forward): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=results_after_forward): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=weights_after_forward_backward): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=result_after_forward_backward): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=results_initial): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=weights_after_forward): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=results_after_forward): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=weights_after_forward_backward): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=result_after_forward_backward): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=results_initial): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=weights_after_forward): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=results_after_forward): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=weights_after_forward_backward): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=result_after_forward_backward): OK 900s test_epoch_vs_fit(): OK 900s test_idata_split(high_level=no, epoch=1, subtest=weights): OK 900s test_idata_split(high_level=no, epoch=1, subtest=results_backward): OK 900s test_idata_split(high_level=no, epoch=1, subtest=results_forward): OK 900s test_idata_split(high_level=no, epoch=2, subtest=weights): OK 900s test_idata_split(high_level=no, epoch=2, subtest=results_backward): OK 900s test_idata_split(high_level=no, epoch=2, subtest=results_forward): OK 900s test_idata_split(high_level=no, epoch=3, subtest=weights): OK 900s test_idata_split(high_level=no, epoch=3, subtest=results_backward): OK 900s test_idata_split(high_level=no, epoch=3, subtest=results_forward): OK 900s test_idata_split(high_level=no, epoch=4, subtest=weights): OK 900s test_idata_split(high_level=no, epoch=4, subtest=results_backward): OK 900s test_idata_split(high_level=no, epoch=4, subtest=results_forward): OK 900s test_idata_split(high_level=yes, epoch=1, subtest=weights): OK 900s test_idata_split(high_level=yes, epoch=1, subtest=results_backward): OK 900s test_idata_split(high_level=yes, epoch=1, subtest=results_forward): OK 900s test_idata_split(high_level=yes, epoch=2, subtest=weights): OK 900s test_idata_split(high_level=yes, epoch=2, subtest=results_backward): OK 900s test_idata_split(high_level=yes, epoch=2, subtest=results_forward): OK 900s test_idata_split(high_level=yes, epoch=3, subtest=weights): OK 900s test_idata_split(high_level=yes, epoch=3, subtest=results_backward): OK 900s test_idata_split(high_level=yes, epoch=3, subtest=results_forward): OK 900s test_idata_split(high_level=yes, epoch=4, subtest=weights): OK 900s test_idata_split(high_level=yes, epoch=4, subtest=results_backward): OK 900s test_idata_split(high_level=yes, epoch=4, subtest=results_forward): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=results): OK 900s test_regression(subtest=weights): OK 900s 118/118 tests passed 900s Backend BLAS: OK 900s 900s Backend 2/2: CPU 900s Device description: CPU 900s Device memory: 3996 MB (3996 MB free) 900s 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=1): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=2): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=3): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=4): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=5): OK 900s test_dataset(shuffle=no, ndata_shard=1, ndata_batch=6): OK 900s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=2): OK 900s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=4): OK 900s test_dataset(shuffle=no, ndata_shard=2, ndata_batch=6): OK 900s test_dataset(shuffle=no, ndata_shard=3, ndata_batch=3): OK 900s test_dataset(shuffle=no, ndata_shard=3, ndata_batch=6): OK 900s test_dataset(shuffle=no, ndata_shard=4, ndata_batch=4): OK 900s test_dataset(shuffle=no, ndata_shard=5, ndata_batch=5): OK 900s test_dataset(shuffle=no, ndata_shard=6, ndata_batch=6): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=1): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=2): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=3): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=4): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=5): OK 900s test_dataset(shuffle=yes, ndata_shard=1, ndata_batch=6): OK 900s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=2): OK 900s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=4): OK 900s test_dataset(shuffle=yes, ndata_shard=2, ndata_batch=6): OK 900s test_dataset(shuffle=yes, ndata_shard=3, ndata_batch=3): OK 900s test_dataset(shuffle=yes, ndata_shard=3, ndata_batch=6): OK 900s test_dataset(shuffle=yes, ndata_shard=4, ndata_batch=4): OK 900s test_dataset(shuffle=yes, ndata_shard=5, ndata_batch=5): OK 900s test_dataset(shuffle=yes, ndata_shard=6, ndata_batch=6): OK 900s test_grad(): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=results_initial): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=weights_after_forward): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=results_after_forward): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=weights_after_forward_backward): OK 900s test_forward_backward(high_level=no, shuffle=no, subtest=result_after_forward_backward): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=results_initial): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=weights_after_forward): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=results_after_forward): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=weights_after_forward_backward): OK 900s test_forward_backward(high_level=yes, shuffle=no, subtest=result_after_forward_backward): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=results_initial): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=weights_after_forward): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=results_after_forward): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=weights_after_forward_backward): OK 900s test_forward_backward(high_level=yes, shuffle=yes, subtest=result_after_forward_backward): OK 900s test_epoch_vs_fit(): OK 900s test_idata_split(high_level=no, epoch=1, subtest=weights): OK 900s test_idata_split(high_level=no, epoch=1, subtest=results_backward): OK 900s test_idata_split(high_level=no, epoch=1, subtest=results_forward): OK 900s test_idata_split(high_level=no, epoch=2, subtest=weights): OK 900s test_idata_split(high_level=no, epoch=2, subtest=results_backward): OK 900s test_idata_split(high_level=no, epoch=2, subtest=results_forward): OK 900s test_idata_split(high_level=no, epoch=3, subtest=weights): OK 900s test_idata_split(high_level=no, epoch=3, subtest=results_backward): OK 900s test_idata_split(high_level=no, epoch=3, subtest=results_forward): OK 900s test_idata_split(high_level=no, epoch=4, subtest=weights): OK 900s test_idata_split(high_level=no, epoch=4, subtest=results_backward): OK 900s test_idata_split(high_level=no, epoch=4, subtest=results_forward): OK 900s test_idata_split(high_level=yes, epoch=1, subtest=weights): OK 900s test_idata_split(high_level=yes, epoch=1, subtest=results_backward): OK 900s test_idata_split(high_level=yes, epoch=1, subtest=results_forward): OK 900s test_idata_split(high_level=yes, epoch=2, subtest=weights): OK 900s test_idata_split(high_level=yes, epoch=2, subtest=results_backward): OK 900s test_idata_split(high_level=yes, epoch=2, subtest=results_forward): OK 900s test_idata_split(high_level=yes, epoch=3, subtest=weights): OK 900s test_idata_split(high_level=yes, epoch=3, subtest=results_backward): OK 900s test_idata_split(high_level=yes, epoch=3, subtest=results_forward): OK 900s test_idata_split(high_level=yes, epoch=4, subtest=weights): OK 900s test_idata_split(high_level=yes, epoch=4, subtest=results_backward): OK 900s test_idata_split(high_level=yes, epoch=4, subtest=results_forward): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=1, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=2, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=3, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=sum, epoch=4, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=1, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=2, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=3, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=2, loss_type=mean, epoch=4, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=1, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=2, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=3, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=sum, epoch=4, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=1, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=2, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=3, subtest=results): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=grads): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=weights): OK 900s test_gradient_accumulation(high_level=no, nbatch_physical=1, loss_type=mean, epoch=4, subtest=results): OK 900s test_regression(subtest=weights): OK 900s 118/118 tests passed 900s Backend CPU: OK 900s 900s 2/2 backends passed 900s OK 900s test_pad_reflect_1d: using CPU backend 1010s Testing f32 1010s Testing f16 1010s Testing q4_0 1010s Testing q4_1 1010s Testing q5_0 1010s Testing q5_1 1010s Testing q8_0 1010s q8_0 reference implementation error: FAILED (0.000175) 1010s Testing q8_1 1010s Testing q2_K 1010s Testing q3_K 1010s Testing q4_K 1010s Testing q5_K 1010s Testing q6_K 1010s Testing q8_K 1010s Testing iq2_xxs 1010s Testing iq2_xs 1010s Testing iq3_xxs 1010s Testing iq1_s 1010s Testing iq4_nl 1010s Testing iq3_s 1010s Testing iq2_s 1010s Testing iq4_xs 1010s Testing i8 1010s Testing i16 1010s Testing i32 1010s Testing i64 1010s Testing f64 1010s Testing iq1_m 1010s Testing bf16 1010s Testing tq1_0 1010s Testing tq2_0 1010s 1 tests failed 1010s f16 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.32 GB/s 1010s quantized throughput : 0.16 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.31 GB/s 1010s quantized throughput : 0.16 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 6.94 GB/s 1010s quantized throughput : 3.47 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.31 GB/s 1010s quantized throughput : 0.16 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.82 GB/s 1010s quantized throughput : 0.91 GB/s 1010s 1010s q4_0 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.08 GB/s 1010s quantized throughput : 0.15 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.07 GB/s 1010s quantized throughput : 0.15 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.46 GB/s 1010s quantized throughput : 0.35 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.86 GB/s 1010s quantized throughput : 0.26 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 30.52 GB/s 1010s quantized throughput : 4.29 GB/s 1010s 1010s q4_1 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.79 GB/s 1010s quantized throughput : 0.12 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.83 GB/s 1010s quantized throughput : 0.13 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 3.63 GB/s 1010s quantized throughput : 0.57 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.98 GB/s 1010s quantized throughput : 0.31 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 30.52 GB/s 1010s quantized throughput : 4.77 GB/s 1010s 1010s q5_0 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.98 GB/s 1010s quantized throughput : 0.17 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.12 GB/s 1010s quantized throughput : 0.19 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.42 GB/s 1010s quantized throughput : 0.42 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.84 GB/s 1010s quantized throughput : 0.32 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.01 GB/s 1010s quantized throughput : 0.35 GB/s 1010s 1010s q5_1 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.68 GB/s 1010s quantized throughput : 0.13 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.78 GB/s 1010s quantized throughput : 0.15 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 3.32 GB/s 1010s quantized throughput : 0.62 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.38 GB/s 1010s quantized throughput : 0.45 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.42 GB/s 1010s quantized throughput : 0.45 GB/s 1010s 1010s q8_0 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.15 GB/s 1010s quantized throughput : 0.30 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.86 GB/s 1010s quantized throughput : 0.49 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 4.92 GB/s 1010s quantized throughput : 1.31 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.86 GB/s 1010s quantized throughput : 0.49 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 38.15 GB/s 1010s quantized throughput : 10.13 GB/s 1010s 1010s q2_K 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.02 GB/s 1010s quantized throughput : 0.00 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.02 GB/s 1010s quantized throughput : 0.00 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 3.11 GB/s 1010s quantized throughput : 0.26 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.66 GB/s 1010s quantized throughput : 0.14 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.72 GB/s 1010s quantized throughput : 0.22 GB/s 1010s 1010s q3_K 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.27 GB/s 1010s quantized throughput : 0.03 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.27 GB/s 1010s quantized throughput : 0.03 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.19 GB/s 1010s quantized throughput : 0.13 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.66 GB/s 1010s quantized throughput : 0.18 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 15.26 GB/s 1010s quantized throughput : 1.64 GB/s 1010s 1010s q4_K 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.01 GB/s 1010s quantized throughput : 0.00 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.01 GB/s 1010s quantized throughput : 0.00 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 3.25 GB/s 1010s quantized throughput : 0.46 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.70 GB/s 1010s quantized throughput : 0.24 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 30.52 GB/s 1010s quantized throughput : 4.29 GB/s 1010s 1010s q5_K 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.02 GB/s 1010s quantized throughput : 0.00 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.02 GB/s 1010s quantized throughput : 0.00 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.62 GB/s 1010s quantized throughput : 0.28 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.66 GB/s 1010s quantized throughput : 0.29 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 21.80 GB/s 1010s quantized throughput : 3.75 GB/s 1010s 1010s q6_K 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.03 GB/s 1010s quantized throughput : 0.01 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.04 GB/s 1010s quantized throughput : 0.01 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.54 GB/s 1010s quantized throughput : 0.52 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.64 GB/s 1010s quantized throughput : 0.34 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 16.95 GB/s 1010s quantized throughput : 3.48 GB/s 1010s 1010s iq4_nl 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.10 GB/s 1010s quantized throughput : 0.01 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.10 GB/s 1010s quantized throughput : 0.01 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.42 GB/s 1010s quantized throughput : 0.34 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.66 GB/s 1010s quantized throughput : 0.23 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 25.43 GB/s 1010s quantized throughput : 3.58 GB/s 1010s 1010s iq4_xs 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.00 GB/s 1010s quantized throughput : 0.00 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.00 GB/s 1010s quantized throughput : 0.00 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.06 GB/s 1010s quantized throughput : 0.27 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.22 GB/s 1010s quantized throughput : 0.16 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 30.52 GB/s 1010s quantized throughput : 4.05 GB/s 1010s 1010s bf16 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.93 GB/s 1010s quantized throughput : 1.47 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.99 GB/s 1010s quantized throughput : 1.50 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 9.54 GB/s 1010s quantized throughput : 4.77 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.93 GB/s 1010s quantized throughput : 1.47 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.42 GB/s 1010s quantized throughput : 1.21 GB/s 1010s 1010s tq1_0 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.86 GB/s 1010s quantized throughput : 0.05 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.89 GB/s 1010s quantized throughput : 0.05 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.26 GB/s 1010s quantized throughput : 0.07 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.54 GB/s 1010s quantized throughput : 0.08 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 3.81 GB/s 1010s quantized throughput : 0.20 GB/s 1010s 1010s tq2_0 1010s quantize_row_q_reference 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.90 GB/s 1010s quantized throughput : 0.06 GB/s 1010s 1010s quantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 0.90 GB/s 1010s quantized throughput : 0.06 GB/s 1010s 1010s dequantize_row_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 2.35 GB/s 1010s quantized throughput : 0.15 GB/s 1010s 1010s quantize_row_q_dot 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 1.66 GB/s 1010s quantized throughput : 0.11 GB/s 1010s 1010s vec_dot_q 1010s 4096 values (0.02 MB) 1010s min cycles/32 vals : 0.00 1010s avg cycles/32 vals : 0.00 1010s float32 throughput : 15.26 GB/s 1010s quantized throughput : 0.98 GB/s 1010s 1010s ggml_roll(3(1), 7(0), 4(-1), 2(0), contiguous): PASSED 1010s ggml_roll(37(-4), 42(3), 59(-7), 2(1), contiguous): PASSED 1010s ggml_roll(37(-4), 42(3), 59(-7), 2(1), permuted): PASSED 1010s 0.8439 -0.9970 0.6497 0.9733 0.9981 0.9999 1.0000 -0.5366 -0.0776 0.7602 0.2296 0.0621 0.0167 0.0045 0.0000 0.0000 0.4242 0.9880 -0.1558 0.8946 0.9923 0.9994 1.0000 -0.9056 0.1547 0.9878 0.4470 0.1240 0.0333 0.0089 0.0000 0.0000 1010s ----------------------------------- 1010s 0.8439 -0.9970 0.6497 0.9733 0.9981 0.9999 1.0000 -0.5366 -0.0776 0.7602 0.2296 0.0621 0.0167 0.0045 0.0000 0.0000 0.4242 0.9880 -0.1558 0.8946 0.9923 0.9994 1.0000 -0.9056 0.1547 0.9878 0.4470 0.1240 0.0333 0.0089 0.0000 0.0000 1035s main: elapsed ticks: 22480721 1035s 2951358969841371840512.000000 1035s ggml_print_objects: objects in context 0x2aa0a524130: 1035s - ggml_object: type = 0, offset = 32, size = 384, next = 0x3ffa217f2a0 1035s - ggml_object: type = 0, offset = 448, size = 736, next = 0x3ffa217f5a0 1035s - ggml_object: type = 0, offset = 1216, size = 24336, next = (nil) 1035s ggml_print_objects: --- end --- 1036s autopkgtest [22:17:26]: test libggml-cpu: -----------------------] 1036s libggml-cpu FAIL non-zero exit status 1 1036s autopkgtest [22:17:26]: test libggml-cpu: - - - - - - - - - - results - - - - - - - - - - 1037s autopkgtest [22:17:27]: @@@@@@@@@@@@@@@@@@@@ summary 1037s libggml-hip SKIP Test lists explicitly supported architectures, but the current architecture s390x isn't listed. 1037s libggml-hip SKIP Test lists explicitly supported architectures, but the current architecture s390x isn't listed. 1037s libggml-cpu FAIL non-zero exit status 1 1037s libggml-cpu FAIL non-zero exit status 1