options

Executable Output


* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal

* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 42888)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.23940 +- 0.000001. Correct Result: 235.239400

Configuration              
Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     379.377
Minimum kernel time:       0.00374007
Maximum kernel time:       0.004215
Arithm. Mean kernel time:  0.00379369

Performance results        
Total GFlops/s:            3.82034
Minimum GFlops/s:          3.43855
Maximum GFlops/s:          3.87519
Arithm. Mean GFlops/s:     3.82042


* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 42888)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 42888)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 42888)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 42888)

Your experiment path is /home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0

To display your profiling results:
#########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                        COMMAND                                                         #
#########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_0  #
#########################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal

* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 43199)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.17861 +- 0.000001. Correct Result: 234.178609

Configuration              
Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     261.908
Minimum kernel time:       0.00257206
Maximum kernel time:       0.00786591
Arithm. Mean kernel time:  0.00261898

Performance results        
Total GFlops/s:            5.53382
Minimum GFlops/s:          1.84257
Maximum GFlops/s:          5.63498
Arithm. Mean GFlops/s:     5.53403


* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 43199)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 43199)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 43199)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 43199)

Your experiment path is /home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1

To display your profiling results:
#########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                        COMMAND                                                         #
#########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_1  #
#########################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal

* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 43411)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.63569 +- 0.000001. Correct Result: 234.635686

Configuration              
Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     154.549
Minimum kernel time:       0.00152206
Maximum kernel time:       0.00674319
Arithm. Mean kernel time:  0.0015454

Performance results        
Total GFlops/s:            9.37792
Minimum GFlops/s:          2.14935
Maximum GFlops/s:          9.52226
Arithm. Mean GFlops/s:     9.3785


* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 43411)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 43411)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 43411)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 43411)

Your experiment path is /home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2

To display your profiling results:
#########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                        COMMAND                                                         #
#########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_2  #
#########################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal

* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 43625)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.08034 +- 0.000001. Correct Result: 234.080339

Configuration              
Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     108.987
Minimum kernel time:       0.00107002
Maximum kernel time:       0.00622797
Arithm. Mean kernel time:  0.00108982

Performance results        
Total GFlops/s:            13.2983
Minimum GFlops/s:          2.32716
Maximum GFlops/s:          13.545
Arithm. Mean GFlops/s:     13.299


* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 43625)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 43625)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 43625)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 43625)

Your experiment path is /home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3

To display your profiling results:
#########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                        COMMAND                                                         #
#########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_3  #
#########################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal

* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 43791)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.12437 +- 0.000001. Correct Result: 234.124373

Configuration              
Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     58.4768
Minimum kernel time:       0.000573874
Maximum kernel time:       0.00577593
Arithm. Mean kernel time:  0.000584684

Performance results        
Total GFlops/s:            24.785
Minimum GFlops/s:          2.50929
Maximum GFlops/s:          25.2556
Arithm. Mean GFlops/s:     24.7886


* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 43791)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 43791)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 43791)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 43791)

Your experiment path is /home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4

To display your profiling results:
#########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                        COMMAND                                                         #
#########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_4  #
#########################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal

* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 43967)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 232.96054 +- 0.000001. Correct Result: 232.960541

Configuration              
Number of Threads:         32
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     31.5587
Minimum kernel time:       0.000304937
Maximum kernel time:       0.00183296
Arithm. Mean kernel time:  0.000315535

Performance results        
Total GFlops/s:            45.9255
Minimum GFlops/s:          7.90714
Maximum GFlops/s:          47.5294
Arithm. Mean GFlops/s:     45.933


* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 43967)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 43967)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 43967)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 43967)

Your experiment path is /home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5

To display your profiling results:
#########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                        COMMAND                                                         #
#########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_5  #
#########################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal

* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 44108)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.58342 +- 0.000001. Correct Result: 234.583420

Configuration              
Number of Threads:         64
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     18.0294
Minimum kernel time:       0.000163794
Maximum kernel time:       0.0055151
Arithm. Mean kernel time:  0.000180255

Performance results        
Total GFlops/s:            80.388
Minimum GFlops/s:          2.62797
Maximum GFlops/s:          88.4863
Arithm. Mean GFlops/s:     80.4054


* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 44108)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 44108)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 44108)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 44108)

Your experiment path is /home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6

To display your profiling results:
#########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                        COMMAND                                                         #
#########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_6  #
#########################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal

* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 44281)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 235.15201 +- 0.000001. Correct Result: 235.152008

Configuration              
Number of Threads:         96
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     7.22814
Minimum kernel time:       6.58035e-05
Maximum kernel time:       0.00231791
Arithm. Mean kernel time:  7.22431e-05

Performance results        
Total GFlops/s:            200.515
Minimum GFlops/s:          6.25284
Maximum GFlops/s:          220.254
Arithm. Mean GFlops/s:     200.621


* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 44281)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 44281)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 44281)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 44281)

Your experiment path is /home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7

To display your profiling results:
#########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                        COMMAND                                                         #
#########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/hbollore/pop3/epi-spmxv-main/spmxv_large_g4_armclang_ofast_armpl/tools/lprof_npsu_run_7  #
#########################################################################################################################################################

×