wiki:Satch

On satch for release 1479, 4 cpus for a 1283

max number of allocated scalars: 39
 ---------------------------------------------
   4 Processor run, global<128,128,128>
   for 100 time steps
main timer : (real:0:3:59,user:0:2:57,sys:0:0:51)[clk:23919]
	loop : (real:0:1:59,user:0:1:43,sys:0:0:8)[clk:11908]	(100 calls X 1.190800e+02)
	FFT timer : (real:0:3:2,user:0:2:12,sys:0:0:43)[clk:18238]
		FFT and transposition time : (real:0:3:2,user:0:2:12,sys:0:0:43)[clk:18238]	(512 calls X 3.562109e+01)
			FFT only : (real:0:0:29,user:0:0:28,sys:0:0:0)[clk:2977]	(2560 calls X 1.162891e+00)
		planification time : (real:0:1:55,user:0:1:9,sys:0:0:42)[clk:11504]
	azur::array timer root : (real:0:0:34,user:0:0:30,sys:0:0:2)[clk:3469]
		view = expr : (real:0:0:34,user:0:0:30,sys:0:0:2)[clk:3469]	(2380 calls X 1.457563e+00)
			basic3<flt> id= (basic3<int> + flt) : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:1]	(2 calls X 5.000000e-01)
			fftw4<cdbl> id= cdbl : (real:0:0:1,user:0:0:1,sys:0:0:0)[clk:177]	(106 calls X 1.669811e+00)
			fftw4<dbl> id= dbl : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:5]	(4 calls X 1.250000e+00)
			fftw3<dbl> id= fftw3<dbl> : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:6]	(9 calls X 6.666667e-01)
			fftw3<dbl> *= dbl : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:3]	(6 calls X 5.000000e-01)
			fftw3<dbl> id= dbl : (real:0:0:2,user:0:0:0,sys:0:0:2)[clk:281]	(516 calls X 5.445736e-01)
			fftw3<dbl> /= dbl : (real:0:0:3,user:0:0:3,sys:0:0:0)[clk:329]	(618 calls X 5.323625e-01)
			fftw4<cdbl> id= fftw4<cdbl> : (real:0:0:9,user:0:0:8,sys:0:0:0)[clk:912]	(406 calls X 2.246305e+00)
			fftw4<dbl> id= fftw4<dbl> : (real:0:0:4,user:0:0:4,sys:0:0:0)[clk:476]	(205 calls X 2.321951e+00)
			fftw4<cdbl> id= (s2v<basic3<dbl>> * ((fftw4<cdbl> * dbl) + fftw4<cdbl>)) : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:5]	(2 calls X 2.500000e+00)
			fftw4<cdbl> id= (s2v<basic3<dbl>> * (fftw4<cdbl> * dbl)) : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:6]	(2 calls X 3.000000e+00)
			fftw4<cdbl> *= s2v<basic3<dbl>> : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:4]	(2 calls X 2.000000e+00)
			fftw4<cdbl> += fftw4<cdbl> : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:4]	(2 calls X 2.000000e+00)
			fftw4<cdbl> id= ((fftw4<cdbl> * dbl) * s2v<basic3<dbl>>) : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:274]	(100 calls X 2.740000e+00)
			fftw4<cdbl> -= fftw4<cdbl> : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:228]	(100 calls X 2.280000e+00)
			fftw4<cdbl> -= ((fftw4<cdbl> * dbl) * s2v<basic3<dbl>>) : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:250]	(100 calls X 2.500000e+00)
			fftw4<cdbl> id= (s2v<basic3<dbl>> swp(*) (fftw4<cdbl> + (fftw4<cdbl> * dbl))) : (real:0:0:5,user:0:0:4,sys:0:0:0)[clk:508]	(200 calls X 2.540000e+00)
	cubby::field timer root : (real:0:0:17,user:0:0:16,sys:0:0:0)[clk:1705]
		vector::in_place_curl : (real:0:0:3,user:0:0:2,sys:0:0:0)[clk:319]	(204 calls X 1.563725e+00)
		vector::vec_prod : (real:0:0:4,user:0:0:4,sys:0:0:0)[clk:494]	(204 calls X 2.421569e+00)
		vector::project : (real:0:0:1,user:0:0:1,sys:0:0:0)[clk:190]	(102 calls X 1.862745e+00)
		scalar::dealias : (real:0:0:6,user:0:0:6,sys:0:0:0)[clk:664]	(612 calls X 1.084967e+00)
		scalar::local_energy : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:37]	(66 calls X 5.606061e-01)

On satch for release 1467, 4 cpus for a 1283

Not sure about the version number

****************************************************
max number of allocated scalars: 39
 ---------------------------------------------
   4 Processor run, global<128,128,128>
   for 100 time steps
main timer : (real:0:3:55,user:0:3:0,sys:0:0:52)[clk:23566]
	loop : (real:0:1:55,user:0:1:46,sys:0:0:7)[clk:11509]	(100 calls X 1.150900e+02)
	FFT timer : (real:0:2:59,user:0:2:13,sys:0:0:44)[clk:17989]
		FFT and transposition time : (real:0:2:59,user:0:2:13,sys:0:0:44)[clk:17989]	(512 calls X 3.513477e+01)
			FFT only : (real:0:0:28,user:0:0:28,sys:0:0:0)[clk:2825]	(2560 calls X 1.103516e+00)
		planification time : (real:0:1:57,user:0:1:11,sys:0:0:44)[clk:11753]
	azur::array timer root : (real:0:0:34,user:0:0:31,sys:0:0:2)[clk:3448]
		view = expr : (real:0:0:34,user:0:0:31,sys:0:0:2)[clk:3447]	(2378 calls X 1.449537e+00)
			V4 = R : (real:0:0:1,user:0:0:1,sys:0:0:0)[clk:166]	(106 calls X 1.566038e+00)
			V4 = R : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:5]	(4 calls X 1.250000e+00)
			V3 = V3 : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:8]	(9 calls X 8.888889e-01)
			V3 x= R : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:2]	(6 calls X 3.333333e-01)
			V3 = R : (real:0:0:2,user:0:0:0,sys:0:0:2)[clk:274]	(516 calls X 5.310078e-01)
			V3 x= R : (real:0:0:3,user:0:0:3,sys:0:0:0)[clk:305]	(618 calls X 4.935275e-01)
			V4 = V4 : (real:0:0:9,user:0:0:9,sys:0:0:0)[clk:936]	(406 calls X 2.305419e+00)
			V4 = V4 : (real:0:0:4,user:0:0:4,sys:0:0:0)[clk:456]	(205 calls X 2.224390e+00)
			V4 x= V4 x ( V4 x V4 ) : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:6]	(2 calls X 3.000000e+00)
			V4 x= V4 x V4 : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:7]	(2 calls X 3.500000e+00)
			V4 x= V4 : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:3]	(2 calls X 1.500000e+00)
			V4 x= V4 : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:5]	(2 calls X 2.500000e+00)
			V4 x= V4 x V4 : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:276]	(100 calls X 2.760000e+00)
			V4 x= V4 : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:231]	(100 calls X 2.310000e+00)
			V4 x= V4 x V4 : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:250]	(100 calls X 2.500000e+00)
			V4 x= V4 x ( V4 x V4 ) : (real:0:0:5,user:0:0:5,sys:0:0:0)[clk:517]	(200 calls X 2.585000e+00)
	cubby::field timer root : (real:0:0:16,user:0:0:16,sys:0:0:0)[clk:1688]
		vector::in_place_curl : (real:0:0:3,user:0:0:3,sys:0:0:0)[clk:363]	(204 calls X 1.779412e+00)
		vector::vec_prod : (real:0:0:4,user:0:0:4,sys:0:0:0)[clk:475]	(204 calls X 2.328431e+00)
		vector::project : (real:0:0:1,user:0:0:1,sys:0:0:0)[clk:184]	(102 calls X 1.803922e+00)
		scalar::dealias : (real:0:0:6,user:0:0:6,sys:0:0:0)[clk:628]	(612 calls X 1.026144e+00)
		scalar::local_energy : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:38]	(66 calls X 5.757576e-01)

On satch for release 1466, 4 cpus for a 1283

max number of allocated scalars: 39
 ---------------------------------------------
   4 Processor run, global<128,128,128>
   for 100 time steps
main timer : (real:0:4:5,user:0:3:8,sys:0:0:54)[clk:24505]
	loop : (real:0:2:0,user:0:1:51,sys:0:0:8)[clk:12063]	(100 calls X 1.206300e+02)
	FFT timer : (real:0:3:3,user:0:2:15,sys:0:0:46)[clk:18334]
		FFT and transposition time : (real:0:3:3,user:0:2:15,sys:0:0:46)[clk:18334]	(512 calls X 3.580859e+01)
			FFT only : (real:0:0:27,user:0:0:27,sys:0:0:0)[clk:2781]	(2560 calls X 1.086328e+00)
		planification time : (real:0:2:1,user:0:1:14,sys:0:0:46)[clk:12126]
	azur::array timer root : (real:0:0:34,user:0:0:31,sys:0:0:2)[clk:3456]
		view = expr : (real:0:0:34,user:0:0:31,sys:0:0:2)[clk:3455]	(2378 calls X 1.452902e+00)
			V4 = R : (real:0:0:1,user:0:0:1,sys:0:0:0)[clk:168]	(106 calls X 1.584906e+00)
			V4 = R : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:5]	(4 calls X 1.250000e+00)
			V3 = V3 : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:9]	(9 calls X 1.000000e+00)
			V3 x= R : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:2]	(6 calls X 3.333333e-01)
			V3 = R : (real:0:0:2,user:0:0:0,sys:0:0:2)[clk:284]	(516 calls X 5.503876e-01)
			V3 x= R : (real:0:0:3,user:0:0:3,sys:0:0:0)[clk:321]	(618 calls X 5.194175e-01)
			V4 = V4 : (real:0:0:9,user:0:0:9,sys:0:0:0)[clk:909]	(406 calls X 2.238916e+00)
			V4 = V4 : (real:0:0:4,user:0:0:4,sys:0:0:0)[clk:477]	(205 calls X 2.326829e+00)
			V4 x= V4 x ( V4 x V4 ) : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:6]	(2 calls X 3.000000e+00)
			V4 x= V4 x V4 : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:5]	(2 calls X 2.500000e+00)
			V4 x= V4 : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:4]	(2 calls X 2.000000e+00)
			V4 x= V4 : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:5]	(2 calls X 2.500000e+00)
			V4 x= V4 x V4 : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:269]	(100 calls X 2.690000e+00)
			V4 x= V4 : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:225]	(100 calls X 2.250000e+00)
			V4 x= V4 x V4 : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:263]	(100 calls X 2.630000e+00)
			V4 x= V4 x ( V4 x V4 ) : (real:0:0:5,user:0:0:5,sys:0:0:0)[clk:502]	(200 calls X 2.510000e+00)
	cubby::field timer root : (real:0:0:22,user:0:0:22,sys:0:0:0)[clk:2293]
		vector::in_place_curl : (real:0:0:3,user:0:0:3,sys:0:0:0)[clk:341]	(204 calls X 1.671569e+00)
		vector::vec_prod : (real:0:0:4,user:0:0:4,sys:0:0:0)[clk:482]	(204 calls X 2.362745e+00)
		vector::project : (real:0:0:2,user:0:0:2,sys:0:0:0)[clk:231]	(102 calls X 2.264706e+00)
		scalar::dealias : (real:0:0:12,user:0:0:11,sys:0:0:0)[clk:1200]	(612 calls X 1.960784e+00)
		scalar::local_energy : (real:0:0:0,user:0:0:0,sys:0:0:0)[clk:39]	(66 calls X 5.909091e-01)
Last modified 10 years ago Last modified on Aug 17, 2010 10:43:14 AM