/[gxemul]/trunk/src/cpu_alpha_instr_loadstore.c
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Contents of /trunk/src/cpu_alpha_instr_loadstore.c

Parent Directory Parent Directory | Revision Log Revision Log


Revision 12 - (show annotations)
Mon Oct 8 16:18:38 2007 UTC (16 years, 5 months ago) by dpavlin
File MIME type: text/plain
File size: 8637 byte(s)
++ trunk/HISTORY	(local)
$Id: HISTORY,v 1.905 2005/08/16 09:16:24 debug Exp $
20050628	Continuing the work on the ARM translation engine. end_of_page
		works. Experimenting with load/store translation caches
		(virtual -> physical -> host).
20050629	More ARM stuff (memory access translation cache, mostly). This
		might break a lot of stuff elsewhere, probably some MIPS-
		related translation things.
20050630	Many load/stores are now automatically generated and included
		into cpu_arm_instr.c; 1024 functions in total (!).
		Fixes based on feedback from Alec Voropay: only print 8 hex
		digits instead of 16 in some cases when emulating 32-bit
		machines; similar 8 vs 16 digit fix for breakpoint addresses;
		4Kc has 16 TLB entries, not 48; the MIPS config select1
		register is now printed with "reg ,0".
		Also changing many other occurances of 16 vs 8 digit output.
		Adding cache associativity fields to mips_cpu_types.h; updating
		some other cache fields; making the output of
		mips_cpu_dumpinfo() look nicer.
		Generalizing the bintrans stuff for device accesses to also
		work with the new translation system. (This might also break
		some MIPS things.)
		Adding multi-load/store instructions to the ARM disassembler
		and the translator, and some optimizations of various kinds.
20050701	Adding a simple dev_disk (it can read/write sectors from
		disk images).
20050712	Adding dev_ether (a simple ethernet send/receive device).
		Debugger command "ninstrs" for toggling show_nr_of_instructions
		during runtime.
		Removing the framebuffer logo.
20050713	Continuing on dev_ether.
		Adding a dummy cpu_alpha (again).
20050714	More work on cpu_alpha.
20050715	More work on cpu_alpha. Many instructions work, enough to run
		a simple framebuffer fill test (similar to the ARM test).
20050716	More Alpha stuff.
20050717	Minor updates (Alpha stuff).
20050718	Minor updates (Alpha stuff).
20050719	Generalizing some Alpha instructions.
20050720	More Alpha-related updates.
20050721	Continuing on cpu_alpha. Importing rpb.h from NetBSD/alpha.
20050722	Alpha-related updates: userland stuff (Hello World using
		write() compiled statically for FreeBSD/Alpha runs fine), and
		more instructions are now implemented.
20050723	Fixing ldq_u and stq_u.
		Adding more instructions (conditional moves, masks, extracts,
		shifts).
20050724	More FreeBSD/Alpha userland stuff, and adding some more
		instructions (inserts).
20050725	Continuing on the Alpha stuff. (Adding dummy ldt/stt.)
		Adding a -A command line option to turn off alignment checks
		in some cases (for translated code).
		Trying to remove the old bintrans code which updated the pc
		and nr_of_executed_instructions for every instruction.
20050726	Making another attempt att removing the pc/nr of instructions
		code. This time it worked, huge performance increase for
		artificial test code, but performance loss for real-world
		code :-( so I'm scrapping that code for now.
		Tiny performance increase on Alpha (by using ret instead of
		jmp, to play nice with the Alpha's branch prediction) for the
		old MIPS bintrans backend.
20050727	Various minor fixes and cleanups.
20050728	Switching from a 2-level virtual to host/physical translation
		system for ARM emulation, to a 1-level translation.
		Trying to switch from 2-level to 1-level for the MIPS bintrans
		system as well (Alpha only, so far), but there is at least one
		problem: caches and/or how they work with device mappings.
20050730	Doing the 2-level to 1-level conversion for the i386 backend.
		The cache/device bug is still there for R2K/3K :(
		Various other minor updates (Malta etc).
		The mc146818 clock now updates the UIP bit in a way which works
		better with Linux for at least sgimips and Malta emulation.
		Beginning the work on refactoring the dyntrans system.
20050731	Continuing the dyntrans refactoring.
		Fixing a small but serious host alignment bug in memory_rw.
		Adding support for big-endian load/stores to the i386 bintrans
		backend.
		Another minor i386 bintrans backend update: stores from the
		zero register are now one (or two) loads shorter.
		The slt and sltu instructions were incorrectly implemented for
		the i386 backend; only using them for 32-bit mode for now.
20050801	Continuing the dyntrans refactoring.
		Cleanup of the ns16550 serial controller (removing unnecessary
		code).
		Bugfix (memory corruption bug) in dev_gt, and a patch/hack from
		Alec Voropay for Linux/Malta.
20050802	More cleanup/refactoring of the dyntrans subsystem: adding
		phys_page pointers to the lookup tables, for quick jumps
		between translated pages.
		Better fix for the ns16550 device (but still no real FIFO
		functionality).
		Converting cpu_ppc to the new dyntrans system. This means that
		I will have to start from scratch with implementing each
		instruction, and figure out how to implement dual 64/32-bit
		modes etc.
		Removing the URISC CPU family, because it was useless.
20050803	When selecting a machine type, the main type can now be omitted
		if the subtype name is unique. (I.e. -E can be omitted.)
		Fixing a dyntrans/device update bug. (Writes to offset 0 of
		a device could sometimes go unnoticed.)
		Adding an experimental "instruction combination" hack for
		ARM for memset-like byte fill loops.
20050804	Minor progress on cpu_alpha and related things.
		Finally fixing the MIPS dmult/dmultu bugs.
		Fixing some minor TODOs.
20050805	Generalizing the 8259 PIC. It now also works with Cobalt
		and evbmips emulation, in addition to the x86 hack.
		Finally converting the ns16550 device to use devinit.
		Continuing the work on the dyntrans system. Thinking about
		how to add breakpoints.
20050806	More dyntrans updates. Breakpoints seem to work now.
20050807	Minor updates: cpu_alpha and related things; removing
		dev_malta (as it isn't used any more).
		Dyntrans: working on general "show trace tree" support.
		The trace tree stuff now works with both the old MIPS code and
		with newer dyntrans modes. :)
		Continuing on Alpha-related stuff (trying to get *BSD to boot
		a bit further, adding more instructions, etc).
20050808	Adding a dummy IA64 cpu family, and continuing the refactoring
		of the dyntrans system.
		Removing the regression test stuff, because it was more or
		less useless.
		Adding loadlinked/storeconditional type instructions to the
		Alpha emulation. (Needed for Linux/alpha. Not very well tested
		yet.)
20050809	The function call trace tree now prints a per-function nr of
		arguments. (Semi-meaningless, since that data isn't read yet
		from the ELFs; some hardcoded symbols such as memcpy() and
		strlen() work fine, though.)
		More dyntrans refactoring; taking out more of the things that
		are common to all cpu families.
20050810	Working on adding support for "dual mode" for PPC dyntrans
		(i.e. both 64-bit and 32-bit modes).
		(Re)adding some simple PPC instructions.
20050811	Adding a dummy M68K cpu family. The dyntrans system isn't ready
		for variable-length ISAs yet, so it's completely bogus so far.
		Re-adding more PPC instructions.
		Adding a hack to src/file.c which allows OpenBSD/mac68k a.out
		kernels to be loaded.
		Beginning to add PPC loads/stores. So far they only work in
		32-bit mode.
20050812	The configure file option "add_remote" now accepts symbolic
		host names, in addition to numeric IPv4 addresses.
		Re-adding more PPC instructions.
20050814	Continuing to port back more PPC instructions.
		Found and fixed the cache/device write-update bug for 32-bit
		MIPS bintrans. :-)
		Triggered a really weird and annoying bug in Compaq's C
		compiler; ccc sometimes outputs code which loads from an
		address _before_ checking whether the pointer was NULL or not.
		(I'm not sure how to handle this problem.)
20050815	Removing all of the old x86 instruction execution code; adding
		a new (dummy) dyntrans module for x86.
		Taking the first steps to extend the dyntrans system to support
		variable-length instructions.
		Slowly preparing for the next release.
20050816	Adding a dummy SPARC cpu module.
		Minor updates (documentation etc) for the release.

==============  RELEASE 0.3.5  ==============


1 /*
2 * Copyright (C) 2005 Anders Gavare. All rights reserved.
3 *
4 * Redistribution and use in source and binary forms, with or without
5 * modification, are permitted provided that the following conditions are met:
6 *
7 * 1. Redistributions of source code must retain the above copyright
8 * notice, this list of conditions and the following disclaimer.
9 * 2. Redistributions in binary form must reproduce the above copyright
10 * notice, this list of conditions and the following disclaimer in the
11 * documentation and/or other materials provided with the distribution.
12 * 3. The name of the author may not be used to endorse or promote products
13 * derived from this software without specific prior written permission.
14 *
15 * THIS SOFTWARE IS PROVIDED BY THE AUTHOR AND CONTRIBUTORS ``AS IS'' AND
16 * ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
17 * IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
18 * ARE DISCLAIMED. IN NO EVENT SHALL THE AUTHOR OR CONTRIBUTORS BE LIABLE
19 * FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
20 * DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
21 * OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
22 * HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
23 * LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
24 * OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
25 * SUCH DAMAGE.
26 *
27 *
28 * $Id: cpu_alpha_instr_loadstore.c,v 1.9 2005/08/08 20:19:43 debug Exp $
29 *
30 * Alpha load/store instructions. (Included from cpu_alpha_instr_inc.c.)
31 *
32 *
33 * Load/store instructions have the following arguments:
34 *
35 * arg[0] = pointer to the register to load to or store from (uint64_t)
36 * arg[1] = pointer to the base register (uint64_t)
37 * arg[2] = offset (as an int32_t)
38 *
39 * NOTE:
40 * Alpha byte and word loads (8- and 16-bit) are unsigned, while
41 * 32-bit long words are sign-extended up to 64 bits during a load!
42 */
43
44
45 #ifndef LS_IGNORE_OFFSET
46 #ifndef LS_ALIGN_CHECK
47 static void LS_GENERIC_N(struct cpu *cpu, struct alpha_instr_call *ic)
48 {
49 #ifdef LS_B
50 unsigned char data[1];
51 #endif
52 #ifdef LS_W
53 unsigned char data[2];
54 #endif
55 #ifdef LS_L
56 unsigned char data[4];
57 #endif
58 #ifdef LS_Q
59 unsigned char data[8];
60 #endif
61 uint64_t addr = *((uint64_t *)ic->arg[1]);
62 uint64_t data_x;
63
64 addr += (int32_t)ic->arg[2];
65 #ifdef LS_UNALIGNED
66 addr &= ~7;
67 #endif
68
69 #ifdef LS_LOAD
70 /* Load: */
71 if (!cpu->memory_rw(cpu, cpu->mem, addr, data, sizeof(data),
72 MEM_READ, CACHE_DATA)) {
73 fatal("store failed: TODO\n");
74 exit(1);
75 }
76
77 data_x = data[0];
78 #ifndef LS_B
79 data_x += (data[1] << 8);
80 #ifndef LS_W
81 data_x += (data[2] << 16);
82 data_x += (data[3] << 24);
83 #ifdef LS_L
84 data_x = (int64_t)(int32_t)data_x;
85 #endif
86 #ifndef LS_L
87 data_x += ((uint64_t)data[4] << 32);
88 data_x += ((uint64_t)data[5] << 40);
89 data_x += ((uint64_t)data[6] << 48);
90 data_x += ((uint64_t)data[7] << 56);
91 #endif
92 #endif
93 #endif
94 *((uint64_t *)ic->arg[0]) = data_x;
95 #else
96 /* Store: */
97 data_x = *((uint64_t *)ic->arg[0]);
98 data[0] = data_x;
99 #ifndef LS_B
100 data[1] = data_x >> 8;
101 #ifndef LS_W
102 data[2] = data_x >> 16;
103 data[3] = data_x >> 24;
104 #ifndef LS_L
105 data[4] = data_x >> 32;
106 data[5] = data_x >> 40;
107 data[6] = data_x >> 48;
108 data[7] = data_x >> 56;
109 #endif
110 #endif
111 #endif
112
113 if (!cpu->memory_rw(cpu, cpu->mem, addr, data, sizeof(data),
114 MEM_WRITE, CACHE_DATA)) {
115 fatal("store failed: TODO\n");
116 exit(1);
117 }
118
119 #ifdef LS_LLSC
120 #ifndef LS_LOAD
121 *((uint64_t *)ic->arg[0]) = 1;
122 #endif
123 #endif
124
125 #endif
126 }
127 #endif
128 #endif
129
130
131 static void LS_N(struct cpu *cpu, struct alpha_instr_call *ic)
132 {
133 int first, a, b, c;
134 uint64_t addr;
135
136 addr = (*((uint64_t *)ic->arg[1]))
137 #ifndef LS_IGNORE_OFFSET
138 + (int32_t)ic->arg[2]
139 #endif
140 ;
141
142 #ifdef LS_UNALIGNED
143 addr &= ~7;
144 #endif
145
146 #ifdef LS_LLSC
147 #ifdef LS_LOAD
148 /* TODO: cache-line size! */
149 cpu->cd.alpha.load_linked_addr = addr & ~63;
150 cpu->cd.alpha.ll_flag = 1;
151 #else
152 /* TODO: only invalidate per cache line, not everything! */
153 if (cpu->cd.alpha.ll_flag == 1) {
154 int i;
155 for (i=0; i<cpu->machine->ncpus; i++)
156 cpu->machine->cpus[i]->cd.alpha.ll_flag = 0;
157 } else {
158 *((uint64_t *)ic->arg[0]) = 0;
159 return;
160 }
161 #endif
162 #endif
163
164 first = addr >> ALPHA_TOPSHIFT;
165 a = (addr >> ALPHA_LEVEL0_SHIFT) & (ALPHA_LEVEL0 - 1);
166 b = (addr >> ALPHA_LEVEL1_SHIFT) & (ALPHA_LEVEL1 - 1);
167 c = addr & 8191;
168
169 #ifdef LS_ALIGN_CHECK
170 #ifndef LS_B
171 if (c &
172 #ifdef LS_W
173 1
174 #endif
175 #ifdef LS_L
176 3
177 #endif
178 #ifdef LS_Q
179 7
180 #endif
181 ) {
182 LS_GENERIC_N(cpu, ic);
183 return;
184 }
185 else
186 #endif
187 #endif
188
189 if (first == 0) {
190 struct alpha_vph_page *vph_p;
191 unsigned char *page;
192 vph_p = cpu->cd.alpha.vph_table0[a];
193 page = vph_p->host_load[b];
194 if (page != NULL) {
195 #ifdef LS_LOAD
196 #ifdef HOST_BIG_ENDIAN
197 uint64_t data_x;
198 data_x = page[c];
199 #ifndef LS_B
200 data_x += (page[c+1] << 8);
201 #ifndef LS_W
202 data_x += (page[c+2] << 16);
203 data_x += (page[c+3] << 24);
204 #ifndef LS_L
205 data_x += ((uint64_t)page[c+4] << 32);
206 data_x += ((uint64_t)page[c+5] << 40);
207 data_x += ((uint64_t)page[c+6] << 48);
208 data_x += ((uint64_t)page[c+7] << 56);
209 #endif
210 #endif
211 #endif
212 #ifdef LS_L
213 *((uint64_t *)ic->arg[0]) = (int64_t)(int32_t)data_x;
214 #else
215 *((uint64_t *)ic->arg[0]) = data_x;
216 #endif
217 #else
218 #ifdef LS_B
219 *((uint64_t *)ic->arg[0]) = page[c];
220 #endif
221 #ifdef LS_W
222 uint16_t d = *((uint16_t *) (page + c));
223 *((uint64_t *)ic->arg[0]) = d;
224 #endif
225 #ifdef LS_L
226 int32_t d = *((int32_t *) (page + c));
227 *((uint64_t *)ic->arg[0]) = (int64_t)d;
228 #endif
229 #ifdef LS_Q
230 uint64_t d = *((uint64_t *) (page + c));
231 *((uint64_t *)ic->arg[0]) = d;
232 #endif
233 #endif
234 #else
235 /* Store: */
236 #ifdef HOST_BIG_ENDIAN
237 uint64_t data_x = *((uint64_t *)ic->arg[0]);
238 page[c] = data_x;
239 #ifndef LS_B
240 page[c+1] = data_x >> 8;
241 #ifndef LS_W
242 page[c+2] = data_x >> 16;
243 page[c+3] = data_x >> 24;
244 #ifndef LS_L
245 page[c+4] = data_x >> 32;
246 page[c+5] = data_x >> 40;
247 page[c+6] = data_x >> 48;
248 page[c+7] = data_x >> 56;
249 #endif
250 #endif
251 #endif
252 #else
253 /* Native byte order: */
254 #ifdef LS_B
255 page[c] = *((uint64_t *)ic->arg[0]);
256 #endif
257 #ifdef LS_W
258 uint32_t d = *((uint64_t *)ic->arg[0]);
259 *((uint16_t *) (page + c)) = d;
260 #endif
261 #ifdef LS_L
262 uint32_t d = *((uint64_t *)ic->arg[0]);
263 *((uint32_t *) (page + c)) = d;
264 #endif
265 #ifdef LS_Q
266 uint64_t d = *((uint64_t *)ic->arg[0]);
267 *((uint64_t *) (page + c)) = d;
268 #endif
269 #endif
270
271 #ifdef LS_LLSC
272 #ifndef LS_LOAD
273 *((uint64_t *)ic->arg[0]) = 1;
274 #endif
275 #endif
276
277 #endif /* !LS_LOAD */
278 } else
279 LS_GENERIC_N(cpu, ic);
280 } else if (first == ALPHA_TOP_KERNEL) {
281 struct alpha_vph_page *vph_p;
282 unsigned char *page;
283 vph_p = cpu->cd.alpha.vph_table0_kernel[a];
284 page = vph_p->host_load[b];
285 if (page != NULL) {
286 #ifdef LS_LOAD
287 #ifdef HOST_BIG_ENDIAN
288 uint64_t data_x;
289 data_x = page[c];
290 #ifndef LS_B
291 data_x += (page[c+1] << 8);
292 #ifndef LS_W
293 data_x += (page[c+2] << 16);
294 data_x += (page[c+3] << 24);
295 #ifndef LS_L
296 data_x += ((uint64_t)page[c+4] << 32);
297 data_x += ((uint64_t)page[c+5] << 40);
298 data_x += ((uint64_t)page[c+6] << 48);
299 data_x += ((uint64_t)page[c+7] << 56);
300 #endif
301 #endif
302 #endif
303 #ifdef LS_L
304 *((uint64_t *)ic->arg[0]) = (int64_t)(int32_t)data_x;
305 #else
306 *((uint64_t *)ic->arg[0]) = data_x;
307 #endif
308 #else
309 #ifdef LS_B
310 *((uint64_t *)ic->arg[0]) = page[c];
311 #endif
312 #ifdef LS_W
313 uint16_t d = *((uint16_t *) (page + c));
314 *((uint64_t *)ic->arg[0]) = d;
315 #endif
316 #ifdef LS_L
317 int32_t d = *((int32_t *) (page + c));
318 *((uint64_t *)ic->arg[0]) = (int64_t)d;
319 #endif
320 #ifdef LS_Q
321 uint64_t d = *((uint64_t *) (page + c));
322 *((uint64_t *)ic->arg[0]) = d;
323 #endif
324 #endif
325 #else
326 /* Store: */
327 #ifdef HOST_BIG_ENDIAN
328 uint64_t data_x = *((uint64_t *)ic->arg[0]);
329 page[c] = data_x;
330 #ifndef LS_B
331 page[c+1] = data_x >> 8;
332 #ifndef LS_W
333 page[c+2] = data_x >> 16;
334 page[c+3] = data_x >> 24;
335 #ifndef LS_L
336 page[c+4] = data_x >> 32;
337 page[c+5] = data_x >> 40;
338 page[c+6] = data_x >> 48;
339 page[c+7] = data_x >> 56;
340 #endif
341 #endif
342 #endif
343 #else
344 /* Native byte order: */
345 #ifdef LS_B
346 page[c] = *((uint64_t *)ic->arg[0]);
347 #endif
348 #ifdef LS_W
349 uint32_t d = *((uint64_t *)ic->arg[0]);
350 *((uint16_t *) (page + c)) = d;
351 #endif
352 #ifdef LS_L
353 uint32_t d = *((uint64_t *)ic->arg[0]);
354 *((uint32_t *) (page + c)) = d;
355 #endif
356 #ifdef LS_Q
357 uint64_t d = *((uint64_t *)ic->arg[0]);
358 *((uint64_t *) (page + c)) = d;
359 #endif
360 #endif
361
362 #ifdef LS_LLSC
363 #ifndef LS_LOAD
364 *((uint64_t *)ic->arg[0]) = 1;
365 #endif
366 #endif
367
368 #endif /* !LS_LOAD */
369 } else
370 LS_GENERIC_N(cpu, ic);
371 } else
372 LS_GENERIC_N(cpu, ic);
373 }
374

  ViewVC Help
Powered by ViewVC 1.1.26