/[gxemul]/trunk/src/cpus/generate_arm_multi.c
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Contents of /trunk/src/cpus/generate_arm_multi.c

Parent Directory Parent Directory | Revision Log Revision Log


Revision 34 - (show annotations)
Mon Oct 8 16:21:17 2007 UTC (16 years, 5 months ago) by dpavlin
File MIME type: text/plain
File size: 10820 byte(s)
++ trunk/HISTORY	(local)
$Id: HISTORY,v 1.1480 2007/02/19 01:34:42 debug Exp $
20061029	Changing usleep(1) calls in the debugger to usleep(10000)
20061107	Adding a new disk image option (-d o...) which sets the ISO9660
		filesystem base offset; also making some other hacks to allow
		NetBSD/dreamcast and homebrew demos/games to boot directly
		from a filesystem image.
		Moving Dreamcast-specific stuff in the documentation to its
		own page (dreamcast.html).
		Adding a border to the Dreamcast PVR framebuffer.
20061108	Adding a -T command line option (again?), for halting the
		emulator on unimplemented memory accesses.
20061109	Continuing on various SH4 and Dreamcast related things.
		The emulator should now halt on more unimplemented device
		accesses, instead of just printing a warning, forcing me to
		actually implement missing stuff :)
20061111	Continuing on SH4 and Dreamcast stuff.
		Adding a bogus Landisk (SH4) machine mode.
20061112	Implementing some parts of the Dreamcast GDROM device. With
		some ugly hacks, NetBSD can (barely) mount an ISO image.
20061113	NetBSD/dreamcast now starts booting from the Live CD image,
		but crashes randomly quite early on in the boot process.
20061122	Beginning on a skeleton interrupt.h and interrupt.c for the
		new interrupt subsystem.
20061124	Continuing on the new interrupt system; taking the first steps
		to attempt to connect CPUs (SuperH and MIPS) and devices
		(dev_cons and SH4 timer interrupts) to it. Many things will
		probably break from now on.
20061125	Converting dev_ns16550, dev_8253 to the new interrupt system.
		Attempting to begin to convert the ISA bus.
20061130	Incorporating a patch from Brian Foley for the configure
		script, which checks for X11 libs in /usr/X11R6/lib64 (which
		is used on some Linux systems).
20061227	Adding a note in the man page about booting from Dreamcast
		CDROM images (i.e. that no external kernel is needed).
20061229	Continuing on the interrupt system rewrite: beginning to
		convert more devices, adding abort() calls for legacy interrupt
		system calls so that everything now _has_ to be rewritten!
		Almost all machine modes are now completely broken.
20061230	More progress on removing old interrupt code, mostly related
		to the ISA bus + devices, the LCA bus (on AlphaBook1), and
		the Footbridge bus (for CATS). And some minor PCI stuff.
		Connecting the ARM cpu to the new interrupt system.
		The CATS, NetWinder, and QEMU_MIPS machine modes now work with
		the new interrupt system :)
20061231	Connecting PowerPC CPUs to the new interrupt system.
		Making PReP machines (IBM 6050) work again.
		Beginning to convert the GT PCI controller (for e.g. Malta
		and Cobalt emulation). Some things work, but not everything.
		Updating Copyright notices for 2007.
20070101	Converting dev_kn02 from legacy style to devinit; the 3max
		machine mode now works with the new interrupt system :-]
20070105	Beginning to convert the SGI O2 machine to the new interrupt
		system; finally converting O2 (IP32) devices to devinit, etc.
20070106	Continuing on the interrupt system redesign/rewrite; KN01
		(PMAX), KN230, and Dreamcast ASIC interrupts should work again,
		moving out stuff from machine.h and devices.h into the
		corresponding devices, beginning the rewrite of i80321
		interrupts, etc.
20070107	Beginning on the rewrite of Eagle interrupt stuff (PReP, etc).
20070117	Beginning the rewrite of Algor (V3) interrupts (finally
		changing dev_v3 into devinit style).
20070118	Removing the "bus" registry concept from machine.h, because
		it was practically meaningless.
		Continuing on the rewrite of Algor V3 ISA interrupts.
20070121	More work on Algor interrupts; they are now working again,
		well enough to run NetBSD/algor. :-)
20070122	Converting VR41xx (HPCmips) interrupts. NetBSD/hpcmips
		can be installed using the new interrupt system :-)
20070123	Making the testmips mode work with the new interrupt system.
20070127	Beginning to convert DEC5800 devices to devinit, and to the
		new interrupt system.
		Converting Playstation 2 devices to devinit, and converting
		the interrupt system. Also fixing a severe bug: the interrupt
		mask register on Playstation 2 is bitwise _toggled_ on writes.
20070128	Removing the dummy NetGear machine mode and the 8250 device
		(which was only used by the NetGear machine).
		Beginning to convert the MacPPC GC (Grand Central) interrupt
		controller to the new interrupt system.
		Converting Jazz interrupts (PICA61 etc.) to the new interrupt
		system. NetBSD/arc can be installed again :-)
		Fixing the JAZZ timer (hardcoding it at 100 Hz, works with
		NetBSD and it is better than a completely dummy timer as it
		was before).
		Converting dev_mp to the new interrupt system, although I
		haven't had time to actually test it yet.
		Completely removing src/machines/interrupts.c, cpu_interrupt
		and cpu_interrupt_ack in src/cpu.c, and
		src/include/machine_interrupts.h! Adding fatal error messages
		+ abort() in the few places that are left to fix.
		Converting dev_z8530 to the new interrupt system.
		FINALLY removing the md_int struct completely from the
		machine struct.
		SH4 fixes (adding a PADDR invalidation in the ITLB replacement
		code in memory_sh.c); the NetBSD/dreamcast LiveCD now runs
		all the way to the login prompt, and can be interacted with :-)
		Converting the CPC700 controller (PCI and interrupt controller
		for PM/PPC) to the new interrupt system.
20070129	Fixing MACE ISA interrupts (SGI IP32 emulation). Both NetBSD/
		sgimips' and OpenBSD/sgi's ramdisk kernels can now be
		interacted with again.
20070130	Moving out the MIPS multi_lw and _sw instruction combinations
		so that they are auto-generated at compile time instead.
20070131	Adding detection of amd64/x86_64 hosts in the configure script,
		for doing initial experiments (again :-) with native code
		generation.
		Adding a -k command line option to set the size of the dyntrans
		cache, and a -B command line option to disable native code
		generation, even if GXemul was compiled with support for
		native code generation for the specific host CPU architecture.
20070201	Experimenting with a skeleton for native code generation.
		Changing the default behaviour, so that native code generation
		is now disabled by default, and has to be enabled by using
		-b on the command line.
20070202	Continuing the native code generation experiments.
		Making PCI interrupts work for Footbridge again.
20070203	More native code generation experiments.
		Removing most of the native code generation experimental code,
		it does not make sense to include any quick hacks like this.
		Minor cleanup/removal of some more legacy MIPS interrupt code.
20070204	Making i80321 interrupts work again (for NetBSD/evbarm etc.),
		and fixing the timer at 100 Hz.
20070206	Experimenting with removing the wdc interrupt slowness hack.
20070207	Lowering the number of dyntrans TLB entries for MIPS from
		192 to 128, resulting in a minor speed improvement.
		Minor optimization to the code invalidation routine in
		cpu_dyntrans.c.
20070208	Increasing (experimentally) the nr of dyntrans instructions per
		loop from 60 to 120.
20070210	Commenting out (experimentally) the dyntrans_device_danger
		detection in memory_rw.c.
		Changing the testmips and baremips machines to use a revision 2
		MIPS64 CPU by default, instead of revision 1.
		Removing the dummy i960, IA64, x86, AVR32, and HP PA-RISC
		files, the PC bios emulation, and the Olivetti M700 (ARC) and
		db64360 emulation modes.
20070211	Adding an "mp" demo to the demos directory, which tests the
		SMP functionality of the testmips machine.
		Fixing PReP interrupts some more. NetBSD/prep now boots again.
20070216	Adding a "nop workaround" for booting Mach/PMAX to the
		documentation; thanks to Artur Bujdoso for the values.
		Converting more of the MacPPC interrupt stuff to the new
		system.
		Beginning to convert BeBox interrupts to the new system.
		PPC603e should NOT have the PPC_NO_DEC flag! Removing it.
		Correcting BeBox clock speed (it was set to 100 in the NetBSD
		bootinfo block, but should be 33000000/4), allowing NetBSD
		to start without using the (incorrect) PPC_NO_DEC hack.
20070217	Implementing (slow) AltiVec vector loads and stores, allowing
		NetBSD/macppc to finally boot using the GENERIC kernel :-)
		Updating the documentation with install instructions for
		NetBSD/macppc.
20070218-19	Regression testing for the release.

==============  RELEASE 0.4.4  ==============


1 /*
2 * Copyright (C) 2005-2007 Anders Gavare. All rights reserved.
3 *
4 * Redistribution and use in source and binary forms, with or without
5 * modification, are permitted provided that the following conditions are met:
6 *
7 * 1. Redistributions of source code must retain the above copyright
8 * notice, this list of conditions and the following disclaimer.
9 * 2. Redistributions in binary form must reproduce the above copyright
10 * notice, this list of conditions and the following disclaimer in the
11 * documentation and/or other materials provided with the distribution.
12 * 3. The name of the author may not be used to endorse or promote products
13 * derived from this software without specific prior written permission.
14 *
15 * THIS SOFTWARE IS PROVIDED BY THE AUTHOR AND CONTRIBUTORS ``AS IS'' AND
16 * ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
17 * IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
18 * ARE DISCLAIMED. IN NO EVENT SHALL THE AUTHOR OR CONTRIBUTORS BE LIABLE
19 * FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
20 * DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
21 * OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
22 * HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
23 * LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
24 * OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
25 * SUCH DAMAGE.
26 *
27 *
28 * $Id: generate_arm_multi.c,v 1.15 2006/12/30 13:30:56 debug Exp $
29 *
30 * Generation of commonly used ARM load/store multiple instructions.
31 *
32 * The main idea is to first check whether a load/store would be possible
33 * without going outside a page, and if so, use the host_load or _store
34 * arrays for quick access to emulated RAM. Otherwise, fall back to using
35 * the generic bdt_load() or bdt_store().
36 */
37
38 #include <stdio.h>
39 #include <stdlib.h>
40
41 #include "misc.h"
42
43
44 /*
45 * generate_opcode():
46 *
47 * Given an ARM load/store multiple opcode, produce equivalent "hardcoded"
48 * C code which emulates the opcode.
49 *
50 * TODO:
51 *
52 * o) On 64-bit hosts, load/store two registers at a time. This
53 * feature depends both on the alignment of the base register,
54 * and the specific set of registers being loaded/stored.
55 *
56 * o) Alignment checks. (Optional?)
57 *
58 * o) For accesses that cross page boundaries, use two pages using
59 * the fast method instead of calling the generic function?
60 */
61 void generate_opcode(uint32_t opcode)
62 {
63 int p, u, s, w, load, r, n_regs, i, x;
64
65 if ((opcode & 0x0e000000) != 0x08000000) {
66 fprintf(stderr, "opcode 0x%08"PRIx32" is not an ldm/stm\n",
67 opcode);
68 exit(1);
69 }
70
71 r = (opcode >> 16) & 15;
72 p = opcode & 0x01000000? 1 : 0;
73 u = opcode & 0x00800000? 1 : 0;
74 s = opcode & 0x00400000? 1 : 0;
75 w = opcode & 0x00200000? 1 : 0;
76 load = opcode & 0x00100000? 1 : 0;
77 n_regs = 0;
78 for (i=0; i<16; i++)
79 if (opcode & (1 << i))
80 n_regs ++;
81
82 /* TODO: Check for register pairs, for 64-bit load/stores */
83
84 if (n_regs == 0) {
85 fprintf(stderr, "opcode 0x%08"PRIx32" has no registers set\n",
86 opcode);
87 exit(1);
88 }
89
90 if (s) {
91 fprintf(stderr, "opcode 0x%08"PRIx32" has s-bit set\n", opcode);
92 exit(1);
93 }
94
95 if (r == 15) {
96 fprintf(stderr, "opcode 0x%08"PRIx32" has r=15\n", opcode);
97 exit(1);
98 }
99
100 printf("\nvoid arm_instr_multi_0x%08"PRIx32"(struct cpu *cpu,"
101 " struct arm_instr_call *ic) {\n", opcode);
102
103 printf("\tunsigned char *page;\n");
104 printf("\tuint32_t addr = cpu->cd.arm.r[%i];\n", r);
105
106 if (!load && opcode & 0x8000) {
107 printf("\tuint32_t tmp_pc = ((size_t)ic - (size_t)\n\t"
108 " cpu->cd.arm.cur_ic_page) / sizeof(struct "
109 "arm_instr_call);\n"
110 "\ttmp_pc = ((cpu->pc & ~((ARM_IC_ENTRIES_PER_PAGE-1)"
111 "\n\t << ARM_INSTR_ALIGNMENT_SHIFT)))\n"
112 "\t + (tmp_pc << ARM_INSTR_ALIGNMENT_SHIFT) + 12;\n");
113 }
114
115 if (p)
116 printf("\taddr %s 4;\n", u? "+=" : "-=");
117
118 printf("\tpage = cpu->cd.arm.host_%s[addr >> 12];\n",
119 load? "load" : "store");
120
121 printf("\taddr &= 0xffc;\n");
122
123 printf("\tif (");
124 switch (p*2 + u) {
125 case 0: /* post-decrement */
126 if (n_regs > 1)
127 printf("addr >= 0x%x && ", 4*(n_regs-1));
128 break;
129 case 1: /* post-increment */
130 if (n_regs > 1)
131 printf("addr <= 0x%x && ", 0x1000 - 4*n_regs);
132 break;
133 case 2: /* pre-decrement */
134 if (n_regs > 1)
135 printf("addr >= 0x%x && ", 4*(n_regs-1));
136 break;
137 case 3: /* pre-increment */
138 if (n_regs > 1)
139 printf("addr <= 0x%x && ", 0x1000 - 4*n_regs);
140 break;
141 }
142 printf("page != NULL) {\n");
143
144 printf("\t\tuint32_t *p = (uint32_t *) (page + addr);\n");
145
146 if (u) {
147 x = 0;
148 for (i=0; i<=15; i++) {
149 if (!(opcode & (1 << i)))
150 continue;
151
152 if (load && w && i == r) {
153 /* Skip the load if we're using writeback. */
154 } else if (load) {
155 if (i == 15)
156 printf("\t\tcpu->pc = p[%i];\n", x);
157 else
158 printf("\t\tcpu->cd.arm.r[%i] = "
159 "p[%i];\n", i, x);
160 } else {
161 if (i == 15)
162 printf("\t\tp[%i] = tmp_pc;\n", x);
163 else
164 printf("\t\tp[%i] = cpu->cd.arm.r"
165 "[%i];\n", x, i);
166 }
167
168 x ++;
169 }
170 } else {
171 /* Decrementing, but do it incrementing anyway: */
172 x = -n_regs;
173 for (i=0; i<=15; i++) {
174 if (!(opcode & (1 << i)))
175 continue;
176
177 x ++;
178
179 if (load && w && i == r) {
180 /* Skip the load if we're using writeback. */
181 } else if (load) {
182 if (i == 15)
183 printf("\t\tcpu->pc = p[%i];\n", x);
184 else
185 printf("\t\tcpu->cd.arm.r[%i] = "
186 "p[%i];\n", i, x);
187 } else {
188 if (i == 15)
189 printf("\t\tp[%i] = tmp_pc;\n", x);
190 else
191 printf("\t\tp[%i] = "
192 "cpu->cd.arm.r[%i];\n", x, i);
193 }
194 }
195 }
196
197 if (w)
198 printf("\t\tcpu->cd.arm.r[%i] %s %i;\n",
199 r, u? "+=" : "-=", 4*n_regs);
200
201 if (load && opcode & 0x8000) {
202 printf("\t\tquick_pc_to_pointers(cpu);\n");
203 }
204
205 printf("\t} else\n");
206 printf("\t\tinstr(bdt_%s)(cpu, ic);\n", load? "load" : "store");
207
208 printf("}\nY(multi_0x%08"PRIx32")\n", opcode);
209 }
210
211
212 /*
213 * main():
214 *
215 * Normal ARM code seems to only use about a few hundred of the 1^24 possible
216 * load/store multiple instructions. (I'm not counting the s-bit now.)
217 * Instead of having a linear array of 100s of entries, we can select a list
218 * to scan based on a few bits (*), and those lists will be shorter.
219 *
220 * (*) By running experiment_arm_multi.c on statistics gathered from running
221 * NetBSD/cats, it seems that choosing the following 8 bits results in
222 * the shortest linear lists:
223 *
224 * xxxx100P USWLnnnn llllllll llllllll
225 * ^ ^ ^ ^ ^ ^ ^ ^ (0x00950154)
226 */
227 int main(int argc, char *argv[])
228 {
229 int i, j;
230 int n_used[256];
231
232 if (argc < 2) {
233 fprintf(stderr, "usage: %s opcode [..]\n", argv[0]);
234 exit(1);
235 }
236
237 printf("\n/* AUTOMATICALLY GENERATED! Do not edit. */\n\n"
238 "#include <stdio.h>\n"
239 "#include <stdlib.h>\n"
240 "#include \"cpu.h\"\n"
241 "#include \"misc.h\"\n"
242 "#define DYNTRANS_PC_TO_POINTERS arm_pc_to_pointers\n"
243 "#include \"quick_pc_to_pointers.h\"\n"
244 "#include \"arm_tmphead_1.h\"\n"
245 "\n#define instr(x) arm_instr_ ## x\n");
246 printf("extern void arm_pc_to_pointers(struct cpu *);\n");
247 printf("extern void arm_instr_nop(struct cpu *, "
248 "struct arm_instr_call *);\n");
249 printf("extern void arm_instr_bdt_load(struct cpu *, "
250 "struct arm_instr_call *);\n");
251 printf("extern void arm_instr_bdt_store(struct cpu *, "
252 "struct arm_instr_call *);\n");
253 printf("\n\n");
254
255 /* Generate the opcode functions: */
256 for (i=1; i<argc; i++)
257 generate_opcode(strtol(argv[i], NULL, 0));
258
259 /* Generate 256 small lookup tables: */
260 for (j=0; j<256; j++) {
261 int n = 0, zz, zz0;
262 for (i=1; i<argc; i++) {
263 zz = strtol(argv[i], NULL, 0);
264 zz = ((zz & 0x00800000) >> 16)
265 |((zz & 0x00100000) >> 14)
266 |((zz & 0x00040000) >> 13)
267 |((zz & 0x00010000) >> 12)
268 |((zz & 0x00000100) >> 5)
269 |((zz & 0x00000040) >> 4)
270 |((zz & 0x00000010) >> 3)
271 |((zz & 0x00000004) >> 2);
272 if (zz == j)
273 n++;
274 }
275 printf("\nuint32_t multi_opcode_%i[%i] = {\n", j, n+1);
276 for (i=1; i<argc; i++) {
277 zz = zz0 = strtol(argv[i], NULL, 0);
278 zz = ((zz & 0x00800000) >> 16)
279 |((zz & 0x00100000) >> 14)
280 |((zz & 0x00040000) >> 13)
281 |((zz & 0x00010000) >> 12)
282 |((zz & 0x00000100) >> 5)
283 |((zz & 0x00000040) >> 4)
284 |((zz & 0x00000010) >> 3)
285 |((zz & 0x00000004) >> 2);
286 if (zz == j)
287 printf("\t0x%08x,\n", zz0);
288 }
289 printf("0 };\n");
290 }
291
292 /* Generate 256 tables with function pointers: */
293 for (j=0; j<256; j++) {
294 int n = 0, zz, zz0;
295 for (i=1; i<argc; i++) {
296 zz = strtol(argv[i], NULL, 0);
297 zz = ((zz & 0x00800000) >> 16)
298 |((zz & 0x00100000) >> 14)
299 |((zz & 0x00040000) >> 13)
300 |((zz & 0x00010000) >> 12)
301 |((zz & 0x00000100) >> 5)
302 |((zz & 0x00000040) >> 4)
303 |((zz & 0x00000010) >> 3)
304 |((zz & 0x00000004) >> 2);
305 if (zz == j)
306 n++;
307 }
308 n_used[j] = n;
309 if (n == 0)
310 continue;
311 printf("void (*multi_opcode_f_%i[%i])(struct cpu *,"
312 " struct arm_instr_call *) = {\n", j, n*16);
313 for (i=1; i<argc; i++) {
314 zz = zz0 = strtol(argv[i], NULL, 0);
315 zz = ((zz & 0x00800000) >> 16)
316 |((zz & 0x00100000) >> 14)
317 |((zz & 0x00040000) >> 13)
318 |((zz & 0x00010000) >> 12)
319 |((zz & 0x00000100) >> 5)
320 |((zz & 0x00000040) >> 4)
321 |((zz & 0x00000010) >> 3)
322 |((zz & 0x00000004) >> 2);
323 if (zz == j) {
324 printf("\tarm_instr_multi_0x%08x__eq,\n", zz0);
325 printf("\tarm_instr_multi_0x%08x__ne,\n", zz0);
326 printf("\tarm_instr_multi_0x%08x__cs,\n", zz0);
327 printf("\tarm_instr_multi_0x%08x__cc,\n", zz0);
328 printf("\tarm_instr_multi_0x%08x__mi,\n", zz0);
329 printf("\tarm_instr_multi_0x%08x__pl,\n", zz0);
330 printf("\tarm_instr_multi_0x%08x__vs,\n", zz0);
331 printf("\tarm_instr_multi_0x%08x__vc,\n", zz0);
332 printf("\tarm_instr_multi_0x%08x__hi,\n", zz0);
333 printf("\tarm_instr_multi_0x%08x__ls,\n", zz0);
334 printf("\tarm_instr_multi_0x%08x__ge,\n", zz0);
335 printf("\tarm_instr_multi_0x%08x__lt,\n", zz0);
336 printf("\tarm_instr_multi_0x%08x__gt,\n", zz0);
337 printf("\tarm_instr_multi_0x%08x__le,\n", zz0);
338 printf("\tarm_instr_multi_0x%08x,\n", zz0);
339 printf("\tarm_instr_nop,\n");
340 }
341 }
342 printf("};\n");
343 }
344
345
346 printf("\nuint32_t *multi_opcode[256] = {\n");
347 for (i=0; i<256; i++) {
348 printf(" multi_opcode_%i,", i);
349 if ((i % 4) == 0)
350 printf("\n");
351 }
352 printf("};\n");
353
354 printf("\nvoid (**multi_opcode_f[256])(struct cpu *,"
355 " struct arm_instr_call *) = {\n");
356 for (i=0; i<256; i++) {
357 if (n_used[i] > 0)
358 printf(" multi_opcode_f_%i,", i);
359 else
360 printf(" NULL,");
361 if ((i % 4) == 0)
362 printf("\n");
363 }
364 printf("};\n");
365
366 return 0;
367 }
368

  ViewVC Help
Powered by ViewVC 1.1.26