cs161-list March 2002

cs161-list@lists.fas.harvard.edu

4 participants
12 discussions

by David Holland

I discovered (the hard way) yesterday that some of the error paths in execv() in sol2 don't release the execv lock. sigh. Patch follows, and will also appear on the web site. (Those of you who stuck to your own code can ignore this.) Index: sol2/kern/userprog/runprogram.c =================================================================== RCS file: /disk/disk0/cs161/CVSREPO/os161/sol2/kern/userprog/runprogram.c,v retrieving revision 1.14 retrieving revision 1.15 diff -u -r1.14 -r1.15 --- runprogram.c 2002/03/07 22:59:39 1.14 +++ runprogram.c 2002/03/28 22:49:28 1.15 @@ -310,17 +310,20 @@ /* make up argv strings */ if (strlen(progname) + 1 > ARG_MAX) { + lock_release(argdata.lock); return E2BIG; } /* allocate the space */ argdata.buffer = kmalloc(strlen(progname) + 1); if (argdata.buffer == NULL) { + lock_release(argdata.lock); return ENOMEM; } argdata.offsets = kmalloc(sizeof(size_t)); if (argdata.offsets == NULL) { kfree(argdata.buffer); + lock_release(argdata.lock); return ENOMEM; } @@ -397,11 +400,13 @@ /* allocate the space */ argdata.buffer = kmalloc(ARG_MAX); if (argdata.buffer == NULL) { + lock_release(argdata.lock); return ENOMEM; } argdata.offsets = kmalloc(NARG_MAX * sizeof(size_t)); if (argdata.offsets == NULL) { kfree(argdata.buffer); + lock_release(argdata.lock); return ENOMEM; } -- - David A. Holland / dholland(a)eecs.harvard.edu

22 years, 1 month

Hardware bug

by David Holland

One of the groups ran across some mysterious behavior with the `tlbp' instruction that turned out to be the result of a hardware bug. It wasn't clearing the "probe failed" bit on a successful probe, so if you did multiple TLB_Probe calls without an intervening TLB_Read or TLB_Write (which happen to clear that bit as a side effect) the probe may falsely appear to fail without finding anything. A new System/161 release (0.98) has been issued and deployed. This has two implications: (1) if a mysterious problem you've been seeing mysteriously disappears and never comes back, and could conceivably be related to this issue, this change may be why it disappeared. (2) if you're working at home and/or have compiled your own System/161 for some other reason, you should download and install the new version. -- - David A. Holland / dholland(a)eecs.harvard.edu

22 years, 1 month

Bug in malloctest.c

by David Holland

A. Student wrote: > Kind of a last-minute discovery, but it might save someone from frantic > bug-hunting anyway: there's a bug in malloctest.c, test 2. When it attempts > to allocate a second block to ensure that the memory from the first has > been correctly freed, it never frees that second block, so you end up very > quickly leaking away most of your memory, until eventually it can't > allocate any at which point you have an infinite loop of "0 bytes: failed". > If you only run test 2 once and don't run any other tests, you won't see > the leak, but as soon as you start trying to run multiple tests it shows up. Oops.... Everybody take note. :-| -- - David A. Holland / dholland(a)eecs.harvard.edu

22 years, 1 month

A Good Idea (tm)

by mike vernal

I respectfully offer ip that it might be a good idea to test your code with many different memory sizes. Some appropriate ones might include: #31 busctl ramsize=524288 #31 busctl ramsize=1048576 #31 busctl ramsize=2097152 #31 busctl ramsize=4194304 #31 busctl ramsize=16777216 You might find yours works peachy for 512K, but is unhappy for other sizes. (Also, make sure your final test is with random autoseed.) good luck, -mike

22 years, 1 month

Additional test

by David Holland

This is what I've been using to hammer on the solution set; I thought I'd send it around in case anyone found it useful. /* * parallelvm.c: highly parallelized VM stress test. */ #include <sys/types.h> #include <sys/wait.h> #include <stdarg.h> #include <stdio.h> #include <string.h> #include <stdlib.h> #include <unistd.h> #include <err.h> #define NJOBS 24 #define DIM 35 #define NMATS 11 #define JOBSIZE ((NMATS+1)*DIM*DIM*sizeof(int)) static const int right_answers[NJOBS] = { -1337312809, 356204544, -537881911, -65406976, 1952063315, -843894784, 1597000869, -993925120, 838840559, -1616928768, -182386335, -364554240, 251084843, -61403136, 295326333, 1488013312, 1901440647, 0, -1901440647, -1488013312, -295326333, 61403136, -251084843, 364554240, }; //////////////////////////////////////////////////////////// struct matrix { int m_data[DIM][DIM]; }; //////////////////////////////////////////////////////////// /* * Use this instead of just calling printf so we know each printout * is atomic; this prevents the lines from getting intermingled. */ static void say(const char *fmt, ...) { char buf[256]; va_list ap; va_start(ap, fmt); vsnprintf(buf, sizeof(buf), fmt, ap); va_end(ap); write(STDOUT_FILENO, buf, strlen(buf)); } //////////////////////////////////////////////////////////// static void multiply(struct matrix *res, const struct matrix *m1, const struct matrix *m2) { int i, j, k; for (i=0; i<DIM; i++) { for (j=0; j<DIM; j++) { int val=0; for (k=0; k<DIM; k++) { val += m1->m_data[i][k]*m2->m_data[k][j]; } res->m_data[i][j] = val; } } } static void addeq(struct matrix *m1, const struct matrix *m2) { int i, j; for (i=0; i<DIM; i++) { for (j=0; j<DIM; j++) { m1->m_data[i][j] += m2->m_data[i][j]; } } } static int trace(const struct matrix *m1) { int i, t=0; for (i=0; i<DIM; i++) { t += m1->m_data[i][i]; } return t; } //////////////////////////////////////////////////////////// static struct matrix mats[NMATS]; static void populate_initial_matrixes(int mynum) { int i,j; struct matrix *m = &mats[0]; for (i=0; i<DIM; i++) { for (j=0; j<DIM; j++) { m->m_data[i][j] = mynum+i-2*j; } } multiply(&mats[1], &mats[0], &mats[0]); } static void compute(int n) { struct matrix tmp; int i, j; for (i=0,j=n-1; i<j; i++,j--) { multiply(&tmp, &mats[i], &mats[j]); addeq(&mats[n], &tmp); } } static void computeall(int mynum) { int i; populate_initial_matrixes(mynum); for (i=2; i<NMATS; i++) { compute(i); } } static int answer(void) { return trace(&mats[NMATS-1]); } static void go(int mynum) { int r; say("Process %d (pid %d) starting computation...\n", mynum, (int) getpid()); computeall(mynum); r = answer(); if (r != right_answers[mynum]) { say("Process %d answer %d: FAILED, should be %d\n", mynum, r, right_answers[mynum]); exit(1); } say("Process %d answer %d: passed\n", mynum, r); exit(0); } //////////////////////////////////////////////////////////// static int status_is_failure(int status) { #ifdef HOST /* Proper interpretation of Unix exit status */ if (WIFSIGNALED(status)) { return 1; } if (!WIFEXITED(status)) { /* ? */ return 1; } status = WEXITSTATUS(status); #endif return status != 0; } static void makeprocs(void) { int i, status, failcount; pid_t pids[NJOBS]; printf("Job size approximately %lu bytes\n", (unsigned long) JOBSIZE); printf("Forking %d jobs; total load %luk\n", NJOBS, (unsigned long) (NJOBS * JOBSIZE)/1024); for (i=0; i<NJOBS; i++) { pids[i] = fork(); if (pids[i]<0) { err(1, "fork"); } if (pids[i]==0) { /* child */ go(i); } } failcount=0; for (i=0; i<NJOBS; i++) { if (waitpid(pids[i], &status, 0)<0) { err(1, "waitpid"); } if (status_is_failure(status)) { failcount++; } } if (failcount>0) { printf("%d subprocesses failed\n", failcount); exit(1); } printf("Test complete\n"); } int main(void) { makeprocs(); return 0; }

22 years, 1 month

Monday's 4-5 section

by Alexandra (Sasha) Fedorova

Today's 4-5 section is cancelled. Instead, I am meeting with each group that is sectioned for this time individually. If you wanted to attend today's 4-5 PM section because you cannot attend the one you have signed up for, you can arrange to meet with me. I am free today until 4 PM and after 7 PM. -- Sasha

22 years, 1 month

thread_exit bug

by David Holland

Someone found a race condition in thread_exit in the base OS/161 system: if you get a timer interrupt at the wrong time, it may end up calling as_activate on a stale address space pointer. The quick fix is to move the splhigh() up before the call to as_destroy, like in the enclosed patch. (A better fix is to store the address in a temporary and set curthread->vmspace to NULL before calling as_destroy.) Index: src/kern/thread/thread.c =================================================================== RCS file: /disk/disk0/cs161/CVSREPO/os161/src/kern/thread/thread.c,v retrieving revision 1.22 diff -U6 -r1.22 thread.c --- thread.c 2002/03/05 21:58:22 1.22 +++ thread.c 2002/03/17 18:27:27 @@ -438,23 +438,24 @@ assert(curthread->stack[0] == (char)0xae); assert(curthread->stack[1] == (char)0x11); assert(curthread->stack[2] == (char)0xda); assert(curthread->stack[3] == (char)0x33); } + splhigh(); + if (curthread->vmspace) { as_destroy(curthread->vmspace); curthread->vmspace = NULL; } if (curthread->cwd) { VOP_DECREF(curthread->cwd); curthread->cwd = NULL; } - splhigh(); assert(numthreads>0); numthreads--; mi_switch(S_ZOMB); panic("Thread came back from the dead!\n"); } -- - David A. Holland / dholland(a)eecs.harvard.edu

22 years, 1 month

free_kpages()

by David Holland

It appears that the "sequential" argument to free_kpages, as called by the kmalloc code, can have one of two values: 1, meaning the address being freed is a single page kmalloc was using for small allocations; or 2, meaning the address being freed is either a large allocation done with alloc_kpages, or a completely invalid free attempt. In the first case the block being freed should be exactly one page long; in the second case, however, it may also be exactly one page long (or it may be longer). It is thus questionable whether this argument provides any useful data at all; you may find it simplest to ignore it. In any event, I apologize for the undocumented and icky use of magic signalling values. -- - David A. Holland / dholland(a)eecs.harvard.edu

22 years, 1 month

solution set 2 released

by David Holland

The solution set code for assignment 2 has been released; it can be fetched (either as a complete tree or as a diff against sol1) from the assignments page. We apologize for the delay. -- - David A. Holland / dholland(a)eecs.harvard.edu

22 years, 1 month

Re: [cs161-list] is the os161 kernel "re-entrant"?

by Margo Seltzer

> Valhala page 23 describes UNIX as being: > "... re-entrant, meaning that several processes may > be involved in kernel activity concurrently." > > is os161 kernel designed to be similarly > "re-entrant"? Yes. > if so, where is the code to handle kernel stacks, and > is it written yet or do we (eventually) provide it? Look at the trap code. - M

22 years, 2 months

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

cs161-list March 2002