racket

Author	SHA1	Message	Date
Matthew Flatt	c4ffe39efb	fix leak related to object counts When collecting to the maximum generation with object counts enabled, a structure type would effectively become permanently reachable. Also, add `bytes-finalized` to report how many bytes were associated with guardian-based finalization by the most recent collection. original commit: 852f5e2de95a26d3500321c4d4d732407945a57a	2020-04-16 16:16:13 -06:00
Matthew Flatt	63baf24ad5	repairs for locking Fix clearing of locked-object information and copying adjacent pairs. original commit: 53d092c50c1c24017c52b6e002e6073b81747e09	2020-04-04 16:05:20 -06:00
Matthew Flatt	afebbdd6a9	convert GC to "mkgc.ss" implementation Replace repetitive C code in "gc.c" and "vfasl.c" with an implementation using a little "Parenthe-C" language, which is a somewhat declarative description of object tracing. From that descrition, we generate different kinds of tracing functions, such as the copy function or the sweep function. The little language is still bascially C, just with parentheses and parameterization that is much better than trying to use the C preprocessor. (The "mkgc.ss" file includes the compiler from Parenthe-C to C.) Besides replacing existing code, we also generate a new traversal to implement `compute-object-sizes`. Finally, the GC can now perform a fused `collect` and `compute-object-sizes` in a single traversal. Also improve the way that locked objects are detected during GC. This can make a significant difference (on the order of 10-20% for a full collection) when locked objects are long-lived. original commit: de1f5c41d729ac75822a1f1e633ec6d042c883dc	2020-04-04 10:21:16 -06:00
Matthew Flatt	f828cb1eaa	fix emphemeron-key tracking in a segment with locked objects original commit: 9d1252b176e972f92030599dae0ce159c9d36c5b	2020-04-01 07:53:32 -06:00
Matthew Flatt	995e53ca71	Merge github.com:cisco/ChezScheme original commit: 8cf52012e2a7b5928cb2602bb17e0128ae0f2776	2020-02-22 15:18:47 -07:00
dybvig	d0b405ac8b	library-manager, numeric, and bytevector-compres improvements - added invoke-library syntax.ss, primdata.ss, 8.ms, root-experr, libraries.stex, release_notes.stex - updated the date release_notes.stex - libraries contained within a whole program or library are now marked pending before their invoke code is run so that invoke cycles are reported as such rather than as attempts to invoke while still loading. compile.ss, syntax.ss, primdata.ss, 7.ms, root-experr - the library manager now protects against unbound references from separately compiled libraries or programs to identifiers ostensibly but not actually exported by (invisible) libraries that exist only locally within a whole program. this is done by marking the invisibility of the library in the library-info and propagating it to libdesc records; the latter is checked upon library import, visit, and invoke as well as by verify-loadability. the import and visit code of each invisible no longer complains about invisibility since it shouldn't be reachable. syntax.ss, compile.ss, expand-lang.ss, 7.ms, 8.ms, root-experr, patch - documented that compile-whole-xxx's linearization of the library initialization code based on static dependencies might not work for dynamic dependencies. system.stex - optimized bignum right shifts so the code (1) doesn't look at shifted-off bigits if the bignum is positive, since it doesn't need to know in that case if any bits are set; (2) doesn't look at shifted-off bigits if the bignum is negative if it determines that at least one bit is set in the bits shifted off the low-order partially retained bigit; (3) quits looking, if it must look, for one bits as soon as it finds one; (4) looks from both ends under the assumption that set bits, if any, are most likely to be found toward the high or low end of the bignum rather than just in the middle; and (5) doesn't copy the retained bigits and then shift; rather shifts as it copies. This leads to dramatic improvements when the shift count is large and often significant improvements otherwise. number.c, 5_3.ms, release_notes.stex - threaded tc argument through to all calls to S_bignum and S_trunc_rem so they don't have to call get_thread_context() when it might already have been called. alloc.c, number.c, fasl.c, print.c, prim5.c, externs.h - added an expand-primitive handler to partially inline integer?. cpnanopass.ss - added some special cases for basic arithmetic operations (+, -, , /, quotient, remainder, and the div/div0/mod/mod0 operations) to avoid doing unnecessary work for large bignums when the result will be zero (e.g,. multiplying by 0), the same as one of the inputs (e.g., adding 0 or multiplying by 1), or the additive inverse of one of the inputs (e.g., subtracting from 0, dividing by -1). This can have a major beneficial affect when operating on large bignums in the cases handled. also converted some uses of / into integer/ where going through the former would just add overhead without the possibility of optimization. 5_3.ss, number.c, externs.h, prim5.c, 5_3.ms, root-experr, patch, release_notes.stex - added a queue to hold pending signals for which handlers have been registered via register-signal-handler so up to 63 (configurable in the source code) unhandled signals are buffered before the handler has to start dropping them. cmacros.ss, library.ss, prims.ss, primdata.ss, schsig.c, externs.h, prim5.c, thread.c, gc.c, unix.ms, system.stex, release_notes.stex - bytevector-compress now selects the level of compression based on the compress-level parameter. Prior to this it always used a default setting for compression. the compress-level parameter can now take on the new minimum in addition to low, medium, high, and maximum. minimum is presently treated the same as low except in the case of lz4 bytevector compression, where it results in the use of LZ4_compress_default rather than the slower but more effective LZ4_compress_HC. cmacros,ss, back.ss, compress_io.c, new_io.c, externs.h, bytevector.ms, mats/Mf-base, root-experr* io.stex, objects.stex, release_notes.stex original commit: 72d90e4c67849908da900d0b6249a1dedb5f8c7f	2020-02-21 13:48:47 -08:00
Matthew Flatt	26ff90e8e6	more compact return points for function calls In the general form of a function call, the return point embeds 4 words of information: offset to the start of the enclosing function, frame size, live-veriable mask, and multiple-value return address. In the common case, however, the multiple-value return address is either the same as the return address or it is a `values-error` library function, and the frame size and live-variable mask fit into a word with bits to spare. This patch implements a more compact return point for that common case, which shrinks the 4 words to 2 and also avoids a relocation (= 1 more word). Multiple-value returns are more complex with this change (i.e., require more code), since they must check whether the return point is compact or not. But multiple-value returns are far less common than function calls, so saving function-call space is a clear win. Overall, this change tends to reduce code size by about 10% on x86_64. original commit: 1f53b5eabef966db01086cb32e544bbf8deacfca	2020-01-24 19:19:32 -07:00
Matthew Flatt	81ea967aea	add stencil vectors and fxpopcount original commit: ec766fca869b5e0407c4f54230b72619af73b40b	2020-01-06 05:34:28 -07:00
Oscar Waddell	05ced37f45	fix typo original commit: 243cd029bb19ce555dac4012e6e20c5673143b64	2019-10-26 17:30:53 -04:00
Matthew Flatt	174c416f9e	repair for opportunistic 1-shot If normal 1-shot continuations are mixed with opportunistic 1-shot continuations created by `call-setting-continuation-attachment`, then promoting an opportunistic 1-shot at a GC is wrong unless the whole chain is promoted. original commit: 2dfac475666763b60935e382386af4438f3029e0	2019-09-24 11:41:50 -06:00
dybvig	7d145e37a8	Various enhancements and fixes highlighted by profiling performance and functionality improvements (including support for measuring coverage), primitive argument-checking fixes, and object-file changes resulting in reduced load times (and some backward incompatibility): - annotations are now preserved in object files for debug only, for profiling only, for both, or not at all, depending on the settings of generate-inspector-information and compile-profile. in particular, when inspector information is not enabled but profiling is, source information does not leak into error messages and inspector output, though it is still available via the profile tools. The mechanics of this involved repurposing the fasl a? parameter to hold an annotation flags value when it is not #f and remaking annotations with new flags if necessary before emitting them. compile.ss, fasl.ss, misc.ms - altered a number of mats to produce correct results even when the 's' directory is profiled. misc.ms, cp0.ms, record.ms - profile-release-counters is now generation-friendly; that is, it doesn't look for dropped code objects in generations that have not been collected since the last call to profile-release-counters. also, it no longer allocates memory when it releases counters. pdhtml.ss, gc.c, gcwrapper.c, globals.h, prim5.c - removed unused entry points S_ifile, S_ofile, and S_iofile alloc.c, externs.h - mats that test loading profile info into the compiler's database to guide optimization now weed out preexisting entries, in case the 's' directory is profiled. 4.ms, mat.ss, misc.ms, primvars.ms - counters for dropped code objects are now released at the start of each mat group. mat.ss - replaced ehc (enable-heap-check) option with hci (heap-check-interval) option that allows heap checks to be performed periodically rather than on each collection. hci=0 is equivalent to ehc=f (disabling heap checks) and hci=1 is equivalent to ehc=t (enabling heap checks every collection), while hci=100 enables heap checks only every 100th collection. allx and bullyx mats use this feature to reduce heap-checking overhead to a more reasonable level. this is particularly important when the 's' directory is profiled, since the amount of static memory to be checked is greatly increased due to the counters. mats/Mf-base, mat.ss, primvars.ms - added a mat that calls #%show-allocation, which was otherwise not being tested. misc.ms - removed a broken primvars mat and updated two others. in each case, the mat was looking for information about primitives in the wrong (i.e., old) place and silently succeeding when it didn't find any primitives to tests. the revised mats (along with a few others) now check to make sure at least one identifier has the information they look for. the removed mat was checking for library information that is now compiled in, so the mat is now unnecessary. the others were (not) doing argument-error checks. fixing these turned up a handful of problems that have also been fixed: a couple of unbound variables in the mat driver, two broken primdata declarations, a tardy argument check by profile-load-data, and a bug in char-ready?, which was requiring an argument rather than defaulting it to the current input port. primdata.ss, pdhtml.ss, io.ms, primdvars.ms, 4.ms, 6.ms, misc.ms, patch* - added initial support for recording coverage information. when the new parameter generate-covin-files is set, the compiler generates .covin files containing the universe of all source objects for which profile forms are present in the expander output. when profiling and generation of covin files are enabled in the 's' directory, the mats optionally generate .covout files for each mat file giving the subset of the universe covered by the mat file, along with an all.covout in each mat output directory aggregating the coverage for the directory and another all.covout in the top-level mat directory aggregating the coverage for all directories. back.ss, compile.ss, cprep.ss, primdata.ss, s/Mf-base, mat.ss, mats/Mf-base, mats/primvars.ms - support for generating covout files is now built in. with-coverage-output gathers and dumps coverage information, and aggregate-coverage-output combines (aggregates) covout files. pdhtml.ss, primdata.ss, compile.ss, mat.ss, mats/Mf-base, primvars.ms - profile-clear now adjusts active coverage trackers to avoid losing coverage information. pdhtml.ss, prim5.c - nested with-coverage calls are now supported. pdhtml.ss - switched to a more compact representation for covin and covout files; reduces disk space (compressed or not) by about a factor of four and read time by about a factor of two with no increase in write time. primdata.ss, pdhtml.ss, cprep.ss, compile.ss, mat.ss, mats/Mf-base - added support for determining coverage for an entire run, including coverage for expressions hit during boot time. 'all' mats now produce run.covout files in each output directory, and 'allx' mats produce an aggregate run.covout file in the mat directory. pdhtml.ss, mat.ss, mats/Mf-base - profile-release-counters now adjusts active coverage trackers to account for the counters that have been released. pdhtml.ss, prim5.c - replaced the artificial "examples" target with a real "build-examples" target so make won't think it always has to mats that depend upon the examples directory having been compiled. mats make clean now runs make clean in the examples directory. mats/Mf-base importing a library from an object file now just visits the object file rather than doing a full load so that the run-time code for the library is not retained. The run-time code is still read because the current fasl format forces the entire file to be read, but not retaining the code can lower heap size and garbage-collection cost, particularly when many object-code libraries are imported. The downside is that the file must be revisited if the run-time code turns out to be required. This change exposed several places where the code was failing to check if a revisit is needed. syntax.ss, 7.ms, 8.ms, misc.ms, root-experr* - fixed typos: was passing unquoted load rather than quoted load to $load-library along one path (where it is loading source code and therefore irrelevant), and was reporting src-path rather than obj-path in a message about failing to define a library. syntax.ss - compile-file and friends now put all recompile information in the first fasl object after the header so the library manager can find it without loading the entire fasl file. The library manager now does so. It also now checks to see if library object files need to be recreated before loading them rather than loading them and possibly recompiling them after discovering they are out of date, since the latter requires loading the full object file even if it's out of date, while the former takes advantage of the ability to extract just recompile information. as well as reducing overhead, this eliminates possibly undesirable side effects, such as creation and registration of out-of-date nongenerative record-type descriptors. because the library manager expects to find recompile information at the front of an object file, it will not find all recompile information if object files are "catted" together. also, compile-file has to hold in memory the object code for all expressions in the file so that it can emit the unified recompile information, rather than writing to the object file incrementally, which can significantly increase the memory required to compile a large file full of individual top-level forms. This does not affect top-level programs, which were already handled as a whole, or a typical library file that contains just a single library form. compile.ss, syntax.ss - the library manager now checks include files before library dependencies when compile-imported-libraries is false (as it already did when compile-imported-libraries is true) in case a source change affects the set of imported libraries. (A library change can affect the set of include files as well, but checking dependencies before include files can cause unneeded libraries to be loaded.) The include-file check is based on recompile-info rather than dependencies, but the library checks are still based on dependencies. syntax.ss - fixed check for binding of scheme-version. (the check prevents premature treatment of recompile-info records as Lexpand forms to be passed to $interpret-backend.) scheme.c - strip-fasl-file now preserves recompile-info when compile-time info is stripped. strip.ss - removed include-req* from library/ct-info and ctdesc records; it is no longer needed now that all recompile information is maintained separately. expand-lang.ss, syntax.ss, compile.ss, cprep.ss, syntax.ss - changed the fasl format and reworked a lot of code in the expander, compiler, fasl writer, and fasl reader to allow the fasl reader to skip past run-time information when it isn't needed and compile-time information when it isn't needed. Skipping past still involves reading and decoding when encrypted, but the fasl reader no longer parses or allocates code and data in the portions to be skipped. Side effects of associating record uids with rtds are also avoided, as are the side effects of interning symbols present only in the skipped data. Skipping past code objects also reduces or eliminates the need to synchronize data and instruction caches. Since the fasl reader no longer returns compile-time (visit) or run-time (revisit) code and data when not needed, the fasl reader no longer wraps these objects in a pair with a 0 or 1 visit or revisit marker. To support this change, the fasl writer generates separate top-level fasl entries (and graphs) for separate forms in the same top-level source form (e.g., begin or library). This reliably breaks eq-ness of shared structure across these forms, which was previously broken only when visit or revisit code was loaded at different times (this is an incompatible change). Because of the change, fasl "groups" are no longer needed, so they are no longer handled. 7.ss, cmacros.ss, compile.ss, expand-lang.ss, strip.ss, externs.h, fasl.c, scheme.c, hash.ms - the change above is surfaced in an optional fasl-read "situation" argument (visit, revisit, or load). The default is load. visit causes it to skip past revisit code and data; revisit causes it to skip past visit code and data; and load causes it not to skip past either. visit-revisit data produced by (eval-when (visit revisit) ---) is never skipped. 7.ss, primdata.ss, io.stex - to improve compile-time and run-time error checking, the Lexpand recompile-info, library/rt-info, library-ct-info, and program-info forms have been replaced with list-structured forms, e.g., (recompile-info ,rcinfo). expand-lang.ss, compile.ss, cprep.ss, interpret.ss, syntax.ss - added visit-compiled-from-port and revisit-compiled-from-port to complement the existing load-compiled-from-port. 7.ss, primdata.ss, 7.ms, system.stex - increased amount read when seeking an lz4-encrypted input file from 32 to 1024 bytes at a time compress-io.c - replaced the fasl a? parameter value #t with an "all" flag value so it's value is consistently a mask. cmacros.ss, fasl.ss, compile.ss - split off profile mats into a separate file misc.ms, profile.ms (new), root-experr, mats/Mf-base - added coverage percent computations to mat allx/bullyx output mat.ss, mats/Mf-base, primvars.ms - replaced coverage tables with more generic and generally useful source tables, which map source objects to arbitrary values. pdhtml.ss, compile.ss, cprep.ss, primdata.ss, mat.ss, mats/Mf-base, primvars.ms, profile.ms, syntax.stex - reduced profile counting overhead by using calls to fold-left instead of calls to apply and map and by using fixnum operations for profile counts on 64-bit machines. pdhtml.ss - used a critical section to fix a race condition in the calculations of profile counts that sometimes resulted in bogus (including negative) counts, especially when the 's' directory is profiled. pdhtml.ss - added discard flag to declaration for hashtable-size primdata.ss - redesigned the printed representation of source tables and rewrote get-source-table! to read and store incrementally to reduce memory overhead. compile.ss - added generate-covin-files to the set of parameters preserved by compile-file, etc. compile.ss, system.stex - moved covop argument before the undocumented machine and hostop arguments to compile-port and compile-to-port. removed the undocumented ofn argument from compile-to-port; using (port-name ip) instead. compile.ss, primdata.ss, 7.ms, system.stex - compile-port now tries to come up with a file position to supply to make-read, which it can do if the port's positions are character positions (presently string ports) or if the port is positioned at zero. compile.ss - audited the argument-type-error fuzz mat exceptions and fixed a host of problems this turned up (entries follow). added #f as an invalid argument for every type for which #f is indeed invalid to catch places where the maybe- prefix was missing on the argument type. the mat tries hard to determine if the condition raised (if any) as the result of an invalid argument is appropriate and redirects the remainder to the mat-output (.mo) file prefixed with 'Expected error', causing them to show up in the expected error output so developers will be encouraged to audit them in the future. primvars.ms, mat.ss - added an initial symbol? test on machine type names so we produce an invalid machine type error message rather than something confusing like "machine type #f is not supported". compile.ss - fixed declarations for many primitives that were specified as accepting arguments of more general types than they actually accept, such as number -> real for various numeric operations, symbol -> endianness for various bytevector operations, time -> time-utc for time-utc->date, and list -> list-of-string-pairs for default-library-search-handler. also replaced some of the sub-xxxx types with specific types such as sub-symbol -> endianness in utf16->string, but only where they were causing issues with the primvars argument-type-error fuzz mat. (this should be done more generally.) primdata.ss - fixed incorrect who arguments (was map instead of fold-right, current-date instead of time-utc->date); switched to using define-who/set-who! generally. 4.ss, date.ss - append! now checks all arguments before any mutation 5_2.ss - with-source-path now properly supplies itself as who for the string? argument check; callers like load now do their own checks. 7.ss - added missing integer? check to $fold-bytevector-native-ref whose lack could have resulted in a compile-time error. cp0.ss - fixed typo in output-port-buffer-mode error message io.ss - fixed who argument (was fx< rather than fx<?) library.ss - fixed declaration of first source-file-descriptor argument (was sfd, now string) primdata.ss - added missing article 'a' in a few error messages prims.ss - fixed the copy-environment argument-type error message for the list of symbols argument. syntax.ss - the environment procedure now catches exceptions that occur and reraises the exception with itself as who if the condition isn't already a who condition. syntax.ss - updated experr and allx patch files for changes to argument-count fuzz mat and fixes for problems turned up by them. root-experr, patch* - fixed a couple of issues setting port sizes: string and bytevector output port put handlers don't need room to store the character or byte, so they now set the size to the buffer length rather than one less. binary-file-port-clear-output now sets the index rather than size to zero; setting the size to zero is inappropriate for some types of ports and could result in loss of buffering and even suppression of future output. removed a couple of redundant sets of the size that occur immediately after setting the buffer. io.ss - it is now possible to return from a call to with-profile-tracker multiple times and not double-count (or worse) any counts. pdhtml.ss, profile.ms - read-token now requires a file position when it is handed a source-file descriptor (since the source-file descriptor isn't otherwise useful), and the source-file descriptor argument can no longer be #f. the input file position plays the same role as the input file position in get-datum/annotations. these extra read-token arguments are now documented. read.ss, 6.ms, io.stex - the source-file descriptor argument to get-datum/annotations can no longer be #f. it was already documented that way. read.ss - read-token and do-read now look for the character-positions port flag before asking if the port has port-position, since the latter is slightly more expensive. read.ss - rd-error now reports the current port position if it can be determined when fp isn't already set, i.e., when reading from a port without character positions (presently any non string port) and fp has not been passed in explicitly (to read-token or get-datum/annotations). the port position might not be a character position, but it should be better than nothing. read.ss - added comment noting an invariant for s_profile_release_counters. prim5.c - restored accidentally dropped fasl-write formdef and dropped duplicate fasl-read formdef io.stex - added a 'coverage' target that tests the coverage of the Scheme-code portions of Chez Scheme by the mats. Makefile.in, Makefile-workarea.in - added .PHONY declarations for all of the targets in the top-level and workarea make files, and renamed the create-bintar, create-rpm, and create-pkg targets bintar, rpm, and pkg. Makefile.in, Makefile-workarea.in - added missing --retain-static-relocation command-line argument and updated the date scheme.1.in - removed a few redundant conditional variable settings configure - fixed declaration of condition wait (timeout -> maybe-timeout) primdata.ss original commit: 88501743001393fa82e89c90da9185fc0086fbcb	2019-09-21 15:37:29 -07:00
Matthew Flatt	b842a134fd	continuation-attachment performance Add a shortcut check when refiying the continuation frame in tail position, which is significantly cheaper when the frame is already there. We pay down the check by skipping an attachment-lists check that is not needed if the frame is newly reified. Aslo, add a one-shot continuation-frame cache, which makes a shallow temporary attachment cheaper, as in (let loop ([i N]) (if (zero? i) 0 (loop (call-setting-continuation-attachment i (lambda () (f (sub1 i))))))) The cache is just one frame. Keeping a chain of allocated-by-not-GCed frames doesn't pay off. Meanwhile, remove the leftover `$shift-attachment` library entry. original commit: 1f454f536b1d7efe20fe9e793cda31e54e31e5f4	2019-09-11 09:34:42 -06:00
Matthew Flatt	502b0b5f50	repair for locked-object handling and multiply-locked values Weak pairs, ephemeron pairs, some symbols, and some ports were handled incorerctly when locked multiple times. original commit: 847fc1c84496f67cd363c8411d0023339f4d6246	2019-09-01 08:57:14 -06:00
Matthew Flatt	2f4d59de0f	remove unused binding original commit: a4732d58666d80e78af5e1cde4c796d3eeae20e7	2019-09-01 07:13:23 -06:00
Matthew Flatt	c195288251	scalable object locking The `unlock-object` operation was O(N) with N currently locked objects --- so, O(N^2) to lock N objects and then unlock them --- because locked objects were stored in and searched in a global list. Also, GC was O(N) at any generation with N locked objects across generations, since every locked object was scanned. Fix these poblems so that locking and unlocking is practically O(1) and GC is not poportional to locked objects. More precisely, locking and unlocking is now O(C) for locking an individual object C times to be balanced by C unlocks. (Since multiple locks on a single object is rare, this performance seems good enough.) The implementation replaces the global list with segment-specific lists. Backpointers are managed using the general generational support, so that unmodified, old-generation locked objects do not need to be swept duing a new-generation collection. original commit: a57d256ca73a3d507792c471facb7e35afbe88b3	2019-09-01 07:03:16 -06:00
Matthew Flatt	2cf27c4727	Merge github.com:cisco/ChezScheme original commit: 8118200e237d756f83be54e8bf3eabb4af2388ed	2019-05-22 10:46:59 -06:00
dyb	82b2cda639	compress-level parameter, improvement in lz4 compression, and various other related improvements - added compress-level parameter to select a compression level for file writing and changed the default for lz4 compression to do a better job compressing. finished splitting glz input routines apart from glz output routines and did a bit of other restructuring. removed gzxfile struct-as-bytevector wrapper and moved its fd into glzFile. moved DEACTIVATE to before glzdopen_input calls in S_new_open_input_fd and S_compress_input_fd, since glzdopen_input reads from the file and could block. the compress format and now level are now recorded directly the thread context. replaced as-gz? flag bit in compressed bytevector header word with a small number of bits recording the compression format at the bottom of the header word. flushed a couple of bytevector compression mats that depended on the old representation. (these last few changes should make adding new compression formats easier.) added s-directory build options to choose whether to compress and, if so, the format and level. compress-io.h, compress-io.c, new-io.c, equates.h, system.h, scheme.c, gc.c, io.ss, cmacros.ss, back.ss, bytevector.ss, primdata.ss, s/Mf-base, io.ms, mat.ss, bytevector.ms, root-experr, release_notes.stex, io.stex, system.stex, objects.stex - improved the effectiveness of LZ4 boot-file compression to within 15% of gzip by increasing the lz4 output-port in_buffer size to 1<<18. With the previous size (1<<14) LZ4-compressed boot files were about 50% larger. set the lz4 input-port in_buffer and out_buffer sizes to 1<<12 and 1<<14. there's no clear win at present for larger input-port buffer sizes. compress-io.c - To reduce the memory hit for the increased output-port in_buffer size and the corresponding increase in computed out_buffer size, one output-side out_buffer is now allocated (lazily) per thread and stored in the thread context. The other buffers are now directly a part of the lz4File_out and lz4File_in structures rather than allocated separately. compress-io.c, scheme.c, gc.c, cmacros.ss - split out the buffer emit code from glzwrite_lz4 into a separate glzemit_lz4 helper that is now also used by gzclose so we can avoid dealing with a NULL buffer in glzwrite_lz4. glzwrite_lz4 also uses it to writing large buffers directly and avoid the memcpy. compress-io.c - replaced lz4File_out and lz4File_in mode enumeration with the compress format and inputp boolean. using switch to check and raising exceptions for unexpected values to further simplify adding new compression formats in the future. compress-io.c - replaced the never-defined struct lz4File pointer in glzFile union with the more specific struct lz4File_in_r and Lz4File_out_r pointers. compress-io.h, compress-io.c - added free of lz4 structures to gzclose. also changed file-close logic generally so that (1) port is marked closed before anything is freed to avoid dangling pointers in the case of an interrupt or error, and (2) structures are freed even in the case of a write or close error, before the error is reported. also now mallocing glz and lz4 structures after possibility of errors have passed where possible and freeing them when not. compress-io.c, io.ss - added return-value checks to malloc calls and to a couple of other C-library calls. compress-io.c - corrected EINTR checks to look at errno rather than return codes. compress-io.c - added S_ prefixes to the glz exports externs.h, compress-io.c, new-io.c, scheme.c, fasl.c - added entries for mutex-name and mutex-thread threads.stex original commit: 722ffabef4c938bc92c0fe07f789a9ba350dc6c6	2019-04-18 05:47:19 -07:00
Matthew Flatt	60005f02d2	Merge github.com:cisco/ChezScheme original commit: 629f8b653ff46afa64bffa1fcfbb8e7c94dd7451	2019-02-17 18:22:55 -07:00
dyb	2daf225cab	committing a handful of changes, none of which should be particularly controversial, unless I damaged something in the process of integrating them with other recent changes. the user's guide and release notes have been updated as well to reflect the changes of interest to end users. - the body of load-library is now wrapped in a $pass-time with to show the time spent loading libraries separately from the time spent in expand. syntax.ss - interpret now plays the pass-time game interpret.ss - added compile-time-value? predicate and compile-time-value-value accessor syntax.ss, primdata.ss, 8.ms, primvars.ms, root-experr* - $pass-stats now returns accurrate stats for the currently timed pass. 7.ss - compile-whole-program and compile-whole-library now propagate recompile info from the named wpo file to the object file to support maybe-compile-program and maybe-compile-library in the case where compile-whole-{program,library} overwrites the original object file. compile.ss, 7.ms, mat.ss, primvars.ms - replaced the ancient and unusable bintar with one that creates a useful tarball for binary installs bintar - generated Mf-install InstallBin (InstallLib, InstallMan) now correctly indirects through InstallPrefix if the --installbin (--installlib, --installman) configure flag is not present. src/configure - removed definition of generate-procedure-source-information patch.ss - guardian tconc cells are now allocated in generation 0 in the hope that they can be released more quickly. gc.c - added ftype-guardian syntax: (ftype-guardian A) creates a new guardian for ftype pointers of type A, the first base field (or one of the first base fields in the case of unions) of which must be a word-sized integer with native endianness representing a reference count. ftype pointers are registered with and retrieved from the guardian just like objects are registered with and retrieved from any guardian. the difference is that the garbage collector decrements the reference count before resurrecting an ftype pointer and resurrects only those whose reference counts become zero, i.e., are ready for deallocation. ftype.ss, cp0.ss, cmacros.ss, cpnanopass.ss, prims.ss, primdata.ss, gc.c, 4.ms, root-experr* - fixed a bug in automatic recompilation handling of missing include files specified with absolute pathnames or pathnames starting with "./" or "..": was erroring out in file-modification-time with a file-not-found or other exception rather than recompiling. syntax.ss, 7.ms, root-experr, patch - changed inline vector-for-each and string-for-each code to put the last call to the procedure in tail position, as was already done for the library definitions and for the inline code for for-each. cp0.ss, 5_4.ms, 5_6.ms - the compiler now generates better inline code for the bytevector procedure. instead of one byte memory write for each argument, it writes up to 4 (32-bit machines) or 8 (64-bit machines) bytes at a time, which almost always results in fewer instructions and fewer writes. cpnanopass.ss, bytevector.ms - packaged unchanging implicit reader arguments into a single record to reduce the number of arguments. read.ss - recoded run-vector to handle zero-length vectors. it appears we're not presently generating empty vectors (representing empty groups), but the fasl format permits them. 7.ss original commit: 7be1d190de7171f74a1ee71e348d3e6310392686	2019-02-11 20:06:42 -08:00
Matthew Flatt	1baa0da991	use opportunistic 1-shot continuations for attachments An attachment continuation link can be a 1-shot continuation, but the existing 1-short continuation implementation tends to work less well than mutishot continuations. An opportunistic 1-shot continuation is like a multi-shot continuation, but if it is called from a stack that is adjacent to the continuation, then the stack is merged with the continuation's stack. original commit: ea1eb3c5192d644ad4c4cbf755bcb6fd438cc364	2019-02-08 13:59:28 -08:00
dyb	a1195b7f7e	addressed foreign-callable / boot file invalid memory reference: - fixed a bug in which instantiating a static foreign-callable code object fails with an invalid memory reference because the collector has discarded its relocation information. foreign-callable code objects are now flagged as "templates", and the collector now refuses to discard relocation information for code objects marked as templates when copying them to the static generation. cmacros.ss, cpnanopass.ss, gc.c, 7.ms - committing updated boot//equates.h (without the boot files, which are still usable for bootstrapping) boot//*.h - updated release notes release_notes.stex original commit: 71d3abba684e04b134720ea1bd9a8c847c38ac5f	2019-02-06 22:22:21 -08:00
Matthew Flatt	8070a7b910	Merge branch 'eqfl' of github.com:mflatt/ChezScheme original commit: 8b36396eacb139e0fff70efcd2c9dc842815324f	2019-01-22 05:57:17 -07:00
Matthew Flatt	21fc705234	adjust GC to preserve `eq?` on flonums original commit: d405416eb2ec6d5dd147afc7a2af5a6c2f0a8130	2019-01-22 05:24:05 -07:00
Matthew Flatt	6e999d02c3	add ordered guardians Also, avoid quadratic time in GC for guardian chains. original commit: 273f79a7be5c04370c399e6b1d8af799efc8b33f	2019-01-22 05:19:38 -07:00
Matthew Flatt	b27f3c0a94	Merge branch 'phantom' of github.com:mflatt/ChezScheme original commit: 743a56d8f1920620e8f6e14edca7984101425e14	2019-01-20 07:56:59 -07:00
Matthew Flatt	538def47de	add phantom bytevectors original commit: 001917fd98ac6a0f13ccab902e15b9d2169c4b9c	2019-01-20 07:41:09 -07:00
Matthew Flatt	6e00dab37f	bootfiles and fixup original commit: a6c7f8851fd3996726f62f62e151ff76f0216f72	2018-07-25 18:15:09 -06:00
Matthew Flatt	95d3146c16	Merge branch 'cm' of github.com:mflatt/ChezScheme original commit: 9d8e3e99e79c1a2fa2cd20849c99f05b91db70d9	2018-07-25 16:07:41 -06:00
Matthew Flatt	4b5daf4594	Merge branch 'arity-wrapper' of github.com:mflatt/ChezScheme original commit: 23102af98ccd2dacd3529dd37c182d00f1d12490	2018-07-25 16:05:17 -06:00
Matthew Flatt	f919bbcab6	add support for continuation attachments original commit: 32669b104ef1119aea21f8592cee09d55f696afa	2018-07-25 06:33:46 -06:00
Matthew Flatt	48228739fe	add `object-references` to reflect GC's tracing of objects The `object-references` function is intended to support debugging of memory leaks by providing a mapping from each live object to the object that retained it. original commit: 61f6602b7e6c388c529f3c5995dcf71a7c42e005	2018-07-16 18:08:48 -06:00
Matthew Flatt	28c8ebaeff	add make-arity-wrapper-procedure A program can use `make-arity-wrapper-procedure` to synthesize a function that reports a given arity mask (without calling `compile`). In addition, `set-arity-wrapper-procedure!` suports modifying the implementation of a synthesized procedure. Although similar functionality could be achieved with `(lambda args (apply (unbox proc) args))`, an arity wrapper procedure can dispatch to another procedure without allocating a list for the arguments. The interpreter now uses an internal variant of arity wrappers to cooperate with `procedure-arity-mask`. original commit: 5fede14302840b55edbeb7565e28d09350a4b2e9	2018-07-16 07:52:55 -06:00
Matthew Flatt	2ca43d6c6f	add ordered guardians Also, avoid quadratic time in GC for guardian chains. original commit: a07c7e14b61862989777909ee63a2ec120c2ea47	2018-07-15 19:12:43 -06:00
dybvig	f7c414bda3	Various updates, mostly to the compiler, including a new lambda commonizatio pass and support for specifying default record equal and hash procedures: - more staid and consistent Mf-cross main target Mf-cross - cpletrec now replaces the incoming prelexes with new ones so that it doesn't have to alter the flags on the incoming ones, since the same expander output is passed through the compiler twice while compiling a file with macro definitions or libraries. we were getting away without this just by luck. cpletrec.ss - pure? and ivory? now return #t for a primref only if the prim is declared to be a proc, since some non-proc prims are mutable, e.g., $active-threads and $collect-request-pending. cp0.ss - $error-handling-mode? and $eol-style? are now properly declared to be procs rather than system state variables. primdata.ss - the new pass $check-prelex-flags verifies that prelex referenced, multiply-referenced, and assigned flags are set when they should be. (it doesn't, however, complain if a flag is set when it need not be.) when the new system parameter $enable-check-prelex-flags is set, $check-prelex-flags is called after each major pass that produces Lsrc forms to verify that the flags are set correctly in the output of the pass. this parameter is unset by default but set when running the mats. cprep.ss, back.ss, compile.ss, primdata.ss, mats/Mf-base - removed the unnecessary set of prelex referenced flag from the build-ref routines when we've just established that it is set. syntax.ss, compile.ss - equivalent-expansion? now prints differences to the current output port to aid in debugging. mat.ss - the nanopass that patches calls to library globals into calls to their local counterparts during whole-program optimization now creates new prelexes and sets the prelex referenced, multiply referenced, and assigned flags on the new prelexes rather than destructively setting flags on the incoming prelexes. The only known problems this fixes are (1) the multiply referenced flag was not previously being set for cross-library calls when it should have been, resulting in overly aggressive inlining of library exports during whole-program optimization, and (2) the referenced flag could sometimes be set for library exports that aren't actually used in the final program, which could prevent some unreachable code from being eliminated. compile.ss - added support for specifying default record-equal and record-hash procedures. primdata.ss, cmacros.ss, cpnanopass.ss, prims.ss, newhash.ss, gc.c, record.ms - added missing call to relocate for subset-mode tc field, which wasn't burning us because the only valid non-false value, the symbol system, is in the static generation after the initial heap compaction. gc.c - added a lambda-commonization pass that runs after the other source optimizations, particularly inlining, and a new parameter that controls how hard it works. the value of commonization-level ranges from 0 through 9, with 0 disabling commonization and 9 maximizing it. The default value is 0 (disabled). At present, for non-zero level n, the commonizer attempts to commonize lambda expressions consisting of 2^(10-n) or more nodes. commonization of one or more lambda expressions requires that they have identical structure down to the leaf nodes for quote expressions, references to unassigned variables, and primitives. So that various downstream optimizations aren't disabled, there are some additional restrictions, the most important of which being that call-position expressions must be identical. The commonizer works by abstracting the code into a helper that takes the values of the differing leaf nodes as arguments. the name of the helper is formed by concatenating the names of the original procedures, separated by '&', and this is the name that will show up in a stack trace. The source location will be that of one of the original procedures. Profiling inhibits commonization, because commonization requires profile source locations to be identical. cpcommonize.ss (new), compile.ss, interpret.ss, cprep.ss, primdata.ss, s/Mf-base, mats/Mf-base - cpletrec now always produces a letrec rather than a let for single immutable lambda bindings, even when not recursive, for consistent expand/optimize output whether the commonizer is run or not. cpletrec.ss, record.ms - trans-make-ftype-pointer no longer generates a call to $verify-ftype-address if the address expression is a call to ftype-pointer-address. ftype.ss original commit: b6a3dcc814b64faacc9310fec4a4531fb3f18dcd	2018-01-29 09:20:07 -05:00
Matthew Flatt	9d8cc87758	add locate-source cache and line+column components to source objects Add optional beginning-line and beginning-column components to a source object, so that line and column information can be recorded independent of the file. Add `locate-source-object-source` to use the recorded information. Add a cache for `locate-source` as enabled by the `use-cache?` optional argument, which can avoid compilation times that are quadratic in the number of `let-values` or `define-values` forms. original commit: b36fab81d5041a54ce01a422395eee79d2f930bc	2017-08-01 05:23:56 -06:00
Matthew Flatt	59c772ba48	add make-ephemeron-eq-hashtable, etc. Revert the use of ephemeron pairs in weak hashtables, since the difference is visible via guardians. Add hashtable based on ephemerons (to avoid key-in-value problems) as an explicit variant. original commit: 31ac6d78592e1a9ba6bfbe802260e3d56d4cf772	2017-07-06 16:27:23 -06:00
dyb	2bc65b5d6d	check_dirty_ephemeron now puts ephemerons whose keys haven't yet been seen on the pending list rather than the trigger lists. gc.c removed scan of space_ephemeron from check_heap because check_heap as written can't handle the two link fields properly. gcwrapper.c in the ephemerons mat that checks interaction between mutation and collection, added generation arguments to the first two collect calls so they always collect into the intended generation. 4.ms updated allx and bullyx patches patch* original commit: 43b54f64949cf992e52cf18bacc2a09f4a199227	2017-05-29 20:21:01 -04:00
Matthew Flatt	da7a81e8cd	improve some function names, comments, and declarations original commit: 795c391b8417d6aec3d7888e292efbac415029f7	2017-05-24 09:38:59 -06:00
Matthew Flatt	28f98ebc0b	fix typo in comment original commit: 001603fdf9c171e36d620999d5e4760ab333f119	2017-05-24 09:38:59 -06:00
Matthew Flatt	0d5340c061	fix interaction of ephemerons and generations; use for weak hashtables original commit: 6f7147e505aae5c2b9139eea6df8a9c25a35289d	2017-05-24 09:38:24 -06:00
Matthew Flatt	18cdcd977e	add ephemerons original commit: 8a09c2c3f032e6e30b1ef393d2334963aa70507e	2017-05-24 09:38:24 -06:00
Bob Burger	831ea8ad18	changed copyright year to 2017 7.ss, scheme.1.in, comments of many files original commit: 06f858f9a505b9d6fb6ca1ac97234927cb2dc641	2017-04-06 11:41:33 -04:00
Kent Dybvig	c503362914	- various tweaks to the immutable object support; also taught cp0 to simplify ($fxu< (most-positive-fixnum) e) => (fx< e 0) so we don't have any incentive in special casing length checks where the maximum length happens to be (most-positive-fixnum). 5_4.ss, 5_6.ss, bytevector.ss, cmacros.ss, cp0.ss, cpnanopass.ss, mkheader.ss, primdata.ss, prims.ss, fasl.c, gc.c, types.h root-experr, patch original commit: 9eb63deda025fd4560b54746b21a881c01af46d6	2017-03-15 14:49:58 -04:00
Kent Dybvig	9cd0199a39	merge @mflatt immutable-vector, immutable-string, immutable-bytevector, immutable-fxvector, and immutable-box support original commit: 547fce9b99c6566d5cb3f7af9ca84654e798486e	2017-03-15 11:09:57 -04:00
Kent Dybvig	9a16156574	eliminated some direct assumptions that a vector's type/length field is a fixnum and added meta-asserts to verify that it is in a couple of others, to facilitate future changes to vector typing. vectors are now treated essentially like fxvectors, strings, and bytevectors. cmacros.ss, cpnanopass.ss, prims.ss, mkheader.ss, alloc.c, gc.c, scheme.c original commit: 564542d32bbae6b33cef808613238d5a4a2a8ee2	2017-03-12 23:54:38 -04:00
Matthew Flatt	21fe925d06	add procedure-arity-mask original commit: 4bd061000ab903feb3fe8e3b96ecbcb10c59dba9	2017-02-22 07:16:53 -07:00
Bob Burger	0d0e876fb7	fixed a couple typos in comments original commit: 9e2347eeb2bd57b35f96f0f1938ef84d624ed6a4	2016-06-23 16:43:39 -04:00
dyb	1356af91b3	initial upload of open-source release original commit: 47a210c15c63ba9677852269447bd2f2598b51fe	2016-04-26 10:04:54 -04:00

48 Commits