cosmopolitan

mirror of https://github.com/jart/cosmopolitan.git synced 2025-10-04 05:31:02 +00:00

Author	SHA1	Message	Date
Justine Tunney	67b19ae733	Release Cosmopolitan v3.5.0	2024-06-23 22:45:14 -07:00
Justine Tunney	c4c812c154	Introduce ctl::set and ctl::map We now have a C++ red-black tree implementation that implements standard template library compatible APIs while compiling 10x faster than libcxx. It's not as beautiful as the red-black tree implementation in Plinko but this will get the job done and the test proves it upholds all invariants This change also restores CheckForMemoryLeaks() support and fixes a real actual bug I discovered with Doug Lea's dlmalloc_inspect_all() function.	2024-06-23 22:27:11 -07:00
Justine Tunney	388e236360	Revert misguided dlmalloc optimization	2024-06-22 09:55:02 -07:00
Justine Tunney	f2c8ddbbe3	Fix --strace use-after-free in pthread_join()	2024-06-22 06:05:52 -07:00
Justine Tunney	d1d4388201	Delete ASAN It hasn't been helpful enough to be justify the maintenance burden. What actually does help is mprotect(), kprintf(), --ftrace and --strace which can always be counted upon to work correctly. We aren't losing much with this change. Support for ASAN on AARCH64 was never implemented. Applying ASAN to the core libc runtimes was disabled many months ago. If there is some way to have an ASAN runtime for user programs that is less invasive we can potentially consider reintroducing support. But now is premature.	2024-06-22 05:45:49 -07:00
Justine Tunney	6ffed14b9c	Rewrite memory manager Actually Portable Executable now supports Android. Cosmo's old mmap code required a 47 bit address space. The new implementation is very agnostic and supports both smaller address spaces (e.g. embedded) and even modern 56-bit PML5T paging for x86 which finally came true on Zen4 Threadripper Cosmopolitan no longer requires UNIX systems to observe the Windows 64kb granularity; i.e. sysconf(_SC_PAGE_SIZE) will now report the host native page size. This fixes a longstanding POSIX conformance issue, concerning file mappings that overlap the end of file. Other aspects of conformance have been improved too, such as the subtleties of address assignment and and the various subtleties surrounding MAP_FIXED and MAP_FIXED_NOREPLACE On Windows, mappings larger than 100 megabytes won't be broken down into thousands of independent 64kb mappings. Support for MAP_STACK is removed by this change; please use NewCosmoStack() instead. Stack overflow avoidance is now being implemented using the POSIX thread APIs. Please use GetStackBottom() and GetStackAddr(), instead of the old error-prone GetStackAddr() and HaveStackMemory() APIs which are removed.	2024-06-22 05:45:11 -07:00
Justine Tunney	7f6d0b8709	Support -x LANG flag in cosmocc	2024-06-20 23:55:46 -07:00
Steven Dee (Jōshin)	d7b1919b29	ctl::unique_ptr improvements and cleanup (#1221 ) Explicitly value-initializes the deleter, even though I have not found a way to get the deleter to act like it’s been default-initialized in unit tests so far. Uses auto in reset. The static cast is apparently not needed (unless I’m missing some case I didn’t think of.) Implements the general move constructor - turns out that the reason this didn’t work before was that default_delete<U> was not move constructible from default_delete<T>. Drop inline specifiers from functions defined entirely inside the struct definition since they are implicitly inline. * Cleans up reset to match spec Remove the variants from the T[] specialization. Also follow the spec on the order of operations in reset, which may matter if we are deleting an object that has a reference to the unique_ptr that is being reset. (?) * Tests Base/Derived reset. * Adds some constexpr declarations. * Adds default_delete specialization for T[]. * Makes parameters const.	2024-06-20 18:44:31 -04:00
Steven Dee (Jōshin)	f86e6f8eb0	Make new.cc definitions weak (#1233 ) The STL says that these should be replaceable by user code. new.cc now defines only a few direct functions (including a free wrapper that perplexingly is needed since g++ didn’t want to alias "free".) Now, all of the operators are weak references to those functions.	2024-06-20 15:20:54 -04:00
Steven Dee (Jōshin)	7e780e57d4	More ctl::string optimization (#1232 ) Moves some isbig checks into string.h, enabling smarter optimizations to be made on small strings. Also we no longer zero out our string prior to calling the various constructors, buying back the performance we lost on big strings when we made the small-string optimization. We further add a little optimization to the big_string copy constructor: if the string is using half or more of its capacity, then we don’t recompute capacity and just take the old string’s. As well, the copy constructor always makes a small string when it will fit, even if copied from a big string that got truncated. This also reworks the test to follow the idiom adopted elsewhere re stl, and adds a helper function to tell if a string is small based on data().	2024-06-20 14:52:12 -04:00
Steven Dee (Jōshin)	9a5a13854d	CTL: utility.h, use ctl::swap in string (#1227 ) * Add ctl utility.h Implements forward, move, swap, and declval. This commit also adds a def for nullptr_t to cxx.inc. We need it now because the CTL headers stopped including anything from libc++, so we no longer get their basic types. * Use ctl::swap in string The STL spec says that swap is located in the string_view header anyawy. Performance-wise this is a noop, but it’s slightly cleaner.	2024-06-19 01:00:59 -04:00
Steven Dee (Jōshin)	a795017416	Fix c.inc _Atomic define for C++ (#1231 ) c.inc (AFAICT erroneously) defined _Atomic(t) as `volatile t *`, when it should have just said `volatile t`, when __STDC_VERSION__ was too small. This happens when we’re compiling C++, but in C++11, _Atomic is a define supplied by the STL rather than a keyword supplied by the compiler. Wait though, it gets better: in C++11, _Atomic hooks you into the morass that is stdatomic.h, and ultimately refers everything back to std::atomic<T>. The gory, horrifying details are in libcxx's __atomic/cxx_atomic_impl.h. The tldr is that for our purposes it’s fine to just say volatile and use the normal libc/intrin/atomic.h functions.	2024-06-17 21:12:02 -07:00
Brian	8e3b361aeb	Remove testing label from labeler.yml (#1220 ) #1219 had an issue with noisy testing label. Investigated if there is a syntax to make it more exclusive, but turns out there isn't one yet. So let's remove an manually add it in as needed. Fixes #1219.	2024-06-16 11:45:57 -04:00
Steven Dee (Jōshin)	f9dd5683a4	Implement ctl::unique_ptr (#1216 ) The way unique_ptr is supposed to work is as a purely compile-time check that your raw pointers are getting deleted when they go out of scope. It should ideally emit the same exact machine code as if you were using raw pointers with manual deletes. Part of what this means is that under normal circumstances, a unique_ptr shouldn’t take up more space than a raw pointer - in other words, sizeof unique_ptr<T> should == sizeof(T*). The present PR doesn’t bother with the specialization for array types. I also left a couple other parts of the STL API unimplemented. I’d love to see someone else implement these, or I’ll get to them at some point.	2024-06-15 22:54:52 -04:00
Steven Dee (Jōshin)	e38a6e7996	ctl string const/value tweaks (#1218 ) The mangled name of a C++ function will typically not vary by const-ness of a by-value parameter; in other words, there is no meaning to a const- qualified by-value parameter in a function prototype. However, the const keyword _does_ matter at function _definition_ time, like it does with a variable declared in the body. So for prototypes, we strip out const for by-value parameters; but for definitions, we leave them alone. At function definition (as opposed to prototype), we add const to values in parameters by default, unless we’re going to mutate them. This commit also changes a couple of const string_view& to be simply by- value string_view. A string_view is only two words; it rarely ever makes sense to pass one by reference if it’s not going to be mutated.	2024-06-15 18:09:30 -07:00
Steven Dee (Jōshin)	ddfaf3fee0	Explain the clang-format off in new.h	2024-06-15 17:17:04 -07:00
Michael Lenaghan	0dbf01bf1d	Bring Lua to 5.4.6. (#1214 ) This essentially re-does the work of #875 on top of master. This is what I did to check that Cosmo's Lua extensions still worked: ``` $ build/bootstrap/make MODE=aarch64 o/aarch64/third_party/lua/lua $ ape o/aarch64/third_party/lua/lua >: 10 10 >: 010 8 >: 0b10 2 >: string.byte("\e") 27 >: "Hello, %s" % {"world"} Hello, world >: "" 3 * ``` `luaL_traceback2` was used to show the stack trace with parameter values; it's used in `LuaCallWithTrace`, which is used in Redbean to run Lua code. You should be able to see the extended stack trace by running something like this: `redbean -e "function a(b)c()end a(2)"` (with "params" indicating the extended stack trace): ``` stack traceback: [string "function a(b)c()end a(2)"]:1: in function 'a', params: b = 2; [string "function a(b)c()end a(2)"]:1: in main chunk ``` @pkulchenko confirmed that I get the expected result with the updated code. This is what I did to check that Lua itself still worked: ``` $ cd third_party/lua/test/ $ ape ../../../o/aarch64/third_party/lua/lua all.lua ``` There's one test failure, in `files.lua`: ``` * FILE 'files.lua'*** testing i/o ../../../o/aarch64/third_party/lua/lua: files.lua:84: assertion failed! stack traceback: [C]: in function 'assert' files.lua:84: in main chunk (...tail calls...) all.lua:195: in main chunk [C]: in ? .>>> closing state <<< ``` That isn't a result of these changes; the same test is failing in master. The failure is here: ```lua if not _port then -- invalid seek local status, msg, code = io.stdin:seek("set", 1000) assert(not status and type(msg) == "string" and type(code) == "number") end ``` The test expects a seek to offset 1,000 on stdin to fail — but it doesn't. `status` ends up being the new offset rather than `nil`. If I comment out that one test, the remaining tests succeed.	2024-06-15 20:13:08 -04:00
Jōshin	3a599bfbe1	Update .git-blame-ignore-revs	2024-06-15 13:36:07 -07:00
Jōshin	89fc95fefd	Rerun clang-format on the repo (#1217 ) 🚨 clang-format changes output per version! This is with version 19.0.0. The modifications seem to be fixing the old version’s errors - mainly involving omitted whitespace around binary ops and inserted whitespace between goto labels and colons (if followed by a curly brace.) Also fixes a few mistakes made by e.g. someone (ahem) forgetting to pass his ctl/string.h modifications through it. We should add this to .git-blame-ignore-revs once we have its final hash on master.	2024-06-15 16:34:48 -04:00
Jōshin	8e37ee2598	ctl::string cleanup (#1215 ) We now fully initialize a ctl::string’s memory, so that it is always set to a well-defined value, thus making it always safe to memcpy out of it. This incidentally makes our string::swap function legal, which it wasn’t before. This also saves us a store in string::reserve. Now that we have made both big_string and small_string POD, I believe it is safe to elide the launder calls, and have done so, thus cleaning up a lot of the blob-related code. I also got rid of set_big_capacity and replaced it with a set_big_string that leaves us in a well-defined state afterwards. This function also is able to be somewhat simpler; rather than delicate bit-twiddling, it just reaches straight into blob and rewrites it wholesale. Overall, this shaves about 1–2ns off of most benchmarks, and adds 1ns to only one of them - creating a string from a char *.	2024-06-15 10:49:52 -07:00
Jōshin	d9b4f647d8	Uncomment swap test (#1210 )	2024-06-10 21:51:19 -07:00
Jōshin	0dde3a0e70	Get rid of preprocessor stuff in test (#1202 ) Also it's a bit more idiomatic to say s.npos rather than string::npos.	2024-06-10 07:00:37 -07:00
Jōshin	32643e9fa7	Decouple swap from std (#1211 ) This allows you to implement your own swap function without it having to be part of the std namespace. std::swap is still used if it's available.	2024-06-10 03:40:17 -07:00
Jōshin	0a92c78035	Remove erroneous ctl:: prefixes	2024-06-08 15:07:39 -07:00
Jōshin	118db71121	Provide a minimal new.h for CTL (#1205 ) This replaces the STL <new> header. Mainly, it defines a global operator new and operator delete, as well as the placement versions of these. The placement versions are required to not get compile errors when trying to write a placement new statement. Each of these operators is defined with many, many different variants. A glance at new.cc is recommended followed by a chaser of the Alexandrescu talk "std::allocator is to Allocation as std::vector is to Vexation". We must provide a global-namespace source-level definition of each operator and it is illegal for any of them to be marked inline, so here we are. The upshot is that we no longer need to include <new>, and our optional/ vector headers are self-contained.	2024-06-08 15:05:38 -07:00
Alkis Evlogimenos	a0410f0170	Make big_string pod (#1204 ) `big_string` is not pod which means it needs to be properly constructed and destroyed. Instead make it POD and destroy it manually in `string` destructor.	2024-06-08 10:02:33 -07:00
Alkis Evlogimenos	d44a7dc603	Fix bugs in in ctl::optional (#1203 ) Manually manage the lifetime of `value_` by using an anonymous `union`. This fixes a bunch of double-frees and double-constructs. Additionally move the `present_` flag last. When `T` has padding `present_` will be placed there saving `alignof(T)` bytes from `sizeof(optional<T>)`.	2024-06-07 20:47:24 -04:00
Jōshin	2ba6b0158f	Fix some memory issues with ctl::string (#1201 ) There were a few errors in how capacity and memory was being handled for small strings. The capacity errors meant that small strings would become big strings too soon, and the memory error introduced undefined behavior that was caught by CheckMemoryLeaks in our test file but only sometimes. The crucial change is in reserve: we only copy n bytes into p2, and then we manually set the null terminator instead of expecting it to have been there already. (E.g. it might not be there for an empty small string.) We also fix one other doozy in append when we were exactly at the small- to-big string boundary: we set the last byte (i.e., the remainder field) to 0, then decremented it, giving us size_t max. Whoops. We boneheadedly fix this by setting the 0 byte after we've fixed up the remainder, so it is at worst a no-op. Otherwise, capacity now works the same for small strings as it does with big strings: it's the amount of space available including the null byte. We test all of this with a new test that only gets included if our class under test is not std::string (presumably meaning it's ctl::string.) The test manually verifies that the small string optimization behaves how we expect. Since this test checks against std::string, we go ahead and include that other header from the STL. Also modifies the new test we introduced to also run on std::string, but it just does the append without expecting anything about how its data is stored. We also check that the string has the right value afterwards.	2024-06-07 01:15:37 -04:00
Jōshin	f3effcb703	One more SSO erratum from #1199 Making a string_view from a string appears to take about 1.3ns no matter what. 100% definitely no point deviating from the STL API over that.	2024-06-06 18:01:26 -07:00
Jōshin	03b476f943	Minor small-string errata from #1199 These commits were sitting on a local branch that I neglected to push before merging. :( * Use memcpy for string::reserve * Remove fence comments	2024-06-06 17:56:30 -07:00
Jōshin	8b3e368e9a	ctl::string small-string optimization (#1199 ) A small-string optimization is a way of reusing inline storage space for sufficiently small strings, rather than allocating them on the heap. The current approach takes after an old Facebook string class: it reuses the highest-order byte for flags and small-string size, in such a way that a maximally-sized small string will have its last byte zeroed, making it a null terminator for the C string. The only flag we have is in the highest-order bit, that says whether the string is big (set) or small (cleared.) Most of the logic switches based on the value of this bit; e.g. data() returns big()->p if it's set, else small()->buf if it's cleared. For a small string, the capacity is always fixed at sizeof(string) - 1 bytes; we store the length in the last byte, but we store it as the number of remaining bytes of capacity, so that at max size, the last byte will read zero and serve as our null terminator. Morally speaking, our class's storage is a union over two POD C structs. For now I gravitated towards a slightly more obtuse approach: the string class itself contains a blob of the right size, and we alias that blob's pointer for the two structs, taking some care not to run afoul of object lifetime rules in C++. If anyone wants to improve on this, contributions are welcome. This commit also introduces the `ctl::__` namespace. It can't be legally spelled by library users, and serves as our version of boost's "detail". We introduced a string::swap function, and we now use that in operator=. operator= now takes its argument by value, so we never need to check for the case where the pointers are equal and can just swap the entire store of the argument with our own, leaving the C++ destructor to free our old storage afterwards. There are probably still a few places where our capacity is slightly off and we grow too fast, although there don't appear to be any where we are too slow. I will leave these to be fixed in future changes.	2024-06-06 20:50:51 -04:00
Brian	df6b384e31	github: add labeler action (#1196 ) Bit easier to do this as everything seems to be sorted into logical folders. You may need to add new labels to support this however.	2024-06-06 05:51:36 -07:00
Brian	280bdec817	github: add issue template (#1195 ) copy paste and adapted from llamafile but added a research template as well for the more technical research tickets	2024-06-06 05:51:06 -07:00
Jōshin	2c5e7ec547	Add terminating :vi on some modelines Noticed because the settings they specified weren't getting picked up by editor sessions in those files.	2024-06-05 20:36:55 -07:00
Jōshin	fdcb8b2f7e	add formatting commit	2024-06-05 16:36:34 -07:00
Jōshin	04c6bc478e	vim C++ filetype is still spelled "cpp"	2024-06-05 16:34:47 -07:00
Justine Tunney	cc2c1893c5	Fix some nits	2024-06-05 04:05:49 -07:00
Justine Tunney	3093f0e467	Release Cosmopolitan v3.4.0	2024-06-05 03:07:03 -07:00
Justine Tunney	3609f65de3	Make malloc() go 200x faster If pthread_create() is linked into the binary, then the cosmo runtime will create an independent dlmalloc arena for each core. Whenever the malloc() function is used it will index `g_heaps[sched_getcpu() / 2]` to find the arena with the greatest hyperthread / numa locality. This may be configured via an environment variable. For example if you say `export COSMOPOLITAN_HEAP_COUNT=1` then you can restore the old ways. Your process may be configured to have anywhere between 1 - 128 heaps We need this revision because it makes multithreaded C++ applications faster. For example, an HTTP server I'm working on that makes extreme use of the STL went from 16k to 2000k requests per second, after this change was made. To understand why, try out the malloc_test benchmark which calls malloc() + realloc() in a loop across many threads, which sees a a 250x improvement in process clock time and 200x on wall time The tradeoff is this adds ~25ns of latency to individual malloc calls compared to MODE=tiny, once the cosmo runtime has transitioned into a fully multi-threaded state. If you don't need malloc() to be scalable then cosmo provides many options for you. For starters the heap count variable above can be set to put the process back in single heap mode plus you can go even faster still, if you include tinymalloc.inc like many of the programs in tool/build/.. are already doing since that'll shave tens of kb off your binary footprint too. Theres also MODE=tiny which is configured to use just 1 plain old dlmalloc arena by default Another tradeoff is we need more memory now (except in MODE=tiny), to track the provenance of memory allocation. This is so allocations can be freely shared across threads, and because OSes can reschedule code to different CPUs at any time.	2024-06-05 02:02:14 -07:00
Justine Tunney	9906f299bb	Refactor and improve CTL and other code	2024-06-04 05:45:48 -07:00
Justine Tunney	1d8f37a2f0	Fix the MODE=tiny builds	2024-06-03 10:36:38 -07:00
Justine Tunney	e677460d14	Delete .vscode folder It hasn't been maintained in years. I'm tired of the root level of our project having an advertisement for Microsoft Visual Studio Code. Your preferred editor should be Emacs or Vim.	2024-06-03 09:40:45 -07:00
Justine Tunney	4937843f70	Introduce Cosmopolitan Templates Library (CTL)	2024-06-03 09:21:59 -07:00
Justine Tunney	b003888696	Make __demangle() heap 10% more compact	2024-06-02 16:18:55 -07:00
Justine Tunney	2ca491dc56	Write more __demangle() tests	2024-06-02 07:37:15 -07:00
Justine Tunney	9aa353d88b	Document __demangle() and fix a const func ptr bug	2024-06-02 04:15:48 -07:00
Justine Tunney	c67faf61df	Delete some unintentional code	2024-06-01 20:36:58 -07:00
Justine Tunney	165c6b37e2	Add C++ demangling to privileged runtime Cosmo will now print C++ symbols correctly in --ftrace logs and backtraces. Doing this required reducing the memory requirement of the __demangle() function by 3x. This was accomplished using 16-bit indices and 16-bit malloc granularity. That puts a limit on the longest symbol we can successfully decode, which I think would be around 6553 characters long, given a 65536-byte buffer	2024-06-01 20:10:58 -07:00
Jōshin	dcd626edf8	Add .git-blame-ignore-revs If you follow the directions in that file then git blame will ignore the listed commits. A commit should only go in that file if its only changes were to formatting, particularly on a large part of the codebase (like a change to .clang-format getting applied to the repo.) Cribbed from here: https://www.stefanjudis.com/today-i-learned/how-to-exclude-commits-from-git-blame/	2024-06-01 20:10:48 -07:00
Jōshin	f032b5570b	Run clang-format (#1197 )	2024-06-01 16:30:43 -04:00

1 2 3 4 5 ...

2332 commits