cosmopolitan

mirror of https://github.com/jart/cosmopolitan.git synced 2025-01-31 03:27:39 +00:00

Author	SHA1	Message	Date
Justine Tunney	41cc053419	Add more content to APE specification	2024-07-21 20:47:24 -07:00
Justine Tunney	d3f87f4c64	Upgrade to cosmocc v3.5.7	2024-07-20 11:21:26 -07:00
Justine Tunney	29ce25c767	Start writing formal specification for APE	2024-07-20 10:04:22 -07:00
Justine Tunney	78d3b86ec7	Fix Android support Thanks to @aj47 (techfren.net) the new Cosmo memory manager is confirmed to be working on Android!! The only issue turned out to be forgetting to update the program address in the linker script. We now know w/ absolute certainty that APE binaries as complex as llamafile, now work correctly.	2024-07-01 01:06:47 -07:00
Justine Tunney	d461c6f47d	Do more quality assurance work	2024-06-24 06:53:49 -07:00
Justine Tunney	d1d4388201	Delete ASAN It hasn't been helpful enough to be justify the maintenance burden. What actually does help is mprotect(), kprintf(), --ftrace and --strace which can always be counted upon to work correctly. We aren't losing much with this change. Support for ASAN on AARCH64 was never implemented. Applying ASAN to the core libc runtimes was disabled many months ago. If there is some way to have an ASAN runtime for user programs that is less invasive we can potentially consider reintroducing support. But now is premature.	2024-06-22 05:45:49 -07:00
Justine Tunney	6ffed14b9c	Rewrite memory manager Actually Portable Executable now supports Android. Cosmo's old mmap code required a 47 bit address space. The new implementation is very agnostic and supports both smaller address spaces (e.g. embedded) and even modern 56-bit PML5T paging for x86 which finally came true on Zen4 Threadripper Cosmopolitan no longer requires UNIX systems to observe the Windows 64kb granularity; i.e. sysconf(_SC_PAGE_SIZE) will now report the host native page size. This fixes a longstanding POSIX conformance issue, concerning file mappings that overlap the end of file. Other aspects of conformance have been improved too, such as the subtleties of address assignment and and the various subtleties surrounding MAP_FIXED and MAP_FIXED_NOREPLACE On Windows, mappings larger than 100 megabytes won't be broken down into thousands of independent 64kb mappings. Support for MAP_STACK is removed by this change; please use NewCosmoStack() instead. Stack overflow avoidance is now being implemented using the POSIX thread APIs. Please use GetStackBottom() and GetStackAddr(), instead of the old error-prone GetStackAddr() and HaveStackMemory() APIs which are removed.	2024-06-22 05:45:11 -07:00
Jōshin	89fc95fefd	Rerun clang-format on the repo (#1217 ) 🚨 clang-format changes output per version! This is with version 19.0.0. The modifications seem to be fixing the old version’s errors - mainly involving omitted whitespace around binary ops and inserted whitespace between goto labels and colons (if followed by a curly brace.) Also fixes a few mistakes made by e.g. someone (ahem) forgetting to pass his ctl/string.h modifications through it. We should add this to .git-blame-ignore-revs once we have its final hash on master.	2024-06-15 16:34:48 -04:00
Jōshin	f032b5570b	Run clang-format (#1197 )	2024-06-01 16:30:43 -04:00
Justine Tunney	e4d25d68e4	Drop support for Windows 8 Microsoft caused some very gentle breakages for Cosmopolitan. They removed the version information from the PEB which caused uname to report WINDOWS 0.0.0. We should have called GetVersionExW but that doesn't really exist anymore either. Windows policy is now to give whatever version we used in ape/ape.S. Windows8 has been EOL since 2023-01-10 so lets avoid our modern executables being relegated to legacy infrastructure. Requiring Windows 10+ going forward lets us remove runtime compatibility bloat from the codebase. Further note Cosmopolitan maintains a Windows Vista branch on GitHub, so anyone preferring the older versions, can still have a future with Cosmo. Another neat thing this fixes is UTF-8 support in the console. The changes Microsoft made broke the if statement that enabled UTF8 in terminals. This explains why bug reports had broken arrows. In the future this should be less of an issue, since the PEB code is gone which means we more strictly conform to only Microsoft's WIN32 API	2024-05-29 19:37:47 -07:00
Justine Tunney	a52792c59f	Add some missing build dependencies	2024-05-20 17:23:25 -07:00
Justine Tunney	ae2a7ac844	Fix thread-local storage bugs on aarch64 This change fixes an issue where .tbss memory might not be initialized.	2024-05-08 04:20:22 -07:00
Jōshin	8f18b3ad65	Reapply apeinstall.sh binfmt flags (#1171 ) `7d31fc3` made it safe to register ape with the binfmt P flag. Older files will still get their paths passed via argv[0] and everything should just work. This also restores the F flag that was rolled back alongside the P flag; this seems like a good idea to me now. It makes it so /usr/bin/ape is kind of part of the kernel; simply replacing the file will not change the loader used. The binfmt will have to be reregistered as well. ape/apeinstall.sh will already nag you if there's an existing binfmt for ape. It might be nice to make it actually replace the registration. This reverts commit `df648fb174`.	2024-05-08 01:48:48 -04:00
Jōshin	7d31fc311a	Loaders rewrite argv[0] for old binaries (#1170 ) For this to work, a loader has to be able to tell the difference between an ‘old’ and a ‘new’ binary. This is achieved via a repurposing of ELF’s e_flags field. We previously tried to use the padding in e_ident for it, but binutils was resetting it to zero in e.g. strip. This introduces one new ELF flag for cosmopolitan binaries. It is called `EF_APE_MODERN`. We choose 0x101ca75, "lol cat 5". It should now be safe to install the ape loader binfmt registration with the `P` flag.	2024-05-07 20:42:18 -04:00
Jōshin	2a1f352622	Tell vim not to mess with ape.S	2024-05-06 15:36:35 -07:00
Justine Tunney	181cd4cbe8	Add sysctlbyname() for MacOS	2024-05-02 23:21:43 -07:00
Matt Colyer	3bcd40be12	Fix regression in apeinstall.sh (#1161 ) This should have been a part of `a6baba1`.	2024-04-29 20:40:38 -07:00
Jōshin	6e6fc38935	Apply clang-format update to repo (#1154 ) Commit `bc6c183` introduced a bunch of discrepancies between what files look like in the repo and what clang-format says they should look like. However, there were already a few discrepancies prior to that. Most of these discrepancies seemed to be unintentional, but a few of them were load-bearing (e.g., a #include that violated header ordering needing something to have been #defined by a 'later' #include.) I opted to take what I hope is a relatively smooth-brained approach: I reverted the .clang-format change, ran clang-format on the whole repo, reapplied the .clang-format change, reran clang-format again, and then reverted the commit that contained the first run. Thus the full effect of this PR should only be to apply the changed formatting rules to the repo, and from skimming the results, this seems to be the case. My work can be checked by applying the short, manual commits, and then rerunning the command listed in the autogenerated commits (those whose messages I have prefixed auto:) and seeing if your results agree. It might be that the other diffs should be fixed at some point but I'm leaving that aside for now. fd '\.c(c\|pp)?$' --print0\| xargs -0 clang-format -i	2024-04-25 10:38:00 -07:00
Justine Tunney	a6baba1b07	Stop using .com extension in monorepo The WIN32 CreateProcess() function does not require an .exe or .com suffix in order to spawn an executable. Now that we have Cosmo bash we're no longer so dependent on the cmd.exe prompt.	2024-03-03 03:12:19 -08:00
Justine Tunney	af8f2bd19f	Shave 4kb off each binary	2024-02-25 11:11:34 -08:00
Justine Tunney	957c61cbbf	Release Cosmopolitan v3.3 This change upgrades to GCC 12.3 and GNU binutils 2.42. The GNU linker appears to have changed things so that only a single de-duplicated str table is present in the binary, and it gets placed wherever the linker wants, regardless of what the linker script says. To cope with that we need to stop using .ident to embed licenses. As such, this change does significant work to revamp how third party licenses are defined in the codebase, using `.section .notice,"aR",@progbits`. This new GCC 12.3 toolchain has support for GNU indirect functions. It lets us support __target_clones__ for the first time. This is used for optimizing the performance of libc string functions such as strlen and friends so far on x86, by ensuring AVX systems favor a second codepath that uses VEX encoding. It shaves some latency off certain operations. It's a useful feature to have for scientific computing for the reasons explained by the test/libcxx/openmp_test.cc example which compiles for fifteen different microarchitectures. Thanks to the upgrades, it's now also possible to use newer instruction sets, such as AVX512FP16, VNNI. Cosmo now uses the %gs register on x86 by default for TLS. Doing it is helpful for any program that links `cosmo_dlopen()`. Such programs had to recompile their binaries at startup to change the TLS instructions. That's not great, since it means every page in the executable needs to be faulted. The work of rewriting TLS-related x86 opcodes, is moved to fixupobj.com instead. This is great news for MacOS x86 users, since we previously needed to morph the binary every time for that platform but now that's no longer necessary. The only platforms where we need fixup of TLS x86 opcodes at runtime are now Windows, OpenBSD, and NetBSD. On Windows we morph TLS to point deeper into the TIB, based on a TlsAlloc assignment, and on OpenBSD/NetBSD we morph %gs back into %fs since the kernels do not allow us to specify a value for the %gs register. OpenBSD users are now required to use APE Loader to run Cosmo binaries and assimilation is no longer possible. OpenBSD kernel needs to change to allow programs to specify a value for the %gs register, or it needs to stop marking executable pages loaded by the kernel as mimmutable(). This release fixes __constructor__, .ctor, .init_array, and lastly the .preinit_array so they behave the exact same way as glibc. We no longer use hex constants to define math.h symbols like M_PI.	2024-02-20 13:27:59 -08:00
Justine Tunney	2ab9e9f7fd	Make improvements - Introduce portable sched_getcpu() api - Support GCC's __target_clones__ feature - Make fma() go faster on x86 in default mode - Remove some asan checks from core libraries - WinMain() now ensures $HOME and $USER are defined	2024-02-12 10:23:00 -08:00
Justine Tunney	f27808c4d2	Remove feature for embedding blink in ape scripts Embedding Blink builds in Cosmo executables was a failed experiment. It turned out to be easier than expected to let the mono repo have support for multiple architectures. Blink still works great; it's supported and recommended; just please use it as a separate program. For example, you can use Blink to run Cosmo binaries on architectures like i486 / s390x.	2024-01-26 22:30:56 -08:00
Jōshin	df648fb174	Revert apeinstall.sh binfmt flags (#1072 ) The P flag breaks backwards compatibility with older binaries. The idea is to revert this commit after that break has been resolved.	2024-01-08 14:21:21 -08:00
Jōshin	8d9fcb5e5a	Fix ape-m1.c usage It's not about `$0` anymore.	2024-01-07 10:35:50 -05:00
Jōshin	aa37a327ea	Make $prog.ape more reliable on Apple Silicon (#1071 ) Now it doesn't matter what argv `$prog.ape` is invoked with. We just get our executable path the Apple way.	2024-01-07 07:13:20 -08:00
Jōshin	d27a47b0e2	Bugfix: ape --help should exit 0 (#1060 )	2024-01-06 12:07:32 -08:00
Jōshin	390335eb45	apeinstall/uninstall.sh can use doas (#1062 )	2024-01-06 12:06:21 -08:00
Jōshin	636bc4007b	Enable argv[0] tests in more places (#1061 ) Now we do them for assimilated binaries (except on OpenBSD or XNU non-Silicon), for XnuSilicon, and for binaries with the preserve- argv[0] auxv flag set. We check whether to pass the argv[0] value at the test site rather than the Child site. We move a lot of the test initialization into Child in the non-child case, in order to get at the pre-init value of `__program_executable_name`. Finally, we print out info about what we are skipping.	2024-01-06 11:42:03 -08:00
Jōshin	80ec6c9283	Try to detect kernel version for P flag (#1059 )	2024-01-05 15:19:46 -08:00
Jōshin	15548b523c	Cleanup apeinstall.sh (#1057 )	2024-01-05 13:45:14 -08:00
Jōshin	412a200ae4	Support binfmt_misc P flag in APE loader (#1058 ) This allows ape to automatically preserve `argv[0]` [as of Linux kernel 5.12][0] if the [binfmt_misc][1] registration contains the P flag. This also removes may_path_search, which was identical to the literally flag in usage. As a result, FindCommand is subsumed into Commandv. [0]: https://patchew.org/QEMU/20210222105004.1642234-1-laurent@vivier.eu/ [1]: https://www.kernel.org/doc/html/latest/admin-guide/binfmt-misc.html	2024-01-05 12:35:01 -08:00
Justine Tunney	a3deef70c2	Release Cosmopolitan v3.2	2024-01-04 09:39:48 -08:00
Justine Tunney	796148790f	Remove hard coded paths from APE bootloader This increases risk of fork bomb but is needed to support the NixOS. Upstream dependencies of APE (uname, mkdir, dd, chmod, gzip, and mv) will be removed from releases, and deleted from the cosmo.zip server See #12	2024-01-03 17:55:19 -08:00
Justine Tunney	2f89c2482a	Delete some dead code	2024-01-01 00:13:16 -08:00
Justine Tunney	81949f038e	Mint APE Loader v1.10	2023-12-31 11:43:13 -08:00
Jōshin	c9550afe5e	Fix loader usage, shave off a few bytes (#1016 ) * Remove -f from loader usage -f was removed in 1.5. As there is now only one flag, a couple more bytes can be shaved off as well. * Further loader golf Shaves off a few bytes, paying for the cost of `RealPath` and then some on x86_64 and offsetting some of the cost to aarch64. * Shave off a few more bytes Removes `-h` and flags from usage. Keeps flag-parsing logic the same, i.e. still accepts `-h` / `--help`. Only difference is what fd and rc the usage uses. Still over 1k north of 8192.	2023-12-31 11:33:42 -08:00
Jōshin	14fe83facd	aarch64 loader passes os (#1042 ) * Reorder Launch arguments, pass aarch64 os Third and fourth arguments are now identical between cosmo and Launch. By passing sp as argument 4, we save a bit of register juggling. Fourth argument (os) is now always passed by the loader on aarch64. It is not yet processed by cosmo. Pushing this change separately, as the cosmo side turns out to be somewhat more involved. * cosmo2 receives os from loader FreeBSD aarch64 now traps early rather than pretending to be Linux. o/aarch64/examples/env.com still works on Linux and Xnu.	2023-12-31 06:42:36 -08:00
Justine Tunney	83107f78ed	Introduce FreeBSD ARM64 support It's 100% passing test fleet. Solid as a rock.	2023-12-29 20:14:02 -08:00
Jōshin	2a11a09d98	Remove realpath/getcwd from loaders (#1024 ) This implements proposals 1 and 2a from this gist: https://gist.github.com/mrdomino/2222cab61715fd527e82e036ba4156b1 The only reason to use realpath from the loader was to try to prevent a TOCTOU between the loader and the binary. But this is only a real issue in set-id contexts, and in those cases there is already a canonical way to do it: `/dev/fd`, passed by the kernel to the loader, so all we have to do is pass that along to the binary. Aside from realpath, there is no reason to absolutize the path we supply to the binary, since it can call `getcwd` as well as we can, and on non- M1 the binary is in a much better position to make that call. Since we no longer absolutize the path, the binary does need to do this, so we make its argv-parsing code generic and apply that to the different possible places the path could come from. This means that `_` is finally usable as a relative path, as a nice side benefit. The M1 realpath code had a significant bug - it uses the wrong offset to truncate the `.ape` in the `$prog.ape` case. This PR also fixes a regression in `ape $progname` out of `$PATH` on the two BSDs (Free and Net) that did not implement `RealPath`.	2023-12-18 15:01:16 -05:00
Jōshin	3a8e01a77a	more modeline errata (#1019 ) Somehow or another, I previously had missed `BUILD.mk` files. In the process I found a few straggler cases where the modeline was different from the file, including one very involved manual fix where a file had been treated like it was ts=2 and ts=8 on separate occasions. The commit history in the PR shows the gory details; the BUILD.mk was automated, everything else was mostly manual.	2023-12-16 23:07:10 -05:00
Jōshin	f94c11d978	Loader path security (#1012 ) The ape loader now passes the program executable name directly as a register. `x2` is used on aarch64, `%rdx` on x86_64. This is passed as the third argument to `cosmo()` (M1) or `Launch` (non-M1) and is assigned to the global `__program_executable_name`. `GetProgramExecutableName` now returns this global's value, setting it if it is initially null. `InitProgramExecutableName` first tries exotic, secure methods: `KERN_PROC_PATHNAME` on FreeBSD/NetBSD, and `/proc` on Linux. If those produce a reasonable response (i.e., not `"/usr/bin/ape"`, which happens with the loader before this change), that is used. Otherwise, if `issetugid()`, the empty string is used. Otherwise, the old argv/envp parsing code is run. The value returned from the loader is always the full absolute path of the binary to be executed, having passed through `realpath`. For the non-M1 loader, this necessitated writing `RealPath`, which uses `readlinkat` of `"/proc/self/fd/[progfd]"` on Linux, `F_GETPATH` on Xnu, and the `__realpath` syscall on OpenBSD. On FreeBSD/NetBSD, it punts to `GetProgramExecutableName`, which is secure on those OSes. With the loader, all platforms now have a secure program executable name. With no loader or an old loader, everything still works as it did, but setuid/setgid is not supported if the insecure pathfinding code would have been needed. Fixes #991.	2023-12-15 12:23:58 -05:00
Jōshin	2fc507c98f	Fix more vi modelines (#1006 ) * modelines: tw -> sw shiftwidth, not textwidth. * space-surround modelines * fix irregular modelines * Fix modeline in titlegen.c	2023-12-13 02:28:11 -05:00
Justine Tunney	8874a37abc	Add <link.h> for absl	2023-12-08 20:04:10 -08:00
Jōshin	e16a7d8f3b	flip et / noet in modelines `et` means `expandtab`. ```sh rg 'vi: .* :vi' -l -0 \| \ xargs -0 sed -i '' 's/vi: $.$ et$.$ :vi/vi: \1 xoet\2:vi/' rg 'vi: .* :vi' -l -0 \| \ xargs -0 sed -i '' 's/vi: $.$noet$.$:vi/vi: \1et\2 :vi/' rg 'vi: .* :vi' -l -0 \| \ xargs -0 sed -i '' 's/vi: $.$xoet$.$:vi/vi: \1noet\2:vi/' ```	2023-12-07 22:17:11 -05:00
Jōshin	394d998315	Fix vi modelines (#989 ) At least in neovim, `│vi:` is not recognized as a modeline because it has no preceding whitespace. After fixing this, opening a file yields an error because `net` is not an option. (`noet`, however, is.)	2023-12-05 14:37:54 -08:00
Jōshin	da8baf2aa5	ape-m1 minor formatting cleanup (#986 )	2023-12-04 19:58:32 -08:00
Jōshin	ed8fadea37	Keep argv[0], add COSMOPOLITAN_PROGRAM_EXECUTABLE (#980 ) * Introduce env.com Handy tool for debugging environment issues. * Inject path as COSMOPOLITAN_PROGRAM_EXECUTABLE `argv[0]` was previously being used as a communication channel between the loader and the binary, giving the binary its full path for use e.g. in `GetProgramExecutableName`. But `argv[0]` is not a good channel for this; much of what made `2a3813c6` so gross is due to that. This change fixes the issue by preserving `argv[0]` and establishing a new communication channel: `COSMOPOLITAN_PROGRAM_EXECUTABLE`. The M1 loader will always set this as the first variable. Linux should soon follow. On the other side, `GetProgramExecutableName` checks that variable first. If it sees it, it trusts it as-is. A lot of the churn in `ape/ape-m1.c` in this change is actually backing out hacks introduced in 2a3813c6; the best comparison is: git diff 2a3813c6^..	2023-12-04 12:45:46 -08:00
Jōshin	2a3813c6cf	$prog.ape support (#977 ) * ape loader: $prog.ape + login shell support If the ape loader is invoked with `$0 = $prog.ape`, then it searches for a `$prog` in the same directory as it and loads that. In particular, the loader searches the `PATH` for an executable named `$prog.ape`, then for an executable named `$prog` in the same directory. If the former but not the latter is found, the search terminates with an error. It also handles the special case of getting started as `-$SHELL`, which getty uses to indicate that the shell is a login shell. The path is not searched in this case, and the program location is read straight out of the `SHELL` variable. It is now possible to have `/usr/local/bin/zsh.ape` act as a login shell for a `/usr/local/bin/zsh` αpε, insofar as the program will get started with the 'correct' args. Unfortunately, many things break if `$0` is not the actual full path of the executable being run; for example, backspace does not update the display properly. To work around the brokenness introduced by not having `$0` be the full path of the binary, we cut the leading `-` out of `argv[0]` if present. This gets the loader's behavior with `$prog.ape` up to par, but doesn't tell login shells that they are login shells. So we introduce a hack to accomplish that: if ape is run as `-$prog.ape` and the shell is `$prog`, the binary that is loaded has a `-l` flag put into its first argument. As of this commit, αpε binaries can be used as login shells on OSX. * if islogin, execfn = shell Prior to this, execfn was not being properly set for login shells that did not receive `$_`, which was the case for iTerm2 on Mac. There were no observable consequences of this, but fixing it seems good anyway. * Fix auxv location calculation In the non-login-shell case, it was leaving a word of uninitialized memory at `envp[i] + 1`. This reuses the previous calculation based on `envp`.	2023-12-03 19:39:32 -08:00
Justine Tunney	fa20edc44d	Reduce header complexity - Remove most __ASSEMBLER__ __LINKER__ ifdefs - Rename libc/intrin/bits.h to libc/serialize.h - Block pthread cancelation in fchmodat() polyfill - Remove `clang-format off` statements in third_party	2023-11-28 14:39:42 -08:00

1 2 3 4 5 ...

259 commits