Undiamond Python headers

This change gets the Python codebase into a state where it conforms to the conventions of this codebase. It's now possible to include headers from Python, without worrying about ordering. Python has traditionally solved that problem by "diamonding" everything in Python.h, but that's problematic since it means any change to any Python header invalidates all the build artifacts. Lastly it makes tooling not work. Since it is hard to explain to Emacs when I press C-c C-h to add an import line it shouldn't add the header that actually defines the symbol, and instead do follow the nonstandard Python convention. Progress has been made on letting Python load source code from the zip executable structure via the standard C library APIs. System calss now recognizes zip!FILENAME alternative URIs as equivalent to zip:FILENAME since Python uses colon as its delimiter. Some progress has been made on embedding the notice license terms into the Python object code. This is easier said than done since Python has an extremely complicated ownership story. - Some termios APIs have been added - Implement rewinddir() dirstream API - GetCpuCount() API added to Cosmopolitan Libc - More bugs in Cosmopolitan Libc have been fixed - zipobj.com now has flags for mangling the path - Fixed bug a priori with sendfile() on certain BSDs - Polyfill F_DUPFD and F_DUPFD_CLOEXEC across platforms - FIOCLEX / FIONCLEX now polyfilled for fast O_CLOEXEC changes - APE now supports a hybrid solution to no-self-modify for builds - Many BSD-only magnums added, e.g. O_SEARCH, O_SHLOCK, SF_NODISKIO
2025-09-10 18:53:48 +00:00 · 2021-08-12 00:42:14 -07:00 · 2021-08-12 00:42:14 -07:00 · b420ed8248
commit b420ed8248
parent 20bb8db9f8
762 changed files with 18410 additions and 53772 deletions
--- a/third_party/python/Tools/unicode/genwincodec.py
+++ b/third_party/python/Tools/unicode/genwincodec.py
@ -1,61 +0,0 @@
-"""This script generates a Python codec module from a Windows Code Page.
-
-It uses the function MultiByteToWideChar to generate a decoding table.
-"""
-
-import ctypes
-from ctypes import wintypes
-from gencodec import codegen
-import unicodedata
-
-def genwinmap(codepage):
-    MultiByteToWideChar = ctypes.windll.kernel32.MultiByteToWideChar
-    MultiByteToWideChar.argtypes = [wintypes.UINT, wintypes.DWORD,
-                                    wintypes.LPCSTR, ctypes.c_int,
-                                    wintypes.LPWSTR, ctypes.c_int]
-    MultiByteToWideChar.restype = ctypes.c_int
-
-    enc2uni = {}
-
-    for i in list(range(32)) + [127]:
-        enc2uni[i] = (i, 'CONTROL CHARACTER')
-
-    for i in range(256):
-        buf = ctypes.create_unicode_buffer(2)
-        ret = MultiByteToWideChar(
-            codepage, 0,
-            bytes([i]), 1,
-            buf, 2)
-        assert ret == 1, "invalid code page"
-        assert buf[1] == '\x00'
-        try:
-            name = unicodedata.name(buf[0])
-        except ValueError:
-            try:
-                name = enc2uni[i][1]
-            except KeyError:
-                name = ''
-
-        enc2uni[i] = (ord(buf[0]), name)
-
-    return enc2uni
-
-def genwincodec(codepage):
-    import platform
-    map = genwinmap(codepage)
-    encodingname = 'cp%d' % codepage
-    code = codegen("", map, encodingname)
-    # Replace first lines with our own docstring
-    code = '''\
-"""Python Character Mapping Codec %s generated on Windows:
-%s with the command:
-  python Tools/unicode/genwincodec.py %s
-"""#"
-''' % (encodingname, ' '.join(platform.win32_ver()), codepage
-      ) + code.split('"""#"', 1)[1]
-
-    print(code)
-
-if __name__ == '__main__':
-    import sys
-    genwincodec(int(sys.argv[1]))
--- a/third_party/python/Tools/unicode/makeunicodedata.py
+++ b/third_party/python/Tools/unicode/makeunicodedata.py
@ -595,7 +595,7 @@ def makeunicodetype(unicode, trace):

 def makeunicodename(unicode, trace):

-    FILE = "Modules/unicodename_db.h"
+    FILE = "third_party/python/Modules/unicodename_db.h"

    print("--- Preparing", FILE, "...")