cri-o

Author	SHA1	Message	Date
Tobias Klauser	822172a892	all: Switch from package syscall to golang.org/x/sys/unix The syscall package is locked down and the comment in [1] advises to switch code to use the corresponding package from golang.org/x/sys. Do so and replace usage of package syscall where possible (leave syscall.SysProcAttr and syscall.Stat_t). [1] https://github.com/golang/go/blob/master/src/syscall/syscall.go#L21-L24 This will also allow to get updates and fixes just by re-vendoring golang.org/x/sys/unix instead of having to update to a new go version. Signed-off-by: Tobias Klauser <tklauser@distanz.ch>	2017-07-12 08:18:55 +02:00
Antonio Murdaca	f3f8b67b76	Merge pull request #626 from mrunalp/pod_infra_oom sandbox: Adjust OOM score of infra container to a low value	2017-06-26 18:38:50 +02:00
Mrunal Patel	cb4c566fac	sandbox: Adjust OOM score of infra container to a low value This matches the current kube behavior. This will probably be provided over the CRI at which point we won't have to define a constant in cri-o code. Signed-off-by: Mrunal Patel <mpatel@redhat.com>	2017-06-23 09:24:53 -07:00
Andrew Pilloud	28cd8bde49	server: Hookup kubelet hostport Signed-off-by: Andrew Pilloud <andrewpilloud@igneoussystems.com>	2017-06-22 08:51:50 -07:00
Antonio Murdaca	6035cff9e4	server: standardize on naming Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-06-22 11:55:03 +02:00
Antonio Murdaca	94a457d46a	sandbox_run: need to stop sandbox before removing it on conflict Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-06-18 11:42:07 +02:00
Samuel Ortiz	4462480e54	sandbox: Check for trusted annotations If we get a kubelet annotation about the sandbox trust level, we use it to toggle our sandbox trust flag. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2017-06-15 10:04:41 +02:00
Samuel Ortiz	0e51bbb778	oci: Support mixing trusted and untrusted workloads Container runtimes provide different levels of isolation, from kernel namespaces to hardware virtualization. When starting a specific container, one may want to decide which level of isolation to use depending on how much we trust the container workload. Fully verified and signed containers may not need the hardware isolation layer but e.g. CI jobs pulling packages from many untrusted sources should probably not run only on a kernel namespace isolation layer. Here we allow CRI-O users to define a container runtime for trusted containers and another one for untrusted containers, and also to define a general, default trust level. This anticipates future kubelet implementations that would be able to tag containers as trusted or untrusted. When missing a kubelet hint, containers are trusted by default. A container becomes untrusted if we get a hint in that direction from kubelet or if the default trust level is set to "untrusted" and the container is not privileged. In both cases CRI-O will try to use the untrusted container runtime. For any other cases, it will switch to the trusted one. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2017-06-15 10:04:36 +02:00
Alexander Larsson	7bb957bf75	Implement non-terminal attach We use a SOCK_SEQPACKET socket for the attach unix domain socket, which means the kernel will ensure that the reading side only ever get the data from one write operation. We use this for frameing, where the first byte is the pipe that the next bytes are for. We have to make sure that all reads from the socket are using at least the same size of buffer as the write side, because otherwise the extra data in the message will be dropped. This also adds a stdin pipe for the container, similar to the ones we use for stdout/err, because we need a way for an attached client to write to stdin, even if not using a tty. This fixes https://github.com/kubernetes-incubator/cri-o/issues/569 Signed-off-by: Alexander Larsson <alexl@redhat.com>	2017-06-14 22:59:50 +02:00
Antonio Murdaca	cfec2c4cf4	sandbox_run: correct a defer Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-06-09 13:57:45 +02:00
Samuel Ortiz	f15859c79f	pkg/annotations: Export CRI-O annotations namespace Some runtimes like Clear Containers need to interpret the CRI-O annotations, to distinguish the infra container from the regular one. Here we export those annotations and use a more standard dotted namespace for them. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2017-06-01 23:45:44 +02:00
Antonio Murdaca	a28ed75e12	sandbox_run: fix name releasing on error Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-06-01 17:37:20 +02:00
Antonio Murdaca	a37dd46654	*: stability fixes Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-06-01 15:42:01 +02:00
Antonio Murdaca	b4f1cee2a2	server: store and use image's stop signal to stop containers Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-05-27 10:21:04 +02:00
Antonio Murdaca	da0b8a6157	server: store containers state on disk Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-05-18 21:19:50 +02:00
Antonio Murdaca	790c6d891a	server: store creation in containers Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-05-18 18:49:54 +02:00
Antonio Murdaca	1f4a4742cb	oci: add container directory to Container struct Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-05-18 18:49:54 +02:00
Antonio Murdaca	3bd4811b3b	server: restore sandbox created time from disk Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-05-18 18:49:54 +02:00
Antonio Murdaca	80a789bce3	server: store sandbox creation time Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-05-18 18:49:54 +02:00
Mrunal Patel	3fefcaa1dd	Convert pod cgroupPath to runc format for systemd cgroup Signed-off-by: Mrunal Patel <mpatel@redhat.com>	2017-05-17 17:46:53 -07:00
Mrunal Patel	d3bc6ab693	Add function to convert kube pod cgroup format to runc format This is a slightly modified version of the function in k8s. Signed-off-by: Mrunal Patel <mpatel@redhat.com>	2017-05-17 17:45:57 -07:00
Antonio Murdaca	ecd0006e80	vendor: upgrade containers/storage Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-05-17 22:18:07 +02:00
Dan Walsh	4493b6f176	Rename ocid to crio. The ocid project was renamed to CRI-O, months ago, it is time that we moved all of the code to the new name. We want to elminate the name ocid from use. Move fully to crio. Also cric is being renamed to crioctl for the time being. Signed-off-by: Dan Walsh <dwalsh@redhat.com>	2017-05-12 09:56:06 -04:00
Antonio Murdaca	b7ba9d058b	server: store kubeName in annotations Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-05-08 09:15:00 +02:00
Dan Williams	13f6e95685	sandbox: pass correct pod Namespace/Name to network plugins and fix id/name ordering Two issues: 1) pod Namespace was always set to "", which prevents plugins from figuring out what the actual pod is, and from getting more info about that pod from the runtime via out-of-band mechanisms 2) the pod Name and ID arguments were switched, further preventing #1 Signed-off-by: Dan Williams <dcbw@redhat.com>	2017-05-05 23:55:37 -05:00
Vincent Batts	f401adffa9	server: readable fields `git grep -w images` or `git grep -w storage` needs to be more useful. Signed-off-by: Vincent Batts <vbatts@hashbangbash.com>	2017-04-20 08:22:50 -04:00
Samuel Ortiz	ea1f6517c1	server: Fix RunPodSandbox error path When RunPodSandbox fails after calling s.addSandbox(sb), we're left with a sandbox in s.state.sandboxes while the sandbox is not created. We fix that by adding removeSandbox() to the deferred cleanup call Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2017-04-06 17:36:26 +02:00
Antonio Murdaca	3c7f3ab2ec	Merge pull request #409 from sameo/topic/fat-lock Serialize Update and Sandbox/Container creation operations	2017-04-04 23:23:19 +02:00
Aleksa Sarai	7679a84c6d	server: issues.k8s.io/44043 workaround Because kubelet will create broken symlinks for logPath it is necessary to remove those symlinks before we attempt to write to them. This is a temporary workaround while the issue is fixed upstream. Ref: https://issues.k8s.io/44043 Signed-off-by: Aleksa Sarai <asarai@suse.de>	2017-04-05 02:45:58 +10:00
Aleksa Sarai	c290c0d9c3	conmon: implement logging to logPath This adds a very simple implementation of logging within conmon, where every buffer read from the masterfd of the container is also written to the log file (with errors during writing to the log file ignored). Signed-off-by: Aleksa Sarai <asarai@suse.de>	2017-04-05 02:45:57 +10:00
Samuel Ortiz	be5084387c	server: Serialize container/pod creation with updates Interleaving asynchronous updates with pod or container creations can lead to unrecoverable races and corruptions of the pod or container hash tables. This is fixed by serializing update against pod or container creation operations, while pod and container creation operations can run in parallel. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2017-04-04 18:43:21 +02:00
Samuel Ortiz	d1006fdfbc	server: Add new sandboxes to the sandbox hash table first We want new sandboxes to be added to the sandbox hash table before adding their ID to the pod Index registrar, in order to avoid potential Update() races. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2017-04-04 17:22:34 +02:00
Mrunal Patel	4ccc5bbe7c	Set the container hostnames same as pod hostname Signed-off-by: Mrunal Patel <mpatel@redhat.com>	2017-03-29 16:11:57 -07:00
Mrunal Patel	d69ad9b5a3	Fix lint issues Signed-off-by: Mrunal Patel <mrunalp@gmail.com>	2017-03-27 10:21:30 -07:00
Samuel Ortiz	72129ee3fb	sandbox: Track and store the pod resolv.conf path When we get a pod with DNS settings, we need to build a resolv.conf file and mount it in all pod containers. In order to do that, we have to track the built resolv.conf file and store/load it. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2017-03-24 15:28:14 +01:00
Daniel J Walsh	19620f3d1e	Switch to using opencontainers/selinux We have moved selinux support out of opencontainers/runc into its own package. This patch moves to using the new selinux go bindings. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2017-03-23 15:53:09 -04:00
Daniel J Walsh	ff950a8e37	Set SELinux mount label for pod sandbox The pause container is creating an AVC since the /dev/null device is not labeled correctly. Looks like we are only setting the label of the process not the label of the content inside of the container. This change will label content in the pause container correctly and eliminate the AVC. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2017-03-16 14:09:38 -04:00
Mrunal Patel	8c0ff7d904	Run conmon under cgroups (systemd) Signed-off-by: Mrunal Patel <mrunalp@gmail.com>	2017-03-06 15:08:46 -08:00
Pengfei Ni	3195f45904	Merge pull request #367 from sameo/topic/host-privileged-runtime Support alternate runtime for host privileged operations	2017-03-05 07:53:20 +08:00
Mrunal Patel	38f497a701	Fix cgroup parent We were using a variable before it was set. Signed-off-by: Mrunal Patel <mrunalp@gmail.com>	2017-03-03 16:38:46 -08:00
Samuel Ortiz	2ec696be41	server: Set sandbox and container privileged flags The sandbox privileged flag is set to true only if either the pod configuration privileged flag is set to true or when any of the pod namespaces are the host ones. A container inherit its privileged flag from its sandbox, and will be run by the privileged runtime only if it's set to true. In other words, the privileged runtime (when defined) will be when one of the below conditions is true: - The sandbox will be asked to run at least one privileged container. - The sandbox requires access to either the host IPC or networking namespaces. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2017-03-03 19:06:04 +01:00
Andrew Pilloud	44e7e88ff3	Run without seccomp support Signed-off-by: Andrew Pilloud <andrewpilloud@igneoussystems.com>	2017-02-21 16:47:03 -08:00
Michał Żyłowski	5c81217e09	Applying k8s.io v3 API for ocic and ocid Signed-off-by: Michał Żyłowski <michal.zylowski@intel.com>	2017-02-06 13:05:10 +01:00
Nalin Dahyabhai	c0333b102b	Integrate containers/storage Use containers/storage to store images, pod sandboxes, and containers. A pod sandbox's infrastructure container has the same ID as the pod to which it belongs, and all containers also keep track of their pod's ID. The container configuration that we build using the data in a CreateContainerRequest is stored in the container's ContainerDirectory and ContainerRunDirectory. We catch SIGTERM and SIGINT, and when we receive either, we gracefully exit the grpc loop. If we also think that there aren't any container filesystems in use, we attempt to do a clean shutdown of the storage driver. The test harness now waits for ocid to exit before attempting to delete the storage root directory. Signed-off-by: Nalin Dahyabhai <nalin@redhat.com>	2017-01-18 10:23:30 -05:00
Jacek J. Łakis	b034072d6a	sandbox_run: Do not run net plugin in host namespace Signed-off-by: Jacek J. Łakis <jacek.lakis@intel.com>	2017-01-16 16:53:29 +01:00
Mrunal Patel	6df58df215	Add support for systemd cgroups Signed-off-by: Mrunal Patel <mpatel@redhat.com>	2016-12-19 16:31:29 -08:00
Harry Zhang	02dfe877e4	Add container to pod qos cgroup Signed-off-by: Harry Zhang <harryz@hyper.sh>	2016-12-15 14:42:59 +08:00
Samuel Ortiz	a9724c2c9c	sandbox: Fix gocyclo complexity With the networking namespace code added, we were reaching a gocyclo complexitiy of 52. By moving the container creation and starting code path out, we're back to reasonable levels. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2016-12-12 19:48:23 +01:00
Samuel Ortiz	482eb460d6	sandbox: Setup networking namespace before sandbox creation In order for hypervisor based container runtimes to be able to fully prepare their pod virtual machines networking interfaces, this patch sets the pod networking namespace before creating the sandbox container. Once the sandbox networking namespace is prepared, the runtime can scan the networking namespace interfaces and build the pod VM matching interfaces (typically TAP interfaces) at pod sandbox creation time. Not doing so means those runtimes would have to rely on all hypervisors to support networking interfaces hotplug. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2016-12-12 19:48:23 +01:00
Samuel Ortiz	4cab8ed06a	sandbox: Use persistent networking namespace Because they need to prepare the hypervisor networking interfaces and have them match the ones created in the pod networking namespace (typically to bridge TAP and veth interfaces), hypervisor based container runtimes need the sandbox pod networking namespace to be set up before it's created. They can then prepare and start the hypervisor interfaces when creating the pod virtual machine. In order to do so, we need to create per pod persitent networking namespaces that we pass to the CNI plugin. This patch leverages the CNI ns package to create such namespaces under /var/run/netns, and assign them to all pod containers. The persitent namespace is removed when either the pod is stopped or removed. Since the StopPodSandbox() API can be called multiple times from kubelet, we track the pod networking namespace state (closed or not) so that we don't get a containernetworking/ns package error when calling its Close() routine multiple times as well. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2016-12-12 19:48:23 +01:00

1 2

58 commits