containerd

Author	SHA1	Message	Date
Stephen J Day	e53539c58f	cmd/dist, cmd/ctr: end to end image pull With this changeset, we now have a proof of concept of end to end pull. Up to this point, the relationship between subsystems has been somewhat theoretical. We now leverage fetching, the snapshot drivers, the rootfs service, image metadata and the execution service, validating the proposed model for containerd. There are a few caveats, including the need to move some of the access into GRPC services, but the basic components are there. The first command we will cover here is `dist pull`. This is the analog of `docker pull` and `git pull`. It performs a full resource fetch for an image and unpacks the root filesystem into the snapshot drivers. An example follows: ``` console $ sudo ./bin/dist pull docker.io/library/redis:latest docker.io/library/redis:latest: resolved \|++++++++++++++++++++++++++++++++++++++\| manifest-sha256:4c8fb09e8d634ab823b1c125e64f0e1ceaf216025aa38283ea1b42997f1e8059: done \|++++++++++++++++++++++++++++++++++++++\| layer-sha256:3b281f2bcae3b25c701d53a219924fffe79bdb74385340b73a539ed4020999c4: done \|++++++++++++++++++++++++++++++++++++++\| config-sha256:e4a35914679d05d25e2fccfd310fde1aa59ffbbf1b0b9d36f7b03db5ca0311b0: done \|++++++++++++++++++++++++++++++++++++++\| layer-sha256:4b7726832aec75f0a742266c7190c4d2217492722dfd603406208eaa902648d8: done \|++++++++++++++++++++++++++++++++++++++\| layer-sha256:338a7133395941c85087522582af182d2f6477dbf54ba769cb24ec4fd91d728f: done \|++++++++++++++++++++++++++++++++++++++\| layer-sha256:83f12ff60ff1132d1e59845e26c41968406b4176c1a85a50506c954696b21570: done \|++++++++++++++++++++++++++++++++++++++\| layer-sha256:693502eb7dfbc6b94964ae66ebc72d3e32facd981c72995b09794f1e87bac184: done \|++++++++++++++++++++++++++++++++++++++\| layer-sha256:622732cddc347afc9360b4b04b46c6f758191a1dc73d007f95548658847ee67e: done \|++++++++++++++++++++++++++++++++++++++\| layer-sha256:19a7e34366a6f558336c364693df538c38307484b729a36fede76432789f084f: done \|++++++++++++++++++++++++++++++++++++++\| elapsed: 1.6 s total: 0.0 B (0.0 B/s) INFO[0001] unpacking rootfs ``` Note that we haven't integrated rootfs unpacking into the status output, but we pretty much have what is in docker today (:P). We can see the result of our pull with the following: ```console $ sudo ./bin/dist images REF TYPE DIGEST SIZE docker.io/library/redis:latest application/vnd.docker.distribution.manifest.v2+json sha256:4c8fb09e8d634ab823b1c125e64f0e1ceaf216025aa38283ea1b42997f1e8059 1.8 kB ``` The above shows that we have an image called "docker.io/library/redis:latest" mapped to the given digest marked with a specific format. We get the size of the manifest right now, not the full image, but we can add more as we need it. For the most part, this is all that is needed, but a few tweaks to the model for naming may need to be added. Specifically, we may want to index under a few different names, including those qualified by hash or matched by tag versions. We can do more work in this area as we develop the metadata store. The name shown above can then be used to run the actual container image. We can do this with the following command: ```console $ sudo ./bin/ctr run --id foo docker.io/library/redis:latest /usr/local/bin/redis-server 1:C 17 Mar 17:20:25.316 # Warning: no config file specified, using the default config. In order to specify a config file use /usr/local/bin/redis-server /path/to/redis.conf 1:M 17 Mar 17:20:25.317 * Increased maximum number of open files to 10032 (it was originally set to 1024). _._ _.-``__ ''-._ _.-`` `. `_. ''-._ Redis 3.2.8 (00000000/0) 64 bit .-`` .-```. ```\/ _.,_ ''-._ ( ' , .-` \| `, ) Running in standalone mode \|`-._`-...-` __...-.``-._\|'` _.-'\| Port: 6379 \| `-._ `._ / _.-' \| PID: 1 `-._ `-._ `-./ _.-' _.-' \|`-._`-._ `-.__.-' _.-'_.-'\| \| `-._`-._ _.-'_.-' \| http://redis.io `-._ `-._`-.__.-'_.-' _.-' \|`-._`-._ `-.__.-' _.-'_.-'\| \| `-._`-._ _.-'_.-' \| `-._ `-._`-.__.-'_.-' _.-' `-._ `-.__.-' _.-' `-._ _.-' `-.__.-' 1:M 17 Mar 17:20:25.326 # WARNING: The TCP backlog setting of 511 cannot be enforced because /proc/sys/net/core/somaxconn is set to the lower value of 128. 1:M 17 Mar 17:20:25.326 # Server started, Redis version 3.2.8 1:M 17 Mar 17:20:25.326 # WARNING overcommit_memory is set to 0! Background save may fail under low memory condition. To fix this issue add 'vm.overcommit_memory = 1' to /etc/sysctl.conf and then reboot or run the command 'sysctl vm.overcommit_memory=1' for this to take effect. 1:M 17 Mar 17:20:25.326 # WARNING you have Transparent Huge Pages (THP) support enabled in your kernel. This will create latency and memory usage issues with Redis. To fix this issue run the command 'echo never > /sys/kernel/mm/transparent_hugepage/enabled' as root, and add it to your /etc/rc.local in order to retain the setting after a reboot. Redis must be restarted after THP is disabled. 1:M 17 Mar 17:20:25.326 * The server is now ready to accept connections on port 6379 ``` Wow! So, now we are running `redis`! There are still a few things to work out. Notice that we have to specify the command as part of the arguments to `ctr run`. This is because are not yet reading the image config and converting it to an OCI runtime config. With the base laid in this PR, adding such functionality should be straightforward. While this is a _little_ messy, this is great progress. It should be easy sailing from here. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-03-21 13:08:23 -07:00
fate-grand-order	680e191c91	fix misspell "resources" in content/store.go Signed-off-by: fate-grand-order <chenjg@harmonycloud.cn>	2017-03-17 10:41:24 +08:00
chchliang	4465e493fc	Closed stream in store.go Active() Signed-off-by: chchliang <chen.chuanliang@zte.com.cn>	2017-03-15 16:53:30 +08:00
Stephen J Day	5da4e1d0d2	services/content: move service client into package Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-28 17:12:24 -08:00
Stephen Day	317b884110	Merge pull request #576 from stevvooe/seek-reset-handling content: allow reset via Truncate	2017-02-28 10:42:21 -08:00
Stephen J Day	d99756a8a2	content: allow reset via Truncate To make restarting after failed pull less racy, we define `Truncate(size int64) error` on `content.Writer` for the zero offset. Truncating a writer will dump any existing data and digest state and start from the beginning. All subsequent writes will start from the zero offset. For the service, we support this by defining the behavior for a write that changes the offset. To keep this narrow, we only support writes out of order at the offset 0, which causes the writer to dump existing data and reset the local hash. This makes restarting failed pulls much smoother when there was a previously encountered error and the source doesn't support arbitrary seeks or reads at arbitrary offsets. By allowing this to be done while holding the write lock on a ref, we can restart the full download without causing a race condition. Once we implement seeking on the `io.Reader` returned by the fetcher, this will be less useful, but it is good to ensure that our protocol properly supports this use case for when streaming is the only option. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-28 10:40:02 -08:00
Stephen J Day	706c629354	api/services/content: define delete method Allow deletion of content over the GRPC interface. For now, we are going with a model that conducts reference management outside of the content store, in the metadata store but this design is valid either way. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-27 20:06:29 -08:00
Stephen J Day	850f8addc2	content: close writer after opening Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-24 16:08:31 -08:00
Stephen J Day	c062a85782	content: cleanup service and interfaces After implementing pull, a few changes are required to the content store interface to make sure that the implementation works smoothly. Specifically, we work to make sure the predeclaration path for digests works the same between remote and local writers. Before, we were hesitent to require the the size and digest up front, but it became clear that having this provided significant benefit. There are also several cleanups related to naming. We now call the expected digest `Expected` consistently across the board and `Total` is used to mark the expected size. This whole effort comes together to provide a very smooth status reporting workflow for image pull and push. This will be more obvious when the bulk of pull code lands. There are a few other changes to make `content.WriteBlob` more broadly useful. In accordance with addition for predeclaring expected size when getting a `Writer`, `WriteBlob` now supports this fully. It will also resume downloads if provided an `io.Seeker` or `io.ReaderAt`. Coupled with the `httpReadSeeker` from `docker/distribution`, we should only be a lines of code away from resumable downloads. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-22 13:30:01 -08:00
Michael Crosby	3101be93bc	Load runtimes dynamically via go1.8 plugins Signed-off-by: Michael Crosby <crosbymichael@gmail.com> Add registration for more subsystems via plugins Signed-off-by: Michael Crosby <crosbymichael@gmail.com> Move content service to separate package Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2017-02-21 16:29:46 -08:00
Stephen J Day	e6efb397cf	cmd/dist: port commands over to use GRPC content store Following from the rest of the work in this branch, we now are porting the dist command to work directly against the containerd content API. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-21 13:10:31 -08:00
Stephen J Day	621164bc84	content: refactor content store for API After iterating on the GRPC API, the changes required for the actual implementation are now included in the content store. The begin change is the move to a single, atomic `Ingester.Writer` method for locking content ingestion on a key. From this, comes several new interface definitions. The main benefit here is the clarification between `Status` and `Info` that came out of the GPRC API. `Status` tells the status of a write, whereas `Info` is for querying metadata about various blobs. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-21 13:10:22 -08:00
kevinetc123	08f8270e2d	remove redundant code Signed-off-by: kevinetc123 <kaiwentan@harmonycloud.cn>	2017-02-15 20:43:17 +08:00
fate-grand-order	d44c8525d7	fix misspell "scrutiny" in content/content.go Signed-off-by: fate-grand-order <chenjg@harmonycloud.cn>	2017-02-07 15:06:40 +08:00
Akihiro Suda	e9510eb69c	content: fix panic during filepath.Walk() Signed-off-by: Akihiro Suda <suda.akihiro@lab.ntt.co.jp>	2017-02-06 05:16:21 +00:00
Kenfe-Mickaël Laventure	c6446b93da	Merge pull request #484 from stevvooe/snapshot-drivers [WIP] snapshot: define the snapshot driver interface	2017-02-03 07:56:12 -08:00
Stephen J Day	68fd25221d	snapshot: define the snapshot driver interface We now define the `snapshot.Driver` interface based on earlier work. Many details of the model are worked out, such as snapshot lifecycle and parentage of commits against "Active" snapshots. The impetus of this change is to provide a snapshot POC that does a complete push/pull workflow. The beginnings of a test suite for snapshot drivers is included that we can use to verify the assumptions of drivers. The intent is to port the existing tests over to this test suite and start scaling contributions and test to the snapshot driver subsystem. There are still some details that need to be worked out, such as listing and metadata access. We can do this activity as we further integrate with tooling. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-02 18:06:02 -08:00
Stephen J Day	5ff837e2e7	content: use buffer pool in hash copy Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-02-02 14:27:52 -08:00
Stephen J Day	f9cd9be61a	dist: expand functionality of the dist tool With this change, we add the following commands to the dist tool: - `ingest`: verify and accept content into storage - `active`: display active ingest processes - `list`: list content in storage - `path`: provide a path to a blob by digest - `delete`: remove a piece of content from storage We demonstrate the utility with the following shell pipeline: ``` $ ./dist fetch docker.io/library/redis latest mediatype:application/vnd.docker.distribution.manifest.v2+json \| \ jq -r '.layers[] \| "./dist fetch docker.io/library/redis "+.digest + "\| ./dist ingest --expected-digest "+.digest+" --expected-size "+(.size \| tostring) +" docker.io/library/redis@"+.digest' \| xargs -I{} -P10 -n1 sh -c "{}" ``` The above fetches a manifest, pipes it to jq, which assembles a shell pipeline to ingest each layer into the content store. Because the transactions are keyed by their digest, concurrent downloads and downloads of repeated content are ignored. Each process is then executed parallel using xargs. Put shortly, this is a parallel layer download. In a separate shell session, could monitor the active downloads with the following: ``` $ watch -n0.2 ./dist active ``` For now, the content is downloaded into `.content` in the current working directory. To watch the contents of this directory, you can use the following: ``` $ watch -n0.2 tree .content ``` This will help to understand what is going on internally. To get access to the layers, you can use the path command: ``` $./dist path sha256:010c454d55e53059beaba4044116ea4636f8dd8181e975d893931c7e7204fffa sha256:010c454d55e53059beaba4044116ea4636f8dd8181e975d893931c7e7204fffa /home/sjd/go/src/github.com/docker/containerd/.content/blobs/sha256/010c454d55e53059beaba4044116ea4636f8dd8181e975d893931c7e7204fffa ``` When you are done, you can clear out the content with the classic xargs pipeline: ``` $ ./dist list -q \| xargs ./dist delete ``` Note that this is mostly a POC. Things like failed downloads and abandoned download cleanup aren't quite handled. We'll probably make adjustments around how content store transactions are handled to address this. From here, we'll build out full image pull and create tooling to get runtime bundles from the fetched content. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-01-27 10:29:10 -08:00
Stephen J Day	f27d43285a	content: cleanup package exports The package exports are now cleaned up to remove a lot of stuttering in the API. We also remove direct mapping of refs to the filesystem, opting for a hash-based approach. This does affect insertion performance, since it requires more individual file ios. A benchmark baseline has been added and we can fix this later. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2017-01-23 20:11:35 -08:00
Michael Crosby	c26e2bde5a	Merge pull request #447 from fate-grand-order/master Correct spelling errors in annotations in locks.go	2017-01-19 10:59:03 -08:00
Aaron.L.Xu	08bcbddb32	fix typo I found in this repo Signed-off-by: Aaron.L.Xu <likexu@harmonycloud.cn>	2017-01-20 01:18:26 +08:00
fate-grand-order	3400a450f2	Correct spelling errors in annotations in locks.go Signed-off-by: fate-grand-order <chenjg@harmonycloud.cn>	2017-01-19 12:27:51 +08:00
Gábor Lipták	d8aee18f6c	Point digest import to opencontainers/go-digest Signed-off-by: Gábor Lipták <gliptak@gmail.com>	2017-01-09 18:10:52 -05:00
Qiang Huang	3b8eee7522	Fix some typos Signed-off-by: Qiang Huang <h.huangqiang@huawei.com>	2016-12-16 09:31:19 +08:00
Stephen J Day	ab4bfc79e9	content: provide more context for lockfile error Signed-off-by: Stephen J Day <stephen.day@docker.com>	2016-12-07 17:46:01 -08:00
Stephen J Day	47f8b25d25	content: add Walk method to content store Signed-off-by: Stephen J Day <stephen.day@docker.com>	2016-12-01 21:37:58 -08:00
Stephen J Day	9dea9654bd	content: query active downloads and status Signed-off-by: Stephen J Day <stephen.day@docker.com>	2016-11-15 20:29:43 -08:00
Stephen J Day	b6e446e7be	content: add cross-process ingest locking Allow content stores to ingest content without coordination of a daemon to manage locks. Supports coordinated ingest and cross-process ingest status. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2016-11-15 20:29:10 -08:00
Stephen J Day	3469905bbb	content: break up into multiple files Break up the content store prototype into a few logical files. We have a file for the store, the writer and helpers. Also, the writer has been modified to remove write and exec permissions on blobs in the store. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2016-11-03 17:18:45 -07:00
Stephen J Day	9008471a84	content: initial implementation of content store After experimenting with pull, we've defined a transactional content store for verified storage of fetched items. A base component of containerkit, this will interact with both the runtime and distribution sides of the system, avoiding coupling. Blob access if provided through direct access to readonly files. Signed-off-by: Stephen J Day <stephen.day@docker.com>	2016-11-01 18:43:15 -07:00

31 commits