Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								3112388004 
								
							 
						 
						
							
							
								
								Fix multiple reporting of incomplete  
							
							
							
						 
						
							2016-08-17 16:01:28 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								5e1a117ff3 
								
							 
						 
						
							
							
								
								Delete the job first to prevent Kubernetes from starting another pod  
							
							
							
						 
						
							2016-08-16 16:33:43 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								742e153133 
								
							 
						 
						
							
							
								
								Fix watch of the jobs key in the build manager  
							
							
							
						 
						
							2016-08-16 15:43:09 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								313d65a6a4 
								
							 
						 
						
							
							
								
								Make sure the etcd watch coroutines get called  
							
							
							
						 
						
							2016-08-16 13:02:27 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								d78361b041 
								
							 
						 
						
							
							
								
								Cleanup old executions that never start  
							
							... 
							
							
							
							Fixes  #1727  
						
							2016-08-15 16:54:02 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								c29f9ccc7f 
								
							 
						 
						
							
							
								
								Fix TTL on heartbeat in etcd  
							
							... 
							
							
							
							Until now, once the heartbeat has expired, we would issue a TTL that is negative, which causes etcd to either raise an exception or simply ignore the expiration (depending on the version of etcd). This change ensures that once the key is expired, it is removed immediately via a set of a TTL of 0. Also adds tests for this case and the normal expiration case. 
							
						 
						
							2016-08-03 11:15:03 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								428a7cb435 
								
							 
						 
						
							
							
								
								Fix decreased setup timeout on ephemeral build manager  
							
							
							
						 
						
							2016-07-22 13:35:38 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								392242d20b 
								
							 
						 
						
							
							
								
								Another fix for the record keeping in buildman  
							
							... 
							
							
							
							Adds some more mocked tests as well 
							
						 
						
							2016-07-22 12:01:30 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								68baa51d55 
								
							 
						 
						
							
							
								
								Fix cross-manager handling of realm components  
							
							
							
						 
						
							2016-07-21 15:47:25 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								4420b1bac9 
								
							 
						 
						
							
							
								
								Add temporary back-compat shims for the build manager  
							
							
							
						 
						
							2016-07-20 13:41:01 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								2c1880b944 
								
							 
						 
						
							
							
								
								Bug fixes, refactoring and "new" tests for the build manager  
							
							... 
							
							
							
							- Fixes various bugs introduced in the most recent build system commit
- Refactors state management in the build manager to be cleaner and more contained
- Adds back in the mock-based tests, fixed to not use threads and adjusted for the refactoring
- Adds some more simplified unit tests around non-etch related flows 
							
						 
						
							2016-07-18 13:46:48 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								74b87fa813 
								
							 
						 
						
							
							
								
								Build manager cleanup and more logging  
							
							
							
						 
						
							2016-07-14 14:33:14 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								d8b72e8503 
								
							 
						 
						
							
							
								
								Switch to using a defined branch and not always pulling the VM image  
							
							
							
						 
						
							2016-07-08 17:53:25 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								3d4af78f01 
								
							 
						 
						
							
							
								
								Fix label to never allow a space (which breaks Kubernetes)  
							
							
							
						 
						
							2016-07-08 17:09:06 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								811413fe9c 
								
							 
						 
						
							
							
								
								Add multiple executor and whitelist support to build manager  
							
							
							
						 
						
							2016-07-08 15:50:51 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								7471d0e35f 
								
							 
						 
						
							
							
								
								Small code cleanup before whitelist addition  
							
							
							
						 
						
							2016-07-08 15:50:51 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Colin Hom 
								
							 
						 
						
							
							
							
							
								
							
							
								1e3351f3f4 
								
							 
						 
						
							
							
								
								local-docker.sh now accepts env vars  
							
							
							
						 
						
							2016-07-08 15:50:51 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Colin Hom 
								
							 
						 
						
							
							
							
							
								
							
							
								bc13333f20 
								
							 
						 
						
							
							
								
								Kubernetes build worker  
							
							
							
						 
						
							2016-07-08 15:50:51 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								713ba3abaf 
								
							 
						 
						
							
							
								
								Further updates to the Prometheus client code  
							
							
							
						 
						
							2016-07-01 14:16:51 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								1173192739 
								
							 
						 
						
							
							
								
								Move channel back, as it is referenced by generate_cloud_config  
							
							
							
						 
						
							2016-06-22 17:25:06 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								61695eb439 
								
							 
						 
						
							
							
								
								Allow the build node AMI to be overridden in config  
							
							
							
						 
						
							2016-06-22 15:13:54 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								7292524d69 
								
							 
						 
						
							
							
								
								Add a cloud watch metric when we fail to start a build via EC2  
							
							... 
							
							
							
							Fixes  #1555  
						
							2016-06-17 16:19:57 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								f9469a84b3 
								
							 
						 
						
							
							
								
								Make the size of the build node HDD configurable  
							
							... 
							
							
							
							Fixes  #1520  
						
							2016-06-06 11:35:10 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								5262535945 
								
							 
						 
						
							
							
								
								Boto error_code is a string, not the HTTP status code  
							
							
							
						 
						
							2015-12-23 15:12:01 -05:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								773e73861f 
								
							 
						 
						
							
							
								
								Change error into info in build manager  
							
							... 
							
							
							
							Fixes  #1046  
						
							2015-12-09 14:30:14 -05:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									josephschorr 
								
							 
						 
						
							
							
							
							
								
							
							
								c06e5cc9c7 
								
							 
						 
						
							
							
								
								Merge pull request  #1002  from coreos-inc/buildertagexc  
							
							... 
							
							
							
							Add timeout and failure if an EC2 instance could not be found when ta… 
							
						 
						
							2015-12-09 14:28:31 -05:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								946e5fabc0 
								
							 
						 
						
							
							
								
								Add timeout and failure if an EC2 instance could not be found when tagging  
							
							... 
							
							
							
							Fixes  #994  
						
							2015-12-09 14:28:19 -05:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								edd9a03af5 
								
							 
						 
						
							
							
								
								Catch additional key not found exception  
							
							... 
							
							
							
							Fixes  #806  
						
							2015-12-01 12:29:58 -05:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jimmy Zelinskie 
								
							 
						 
						
							
							
							
							
								
							
							
								46b2f10d7f 
								
							 
						 
						
							
							
								
								check for VPC subnet ID before using builder VPC  
							
							... 
							
							
							
							This means you can use legacy networking machines by simply changing the
instance type and removing the specified 'EC2_VPC_SUBNET_ID' from the
executor config. 
							
						 
						
							2015-10-22 14:50:54 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								0f37e66cc8 
								
							 
						 
						
							
							
								
								Better error handling for the build manager  
							
							... 
							
							
							
							Fixes  #604  
						
							2015-10-13 11:40:07 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Matt Jibson 
								
							 
						 
						
							
							
							
							
								
							
							
								bba1557437 
								
							 
						 
						
							
							
								
								Monitor queue adds and EC2 node starts  
							
							... 
							
							
							
							fixes  #157 
see #304  
						
							2015-09-18 16:21:16 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								04cc471585 
								
							 
						 
						
							
							
								
								Increase the HD size on the build nodes  
							
							... 
							
							
							
							Fixes  #228  
						
							2015-07-14 15:20:17 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								6655c7f745 
								
							 
						 
						
							
							
								
								Add exception handling that doesn't log the read-timeout exception  
							
							... 
							
							
							
							Note: This is a *hack* and needs to be replaced with proper code ASAP 
							
						 
						
							2015-06-25 23:35:29 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								6e6610f31a 
								
							 
						 
						
							
							
								
								Switch to a 30s maximum timeout  
							
							
							
						 
						
							2015-06-25 23:08:49 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								ecebc06343 
								
							 
						 
						
							
							
								
								Update comment now that restarter is abstracted  
							
							
							
						 
						
							2015-06-25 21:53:42 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								9f5f71398c 
								
							 
						 
						
							
							
								
								Abstract out the concept of a restart function  
							
							
							
						 
						
							2015-06-25 21:40:50 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								52fa9aad5b 
								
							 
						 
						
							
							
								
								Fix etcd watching  
							
							... 
							
							
							
							Etcd can miss events on watches if they are occurring fast enough, so if we can get an exception indicating that we've missed an index, we reset the state of our local tracking structures by re-reading the *full* list and starting a new watch at HEAD 
							
						 
						
							2015-06-25 21:22:39 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								b4c39e8ec0 
								
							 
						 
						
							
							
								
								Fix ephemeral build manager to ask for watches in index order with no gaps  
							
							
							
						 
						
							2015-06-23 17:11:46 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								c2dc1c9b75 
								
							 
						 
						
							
							
								
								Handle case where etcd key is already removed on job complete  
							
							
							
						 
						
							2015-06-17 15:02:58 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								c435f5c127 
								
							 
						 
						
							
							
								
								Add a comment about why we are taking a lock when terminating a builder machine.  
							
							
							
						 
						
							2015-06-10 16:19:51 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								f767fc4d03 
								
							 
						 
						
							
							
								
								Track whether builders ever came online in etcd. Mark builds which never successfully heartbeated as incomplete.  
							
							
							
						 
						
							2015-06-10 16:19:51 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								884fedd229 
								
							 
						 
						
							
							
								
								Improve the log messages in the buildman.  
							
							
							
						 
						
							2015-06-10 16:19:51 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								d31e25d5cd 
								
							 
						 
						
							
							
								
								Allow the individual build manager types to specify how long the queue should wait before retring a job that fails to schedule.  
							
							
							
						 
						
							2015-06-10 16:19:50 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Joseph Schorr 
								
							 
						 
						
							
							
							
							
								
							
							
								f82831bff6 
								
							 
						 
						
							
							
								
								Log the etcd exception so we can debug this issue  
							
							
							
						 
						
							2015-06-09 20:33:55 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								6eead7c860 
								
							 
						 
						
							
							
								
								Add logentries reporting to the ephemeral builders.  
							
							
							
						 
						
							2015-03-27 15:28:08 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								0349f3f1a3 
								
							 
						 
						
							
							
								
								Handle the case where YAML config returns a list not a tuple.  
							
							
							
						 
						
							2015-03-26 14:53:56 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jimmy Zelinskie 
								
							 
						 
						
							
							
							
							
								
							
							
								8589871f43 
								
							 
						 
						
							
							
								
								buildman: rm unused imports  
							
							
							
						 
						
							2015-03-09 13:04:16 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								5c68e52fce 
								
							 
						 
						
							
							
								
								Really really fix the exception handling.  
							
							
							
						 
						
							2015-02-27 17:33:46 -05:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								cf5bc6f0be 
								
							 
						 
						
							
							
								
								Properly catch multiple exceptions.  
							
							
							
						 
						
							2015-02-27 17:32:10 -05:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jake Moshenko 
								
							 
						 
						
							
							
							
							
								
							
							
								857c3e2959 
								
							 
						 
						
							
							
								
								Start catching etcd key errors as well.  
							
							
							
						 
						
							2015-02-27 17:10:15 -05:00