minio

Author	SHA1	Message	Date
Harshavardhana	cfc9cfd84a	fix: various optimizations, idiomatic changes (#9179 ) - acquire since leader lock for all background operations - healing, crawling and applying lifecycle policies. - simplify lifecyle to avoid network calls, which was a bug in implementation - we should hold a leader and do everything from there, we have access to entire name space. - make listing, walking not interfere by slowing itself down like the crawler. - effectively use global context everywhere to ensure proper shutdown, in cache, lifecycle, healing - don't read `format.json` for prometheus metrics in StorageInfo() call.	2020-03-22 12:16:36 -07:00
Harshavardhana	c9212819af	fix: lock maintenance should honor quorum (#9138 ) The staleness of a lock should be determined by the quorum number of entries returning stale, this allows for situations when locks are held when nodes are down - we don't accidentally clear locks unintentionally when they are valid and correct. Also lock maintenance should be run by all servers, not one server, stale locks need to be run outside the requirement for holding distributed locks. Thanks @klauspost for reproducing this issue	2020-03-15 11:55:52 -07:00
kannappanr	c7ca791c58	fix: lock expiry on zoned setups (#9084 ) lock ownership is limited to endpoints on first zone, as we do not hold locks on other zones in an expanded setup. current code unintentionally expired active locks when it couldn't see ownership from the secondary zone which leads to unexpected bugs as locking fails to work as expected.	2020-03-04 16:06:17 -08:00
Harshavardhana	ab7d3cd508	fix: Speed up multi-object delete by taking bulk locks (#8974 ) Change distributed locking to allow taking bulk locks across objects, reduces usually 1000 calls to 1. Also allows for situations where multiple clients sends delete requests to objects with following names ``` {1,2,3,4,5} ``` ``` {5,4,3,2,1} ``` will block and ensure that we do not fail the request on each other.	2020-02-21 11:29:57 +05:30
Harshavardhana	b00cda8ad4	Avoid running lock maintenance from all nodes (#8737 ) Co-Authored-By: Krishnan Parthasarathi <krisis@users.noreply.github.com>	2020-01-03 23:11:07 +05:30
Harshavardhana	10b2f15f6f	Add randomize sleep times for lock checkers (#8628 )	2019-12-11 10:57:05 -08:00
Harshavardhana	720442b1a2	Add lock expiry handler to expire state locks (#8562 )	2019-11-25 16:39:43 -08:00
Harshavardhana	c3771df641	Add bootstrap REST handler for verifying server config (#8550 )	2019-11-22 12:45:13 -08:00
Harshavardhana	347b29d059	Implement bucket expansion (#8509 )	2019-11-19 17:42:27 -08:00
Harshavardhana	e9b2bf00ad	Support MinIO to be deployed on more than 32 nodes (#8492 ) This PR implements locking from a global entity into a more localized set level entity, allowing for locks to be held only on the resources which are writing to a collection of disks rather than a global level. In this process this PR also removes the top-level limit of 32 nodes to an unlimited number of nodes. This is a precursor change before bring in bucket expansion.	2019-11-13 12:17:45 -08:00
Harshavardhana	4e63e0e372	Return appropriate errors API versions changes across REST APIs (#8480 ) This PR adds code to appropriately handle versioning issues that come up quite constantly across our API changes. Currently we were also routing our requests wrong which sort of made it harder to write a consistent error handling code to appropriately reject or honor requests. This PR potentially fixes issues - old mc is used against new minio release which is incompatible returns an appropriate for client action. - any older servers talking to each other, report appropriate error - incompatible peer servers should report error and reject the calls with appropriate error	2019-11-04 09:30:59 -08:00
Harshavardhana	6a4ef2e48e	Initialize configs correctly, move notification config (#8367 ) This PR also removes deprecated tests, adds checks to avoid races reproduced on CI/CD.	2019-10-09 11:41:15 +05:30
Praveen raj Mani	e48005ddc7	Add more context to rpc version mismatch errors (#8271 ) Fixes #5665	2019-10-03 00:08:12 -07:00
Krishna Srinivas	2ab0681c0c	Do not ignore Lock()'s return value (#8142 )	2019-08-28 16:12:57 -07:00
Harshavardhana	e6d8e272ce	Use const slashSeparator instead of "/" everywhere (#8028 )	2019-08-06 12:08:58 -07:00
Harshavardhana	b52b90412b	Avoid data-transfer in distributed locking (#8004 )	2019-08-05 11:45:30 -07:00
Krishna Srinivas	338e9a9be9	Put object client disconnect (#7824 ) Fail putObject and postpolicy in case client prematurely disconnects Use request's context to cancel lock requests on client disconnects	2019-06-28 22:09:17 -07:00
Harshavardhana	59e1d94770	Remove stale entry spurious logging (#7663 ) The problem in current code was we were removing an entry from a lock lockerMap without considering the fact that different entry for same resource is a possibility due the nature of locks that can be acquired in parallel before we decide if the lock is considered stale A sequence of events is as follows - Lock("resource") - lockMaintenance(finds a long lived lock in this "resource") - Owner node rebooted which now retruns Expired() as true for this "resource" - Unlock("resource") which succeeded in quorum - Now by this time application retried and acquired a new Lock() on the same "resource" - Now that we have Expired() true from the previous call, we proceed to purge the entry from the local lockMap() local lockMap reports a different entry for the expired UID which results in a spurious log entry. This PR removes this logging as this situation is an expected scenario.	2019-05-22 12:21:36 -07:00
kannappanr	d2f42d830f	Lock: Use REST API instead of RPC (#7469 ) In distributed mode, use REST API to acquire and manage locks instead of RPC. RPC has been completely removed from MinIO source. Since we are moving from RPC to REST, we cannot use rolling upgrades as the nodes that have not yet been upgraded cannot talk to the ones that have been upgraded. We expect all minio processes on all nodes to be stopped and then the upgrade process to be completed. Also force http1.1 for inter-node communication	2019-04-17 23:16:27 -07:00

19 commits