minio

Author	SHA1	Message	Date
Bala FA	2c3e34f001	add force delete option of non-empty bucket (#9166 ) passing HTTP header `x-minio-force-delete: true` would allow standard S3 API DeleteBucket to delete a non-empty bucket forcefully.	2020-03-27 21:52:59 -07:00
Anis Elleuch	7f8f1ad4e3	fix: cleanup lifecycle unused code (#9219 )	2020-03-27 18:57:50 -07:00
Harshavardhana	6f992134a2	fix: startup load time by reusing storageDisks (#9210 )	2020-03-27 14:48:30 -07:00
Sidhartha Mani	0c80bf45d0	Implement oboard diagnostics admin API (#9024 ) - Implement a graph algorithm to test network bandwidth from every node to every other node - Saturate any network bandwidth adaptively, accounting for slow and fast network capacity - Implement parallel drive OBD tests - Implement a paging mechanism for OBD test to provide periodic updates to client - Implement Sys, Process, Host, Mem OBD Infos	2020-03-26 21:07:39 -07:00
Anis Elleuch	b207520d98	Fix lifecycle GET: AWS SDK complaints on empty config (#9201 )	2020-03-25 21:06:03 -07:00
Krishna Srinivas	ef6304c5c2	Improve connectDisks() performance (#9203 )	2020-03-24 23:26:13 -07:00
Nitish Tiwari	6b984410d5	Add support for self-healing related metrics in Prometheus (#9079 ) Fixes #8988 Co-authored-by: Anis Elleuch <vadmeste@users.noreply.github.com> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-03-24 22:40:45 -07:00
Harshavardhana	813e0fc1a8	fix: optimize isConnected to avoid url.String() conversions (#9202 ) Stringifying in a loop can tax the system, avoid this and convert the endpoints to strings early on and remember them for the lifetime of the server.	2020-03-24 18:53:24 -07:00
Harshavardhana	6f6a2214fc	Add rate limiter for S3 API layer (#9196 ) - total number of S3 API calls per server - maximum wait duration for any S3 API call This implementation is primarily meant for situations where HDDs are not capable enough to handle the incoming workload and there is no way to throttle the client. This feature allows MinIO server to throttle itself such that we do not overwhelm the HDDs.	2020-03-24 12:43:40 -07:00
Anis Elleuch	791821d590	sa: Allow empty policy to indicate parent user's policy is inherited (#9185 )	2020-03-23 14:17:18 -07:00
Harshavardhana	9a951da881	honor the credentials of user admin for encrypt/decrypt (#9194 ) Fixes #9193	2020-03-23 14:06:00 -07:00
Harshavardhana	ff932ca2a0	fix: log only catastrophic errors in prepare storage (#9189 )	2020-03-23 07:32:18 -07:00
poornas	818d3bcaf5	fix: deprecate TestDiskCache test from unit tests (#9187 )	2020-03-22 23:46:36 -07:00
Krishna Srinivas	45b1c66195	fix: implement splunk specific listObjects when delimiter=guidSplunk (#9186 )	2020-03-22 19:23:47 -07:00
Harshavardhana	da04cb91ce	optimize listObjects to list only from 3 random disks (#9184 )	2020-03-22 16:33:49 -07:00
Harshavardhana	cfc9cfd84a	fix: various optimizations, idiomatic changes (#9179 ) - acquire since leader lock for all background operations - healing, crawling and applying lifecycle policies. - simplify lifecyle to avoid network calls, which was a bug in implementation - we should hold a leader and do everything from there, we have access to entire name space. - make listing, walking not interfere by slowing itself down like the crawler. - effectively use global context everywhere to ensure proper shutdown, in cache, lifecycle, healing - don't read `format.json` for prometheus metrics in StorageInfo() call.	2020-03-22 12:16:36 -07:00
Harshavardhana	ea18e51f4d	Support multiple LDAP OU's, smAccountName support (#9139 ) Fixes #8532	2020-03-21 22:47:26 -07:00
Harshavardhana	3d3beb6a9d	Add response header timeouts (#9170 ) - Add conservative timeouts upto 3 minutes for internode communication - Add aggressive timeouts of 30 seconds for gateway communication Fixes #9105 Fixes #8732 Fixes #8881 Fixes #8376 Fixes #9028	2020-03-21 22:10:13 -07:00
poornas	27b8f18cce	Fix storage info message on startup (#9177 )	2020-03-21 10:02:20 -07:00
Harshavardhana	b4bfdc92cc	fix: admin console logger changes to log.Info	2020-03-20 15:14:14 -07:00
Harshavardhana	ae654831aa	Add madmin package context support (#9172 ) This is to improve responsiveness for all admin API operations and allowing callers to cancel any on-going admin operations, if they happen to be waiting too long.	2020-03-20 15:00:44 -07:00
Stephen N	1ffa983a9d	added support for SASL/SCRAM on Kafka bucket notifications. (#9168 ) fixes #9167	2020-03-20 11:10:27 -07:00
Nitish Tiwari	ecf1566266	Add an option to allow plaintext connection to LDAP/AD Server (#9151 )	2020-03-19 19:20:51 -07:00
Harshavardhana	b1a2169dcc	fix: data usage crawler env handling, usage-cache.bin location (#9163 ) canonicalize the ENVs such that we can bring these ENVs as part of the config values, as a subsequent change. - fix location of per bucket usage to `.minio.sys/buckets/<bucket_name>/usage-cache.bin` - fix location of the overall usage in `json` at `.minio.sys/buckets/.usage.json` (avoid conflicts with a bucket named `usage.json` ) - fix location of the overall usage in `msgp` at `.minio.sys/buckets/.usage.bin` (avoid conflicts with a bucket named `usage.bin`	2020-03-19 09:47:47 -07:00
Harshavardhana	d45a1808f2	fix: Walk() should require quorum number of disks only (#9164 )	2020-03-18 20:56:07 -07:00
Anis Elleuch	db2155551a	heal: Pass scan mode to HealObjects to deep scan full quorum objects (#9159 ) As an optimization of the healing, HealObjects() avoid sending an object to the background healing subsystem when the object is present in all disks. However, HealObjects() should have checked the scan type, if this deep, always pass the object to the healing subsystem.	2020-03-18 17:50:00 -07:00
Harshavardhana	09d35d3b4c	fix: sts to return appropriate errors (#9161 )	2020-03-18 17:25:45 -07:00
Anis Elleuch	5b9342d35c	xl: Tree walking should not quit when one disk returns empty (#9160 ) Currently, a tree walking, needed to a list objects in a specific set quits listing as long as it finds no entries in a disk, which is wrong. This affected background healing, because the latter is using tree walk directly. If one object does not exist in the first disk for example, it will be seemed like the object does not exist at all and no healing work is needed. This commit fixes the behavior.	2020-03-18 16:58:05 -07:00
Klaus Post	8d98662633	re-implement data usage crawler to be more efficient (#9075 ) Implementation overview: https://gist.github.com/klauspost/1801c858d5e0df391114436fdad6987b	2020-03-18 16:19:29 -07:00
Anis Elleuch	7fdeb44372	info: Initialize boot time early so uptime will always be correct (#9154 )	2020-03-17 16:37:28 -07:00
poornas	59dced8237	Print node status even in --quiet mode (#9149 )	2020-03-17 15:25:00 -07:00
Anis Elleuch	496f4a7dc7	Add service account type in IAM (#9029 )	2020-03-17 10:36:13 -07:00
kannappanr	8b880a246a	fix: deleteObjectTagging should 204 on success (#9150 )	2020-03-16 23:21:24 -07:00
Klaus Post	eeb5942b6b	fix: remote profile names and extension (#9145 ) Remote profiles are not formatted correctly: ``` profile-172.31.91.126_9000-cpu.pprof profile-172.31.91.126_9000-goroutines-before.txt profile-172.31.91.126_9000-goroutines.txt profiling-172.31.80.49_9000-cpu.pprof.pprof profiling-172.31.80.49_9000-goroutines-before.txt.pprof profiling-172.31.80.49_9000-goroutines.txt.pprof profiling-172.31.86.101_9000-cpu.pprof.pprof profiling-172.31.86.101_9000-goroutines-before.txt.pprof profiling-172.31.86.101_9000-goroutines.txt.pprof profiling-172.31.91.191_9000-cpu.pprof.pprof profiling-172.31.91.191_9000-goroutines-before.txt.pprof profiling-172.31.91.191_9000-goroutines.txt.pprof ``` `profiling` -> `profile`, remove extra extension.	2020-03-16 11:39:53 -07:00
Harshavardhana	c9212819af	fix: lock maintenance should honor quorum (#9138 ) The staleness of a lock should be determined by the quorum number of entries returning stale, this allows for situations when locks are held when nodes are down - we don't accidentally clear locks unintentionally when they are valid and correct. Also lock maintenance should be run by all servers, not one server, stale locks need to be run outside the requirement for holding distributed locks. Thanks @klauspost for reproducing this issue	2020-03-15 11:55:52 -07:00
poornas	10fd53d6bb	Fix: admin config set API for notifications (#9085 ) Filter out targets set via env when validating incoming config change against configured notification targets Fixes #9066	2020-03-14 00:01:15 -07:00
Krishna Srinivas	2e9fed1a14	non-empty dirs should not be listed as objects (#9129 )	2020-03-13 17:43:00 -07:00
Kody A Kantor	06e30b5aa1	Skip building directio on platforms that don't support Direct IO (#9059 )	2020-03-12 18:57:41 -07:00
Harshavardhana	a54cdb9587	fix: Send x-amz-mp-parts-count for multiparted objects (#9116 ) Some AWS SDKs latently rely on this value some times to calculate the right number of parts during a parallel GetObject request, this is feature used along with content-range - we should support this as well.	2020-03-12 12:37:27 -07:00
Harshavardhana	cfd12914e1	fix: crash in serverInfo handler when ldap is configured (#9123 )	2020-03-11 23:13:32 -07:00
Anis Elleuch	fdf65aa9b9	heal: Add info about the next background healing round (#9122 ) - avoid setting last heal activity when starting self-healing This can be confusing to users thinking that the self healing cycle was already performed. - add info about the next background healing round	2020-03-11 23:00:31 -07:00
Harshavardhana	69b2aacf5a	fix return proper error for OperationTimedout (#9117 ) OperationTimedout error occurs when locking timesout, trying to acquire a lock. This error should be returned appropriately to the client with http status "408" (request timedout) This translation was broken, fix it.	2020-03-11 14:11:04 -07:00
Anis Elleuch	0af62d35a0	xl: Implement posix.DeletePrefixes to enhance delete perf (#9100 ) Bulk delete API was using cleanupObjectsBulk() which calls posix listing and delete API to remove objects internal files in the backend (xl.json and parts) one by one. Add DeletePrefixes in the storage API to remove the content of a directory in a single call. Also use a remove goroutine for each disk to accelerate removal.	2020-03-11 08:56:36 -07:00
Nitish Tiwari	7c32f3f554	Fix the URL for MinIO update when using custom download server (#9111 ) Co-authored-by: Nitish Tiwari <nitish@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-03-11 20:09:20 +05:30
Harshavardhana	5ab9cc029d	fix: crash observed for anonymous deletes from UI (#9107 )	2020-03-09 21:21:35 -07:00
Harshavardhana	6a00eb10bf	fix: allow set drive count of proper divisible values (#9101 ) Currently the code assumed some orthogonal requirements which led situations where when we have a setup where we have let's say for example 168 drives, the final set_drive_count chosen was 14. Indeed 168 drives are divisible by 12 but this wasn't allowed due to an unexpected requirement to have 12 to be a perfect modulo of 14 which is not possible. This assumption was incorrect. This PR fixes this old assumption properly, also adds few tests and some negative tests as well. Improvements are seen in error messages as well.	2020-03-08 13:30:25 -07:00
Harshavardhana	792ee48d2c	add additional logging during server formatting (#9102 )	2020-03-08 12:12:07 -07:00
Harshavardhana	88ae0f1196	Improve delete performance by reducing the number of calls (#9092 ) - Remove the requirement to honor storage class for deletes - Improve `posix.DeleteFileBulk` code to Stat the volumeDir only once per call, rather than for all object paths.	2020-03-06 13:44:24 -08:00
Anis Elleuch	23a0415eb7	profiling: Fix crash when enabling goroutines profiling (#9097 ) This commit replaces 'goroutines' with 'goroutine' when passing it to pprof library when activating goroutine type profiling	2020-03-06 13:22:47 -08:00
Anis Elleuch	75a0661213	data-usage: Fix the calculation of the next crawling round (#9096 ) This commit fixes a simple typo miscalculated the waiting time until the next round of data crawling to compute the data usage.	2020-03-06 11:34:12 -08:00
kannappanr	07a7f329e7	xl: Fix counting offline disks in StorageInfo (#9082 ) Recent modification in the code led to incorrect calculation of offline disks. This commit saves the endpoint list in a xlObjects then we know the name of each disk.	2020-03-04 16:18:32 -08:00
kannappanr	c7ca791c58	fix: lock expiry on zoned setups (#9084 ) lock ownership is limited to endpoints on first zone, as we do not hold locks on other zones in an expanded setup. current code unintentionally expired active locks when it couldn't see ownership from the secondary zone which leads to unexpected bugs as locking fails to work as expected.	2020-03-04 16:06:17 -08:00
kannappanr	d9be8bc693	Add env. variable to disable data usage crawling (#9086 )	2020-03-04 15:51:03 -08:00
poornas	9fc7537f2a	Enforce md5sum checks for object retention APIs (#9030 ) this PR enforces md5sum verification for following API's to be compatible with AWS S3 spec - PutObjectRetention - PutObjectLegalHold Co-authored-by: Harshavardhana <harsha@minio.io>	2020-03-04 07:04:12 -08:00
Klaus Post	f1b2462193	Add goroutine profiles (#9078 ) Allow downloading goroutine dump to help detect leaks or overuse of goroutines. Extensions are now type dependent. Change `profiling` -> `profile` prefix, since that is what they are not the abstract concept.	2020-03-04 06:58:12 -08:00
poornas	c93157019f	Allow gc to run in parallel on cache drives (#9051 )	2020-03-03 06:42:26 +03:00
Harshavardhana	e3b44c3829	Remove partName, partETag requirement (#9044 ) This is a precursor change before versioning, removes/deprecates the requirement of remembering partName and partETag which are not useful after a multipart transaction has finished. This PR reduces the overall size of the backend JSON for large file uploads.	2020-03-03 03:29:30 +03:00
poornas	978bd4e2c4	check cacheControl not nil before access (#9055 ) Fixes: #9053	2020-02-27 10:57:00 -08:00
poornas	5d25b10f72	Fix panic in StorageInfo call (#9050 )	2020-02-26 15:29:50 -08:00
poornas	eac02c04f7	Fix sporadic failure in TestDiskCacheMaxUse (#9049 )	2020-02-26 13:31:15 -08:00
Harshavardhana	1330e59307	accessKeyId missing should return appropriate error in AssumeRole (#9048 ) For a non-existent user server would return STS not initialized ``` aws --profile harsha --endpoint-url http://localhost:9000 \ sts assume-role \ --role-arn arn:xxx:xxx:xxx:xxxx \ --role-session-name anything ``` instead return an appropriate error as expected by STS API Additionally also format the `trace` output for STS APIs	2020-02-26 12:26:47 -08:00
Harshavardhana	2dd14c0b89	print version with proper indentation (#9047 ) currently version is printed as > VERSION: > DEVELOPMENT.2020-02-26T14-30-02Z this is what we want > VERSION: > DEVELOPMENT.2020-02-26T14-30-02Z >	2020-02-26 23:09:08 +05:30
Harshavardhana	6f66f1a910	close channel upon error in Walk()'er (#9042 )	2020-02-25 19:58:58 -08:00
Harshavardhana	23a8411732	Add a generic Walk()'er to list a bucket, optinally prefix (#9026 ) This generic Walk() is used by likes of Lifecyle, or KMS to rotate keys or any other functionality which relies on this functionality.	2020-02-25 21:22:28 +05:30
Harshavardhana	ece0d4ac53	simplify recordAPIStats wrapper for ResponseWriters (#9034 )	2020-02-24 09:45:32 -08:00
Harshavardhana	4c92bec619	allow rolling upgrades, remove same MinIO version requirement (#9033 ) Upgrades between releases are failing due to strict rule to avoid rolling upgrades, it is enough to bump up APIs between versions to allow for quorum failure and wait times. Authentication failures are catastrophic in nature which leads to server not be able to upgrade properly. Fixes #9021 Fixes #8968	2020-02-24 10:32:30 +05:30
Harshavardhana	dcd63b4146	fix: avoid double ListBuckets() loading object lock (#9031 )	2020-02-24 06:39:11 +05:30
poornas	224b4f13b8	Add cache eviction low and high watermarks (#8958 ) To allow better control the cache eviction process. Introduce MINIO_CACHE_WATERMARK_LOW and MINIO_CACHE_WATERMARK_HIGH env. variables to specify when to stop/start cache eviction process. Deprecate MINIO_CACHE_EXPIRY environment variable. Cache gc sweeps at 30 minute intervals whenever high watermark is reached to clear least recently accessed entries in the cache until sufficient space is cleared to reach the low watermark. Garbage collection uses an adaptive file scoring approach based on last access time, with greater weights assigned to larger objects and those with more hits to find the candidates for eviction. Thanks to @klauspost for this file scoring algorithm Co-authored-by: Klaus Post <klauspost@minio.io>	2020-02-23 19:03:39 +05:30
Harshavardhana	51a9d1bdb7	Avoid unnecessary allocations for XML parsing (#9017 )	2020-02-23 09:06:46 +05:30
Klaus Post	b2db1e96e2	Remove crawler concurrency (#9023 ) Only have one crawler per disk. Removes locking, but keep fastwalk itself able to run concurrently.	2020-02-21 20:50:16 +05:30
Harshavardhana	ab7d3cd508	fix: Speed up multi-object delete by taking bulk locks (#8974 ) Change distributed locking to allow taking bulk locks across objects, reduces usually 1000 calls to 1. Also allows for situations where multiple clients sends delete requests to objects with following names ``` {1,2,3,4,5} ``` ``` {5,4,3,2,1} ``` will block and ensure that we do not fail the request on each other.	2020-02-21 11:29:57 +05:30
Anis Elleuch	d4dcf1d722	metrics: Use StorageInfo() instead to have consistent info (#9006 ) Metrics used to have its own code to calculate offline disks. StorageInfo() was avoided because it is an expensive operation by sending calls to all nodes. To make metrics & server info share the same code, a new argument `local` is added to StorageInfo() so it will only query local disks when needed. Metrics now calls StorageInfo() as server info handler does but with the local flag set to false. Co-authored-by: Praveen raj Mani <praveen@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-20 09:21:33 +05:30
poornas	02a59a04d1	Fix error messages returned by (Put)GetObjectLegalHold (#9013 ) fiixing some minor discrepancies between aws s3 responses vs minio server	2020-02-19 08:15:48 +05:30
Harshavardhana	16a6e68d7b	fix: indicate PutBucketEncryption as a valid policy action (#9009 )	2020-02-18 10:32:53 -08:00
Praveen raj Mani	1b427ddb69	Support for Kafka version in the config (#9001 ) Add a field for the Kafka version in the config. The user can explicitly set the version of the Kafka cluster. Fixes #8768	2020-02-17 07:56:33 +05:30
Harshavardhana	712e82344c	acl: Support PUT calls with success for 'private' ACL's (#9000 ) Add dummy calls which respond success when ACL's are set to be private and fails, if user tries to change them from their default 'private' Some applications such as nuxeo may have an unnecessary requirement for this operation, we support this anyways such that don't have to fully implement the functionality just that we can respond with success for default ACLs	2020-02-16 11:37:52 +05:30
poornas	716a52f261	Fix hang in cache copyobject call (#8993 ) Avoid GetObjectNInfo call from cache in CopyObjectHandler - in the case of server side copy with metadata replacement, the reader returned from cache is never consumed, but the net effect of GetObjectNInfo from cache layer, is cache holding a write lock to fill the cache. Subsequent stat operation on cache in CopyObject is not able to acquire a read lock, thus causing the hang. Fixes #8991	2020-02-13 15:32:26 -08:00
Harshavardhana	d1144c2c7e	reference format obtained doesn't need further validation (#8964 ) we don't need to validateFormats again once we have obtained reference format, because it is possible that at this stage another server is doing a disk heal during startup, once in a while due to delays we get false positives and our server doesn't start. Format in quorum as reference format can be assumed as valid and we proceed further, until and unless HealFormat re-inits the disks after a successful heal. Also use separate port for healing tests to avoid any conflicts with regular build testing. Fixes #8884	2020-02-13 14:01:41 -08:00
Harshavardhana	9ecd66007f	fix: reduce the load on CPU when loading users/policies (#8984 ) Trying to be conservative by slowing ourselves down on a regular basis.	2020-02-13 06:36:23 -08:00
Anis Elleuch	6b9805e891	fix: Avoid crash when there is an error testing a target notif (#8986 ) RegisterNotificationTargets() cleans up all connections that it makes to notification targets when an error occurs during its execution. However there is a typo in the code that makes the function to always try to access to a nil pointer in the defer code since the function in question will always return nil in the case of any error. This commit fixes the typo in the code.	2020-02-13 11:26:23 +05:30
poornas	013773065c	Save metadata correctly in cache.json on PUT (#8985 ) fixes #8979	2020-02-13 08:49:32 +05:30
Anis Elleuch	7d6766adc6	fix: erroneous high value for gateway received bytes metrics (#8978 ) http.Request.ContentLength can be negative, which affects the gateway_s3_bytes_received value in Prometheus output. The commit only increases the value of the total received bytes in gateway mode when r.ContentLength is greater than zero.	2020-02-12 10:15:00 +05:30
Harshavardhana	c56c2f5fd3	fix routing issue for esoteric characters in gorilla/mux (#8967 ) First step is to ensure that Path component is not decoded by gorilla/mux to avoid routing issues while handling certain characters while uploading through PutObject() Delay the decoding and use PathUnescape() to escape the `object` path component. Thanks to @buengese and @ncw for neat test cases for us to test with. Fixes #8950 Fixes #8647	2020-02-12 09:08:02 +05:30
Nitish Tiwari	7e819d00ea	Fix Error Code for ObjectTagging Parsing (#8971 ) Also add Mint tests	2020-02-11 17:42:28 -08:00
Nitish Tiwari	63be4709b7	Add metrics support for Azure & GCS Gateway (#8954 ) We added support for caching and S3 related metrics in #8591. As a continuation, it would be helpful to add support for Azure & GCS gateway related metrics as well.	2020-02-11 21:08:01 +05:30
astorath	6b1f2fc133	fix: using correct response on get_bucket_lifecycle_configuration (#8962 )	2020-02-08 16:46:59 +05:30
poornas	9b4d46a6ed	evict cached entry for server side copy (#8947 ) Fixes #8942	2020-02-07 14:36:46 -08:00
Anis Elleuch	502e652b7a	fix: Avoid closing target in RegisterNotificationTargets (#8966 ) This will prevent a double target Close() call when fetchLambdaInfo() is executed (mc admin info) This fixes a crash when mc admin info is called.	2020-02-07 14:35:56 -08:00
Nitish Tiwari	15e2ea2c96	Fix an issue where MinIO was logging every error twice (#8953 ) The logging subsystem was initialized under init() method in both gateway-main.go and server-main.go which are part of same package. This created two logging targets and hence errors were logged twice. This PR moves the init() method to common-main.go	2020-02-07 13:48:07 +05:30
Klaus Post	d0cea7adea	Fix stream read IO count (#8961 ) Streams are returning a readcloser and returning would decrement io count instantly, fix it. change maxActiveIOCount to 3, meaning it will pause crawling if 3 operations are running.	2020-02-07 09:43:55 +05:30
Klaus Post	2165d45d3f	Time getSize and use to estimate latency (#8959 ) Remove the random sleep. This is running in 4 goroutines, so mostly doing nothing. We use the getSize latency to estimate system load, meaning when there is little load on the system and we get the result fast we sleep a little. If it took a long time we have high load and release ourselves longer. We are sleeping inside the mutex so this affects all goroutines doing IO.	2020-02-07 09:05:55 +05:30
Anis Elleuch	6d5d77f62c	usage typo: Fix creating .minio.sys/background-ops bucket (#8957 ) Due to a typo in the code, a cluster was not correctly creating `background-ops` in all disks and nodes print the following error: minio3_1 \| API: SYSTEM() minio3_1 \| Time: 19:32:45 UTC 02/06/2020 minio3_1 \| DeploymentID: d67c20fa-4a1e-41f5-b319-7e3e90f425d8 minio3_1 \| Error: Bucket not found: .minio.sys/background-ops minio3_1 \| 2: cmd/data-usage.go:109:cmd.runDataUsageInfo() minio3_1 \| 1: cmd/data-usage.go:56:cmd.runDataUsageInfoUpdateRoutine() This commit fixes the typo.	2020-02-06 13:12:36 -08:00
Harshavardhana	49df290270	Add metadata parsing to be inside mutex to slow down (#8952 ) Adding mutex slows down the crawler to avoid large spikes in CPU, also add millisecond interval jitter in calculation of disk usage to slow down the spikes further.	2020-02-06 00:22:11 -08:00
Nitish Tiwari	e5951e30d0	Add support for Object Tagging in LifeCycle configuration (#8880 ) Fixes #8870 Co-Authored-By: Krishnan Parthasarathi <krisis@users.noreply.github.com>	2020-02-06 13:20:10 +05:30
Harshavardhana	c2c5b09bb1	Avoid object names with '//' to avoid hash inconsistencies (#8946 ) This is to fix a situation where an object name incorrectly is sent with '//' in its path heirarchy, we should reject such object names because they may be hashed to a set where the object might not originally belong because, this can cause situations where once object is uploaded we cannot delete it anymore. Fixes #8873	2020-02-06 08:29:38 +05:30
Andreas Auernhammer	086fbb745e	fix and improve KMS server info (#8944 ) This commit fixes typos in the displayed server info w.r.t. the KMS and removes the update status. For more information about why the update status is removed see: PR #8943	2020-02-06 06:18:34 +05:30
Andreas Auernhammer	4f37c8ccf2	refine the KMS admin API (#8943 ) This commit removes the `Update` functionality from the admin API. While this is technically a breaking change I think this will not cause any harm because: - The KMS admin API is not complete, yet. At the moment only the status can be fetched. - The `mc` integration hasn't been merged yet. So no `mc` client could have used this API in the past. The `Update`/`Rewrap` status is not useful anymore. It provided a way to migrate from one master key version to another. However, KES does not support the concept of key versions. Instead, key migration should be implemented as migration from one master key to another. Basically, the `Update` functionality has been implemented just for Vault.	2020-02-05 22:47:35 +05:30
Krishnan Parthasarathi	026265f8f7	Add support for bucket encryption feature (#8890 ) - pkg/bucket/encryption provides support for handling bucket encryption configuration - changes under cmd/ provide support for AES256 algorithm only Co-Authored-By: Poorna <poornas@users.noreply.github.com> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-05 15:12:34 +05:30
Anis Elleuch	52bdbcd046	Add new admin API to return Accounting Usage (#8689 )	2020-02-04 18:20:39 -08:00
poornas	301c50b721	Add canned `diagnostics` policy for admin users (#8937 )	2020-02-04 17:58:38 -08:00
Harshavardhana	e9c111c8d0	Avoid unnecessary statPart() calls in PutObjectPart (#8905 ) Assume `xl.json` as the source of truth for all operations.	2020-02-04 10:04:37 +05:30
poornas	278a165674	Allow caching based on a configurable number of hits. (#8891 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-04 09:10:01 +05:30
Anis Elleuch	e934c3e2a2	usage: Fix buckets count calculation when no object is present (#8929 ) XL crawling wrongly returns a zero buckets count when there are no objects uploaded in the server yet. The reason is data of the crawler of posix returns invalid result when all disks has zero objects. A simple fix is to always pick the crawling result of the first disk but choose over the result of the disk which has the most objects in it.	2020-02-04 06:57:47 +05:30
Harshavardhana	2d295a31de	Avoid select inside a recursive function to avoid CPU spikes (#8923 ) Additionally also allow configurable go-routines	2020-02-03 16:45:59 -08:00
Harshavardhana	9bbf5cb74f	fix: Avoid re-reading bucket names from etcd (#8924 ) This helps improve performance when there are 1000+ bucket entries on etcd, improves the startup time significantly.	2020-02-03 13:54:20 +05:30
Harshavardhana	680e493065	fix a crash in base64 buffer pool (#8925 ) looks like 1024 buffer size is not enough in all situations, use 8192 instead which can satisfy all the rare situations that may arise in base64 decoding.	2020-02-03 08:42:32 +05:30
poornas	1ea2449269	NAS gateway: fix notification initialization (#8920 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-02 15:22:07 +05:30
Harshavardhana	7ce63b3078	fix: multi-delete API write quorum failures (#8926 ) multi-delete API failed with write quorum errors under following situations - list of files requested for delete doesn't exist anymore can lead to quorum errors and failure - due to usage of query param for paths, for really long paths MinIO server rejects these requests as malformed as unexpected. This was reproduced with warp	2020-02-01 18:11:29 -08:00
Anis Elleuch	7432b5c9b2	Use user CAs in checkEndpoint() call (#8911 ) The server info handler makes a http connection to other nodes to check if they are up but does not load the custom CAs in ~/.minio/certs/CAs. This commit fix it. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-02 07:15:29 +05:30
Harshavardhana	d76160c245	Initialize only one retry timer for all sub-systems (#8913 ) Also make sure that we create buckets on all zones successfully, do not run quick heal buckets if not running with expansion.	2020-02-02 06:37:43 +05:30
poornas	5d838edcef	Fix panic in ServerInfoHandler when (#8915 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-01 17:50:04 +05:30
poornas	c9116e6bd7	trace - log request body (#8917 )	2020-02-01 02:39:49 -08:00
Harshavardhana	d7dc9aaf52	fix: remove response header timeout (#8919 ) Adding respone header timeout seems to have premature timeout like consequences which leads to potential disconnections.	2020-02-01 08:31:55 +05:30
Harshavardhana	bfe8a9bccc	jwt: Simplify JWT parsing (#8802 ) JWT parsing is simplified by using a custom claim data structure such as MapClaims{}, also writes a custom Unmarshaller for faster unmarshalling. - Avoid as much reflections as possible - Provide the right types for functions as much as possible - Avoid strings.Join, strings.Split to reduce allocations, rely on indexes directly.	2020-01-31 08:29:22 +05:30
Klaus Post	9990464cd5	Fix recursive deep scan of buckets (#8900 )	2020-01-30 17:20:07 +05:30
poornas	881e983ed9	Fix Retention, ObjectLock, LegalHold struct namespaces correctly. (#8909 ) Reverts #8903 to allow structs to be unmarshalled even if the namespace is missing.	2020-01-30 09:58:05 +05:30
Harshavardhana	f98616dce7	heal: Optimize heal listing by avoiding batches (#8901 ) Also limit the heal per object if there is incoming requests by suspending heal for longer periods of time.	2020-01-29 12:05:44 +05:30
Ashish Kumar Sinha	5bd0e95eef	Set default namespace for necessary structs (#8903 )	2020-01-29 10:19:38 +05:30
Harshavardhana	0cbebf0f57	Rename pkg/{tagging,lifecycle} to pkg/bucket sub-directory (#8892 ) Rename to allow for more such features to come in a more proper hierarchical manner.	2020-01-27 14:12:34 -08:00
poornas	2232e095d5	Make admin permissions more granular for admin handlers. (#8888 )	2020-01-26 20:47:52 -06:00
poornas	a78e5d4763	Add missing error check in cache GetObjectNInfo (#8889 )	2020-01-24 15:49:16 -08:00
Harshavardhana	cf37c7997e	Heal bucket only on missing drives in quorum (#8883 ) MakeVol shouldn't be called in heal bucket when bucket doesn't really exist in quorum.	2020-01-24 15:38:07 -08:00
Harshavardhana	1ffbb5c24c	fix racy tests when editing xl.getDisks (#8879 )	2020-01-23 11:50:09 -08:00
Harshavardhana	b9c48e0ab0	fix return appropriate error for MakeBucket in federation (#8878 )	2020-01-22 08:25:28 -08:00
Harshavardhana	fe5d599802	fix: STS creds without "aud" should be honored with STS checks (#8868 ) Fixes #8865	2020-01-22 15:09:46 +05:30
Aditya Manthramurthy	55063906b5	Fix group add/remove membership bug (#8877 )	2020-01-21 19:00:41 -08:00
Klaus Post	c7178d2066	Profiling: Add base, fix memory profiling (#8850 ) For 'snapshot' type profiles, record a 'before' profile that can be used as `go tool pprof -base=before ...` to compare before and after. "Before" profiles are included in the zipped package. [`runtime.MemProfileRate`](https://golang.org/pkg/runtime/#pkg-variables) should not be updated while the application is running, so we set it at startup. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-01-21 15:49:25 -08:00
Harshavardhana	f14f60a487	fix: Avoid double usage calculation on every restart (#8856 ) On every restart of the server, usage was being calculated which is not useful instead wait for sufficient time to start the crawling routine. This PR also avoids lots of double allocations through strings, optimizes usage of string builders and also avoids crawling through symbolic links. Fixes #8844	2020-01-21 14:07:49 -08:00
Harshavardhana	e2b3c083aa	fix: close and drain the response body always (#8847 )	2020-01-21 02:46:58 -08:00
Harshavardhana	86252ec7e1	fix: document _ENABLE for all notification targets (#8864 ) Fixes #8863	2020-01-20 16:48:19 -08:00
Nitish Tiwari	61c17c8933	Add ObjectTagging Support (#8754 ) This PR adds support for AWS S3 ObjectTagging API as explained here https://docs.aws.amazon.com/AmazonS3/latest/dev/object-tagging.html	2020-01-20 08:45:59 -08:00
Forest Lovewood	dd93eee1e3	Implement bucket caching for b2 gateway (#8820 ) fixes #8739 #6806	2020-01-20 22:13:38 +05:30
Harshavardhana	88286cf8d0	fix: support pre-sign signature for STS tokens (#8826 ) Fixes #8391	2020-01-18 17:04:50 -08:00
Klaus Post	8cb6184f1d	Fix erasure block allocation (#8851 ) Small blocks are undersized when file size isn't divisible by the shard could leading to allocation in reedsolomon.Split()	2020-01-18 14:21:58 -08:00
Harshavardhana	09ee145e9c	gw/hdfs: indicate hdfs gateway is production ready (#8848 )	2020-01-18 07:25:03 -08:00
Harshavardhana	23e46f9dba	log formatting only the first time (#8846 )	2020-01-17 15:39:07 -08:00
Harshavardhana	fc5213258e	posix: Do not take disk offline on I/O errors (#8836 ) Choosing maxAllowedIOError is arbitrary and prone to errors, when drives might be perfectly capable of taking I/O with only few locations return I/O error. This is a hindrance of sort where backend filesystems like ZFS can automatically fix and handle these scenarios. The added problem with current approach that we take the drive offline, making it virtually impossible to bring it online without restart the server which is not desirable on a busy cluster. Remove this state such that let the backend return error appropriately to caller and let the caller decide what to do with the error.	2020-01-17 13:34:43 -08:00
Anis Elleuch	017067e11f	data-usage: Avoid crawling duplicated call (#8843 ) This fix will also picks 3 and not 4 disks from a single erasure set.	2020-01-17 09:59:37 -08:00
Harshavardhana	2bb69033e5	http: fail appropriately and return standard Go error (#8837 ) return http.ErrServerClosed with proper body when server is shutting down, allowing more context instead of just returning '503' which doesn't mean the same thing.	2020-01-17 05:48:39 -08:00
Harshavardhana	fca4ee84c9	gw/hdfs: listing should list directories properly (#8827 ) Fixes #8822	2020-01-16 17:11:25 -08:00
poornas	60e60f68dd	Add support for object locking with legal hold. (#8634 )	2020-01-16 15:41:56 -08:00
Harshavardhana	c6b218e5df	fix: readiness should return 200 OK with first zone online (#8834 )	2020-01-16 13:49:25 -08:00
Anis Elleuch	c18fbdb29a	posix: Remove a non needed nil check in DiskInfo() (#8830 ) posix.DiskInfo() returns errFaultyDisk when posix is nil, but there is no way that this would happen any time, therefore removing un-needed code.	2020-01-16 11:27:50 -08:00
Harshavardhana	b1ad99edbf	fix: avoid crash copy map before reading (#8825 ) code of this form is always racy, when the map itself is being written to as well ``` func (r Map) retMap() map[string]string { .. lock .. return r.internalMap } func (r Map) addMap(k, v string) { .. lock .. r.internalMap[k] = v } ``` Anyone reading from `retMap()` is not protected because of locking and we need to make sure to avoid code in this manner. Always safe to copy the map and return.	2020-01-16 01:35:30 -08:00
Anis Elleuch	935546d5ca	xl: Implement MRF healing (#8470 )	2020-01-15 18:30:32 -08:00
Harshavardhana	64fde1ab95	xl/zones: return errNoHealRequired when no heal is required (#8821 ) Zone abstraction of object layer was returning `nil` incorrectly under situations where disk healing is not required. Returning `nil` is considered as healing successful, which leads to unexpected ReloadFormat() peer notification calls during startup. This PR fixes this behavior properly for zones.	2020-01-15 17:19:13 -08:00
Anis Elleuch	069876e262	xl: All nodes create meta volumes in its local disks (#8786 ) Meta volumes directories, tmp/, background-ops/, etc.. undr .minio.sys are created when disks are formatted but also when the cluster is started. However using MakeVolBulk() is not appropriate in the case of a user migrating from a version which does not have .minio.sys/background-ops/. The reason is that MakeVolBulk() exits early when an error is occured: errVolumeExists in this case, which is expected since some directories such as tmp/ already exist. This commit will avoid use MakeVolBulk and use MakeVol instead. Also the PR will make each node creates meta volumes in its local disks and stop relying on the first disk since the first node could be offline.	2020-01-15 12:36:52 -08:00
Harshavardhana	442e1698cb	heal: Avoid spinning up object healing during startup (#8819 ) auto-heal disks, metadata and buckets in background but not objects, let the auto heal kick in for objects after the cluster has been up for a while.	2020-01-15 01:08:39 -08:00
poornas	d76518eeb9	Remove TestPutObjectPartDiskNotFound unit test (#8815 )	2020-01-14 18:46:33 -08:00
Harshavardhana	0879a4f743	rest/storage: Remove racy LastError usage (#8817 ) instead perform a liveness check call to verify if server is online and print relevant errors. Also introduce a StorageErr string error type instead of errors.New() deprecate usage of VerifyFileError, DeleteFileError for gob, change in datastructure also requires bump in storage REST version to v13. Fixes #8811	2020-01-14 18:45:17 -08:00

1 2 3 4 5 ...

2499 commits