minio

Author	SHA1	Message	Date
Harshavardhana	44dff36ff7	listing with prefix prefixed with '/' should be ignored (#11268 ) fixes #11265	2021-01-13 09:44:11 -08:00
Poorna Krishnamoorthy	b97d53b29c	fix remote target healthcheck (#11267 )	2021-01-12 20:48:04 -08:00
Harshavardhana	1a5775e2e8	enable small and large file optimization (#11260 ) - for large objects we found that 1MiB block for r/w respectively. - for small objects we found that 128KiB block for r/w respectively.	2021-01-12 10:20:39 -08:00
Anis Elleuch	e2579b1f5a	azure: Use default upload parameters to avoid consuming too much memory (#11251 ) A lot of memory is consumed when uploading small files in parallel, use the default upload parameters and add MINIO_AZURE_UPLOAD_CONCURRENCY for users to tweak.	2021-01-11 22:48:09 -08:00
Poorna Krishnamoorthy	7824e19d20	Allow synchronous replication if enabled. (#11165 ) Synchronous replication can be enabled by setting the --sync flag while adding a remote replication target. This PR also adds proxying on GET/HEAD to another node in a active-active replication setup in the event of a 404 on the current node.	2021-01-11 22:36:51 -08:00
Harshavardhana	317305d5f9	fix: regression in adding new replication targets (#11257 )	2021-01-11 09:08:42 -08:00
Harshavardhana	e4e117faab	fix: enable xl.json to xl.meta only if legacy drive is found (#11255 ) another optimization is renameLegacyMetadata() never needs to validate bucket with os.Stat() again, leading to reduction in one extra syscall.	2021-01-11 02:27:04 -08:00
Klaus Post	51dad1d130	Fix missing GetObjectNInfo Closure (#11243 ) Review for missing Close of returned value from `GetObjectNInfo`. This was often obscured by the stuff that auto-unlocks when reaching EOF.	2021-01-08 10:12:26 -08:00
Harshavardhana	4593b146be	fix: print errors only when metacache status has errors (#11248 )	2021-01-08 16:52:19 +05:30
Harshavardhana	f21d650ed4	fix: readData in bulk call using messagepack byte wrappers (#11228 ) This PR refactors the way we use buffers for O_DIRECT and to re-use those buffers for messagepack reader writer. After some extensive benchmarking found that not all objects have this benefit, and only objects smaller than 64KiB see this benefit overall. Benefits are seen from almost all objects from 1KiB - 32KiB Beyond this no objects see benefit with bulk call approach as the latency of bytes sent over the wire v/s streaming content directly from disk negate each other with no remarkable benefits. All other optimizations include reuse of msgp.Reader, msgp.Writer using sync.Pool's for all internode calls.	2021-01-07 19:27:31 -08:00
Harshavardhana	a4f6705874	expire stale locks when owner is down (#11247 ) fixes #11246	2021-01-07 19:16:18 -08:00
Poorna Krishnamoorthy	b35b537e3f	Pass versionID to checkReplicateDelete in web handler (#11244 )	2021-01-07 15:28:27 -08:00
Harshavardhana	5c52d5ffc7	fix: treat errVolumeNotFound as EOF error in listPathRaw (#11238 )	2021-01-07 09:52:53 -08:00
Harshavardhana	f0808bb2e5	fix: getObject fd leaks in transition and replication code (#11237 )	2021-01-06 16:13:10 -08:00
Harshavardhana	a6dee21092	initialize IAM store before Init() to avoid any crash (#11236 )	2021-01-06 13:40:20 -08:00
Anis Elleuch	6f781c5e7a	heal: Reduce whitespace ticker to 5 seconds (#11234 ) 30 seconds white spaces is long for some setups which time out when no read activity in short time, reduce the subnet health white space ticker to 5 seconds, since it has no cost at all.	2021-01-06 13:29:50 -08:00
Harshavardhana	f8ca859790	fix: server/gateway banner formatting (#11230 )	2021-01-06 10:38:07 -08:00
Harshavardhana	76e2713ffe	fix: use buffers only when necessary for io.Copy() (#11229 ) Use separate sync.Pool for writes/reads Avoid passing buffers for io.CopyBuffer() if the writer or reader implement io.WriteTo or io.ReadFrom respectively then its useless for sync.Pool to allocate buffers on its own since that will be completely ignored by the io.CopyBuffer Go implementation. Improve this wherever we see this to be optimal. This allows us to be more efficient on memory usage. ``` 385 // copyBuffer is the actual implementation of Copy and CopyBuffer. 386 // if buf is nil, one is allocated. 387 func copyBuffer(dst Writer, src Reader, buf []byte) (written int64, err error) { 388 // If the reader has a WriteTo method, use it to do the copy. 389 // Avoids an allocation and a copy. 390 if wt, ok := src.(WriterTo); ok { 391 return wt.WriteTo(dst) 392 } 393 // Similarly, if the writer has a ReadFrom method, use it to do the copy. 394 if rt, ok := dst.(ReaderFrom); ok { 395 return rt.ReadFrom(src) 396 } ``` From readahead package ``` // WriteTo writes data to w until there's no more data to write or when an error occurs. // The return value n is the number of bytes written. // Any error encountered during the write is also returned. func (a *reader) WriteTo(w io.Writer) (n int64, err error) { if a.err != nil { return 0, a.err } n = 0 for { err = a.fill() if err != nil { return n, err } n2, err := w.Write(a.cur.buffer()) a.cur.inc(n2) n += int64(n2) if err != nil { return n, err } ```	2021-01-06 09:36:55 -08:00
Harshavardhana	b5d291ea88	fix: rename remaining zone -> pool (#11231 )	2021-01-06 09:35:47 -08:00
Klaus Post	eb9172eecb	Allow Compression + encryption (#11103 )	2021-01-05 20:08:35 -08:00
Poorna Krishnamoorthy	64bddf47d8	Pass deletemarker correctly to replicate opts (#11227 ) fixes: #11180	2021-01-05 14:12:37 -08:00
Harshavardhana	4ed45ce543	fix: healing buckets during pool expansion (#11224 ) fixes #11209	2021-01-05 13:24:22 -08:00
Klaus Post	ad511b0eb8	tests: Fix occasional data race (#11223 ) CI tests could trigger a data race. Servers are generally not expected to reinitialize, so tests could trigger data races when reinitializing and async operations are running. We add the option to safely reset global vars instead of overwriting. Fixes races like: ``` WARNING: DATA RACE Read at 0x00000477ab18 by goroutine 1159: github.com/minio/minio/cmd.FileInfo.ToObjectInfo() /home/runner/work/minio/minio/cmd/erasure-metadata.go:105 +0x16d github.com/minio/minio/cmd.erasureObjects.putObject() /home/runner/work/minio/minio/cmd/erasure-object.go:748 +0x13f8 github.com/minio/minio/cmd.(erasureObjects).listPath.func3.2() /home/runner/work/minio/minio/cmd/metacache-set.go:682 +0x7d3 github.com/minio/minio/cmd.newMetacacheBlockWriter.func1.2() /home/runner/work/minio/minio/cmd/metacache-stream.go:777 +0x1c4 github.com/minio/minio/cmd.newMetacacheBlockWriter.func1() /home/runner/work/minio/minio/cmd/metacache-stream.go:806 +0x614 Previous write at 0x00000477ab18 by goroutine 1269: [failed to restore the stack] Goroutine 1159 (running) created at: github.com/minio/minio/cmd.newMetacacheBlockWriter() /home/runner/work/minio/minio/cmd/metacache-stream.go:760 +0x112 github.com/minio/minio/cmd.(erasureObjects).listPath.func3() /home/runner/work/minio/minio/cmd/metacache-set.go:672 +0xe22 Goroutine 1269 (running) created at: testing.(T).Run() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1095 +0x537 testing.runTests.func1() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1339 +0xa6 testing.tRunner() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1050 +0x1eb testing.runTests() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1337 +0x594 testing.(M).Run() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1252 +0x2ff github.com/minio/minio/cmd.TestMain() /home/runner/work/minio/minio/cmd/test-utils_test.go:120 +0x44e main.main() _testmain.go:1408 +0x223 ================== ================== WARNING: DATA RACE Read at 0x00000477aae8 by goroutine 1159: github.com/minio/minio/cmd.(BucketVersioningSys).Enabled() /home/runner/work/minio/minio/cmd/bucket-versioning.go:26 +0x52 github.com/minio/minio/cmd.FileInfo.ToObjectInfo() /home/runner/work/minio/minio/cmd/erasure-metadata.go:105 +0x197 github.com/minio/minio/cmd.erasureObjects.putObject() /home/runner/work/minio/minio/cmd/erasure-object.go:748 +0x13f8 github.com/minio/minio/cmd.(erasureObjects).listPath.func3.2() /home/runner/work/minio/minio/cmd/metacache-set.go:682 +0x7d3 github.com/minio/minio/cmd.newMetacacheBlockWriter.func1.2() /home/runner/work/minio/minio/cmd/metacache-stream.go:777 +0x1c4 github.com/minio/minio/cmd.newMetacacheBlockWriter.func1() /home/runner/work/minio/minio/cmd/metacache-stream.go:806 +0x614 Previous write at 0x00000477aae8 by goroutine 1269: [failed to restore the stack] Goroutine 1159 (running) created at: github.com/minio/minio/cmd.newMetacacheBlockWriter() /home/runner/work/minio/minio/cmd/metacache-stream.go:760 +0x112 github.com/minio/minio/cmd.(erasureObjects).listPath.func3() /home/runner/work/minio/minio/cmd/metacache-set.go:672 +0xe22 Goroutine 1269 (running) created at: testing.(T).Run() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1095 +0x537 testing.runTests.func1() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1339 +0xa6 testing.tRunner() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1050 +0x1eb testing.runTests() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1337 +0x594 testing.(*M).Run() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1252 +0x2ff github.com/minio/minio/cmd.TestMain() /home/runner/work/minio/minio/cmd/test-utils_test.go:120 +0x44e main.main() _testmain.go:1408 +0x223 ================== ```	2021-01-05 10:45:26 -08:00
Harshavardhana	cb0eaeaad8	feat: migrate to ROOT_USER/PASSWORD from ACCESS/SECRET_KEY (#11185 )	2021-01-05 10:22:57 -08:00
Harshavardhana	d0027c3c41	do not use large buffers if not necessary (#11220 ) without this change, there is a performance regression for small objects GETs, this makes the overall speed to go back to pre '59d363' commit days.	2021-01-04 18:51:52 -08:00
Anis Elleuch	cb7fc99368	handlers: Avoid initializing a struct in each handler call (#11217 )	2021-01-04 09:54:22 -08:00
Harshavardhana	a4383051d9	remove/deprecate crawler disable environment (#11214 ) with changes present to automatically throttle crawler at runtime, there is no need to have an environment value to disable crawling. crawling is a fundamental piece for healing, lifecycle and many other features there is no good reason anyone would need to disable this on a production system. * Apply suggestions from code review	2021-01-04 09:43:31 -08:00
Harshavardhana	e7ae49f9c9	fix: calculate prometheus disks_offline/disks_total correctly (#11215 ) fixes #11196	2021-01-04 09:42:09 -08:00
Anis Elleuch	153d4be032	tracing: NumSubscribers() to use atomic instead of mutex (#11219 ) globalSubscribers.NumSubscribers() is heavily used in S3 requests and it uses mutex, use atomic.Load instead since it is faster Co-authored-by: Anis Elleuch <anis@min.io>	2021-01-04 09:40:30 -08:00
Anis Elleuch	dfd99b6d8f	handlers: Little bit more optimizations (#11211 )	2021-01-04 00:01:06 -08:00
Harshavardhana	c4b1d394d6	erasure: avoid io.Copy in hotpaths to reduce allocation (#11213 )	2021-01-03 16:27:34 -08:00
Harshavardhana	c4131c2798	feat: Small object optimization read data in single bulk call (#11207 )	2021-01-03 11:27:57 -08:00
Anis Elleuch	c9d502e6fa	parentDirIsObject() to return quickly with inexistant parent (#11204 ) Rewrite parentIsObject() function. Currently if a client uploads a/b/c/d, we always check if c, b, a are actual objects or not. The new code will check with the reverse order and quickly quit if the segment doesn't exist. So if a, b, c in 'a/b/c' does not exist in the first place, then returns false quickly.	2021-01-02 12:01:29 -08:00
Anis Elleuch	677e80c0f8	xl: Remove check-dir in ReadVersion (#11200 ) The only purpose of check-dir flag in ReadVersion is to return 404 when an object has xl.meta but without data. This is causing an extract call to the disk which can be penalizing in case of busy system where disks receive many concurrent access.	2021-01-02 10:35:57 -08:00
Harshavardhana	aa85af4d1a	fix: missing CopyObjectPart maxClients reorder	2021-01-01 23:07:37 -08:00
Anis Elleuch	ae731d232f	trace: Reorder http/trace maxClients wrapping for correct tracing (#11202 ) mc admin trace does not show the correct handler name in the output: it is printing `maxClients' for all handlers. The reason is that the wrong order of handler wrapping.	2021-01-01 23:06:07 -08:00
Anis Elleuch	a317d220ed	xl-storage: Do not stat bucket assuming the object exists (#11201 ) In HEAD/GET, only STAT the bucket if the object does not exist to return the correct error response.	2021-01-01 09:44:36 -08:00
Harshavardhana	3e1221a01c	fix: log once updating dataUsageCache versions (#11190 ) also reduce usage of *bytes.Buffer for reading `usage-cache.bin`	2020-12-31 09:45:09 -08:00
Ritesh H Shukla	36fc2f98ed	fix: admin trace throttled requests (#11192 )	2020-12-30 21:04:55 -08:00
Ritesh H Shukla	556524c715	Reduce logging when peer is offline (#11184 )	2020-12-30 14:38:54 -08:00
Harshavardhana	cc457f1798	fix: enhance logging in crawler use console.Debug instead of logger.Info (#11179 )	2020-12-29 01:57:28 -08:00
Harshavardhana	ca0d31b09a	fix: re-arrange handlers to handle requests on /minio (#11177 ) fixes #11175	2020-12-28 17:10:33 -08:00
Harshavardhana	445a9bd827	fix: heal optimizations in crawler to avoid multiple healing attempts (#11173 ) Fixes two problems - Double healing when bitrot is enabled, instead heal attempt once in applyActions() before lifecycle is applied. - If applyActions() is successful and getSize() returns proper value, then object is accounted for and should be removed from the oldCache namespace map to avoid double heal attempts.	2020-12-28 10:31:00 -08:00
Harshavardhana	d8d25a308f	fix: use HealObject for cleaning up dangling objects (#11171 ) main reason is that HealObjects starts a recursive listing for each object, this can be a really really long time on large namespaces instead avoid recursive listing just perform HealObject() instead at the prefix. delete's already handle purging dangling content, we don't need to achieve this by doing recursive listing, this in-turn can delay crawling significantly.	2020-12-27 15:42:20 -08:00
Harshavardhana	c19e6ce773	avoid a crash in crawler when lifecycle is not initialized (#11170 ) Bonus for static buffers use bytes.NewReader instead of bytes.NewBuffer, to use a more reader friendly implementation	2020-12-26 22:58:06 -08:00
Harshavardhana	59d3639396	fix: inherit heal opts globally, including bitrot settings (#11166 ) Bonus re-use ReadFileStream internal io.Copy buffers, fixes lots of chatty allocations when reading metacache readers with many sustained concurrent listing operations ``` 17.30GB 1.27% 84.80% 35.26GB 2.58% io.copyBuffer ```	2020-12-24 23:04:03 -08:00
Harshavardhana	027e17468a	fix: discarding results do not attempt in-memory metacache writer (#11163 ) Optimizations include - do not write the metacache block if the size of the block is '0' and it is the first block - where listing is attempted for a transient prefix, this helps to avoid creating lots of empty metacache entries for `minioMetaBucket` - avoid the entire initialization sequence of cacheCh , metacacheBlockWriter if we are simply going to skip them when discardResults is set to true. - No need to hold write locks while writing metacache blocks - each block is unique, per bucket, per prefix and also is written by a single node.	2020-12-24 15:02:02 -08:00
Harshavardhana	45ea161f8d	webUI: change listing to 1000 keys from browser UI (#11159 ) gateway implementations do not handle maxKeys being `-1` properly unlike MinIO implementation, handle it by setting an appropriate value. fixes #11158	2020-12-23 19:58:15 -08:00
Harshavardhana	6a66f142d4	fix: strict quorum in list should list on all drives (#11157 ) current implementation was incorrect, it in-fact assumed only read quorum number of disks. in-fact that value is only meant for read quorum good entries from all online disks. This PR fixes this behavior properly.	2020-12-23 09:26:40 -08:00
Harshavardhana	5982965839	fix: re-use bytes.Buffer using sync.Pool (#11156 )	2020-12-22 23:22:37 -08:00
Harshavardhana	8565cefe4e	fix: allow HTTP2.0 to be always configured	2020-12-22 16:32:58 -08:00
Andreas Auernhammer	8cdf2106b0	refactor cmd/crypto code for SSE handling and parsing (#11045 ) This commit refactors the code in `cmd/crypto` and separates SSE-S3, SSE-C and SSE-KMS. This commit should not cause any behavior change except for: - `IsRequested(http.Header)` which now returns the requested type {SSE-C, SSE-S3, SSE-KMS} and does not consider SSE-C copy headers. However, SSE-C copy headers alone are anyway not valid.	2020-12-22 09:19:32 -08:00
Harshavardhana	35fafb837b	fix: issues with handling delete markers in metacache (#11150 ) Additional cases handled - fix address situations where healing is not triggered on failed writes and deletes. - consider object exists during listing when metadata can be successfully decoded.	2020-12-22 09:16:43 -08:00
Harshavardhana	274bbad5cb	fix: select always online peers for remote listing (#11153 ) always find the right set of online peers for remote listing, this may have an effect on listing if the server is down - we should do this to avoid always performing transient operations on bucket->peerClient that is permanently or down for a long period.	2020-12-22 09:16:07 -08:00
Harshavardhana	5c451d1690	update x/net/http2 to address few bugs (#11144 ) additionally also configure http2 healthcheck values to quickly detect unstable connections and let them timeout. also use single transport for proxying requests	2020-12-21 21:42:38 -08:00
Poorna Krishnamoorthy	c987313431	Encrypt remote target if kms is configured (#11034 ) Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2020-12-21 16:21:33 -08:00
Anis Elleuch	2ecaab55a6	admin: ServerInfo returns info without object layer initialized (#11142 )	2020-12-21 09:35:19 -08:00
Harshavardhana	3e792ae2a2	fix: change defaults for DNS cache dialer (#11145 )	2020-12-21 09:33:29 -08:00
Harshavardhana	4cc500a041	normalize users with double // in accessKeys (#11143 ) Bonus fix, use constant time compare for secret keys in web-handlers.go:SetAuth()	2020-12-20 10:09:51 -08:00
Harshavardhana	d8e28830cf	fix: allow STS creds for admin accounts to add users (#11138 ) Allow rotating creds with privileges to add users fixes https://github.com/minio/console/issues/529	2020-12-19 13:24:21 -08:00
Harshavardhana	3e16ec457a	fix: support user/groups with '/' character (#11127 ) NOTE: user/groups with `//` shall be normalized to `/` fixes #11126	2020-12-19 09:36:37 -08:00
Harshavardhana	e5d378931d	fix: delimiter based listing was broken without marker (#11136 ) with missing nextMarker with delimiter based listing, top level prefixes beyond 4500 or max-keys value wouldn't be sent back for client to ask for the next batch. reproduced at a customer deployment, create prefixes as shown below ``` for year in $(seq 2017 2020) do for month in {01..12} do for day in {01..31} do mc -q cp file myminio/testbucket/dir/day_id=$year-$month-$day/; done done done ``` Then perform ``` aws s3api --profile minio --endpoint-url http://localhost:9000 list-objects \ --bucket testbucket --prefix dir/ --delimiter / --max-keys 1000 ``` You shall see missing NextMarker, this would disallow listing beyond max-keys requested and also disallow beyond 4500 (maxKeyObjectList) prefixes being listed because client wouldn't know the NextMarker available. This PR addresses this situation properly by making the implementation more spec compatible. i.e NextMarker in-fact can be either an object, a prefix with delimiter depending on the input operation. This issue was introduced after the list caching changes and has been present for a while.	2020-12-19 09:36:04 -08:00
Anis Elleuch	e63a10e505	Profiling does not required object layer to be initialized (#11133 )	2020-12-18 11:51:15 -08:00
Anis Elleuch	5434088c51	replication: Ensure to always use nano precision source modtime (#11135 )	2020-12-18 11:37:28 -08:00
Harshavardhana	a773cf48d8	fix: overlapping object and prefix rejected (#11130 ) fixes #11129	2020-12-18 08:51:09 -08:00
Harshavardhana	f714840da7	add _MINIO_SERVER_DEBUG env for enabling debug messages (#11128 )	2020-12-17 16:52:47 -08:00
Harshavardhana	7c9ef76f66	fix: timer deadlock on expired timers (#11124 ) issue was introduced in #11106 the following pattern <-t.C // timer fired if !t.Stop() { <-t.C // timer hangs } Seems to hang at the last `t.C` line, this issue happens because a fired timer cannot be Stopped() anymore and t.Stop() returns `false` leading to confusing state of usage. Refactor the code such that use timers appropriately with exact requirements in place.	2020-12-17 12:35:02 -08:00
Anis Elleuch	cffdb01279	azure/s3 gateways: Pass ETag during GET call to avoid data corruption (#11024 ) Both Azure & S3 gateways call for object information before returning the stream of the object, however, the object content/length could be modified meanwhile, which means it can return a corrupted object. Use ETag to ensure that the object was not modified during the GET call	2020-12-17 09:11:14 -08:00
Harshavardhana	b390a2a0b9	fix: reuser timers in erasure set hotpaths (#11106 ) reuser timers in - connectDisks() monitoring - healMRFRoutine() channel timeouts	2020-12-16 14:33:05 -08:00
Harshavardhana	90158f1e33	fix: avoid logging for Heal APIs in FS mode (#11121 ) fixes #11120	2020-12-16 09:46:13 -08:00
Harshavardhana	c606c76323	fix: prioritized latest buckets for crawler to finish the scans faster (#11115 ) crawler should only ListBuckets once not for each serverPool, buckets are same across all pools, across sets and ListBuckets always returns an unified view, once list buckets returns sort it by create time to scan the latest buckets earlier with the assumption that latest buckets would have lesser content than older buckets allowing them to be scanned faster and also to be able to provide more closer to latest view.	2020-12-15 17:34:54 -08:00
Klaus Post	e7d3b49a20	metacache: Make very small requests transient (#11109 )	2020-12-15 11:25:36 -08:00
Harshavardhana	5df61ab96b	fix: remove gorilla/rpc/ deps fully after our fork (#11108 )	2020-12-15 11:18:06 -08:00
Poorna Krishnamoorthy	3456b03b12	Ignore ObjectNotFound errors in delete api while enforcing locking (#11114 ) AWS does not report this or version not found as errors in the response.	2020-12-15 11:15:49 -08:00
Klaus Post	f6fb27e8f0	Don't copy interesting ids, clean up logging (#11102 ) When searching the caches don't copy the ids, instead inline the loop. ``` Benchmark_bucketMetacache_findCache-32 19200 63490 ns/op 8303 B/op 5 allocs/op Benchmark_bucketMetacache_findCache-32 20338 58609 ns/op 111 B/op 4 allocs/op ``` Add a reasonable, but still the simplistic benchmark. Bonus - make nicer zero alloc logging	2020-12-14 13:13:33 -08:00
Harshavardhana	8368ab76aa	fix: remove the requirement for healing buckets in ListBucketsHeal (#11098 ) With new refactor of bucket healing, healing bucket happens automatically including its metadata, there is no need to redundant heal buckets also in ListBucketsHeal remove it.	2020-12-14 12:07:07 -08:00
Harshavardhana	3e83643320	lifecycle improvements and additional debug logging (#11096 ) Bonus change fix browser assets	2020-12-13 12:05:54 -08:00
Harshavardhana	2eb52ca5f4	fix: heal bucket metadata right before healing bucket (#11097 ) optimization mainly to avoid listing the entire `.minio.sys/buckets/.minio.sys` directory, this can get really huge and comes in the way of startup routines, contents inside `.minio.sys/buckets/.minio.sys` are rather transient and not necessary to be healed.	2020-12-13 11:57:08 -08:00
Anis Elleuch	f164085227	xl: Always set root disk to true in test environment (#11094 ) Tests environments (go test or manual testing) should always consider the passed disks are root disks and should not rely on disk.IsRootDisk() function. The reason is that this latter can return a false negative when called in a busy system. However, returning a false negative will only occur in a testing environment and not in a production, so we can accept this trade-off for now.	2020-12-12 16:10:07 -08:00
Harshavardhana	48191dd748	return NoSuchVersion if invalid version-id is specified (#11091 )	2020-12-11 20:44:08 -08:00
Anis Elleuch	c4f29d24da	metacache: Ask all disks when drive count is 4 (#11087 )	2020-12-11 17:54:31 -08:00
Harshavardhana	db7890660e	fix: a crash when disk is nil, safe access on erasureDisks (#11089 ) fixes #11088	2020-12-11 16:58:36 -08:00
Poorna Krishnamoorthy	9adc33efbb	Return version-id header in DeleteObject response (#11090 ) even when the object version is non-existent To make this consistent with aws behavior. Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2020-12-11 16:58:15 -08:00
Poorna Krishnamoorthy	8f65aba04b	ignore NoSuchVersion error in DeleteObjects API (#11086 ) Currently, the error response reports NoSuchVersion for a non-existent version-id, whereas AWS ignores it.	2020-12-11 12:39:09 -08:00
Harshavardhana	3a0082f0f1	fix: TTFB prometheus metrics calculation (#11082 ) until now metrics was reporting entire call duration instead of ttfb's this PR fixes it	2020-12-10 23:02:25 -08:00
Klaus Post	4bca62a0bd	crawler: Stream bucket usage cache data (#11068 ) Stream bucket caches to storage and through RPC calls.	2020-12-10 13:03:22 -08:00
Klaus Post	82e2be4239	metacache: Speed up cleanup operation (#11078 ) Perform cleanup operations on copied data. Avoids read locking data while determining which caches to keep. Also, reduce the log(NN) operation to log(NM) where M caches with the same root or below when checking potential replacements.	2020-12-10 12:30:28 -08:00
Harshavardhana	4550ac6fff	fix: refactor locks to apply them uniquely per node (#11052 ) This refactor is done for few reasons below - to avoid deadlocks in scenarios when number of nodes are smaller < actual erasure stripe count where in N participating local lockers can lead to deadlocks across systems. - avoids expiry routines to run 1000 of separate network operations and routes per disk where as each of them are still accessing one single local entity. - it is ideal to have since globalLockServer per instance. - In a 32node deployment however, each server group is still concentrated towards the same set of lockers that partipicate during the write/read phase, unlike previous minio/dsync implementation - this potentially avoids send 32 requests instead we will still send at max requests of unique nodes participating in a write/read phase. - reduces overall chattiness on smaller setups.	2020-12-10 07:28:37 -08:00
Klaus Post	e65ed2e44f	listcache: Add path index (#11063 ) Add a root path index. ``` Before: Benchmark_bucketMetacache_findCache-32 10000 730737 ns/op With excluded prints: Benchmark_bucketMetacache_findCache-32 10000 207100 ns/op With the root path: Benchmark_bucketMetacache_findCache-32 705765 1943 ns/op ``` Benchmark used (not linear): ```Go func Benchmark_bucketMetacache_findCache(b *testing.B) { bm := newBucketMetacache("", false) for i := 0; i < b.N; i++ { bm.findCache(listPathOptions{ ID: mustGetUUID(), Bucket: "", BaseDir: "prefix/" + mustGetUUID(), Prefix: "", FilterPrefix: "", Marker: "", Limit: 0, AskDisks: 0, Recursive: false, Separator: slashSeparator, Create: true, CurrentCycle: 0, OldestCycle: 0, }) } } ``` Replaces #11058	2020-12-09 08:37:43 -08:00
Anis Elleuch	d90044b847	federation: Redirect Lifecycle PUT request by bucket name (#11062 ) The bucket forwarder handler considers MakeBucket to be always local but it mistakenly thinks that PUT bucket lifecycle to be a MakeBucket call. Fix the check of the MakeBucket call by ensuring that the query is empty in the PUT url.	2020-12-09 07:25:26 -08:00
Harshavardhana	d8c1f93de6	reject mixed drive situations with drives on root disks (#11057 ) till now we used to match the inode number of the root drive and the drive path minio would use, if they match we knew that its a root disk. this may not be true in all situations such as running inside a container environment where the container might be mounted from a different partition altogether, root disk detection might fail.	2020-12-09 00:27:02 -08:00
Anis Elleuch	a51488cbaa	s3: Fix reading GET with partNumber specified (#11032 ) partNumber was miscalculting the start and end of parts when partNumber query is specified in the GET request. This commit fixes it and also fixes the ContentRange header in that case.	2020-12-08 13:12:42 -08:00
Harshavardhana	dc819afa44	fix: auto update crawler meta version PR `038bcd9079` introduced version '3', we need to make sure that we do not print an unexpected error instead log a message to indicate we will auto update the version.	2020-12-08 10:40:51 -08:00
Harshavardhana	4a564336fe	Revert "Add metrics for nodes online and offline (#11050 )" This reverts commit `f60bbdf86b`.	2020-12-08 09:23:35 -08:00
Ritesh H Shukla	f60bbdf86b	Add metrics for nodes online and offline (#11050 )	2020-12-08 01:06:27 -08:00
Poorna Krishnamoorthy	f3beb1236a	Add cache usage, total capacity to prometheus metrics (#11026 )	2020-12-07 16:35:11 -08:00
Poorna Krishnamoorthy	934bed47fa	Add transition event notification (#11047 ) This is a MinIO specific extension to allow monitoring of transition events.	2020-12-07 13:53:28 -08:00
Ritesh H Shukla	038bcd9079	Add replication capacity metrics support in crawler (#10786 )	2020-12-07 13:47:48 -08:00
Harshavardhana	ce93b2681b	fix: re-use er.getDisks() properly in certain calls (#11043 )	2020-12-07 10:04:07 -08:00
Harshavardhana	8d036ed6d8	fix: allow sub-admin to modify password for other users (#11039 ) fixes #11037	2020-12-06 20:36:34 -08:00
Harshavardhana	9c53cc1b83	fix: heal multiple buckets in bulk (#11029 ) makes server startup, orders of magnitude faster with large number of buckets	2020-12-05 13:00:44 -08:00
Harshavardhana	3514e89eb3	support envs as well for new crawler sub-system (#11033 )	2020-12-04 21:54:24 -08:00
Klaus Post	a896125490	Add crawler delay config + dynamic config values (#11018 )	2020-12-04 09:32:35 -08:00
Harshavardhana	e083471ec4	use argon2 with sync.Pool for better memory management (#11019 )	2020-12-03 19:23:19 -08:00
Harshavardhana	80d31113e5	fix: etcd import paths again depend on v3.4.14 release (#11020 ) Due to botched upstream renames of project repositories and incomplete migration to go.mod support, our current dependency version of `go.mod` had bugs i.e it was using commits from master branch which didn't have the required fixes present in release-3.4 branches which leads to some rare bugs https://github.com/etcd-io/etcd/pull/11477 provides a workaround for now and we should migrate to this. release-3.5 eventually claims to fix all of this properly until then we cannot use /v3 import right now	2020-12-03 11:35:18 -08:00
Ritesh H Shukla	7e2b79984e	Stream bucket bandwidth measurements (#11014 )	2020-12-03 11:34:42 -08:00
Harshavardhana	951b6b203b	skip metacache entries healing to speed up startup	2020-12-02 21:30:54 -08:00
Harshavardhana	44e23b7f4f	fix: startup being slow - wait only if IOCount > 0	2020-12-02 21:06:17 -08:00
Harshavardhana	96c0ce1f0c	add support for tuning healing to make healing more aggressive (#11003 ) supports `mc admin config set <alias> heal sleep=100ms` to enable more aggressive healing under certain times. also optimize some areas that were doing extra checks than necessary when bitrotscan was enabled, avoid double sleeps make healing more predictable. fixes #10497	2020-12-02 11:12:00 -08:00
ebozduman	303be1866d	Adds "x-amz-usr-agent" and "x-id" params to be used in authentication of presignedURL (#10792 )	2020-12-02 02:02:49 -08:00
Harshavardhana	4ec45753e6	rename server sets to server pools	2020-12-01 13:50:33 -08:00
Klaus Post	e6ea5c2703	crawler: Missing folder heal check per set (#10876 )	2020-12-01 12:07:39 -08:00
Harshavardhana	790833f3b2	Revert "Support variable server sets (#10314 )" This reverts commit `aabf053d2f`.	2020-12-01 12:02:29 -08:00
Harshavardhana	7cbca43eb1	fix: allow admins to create users (#11005 ) PR #10978 introduced a regression, root credential should be allowed to create users	2020-11-30 21:53:23 -08:00
Poorna Krishnamoorthy	2f564437ae	Disallow writeback caching with cache_after (#11002 ) fixes #10974	2020-11-30 20:53:27 -08:00
Harshavardhana	bdd094bc39	fix: avoid sending errors on missing objects on locked buckets (#10994 ) make sure multi-object delete returned errors that are AWS S3 compatible	2020-11-28 21:15:45 -08:00
Harshavardhana	e6fa410778	fix: allow accountInfo, addUser and getUserInfo implicit (#10978 ) - accountInfo API that returns information about user, access to buckets and the size per bucket - addUser - user is allowed to change their secretKey - getUserInfo - returns user info if the incoming is the same user requesting their information	2020-11-27 17:23:57 -08:00
Harshavardhana	aabf053d2f	Support variable server sets (#10314 )	2020-11-25 16:28:47 -08:00
Anis Elleuch	91130e884b	Avoid sending errors in gob in storage requests (#10977 )	2020-11-25 12:42:48 -08:00
Poorna Krishnamoorthy	2ff655a745	Refactor replication, ILM handling in DELETE API (#10945 )	2020-11-25 11:24:50 -08:00
Klaus Post	0422eda6a2	metacache: Always close block writer (#10973 ) In some cases a writer could be left behind unclosed, leaking compression blocks. Always close and set compression concurrency to 2 which should be fine to keep up.	2020-11-25 09:37:30 -08:00
Harshavardhana	31e6f60847	fix: improve error handling in metacache (#10965 )	2020-11-25 01:11:22 -08:00
Poorna Krishnamoorthy	3ad41fe89d	Add admin API to edit remote bucket target credentials (#10848 )	2020-11-24 19:09:05 -08:00
Klaus Post	a75fafdbe2	Remove msgp workaround (#10964 ) The error in `github.com/philhofer/fwd` was quickly fixed through https://github.com/philhofer/fwd/pull/22 - update the dependency and remove the workaround.	2020-11-24 11:58:10 -08:00
Klaus Post	a58b7874ef	Temporary workaround for msgp skipping (#10960 ) Due to https://github.com/philhofer/fwd/issues/20 when skipping a metadata entry that is >2048 bytes and the buffer is full (2048 bytes) the skip will fail with `io.ErrNoProgress`. Enlarge the buffer so we temporarily make this much more unlikely. If it still happens we will have to rewrite the skips to reads. Fixes #10959	2020-11-23 18:51:59 -08:00
Harshavardhana	6990de9c94	fix: dangling object delete shall return object doesn't exist (#10961 ) dangling object when deleted means object doesn't exist anymore, so we should return appropriate errors, this allows crawler heal to ensure that it removes the tracker for dangling objects.	2020-11-23 18:50:53 -08:00
Anis Elleuch	75a8e81f8f	azure: Specify different Azure storage in the shell env (#10943 ) AZURE_STORAGE_ACCOUNT and AZURE_STORAGE_KEY are used in azure CLI to specify the azure blob storage access & secret keys. With this commit, it is possible to set them if you want the gateway's own credentials to be different from the Azure blob credentials. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-11-23 16:45:56 -08:00
Harshavardhana	519c0077a9	fix: do not return an error for successfully deleted dangling objects (#10938 ) dangling objects when removed `mc admin heal -r` or crawler auto heal would incorrectly return error - this can interfere with usage calculation as the entry size for this would be returned as `0`, instead upon success use the resultant object size to calculate the final size for the object and avoid reporting this in the log messages Also do not set ObjectSize in healResultItem to be '-1' this has an effect on crawler metrics calculating 1 byte less for objects which seem to be missing their `xl.meta`	2020-11-23 09:12:17 -08:00
Harshavardhana	734d07a532	fix: all hosts local and port same should be local erasure setup (#10951 ) this is needed to avoid initializing notification peers that can lead to races in many sub-systems fixes #10950	2020-11-23 09:07:50 -08:00
Harshavardhana	df93102235	fix: unwrapping issues with os.Is* functions (#10949 ) reduces 3 stat calls, reducing the overall startup time significantly.	2020-11-23 08:36:49 -08:00
Poorna Krishnamoorthy	39f3d5493b	Show Delete replication status header (#10946 ) X-Minio-Replication-Delete-Status header shows the status of the replication of a permanent delete of a version. All GETs are disallowed and return 405 on this object version. In the case of replicating delete markers. X-Minio-Replication-DeleteMarker-Status shows the status of replication, and would similarly return 405. Additionally, this PR adds reporting of delete marker event completion and updates documentation	2020-11-21 23:48:50 -08:00
Klaus Post	692ff41ef7	Unwrap network errors (#10934 ) Alternative to #10927 Instead of having an upstream fix, do unwrap when checking network errors. 'As' will also work when destination is an interface as checked by the tests.	2020-11-20 22:55:35 -08:00
Harshavardhana	86409fa93d	add audit/admin trace support for browser requests (#10947 ) To support this functionality we had to fork the gorilla/rpc package with relevant changes	2020-11-20 22:52:17 -08:00
Shireesh Anjal	7bc47a14cc	Rename OBD to Health (#10842 ) Also, Remove thread stats and openfds from the health report as we already have process stats and numfds	2020-11-20 12:52:53 -08:00
Harshavardhana	73e308079a	fix: handle errors appropriately as they are wrapped (#10917 )	2020-11-20 10:43:07 -08:00
Poorna Krishnamoorthy	08b24620c0	Display storage-class of transitioned object in HEAD	2020-11-20 09:17:31 -08:00
Harshavardhana	95675b0c9a	fix: do not crash PutObjectTags when node is down (#10940 ) fixes #10939	2020-11-20 09:10:48 -08:00
Poorna Krishnamoorthy	251c1ef6da	Add support for replication of object tags, retention metadata (#10880 )	2020-11-19 18:56:09 -08:00
Poorna Krishnamoorthy	0fa430c1da	validate service type of target in replication/ilm transition config (#10928 )	2020-11-19 18:47:33 -08:00
Poorna Krishnamoorthy	f60b6eb82e	fix validation for deletemarker replication on object locked bucket (#10892 )	2020-11-19 18:47:19 -08:00
Poorna Krishnamoorthy	1ebf6f146a	Add support for ILM transition (#10565 ) This PR adds transition support for ILM to transition data to another MinIO target represented by a storage class ARN. Subsequent GET or HEAD for that object will be streamed from the transition tier. If PostRestoreObject API is invoked, the transitioned object can be restored for duration specified to the source cluster.	2020-11-19 18:47:17 -08:00
Harshavardhana	8f7fe0405e	fix: delete marker replication should support directories (#10878 ) allow directories to be replicated as well, along with their delete markers in replication. Bonus fix to fix bloom filter updates for directories to be preserved.	2020-11-19 18:47:12 -08:00
Harshavardhana	9a34fd5c4a	Revert "Revert "Add delete marker replication support (#10396 )"" This reverts commit `267d7bf0a9`.	2020-11-19 18:43:58 -08:00
Harshavardhana	f794fe79e3	fix: network shutdown was not handle properly (#10927 ) fixes a regression introduced in #10859, due to the error returned by rest.Client being typed i.e *rest.NetworkError - IsNetworkHostDown function didn't work as expected to detect network issues. This in-turn aggravated the situations when nodes are disconnected leading to performance loss.	2020-11-19 13:53:49 -08:00
Harshavardhana	0f9e125cf3	fix: check for gateway backend online without http request (#10924 ) fixes #10921	2020-11-19 10:38:02 -08:00
Harshavardhana	d778d9493f	remove MinIO release tag as part of HTTP Server string (#10929 )	2020-11-19 09:16:02 -08:00
Harshavardhana	70d2c2ccc9	skip files that are not erasure objects or directories (#10926 ) without this change WalkDir reports errors while trying to read `format.json/xl.meta` which is a replicated file	2020-11-19 09:15:09 -08:00
Harshavardhana	9dea7020f0	allow prefix filtering for WalkDir to be optional (#10923 )	2020-11-18 12:03:16 -08:00
Klaus Post	990d074f7d	metacache: Allow prefix filtering (#10920 ) Do listings with prefix filter when bloom filter is dirty. This will forward the prefix filter to the lister which will make it only scan the folders/objects with the specified prefix. If we have a clean bloom filter we try to build a more generally useful cache so in that case, we will list all objects/folders.	2020-11-18 10:44:18 -08:00
Klaus Post	e413f05397	Save listing error async (#10922 ) Since the RPC call may have to time out save an error state async to not hold up the listing returning. Fixes #10919	2020-11-18 10:28:22 -08:00
Harshavardhana	d1b1fee080	fix: save healing tracker right before healing (#10915 ) this change avoids a situation where accidentally if the user deleted the healing tracker or drives were replaced again within the 10sec window.	2020-11-18 09:34:46 -08:00
Harshavardhana	9738d605e4	increase readdir per block memory to facilitate faster WalkDir (#10908 )	2020-11-18 09:21:02 -08:00
Klaus Post	10099357b6	listcache: Wrap returned errors (#10882 ) To give an indication of where they happen	2020-11-17 09:11:59 -08:00
Harshavardhana	80b8ce89a4	remove context deadline from Delete calls (#10901 )	2020-11-17 09:09:45 -08:00
Poorna Krishnamoorthy	0b766288ef	fix: send replication completed event notification (#10902 )	2020-11-15 22:16:41 -08:00
Rafael Bodill	598ca0569c	fix: global in-place update boolean check (#10900 )	2020-11-15 13:34:12 -08:00
Poorna Krishnamoorthy	d295ce5708	Fix disk cache usage percent for prometheus (#10898 ) Fixes: #10895 Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2020-11-14 19:18:00 -08:00
Klaus Post	b5a3d79bce	listobjectversions: Add shortcut for Veeam blocks (#10893 ) Add shortcut for `APN/1.0 Veeam/1.0 Backup/10.0` It requests unique blocks with a specific prefix. We skip scanning the parent directory for more objects matching the prefix.	2020-11-13 16:58:20 -08:00
Harshavardhana	17a5ff51ff	fix: move context timeout closer to network for Delete calls (#10897 ) allowing for disconnects to be limited to the drive themselves instead of disconnecting all drives.	2020-11-13 16:56:45 -08:00
Harshavardhana	0bcb1b679d	fix: disallow update if dates are same (#10890 ) fixes #10889	2020-11-12 14:18:59 -08:00
Klaus Post	a3017c724e	Sort directory objects correctly (#10886 ) Decode dir objects when listing and sort them correctly.	2020-11-12 13:09:34 -08:00
Harshavardhana	267d7bf0a9	Revert "Add delete marker replication support (#10396 )" This reverts commit `50c10a5087`. PR is moved to origin/dev branch	2020-11-12 11:43:14 -08:00
cksac	be83dfc52a	fix: HDFS list bucket when subpath is provided (#10884 )	2020-11-12 11:26:51 -08:00
Harshavardhana	ca88ca753c	ignore typed errors correctly in list cache layer (#10879 ) bonus write bucket metadata cache with enough quorum Possible fix for #10868	2020-11-12 09:28:56 -08:00
Klaus Post	f86d3538f6	Allow deeper sleep (#10883 ) Allow each crawler operation to sleep up to 10 seconds on very heavily loaded systems. This will of course make minimum crawler speed less, but should be more effective at stopping.	2020-11-12 09:17:56 -08:00
Klaus Post	1c3590078d	Skip 0 byte stream writes (#10875 ) Don't send a packet when receiving 0 bytes or there is an error recorded	2020-11-11 18:07:40 -08:00
Harshavardhana	aa158228f9	fix: simplify healing metadata objects per set (#10867 )	2020-11-11 10:58:16 -08:00
Klaus Post	8747834c69	DeletedObjects: Return objects on lock failure (#10874 ) Return objects when locking fails. <details> <summary>Panic</summary> ``` : 2020/11/10 04:15:55 http: panic serving 10.10.62.153:44858: runtime error: index out of range [0] with length 0 : goroutine 363537270 [running]: : net/http.(conn).serve.func1(0xc019232780) : net/http/server.go:1801 +0x147 : panic(0x1cadd60, 0xc001719260) : runtime/panic.go:975 +0x47a : github.com/minio/minio/cmd.criticalErrorHandler.ServeHTTP.func1(0xc0121d1200, 0x210cda0, 0xc0141940e0) : github.com/minio/minio/cmd/generic-handlers.go:781 +0x1a8 : panic(0x1cadd60, 0xc001719260) : runtime/panic.go:969 +0x1b9 : github.com/minio/minio/cmd.objectAPIHandlers.DeleteMultipleObjectsHandler(0x1e71ce8, 0x1e71cc8, 0x2108420, 0xc0192328c0, 0xc0121d1400) : github.com/minio/minio/cmd/bucket-handlers.go:465 +0x2490 : net/http.HandlerFunc.ServeHTTP(...) : net/http/server.go:2042 : github.com/minio/minio/cmd.httpTraceAll.func1(0x2108420, 0xc0192328c0, 0xc0121d1400) : github.com/minio/minio/cmd/handler-utils.go:353 +0x158 : net/http.HandlerFunc.ServeHTTP(...) : net/http/server.go:2042 : github.com/minio/minio/cmd.collectAPIStats.func1(0x2108420, 0xc019232820, 0xc0121d1400) : github.com/minio/minio/cmd/handler-utils.go:380 +0xed : net/http.HandlerFunc.ServeHTTP(...) : net/http/server.go:2042 : github.com/minio/minio/cmd.maxClients.func1(0x2108420, 0xc019232820, 0xc0121d1400) : github.com/minio/minio/cmd/handler-api.go:132 +0x33b : net/http.HandlerFunc.ServeHTTP(0xc00271d590, 0x2108420, 0xc019232820, 0xc0121d1400) : net/http/server.go:2042 +0x44 : github.com/minio/minio/cmd.redirectHandler.ServeHTTP(0x20e2180, 0xc00271d590, 0x2108420, 0xc019232820, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:192 +0x156 : github.com/minio/minio/cmd.customHeaderHandler.ServeHTTP(0x20e1060, 0xc0141a22b0, 0x21083e0, 0xc01814d2e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:751 +0x162 : github.com/minio/minio/cmd.securityHeaderHandler.ServeHTTP(0x20e0fc0, 0xc0141a22c0, 0x21083e0, 0xc01814d2e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:766 +0x1d6 : github.com/minio/minio/cmd.bucketForwardingHandler.ServeHTTP(0xc0121c7a40, 0x20e1120, 0xc0141a22d0, 0x21083e0, 0xc01814d2e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:624 +0xbf : github.com/minio/minio/cmd.requestValidityHandler.ServeHTTP(0x20e0f20, 0xc01814d280, 0x21083e0, 0xc01814d2e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:608 +0x42a : github.com/minio/minio/cmd.httpStatsHandler.ServeHTTP(0x20e10c0, 0xc0141a2300, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:536 +0xe4 : github.com/minio/minio/cmd.requestSizeLimitHandler.ServeHTTP(0x20e0fe0, 0xc0141a2310, 0x50004000000, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:68 +0xd4 : github.com/minio/minio/cmd.requestHeaderSizeLimitHandler.ServeHTTP(0x20e10a0, 0xc01814d2a0, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:93 +0x1b7 : github.com/minio/minio/cmd.crossDomainPolicy.ServeHTTP(0x20e1080, 0xc0141a2320, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/crossdomain-xml-handler.go:51 +0x82 : github.com/minio/minio/cmd.browserRedirectHandler.ServeHTTP(0x20e0fa0, 0xc0141a2330, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:276 +0x68 : github.com/minio/minio/cmd.minioReservedBucketHandler.ServeHTTP(0x20e0f00, 0xc0141a2340, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:344 +0xb8 : github.com/minio/minio/cmd.cacheControlHandler.ServeHTTP(0x20e1020, 0xc0141a2350, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:303 +0x1ce : github.com/minio/minio/cmd.timeValidityHandler.ServeHTTP(0x20e0f40, 0xc0141a2360, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:414 +0x3ca : github.com/minio/minio/cmd.resourceHandler.ServeHTTP(0x20e1160, 0xc0141a2370, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:516 +0xab : github.com/minio/minio/cmd.authHandler.ServeHTTP(0x20e1100, 0xc0141a2380, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/auth-handler.go:502 +0x2e7 : github.com/minio/minio/cmd.sseTLSHandler.ServeHTTP(0x20e0ee0, 0xc0141a2390, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:802 +0x79 : github.com/minio/minio/cmd.reservedMetadataHandler.ServeHTTP(0x20e1140, 0xc0141a23a0, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:139 +0x1b7 : github.com/gorilla/mux.(Router).ServeHTTP(0xc00073fb00, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : github.com/gorilla/mux@v1.8.0/mux.go:210 +0xd3 : github.com/rs/cors.(Cors).Handler.func1(0x210cda0, 0xc0141940e0, 0xc0121d1200) : github.com/rs/cors@v1.7.0/cors.go:219 +0x1b9 : net/http.HandlerFunc.ServeHTTP(0xc0009aece0, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : net/http/server.go:2042 +0x44 : github.com/minio/minio/cmd.criticalErrorHandler.ServeHTTP(0x20e2180, 0xc0009aece0, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : github.com/minio/minio/cmd/generic-handlers.go:784 +0x85 : github.com/minio/minio/cmd/http.(Server).Start.func1(0x210cda0, 0xc0141940e0, 0xc0121d1200) : github.com/minio/minio/cmd/http/server.go:101 +0x258 : net/http.HandlerFunc.ServeHTTP(0xc000dc4080, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : net/http/server.go:2042 +0x44 : net/http.serverHandler.ServeHTTP(0xc000764c60, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : net/http/server.go:2843 +0xa3 : net/http.(conn).serve(0xc019232780, 0x2114720, 0xc03381f6c0) : net/http/server.go:1925 +0x8ad : created by net/http.(Server).Serve : net/http/server.go:2969 +0x36c ``` </details>	2020-11-11 09:14:32 -08:00
Poorna Krishnamoorthy	50c10a5087	Add delete marker replication support (#10396 ) Delete marker replication is implemented for V2 configuration specified in AWS spec (though AWS allows it only in the V1 configuration). This PR also brings in a MinIO only extension of replicating permanent deletes, i.e. deletes specifying version id are replicated to target cluster.	2020-11-10 15:24:14 -08:00
Steven Reitsma	4683a623dc	fix: negative STS IAM token TTL value (#10866 )	2020-11-10 12:24:01 -08:00
Klaus Post	06899210a7	Reduce health check output (#10859 ) This will make the health check clients 'silent'. Use `IsNetworkOrHostDown` determine if network is ok so it mimics the functionality in the actual client.	2020-11-10 09:28:23 -08:00
Harshavardhana	cbdab62c1e	fix: heal user/metadata right away upon server startup (#10863 ) this is needed such that we make sure to heal the users, policies and bucket metadata right away as we do listing based on list cache which only lists '3' sufficiently good drives, to avoid possibly losing access to these users upon upgrade make sure to heal them.	2020-11-10 09:02:06 -08:00
Harshavardhana	8df6112204	fix: avoid divide by zero error single node distributed setup (#10862 )	2020-11-09 20:40:39 -08:00
Harshavardhana	97692bc772	re-route requests if IAM is not initialized (#10850 )	2020-11-07 21:03:06 -08:00
Steven Reitsma	54120107ce	fix: infinite loop in cleanupStaleUploads of encrypted MPUs (#10845 ) fixes #10588	2020-11-06 11:53:42 -08:00
Klaus Post	9bf5990ea9	metadata: Invalidate cache if unreadable and not updating (#10844 ) If a scanning server shuts down unexpectedly we may have "successful" caches that are incomplete on a set. In this case mark the cache with an error so it will no longer be handed out.	2020-11-06 08:54:09 -08:00
Steven Reitsma	74f7cf24ae	fix: s3 gateway SSE pagination (#10840 ) Fixes #10838	2020-11-05 15:04:03 -08:00
Harshavardhana	fb28aa847b	fix: add missing deleted key element in multiObjectDelete (#10839 ) fixes #10832	2020-11-05 12:47:46 -08:00
Klaus Post	0724205f35	metacache: Add option for life extension (#10837 ) Add `MINIO_API_EXTEND_LIST_CACHE_LIFE` that will extend the life of generated caches for a while. This changes caches to remain valid until no updates have been received for the specified time plus a fixed margin. This also changes the caches from being invalidated when the first set finishes until the last set has finished plus the specified time has passed.	2020-11-05 11:49:56 -08:00
Harshavardhana	b72cac4cf3	fix: dangling objects on actual namespace (#10822 )	2020-11-05 11:48:55 -08:00
Klaus Post	bd77f29fc4	Don't replace caches that are receiving updates (#10834 ) Keep caches while they are receiving updates. Move update code to separate function.	2020-11-05 07:34:08 -08:00
Klaus Post	d1e1205036	metacache: Always close the s2 writer (#10836 ) The s2 writer could be leaked if there was an error. Make sure it is always closed.	2020-11-05 07:30:14 -08:00
Harshavardhana	71753e21e0	add missing TTL for STS credentials on etcd (#10828 )	2020-11-04 13:06:05 -08:00
Harshavardhana	fde3299bf3	re-use optimized readdir for isDirEmpty() (#10829 ) reduces effective memory usage by an order of magnitude, also increases performance for small objects	2020-11-04 13:05:21 -08:00
Harshavardhana	1a1f00fa15	fix: use internode data for DisksInfo, VolsInfo in message pack (#10821 ) Similar to #10775 for fewer memory allocations, since we use getOnlineDisks() extensively for listing we should optimize it further. Additionally, remove all unused walkers from the storage layer	2020-11-04 10:10:54 -08:00
Bill Thorp	4a1efabda4	Context based AccessKey passing (#10615 ) A new field called AccessKey is added to the ReqInfo struct and populated. Because ReqInfo is added to the context, this allows the AccessKey to be accessed from 3rd-party code, such as a custom ObjectLayer. Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Kaloyan Raev <kaloyan@storj.io>	2020-11-04 09:13:34 -08:00
Klaus Post	3b88a646ec	Add remote online/offline information (#10825 ) Log information about remote clients being marked offline. This will help to identify root causes of failures.	2020-11-04 08:27:32 -08:00
Klaus Post	2294e53a0b	Don't retain context in locker (#10515 ) Use the context for internal timeouts, but disconnect it from outgoing calls so we always receive the results and cancel it remotely.	2020-11-04 08:25:42 -08:00
Klaus Post	f0819cce75	Keep transient lists while they are updating (#10826 ) On extremely long running listings keep the transient list 15 minutes after last update instead of using start time. Also don't do overlap checks on transient lists.	2020-11-04 08:01:33 -08:00
Klaus Post	1e11b4629f	Add remote Diskinfo caching (#10824 ) Add 1 second remote disk info cache. Should decrease need for remote calls a great deal due to how actively it is used now.	2020-11-04 08:00:18 -08:00
Harshavardhana	5c72a34fa8	fix: honor delimiter as per AWS S3 spec (#10823 )	2020-11-04 07:56:58 -08:00
Klaus Post	b9277c8030	metacache: Add trashcan (#10820 ) Add trashcan that keeps recently updated lists after bucket deletion. All caches were deleted once a bucket was deleted, so caches still running would report errors. Now they are canceled. Fix `.minio.sys` not being transient.	2020-11-03 12:47:52 -08:00
Harshavardhana	8c76e1353e	initialize IAM after etcd has initialized (#10819 )	2020-11-03 12:12:30 -08:00
Harshavardhana	ad382799b1	use list cache for Walk() with webUI and quota (#10814 ) bring list cache optimizations for web UI object listing, also FIFO quota enforcement through list cache as well.	2020-11-03 08:53:48 -08:00
Harshavardhana	68de5a6f6a	fix: IAM store fallback to list users and policies from disk (#10787 ) Bonus fixes, remove package retry it is harder to get it right, also manage context remove it such that we don't have to rely on it anymore instead use a simple Jitter retry.	2020-11-02 17:52:13 -08:00
Harshavardhana	4ea31da889	fix: move list quorum ENV to config (#10804 )	2020-11-02 17:21:56 -08:00
Klaus Post	0a796505c1	metacache: Check only one disk for updates (#10809 ) Check only one disk for updates. This will reduce IO while waiting for lists to finish.	2020-11-02 17:20:27 -08:00
Klaus Post	37749f4623	Optimize FileInfo(Version) transfer (#10775 ) File Info decoding, in particular, is showing up as a major allocator and time consumer for internode data transfers Switch to message pack for cross-server transfers: ``` MSGP: Size: 945 bytes BenchmarkEncodeFileInfoMsgp-32 1558444 866 ns/op 1.16 MB/s 0 B/op 0 allocs/op BenchmarkDecodeFileInfoMsgp-32 479968 2487 ns/op 0.40 MB/s 848 B/op 18 allocs/op GOB: Size: 1409 bytes BenchmarkEncodeFileInfoGOB-32 333339 3237 ns/op 0.31 MB/s 576 B/op 19 allocs/op BenchmarkDecodeFileInfoGOB-32 20869 57837 ns/op 0.02 MB/s 16439 B/op 428 allocs/op ```	2020-11-02 17:07:52 -08:00
Klaus Post	86e0d272f3	Reduce WriteAll allocs (#10810 ) WriteAll saw 127GB allocs in a 5 minute timeframe for 4MiB buffers used by `io.CopyBuffer` even if they are pooled. Since all writers appear to write byte buffers, just send those instead and write directly. The files are opened through the `os` package so they have no special properties anyway. This removes the alloc and copy for each operation. REST sends content length so a precise alloc can be made.	2020-11-02 16:14:31 -08:00
Harshavardhana	8527f22df1	optimize request URL encoding for internode (#10811 ) this reduces allocations in order of magnitude Also, revert "erasure: delete dangling objects automatically (#10765)" affects list caching should be investigated.	2020-11-02 15:15:12 -08:00
Anis Elleuch	b456292295	erasure: delete dangling objects automatically (#10765 )	2020-11-02 10:49:30 -08:00
Poorna Krishnamoorthy	03fdbc3ec2	Add async caching commit option in diskcache (#10742 ) Add store and a forward option for a single part uploads when an async mode is enabled with env MINIO_CACHE_COMMIT=writeback It defaults to `writethrough` if unspecified.	2020-11-02 10:00:45 -08:00
Harshavardhana	4c773f7068	re-use remote transports in Peer,Storage,Locker clients (#10788 ) use one transport for internode communication	2020-11-02 07:43:11 -08:00
Harshavardhana	5412d730c1	simplify monitoring doesn't need to be canceled (#10803 ) connect disks monitoring doesn't need to be canceled upon drive replacement, since we only need to replace the newly replaced drive.	2020-10-31 14:10:12 -07:00
Klaus Post	fe9f23e632	Recreate bucket metacache if corrupted (#10800 ) If bucket metadata cannot be read, clean up existing and create a new.	2020-10-31 10:26:16 -07:00
Klaus Post	422898d9b3	Clean up metadata cache when deleting bucket (#10802 ) Metadata caches were left behind when deleting a bucket.	2020-10-31 09:46:18 -07:00
Harshavardhana	b686bb9c83	fix: replaced drive properly by healing the entire drive (#10799 ) Bonus fixes, we do not need reload format anymore as the replaced drive is healed locally we only need to ensure that drive heal reloads the drive properly. We preserve the UUID of the original order, this means that the replacement in `format.json` doesn't mean that the drive needs to be reloaded into memory anymore. fixes #10791	2020-10-31 01:34:48 -07:00
Harshavardhana	5e5cdc581d	remove unnecessary logging and move to log once (#10798 ) the current master logs way too much when a node is down, instead log once and move on.	2020-10-30 14:55:50 -07:00
Harshavardhana	02cfa774be	allow requests to be proxied when server is booting up (#10790 ) when server is booting up there is a possibility that users might see '503' because object layer when not initialized, then the request is proxied to neighboring peers first one which is online.	2020-10-30 12:20:28 -07:00
Krishna Srinivas	3a2f89b3c0	fix: add support for O_DIRECT reads for erasure backends (#10718 )	2020-10-30 11:04:29 -07:00
Klaus Post	6135f072d2	Fix invalidated metacaches (#10784 ) * Fix caches having EOF marked as a failure. * Simplify cache updates. * Provide context for checkMetacacheState failures. * Log 499 when the client disconnects.	2020-10-30 09:33:16 -07:00
Klaus Post	e63a44b734	rest client: Expect context timeouts for locks (#10782 ) Add option for rest clients to not mark a remote offline for context timeouts. This can be used if context timeouts are expected on the call.	2020-10-29 09:52:11 -07:00
Klaus Post	6b14c4ab1e	Optimize decryptObjectInfo (#10726 ) `decryptObjectInfo` is a significant bottleneck when listing objects. Reduce the allocations for a significant speedup. https://github.com/minio/sio/pull/40 ``` λ benchcmp before.txt after.txt benchmark old ns/op new ns/op delta Benchmark_decryptObjectInfo-32 24260928 808656 -96.67% benchmark old MB/s new MB/s speedup Benchmark_decryptObjectInfo-32 0.04 1.24 31.00x benchmark old allocs new allocs delta Benchmark_decryptObjectInfo-32 75112 48996 -34.77% benchmark old bytes new bytes delta Benchmark_decryptObjectInfo-32 287694772 4228076 -98.53% ```	2020-10-29 09:34:20 -07:00
Harshavardhana	4bf90ca67f	fix: handle a crash when AskDisks is set to -1 (#10777 )	2020-10-29 09:25:43 -07:00
Harshavardhana	e0655e24f2	fix: A possible crash when fi.Erasure.Distribution is empty (#10779 )	2020-10-28 19:24:01 -07:00
Klaus Post	bfc36aed89	Add update retry limit and compare error by string instead (#10776 )	2020-10-28 13:19:53 -07:00
Kaloyan Raev	be7f67268d	fix: Do not cleanup range files in cache SaveMetadata when total hits are false (#10728 )	2020-10-28 09:23:17 -07:00
Klaus Post	a982baff27	ListObjects Metadata Caching (#10648 ) Design: https://gist.github.com/klauspost/025c09b48ed4a1293c917cecfabdf21c Gist of improvements: * Cross-server caching and listing will use the same data across servers and requests. * Lists can be arbitrarily resumed at a constant speed. * Metadata for all files scanned is stored for streaming retrieval. * The existing bloom filters controlled by the crawler is used for validating caches. * Concurrent requests for the same data (or parts of it) will not spawn additional walkers. * Listing a subdirectory of an existing recursive cache will use the cache. * All listing operations are fully streamable so the number of objects in a bucket no longer dictates the amount of memory. * Listings can be handled by any server within the cluster. * Caches are cleaned up when out of date or superseded by a more recent one.	2020-10-28 09:18:35 -07:00
Krishna Srinivas	f53c5a020e	fix: heal object shards with ec.index and ec.distribution mismatches (#10773 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-10-28 00:10:20 -07:00
Harshavardhana	5b30bbda92	fix: add more protection distribution to match EcIndex (#10772 ) allows for more stricter validation in picking up the right set of disks for reconstruction.	2020-10-28 00:09:15 -07:00
Shireesh Anjal	858e2a43df	Remove logging info from OBDInfoHandler (#10727 ) A lot of logging data is counterproductive. A better implementation with precise useful log data can be introduced later.	2020-10-27 17:41:48 -07:00
Kaloyan Raev	df9894e275	avoid caching http ranges in background goroutine (#10724 )	2020-10-26 23:04:48 -07:00
Krishna Srinivas	592f2f23a3	fix: heal rejects objects with disk re-ordering issue (#10766 )	2020-10-26 18:48:47 -07:00
Krishna Srinivas	c49a80db41	fix: use meta.Erasure.Index for GetObject() to reconstruct object (#10764 )	2020-10-26 16:19:42 -07:00
Poorna Krishnamoorthy	46275c6547	cache: rename function declarations (#10763 )	2020-10-26 15:41:24 -07:00
Poorna Krishnamoorthy	0994ed9783	cache: fix call in GetObjectNInfo (#10762 ) Fixes: #10751	2020-10-26 12:30:40 -07:00
Anis Elleuch	eb95353cb1	fix: Get/HeadObject return 404 on non quorum objects (#10753 )	2020-10-26 10:30:46 -07:00
Harshavardhana	029758cb20	fix: retain the previous UUID for newly replaced drives (#10759 ) only newly replaced drives get the new `format.json`, this avoids disks reloading their in-memory reference format, ensures that drives are online without reloading the in-memory reference format. keeping reference format in-tact means UUIDs never change once they are formatted.	2020-10-26 10:29:29 -07:00
Harshavardhana	646d6917ed	turn-off checking for updates completely if MINIO_UPDATE=off (#10752 )	2020-10-24 22:39:44 -07:00
Harshavardhana	d9db7f3308	expire lockers if lockers are offline (#10749 ) lockers currently might leave stale lockers, in unknown ways waiting for downed lockers. locker check interval is high enough to safely cleanup stale locks.	2020-10-24 13:23:16 -07:00
Harshavardhana	6a8c62f9fd	make sure to preserve UUID from reference format (#10748 ) reference format should be source of truth for inconsistent drives which reconnect, add them back to their original position remove automatic fix for existing offline disk uuids	2020-10-24 13:23:08 -07:00
Anis Elleuch	00124c56d9	erasure: Commit data before xl.meta in RenameData() (#10734 ) This will reduce the chance to have updated xl.meta without data.	2020-10-23 21:54:58 -07:00
Anis Elleuch	2c32c2149e	tests: Avoid running TestNSRace in short test mode (#10735 )	2020-10-23 21:23:12 -07:00
Harshavardhana	734f258878	fix: slow down auto healing more aggressively (#10730 ) Bonus fixes - logging improvements to ensure that we don't use `go logger.LogIf` to avoid runtime.Caller missing the function name. log where necessary. - remove unused code at erasure sets	2020-10-22 13:36:24 -07:00
Anis Elleuch	0e0c53bba4	tests: Lower expectation in addr selection in rand cache dialer (#10739 ) Test TestDialContextWithDNSCacheRand was failing sometimes because it depends on a random selection of addresses when testing random DNS resolution from cache. Lower addr selection exception to 10%	2020-10-22 09:35:32 -07:00
Poorna Krishnamoorthy	5cc23ae052	validate if iam store is initialized (#10719 ) Fixes panic - regression from `d6d770c1b1`	2020-10-20 21:28:24 -07:00
Harshavardhana	d6d770c1b1	initialize object layer right after config has loaded	2020-10-19 22:04:59 -07:00
Harshavardhana	b07df5cae1	initialize IAM as soon as object layer is initialized (#10700 ) Allow requests to come in for users as soon as object layer and config are initialized, this allows users to be authenticated sooner and would succeed automatically on servers which are yet to fully initialize.	2020-10-19 09:54:40 -07:00
Harshavardhana	c107728676	fix: s3 gateway DNS cache initialization (#10706 ) fixes #10705	2020-10-19 01:34:23 -07:00
Anis Elleuch	284a2b9021	ilm: Send delete marker creation event when appropriate (#10696 ) Before this commit, the crawler ILM will always send object delete event notification though this is wrong.	2020-10-16 21:22:12 -07:00
Ritesh H Shukla	0b53e30ecb	Clean up monitor on delete bucket (#10698 )	2020-10-16 17:59:31 -07:00
Harshavardhana	bd2131ba34	add DNS cache support to avoid DNS flooding (#10693 ) Go stdlib resolver doesn't support caching DNS resolutions, since we compile with CGO disabled we are more probe to DNS flooding for all network calls to resolve for DNS from the DNS server. Under various containerized environments such as VMWare this becomes a problem because there are no DNS caches available and we may end up overloading the kube-dns resolver under concurrent I/O. To circumvent this issue implement a DNSCache resolver which resolves DNS and caches them for around 10secs with every 3sec invalidation attempted.	2020-10-16 14:49:05 -07:00
ebozduman	1aec168c84	fix: azure gateway should reject bucket names with "." (#10635 )	2020-10-16 09:30:18 -07:00
Klaus Post	21a549a83b	fix: keep MRF channel open to avoid random CI crash (#10686 ) There doesn't seem to be any benefit to closing the channel, so just keep it open and let it die with the server.	2020-10-16 09:08:51 -07:00
Ritesh H Shukla	8a16a1a1a9	fix: misc fixes for bandwidth reporting amd monitoring (#10683 ) * Set peer for fetch bandwidth * Fix the limit for bandwidth that is reported. * Reduce CPU burn from bandwidth management.	2020-10-16 09:07:50 -07:00
Harshavardhana	ad726b49b4	rename zones to serverSets to avoid terminology conflict (#10679 ) we are bringing in availability zones, we should avoid zones as per server expansion concept.	2020-10-15 14:28:50 -07:00
Anis Elleuch	db2241066b	heal: Enable removing dangling delete markers (#10688 )	2020-10-15 13:06:40 -07:00
Harshavardhana	f1cc16e788	fix: background heal rely on getOnlineDisks() (#10687 )	2020-10-15 13:06:23 -07:00
Klaus Post	3820a905e0	in getOnlineDisks wait for disks to be populated (#10685 )	2020-10-15 06:37:10 -07:00
Harshavardhana	2042d4873c	rename crawler config option to heal (#10678 )	2020-10-14 13:51:51 -07:00
Harshavardhana	f9be783f3e	fix: allow crawler to crawl on disks without usage constraints (#10677 ) additionally also change the resolution usage wise return of disks, allows to small byte level differences to be masked.	2020-10-14 12:12:10 -07:00
Harshavardhana	71b97fd3ac	fix: connect disks pre-emptively during startup (#10669 ) connect disks pre-emptively upon startup, to ensure we have enough disks are connected at startup rather than wait for them. we need to do this to avoid long wait times for server to be online when we have servers come up in rolling upgrade fashion	2020-10-13 18:28:42 -07:00
Klaus Post	03991c5d41	crawler: Remove waitForLowActiveIO (#10667 ) Only use dynamic delays for the crawler. Even though the max wait was 1 second the number of waits could severely impact crawler speed. Instead of relying on a global metric, we use the stateless local delays to keep the crawler running at a speed more adjusted to current conditions. The only case we keep it is before bitrot checks when enabled.	2020-10-13 13:45:08 -07:00
飞雪无情	614060764d	fix: use the correct Action type for policy.Args and iampolicy.Args (#10650 )	2020-10-12 15:18:22 -07:00
Harshavardhana	a3ba8188d7	fix: allow locker to be niladic	2020-10-12 14:23:44 -07:00
Harshavardhana	2760fc86af	Bump default idleConnsPerHost to control conns in time_wait (#10653 ) This PR fixes a hang which occurs quite commonly at higher concurrency by allowing following changes - allowing lower connections in time_wait allows faster socket open's - lower idle connection timeout to ensure that we let kernel reclaim the time_wait connections quickly - increase somaxconn to 4096 instead of 2048 to allow larger tcp syn backlogs. fixes #10413	2020-10-12 14:19:46 -07:00
Ritesh H Shukla	8ceb2a93fd	fix: peer replication bandwidth monitoring in distributed setup (#10652 )	2020-10-12 09:04:55 -07:00
Ritesh H Shukla	c2f16ee846	Add basic bandwidth monitoring for replication. (#10501 ) This change tracks bandwidth for a bucket and object - [x] Add Admin API - [x] Add Peer API - [x] Add BW throttling - [x] Admin APIs to set replication limit - [x] Admin APIs for fetch bandwidth	2020-10-09 20:36:00 -07:00
Harshavardhana	6484453fc6	optionally allow strict quorum listing (#10649 ) ``` export MINIO_API_LIST_STRICT_QUORUM=on ``` would enable listing in quorum if necessary	2020-10-09 15:40:46 -07:00
Harshavardhana	a0d0645128	remove safeMode behavior in startup (#10645 ) In almost all scenarios MinIO now is mostly ready for all sub-systems independently, safe-mode is not useful anymore and do not serve its original intended purpose. allow server to be fully functional even with config partially configured, this is to cater for availability of actual I/O v/s manually fixing the server. In k8s like environments it will never make sense to take pod into safe-mode state, because there is no real access to perform any remote operation on them.	2020-10-09 09:59:52 -07:00
Harshavardhana	253194e491	do not hold write locks - if objects don't exist (#10644 )	2020-10-08 17:47:21 -07:00
Harshavardhana	736e58dd68	fix: handle concurrent lockers with multiple optimizations (#10640 ) - select lockers which are non-local and online to have affinity towards remote servers for lock contention - optimize lock retry interval to avoid sending too many messages during lock contention, reduces average CPU usage as well - if bucket is not set, when deleteObject fails make sure setPutObjHeaders() honors lifecycle only if bucket name is set. - fix top locks to list out always the oldest lockers always, avoid getting bogged down into map's unordered nature.	2020-10-08 12:32:32 -07:00
Poorna Krishnamoorthy	907a171edd	Generalize error messages for remote targets (#10638 ) This is to allow remote targets to be generalized for replication/ILM transition Also adding a field in BucketTarget to identify a remote target with a label.	2020-10-08 10:54:11 -07:00
Andreas Auernhammer	ed6d2a100f	logger: avoid writing audit log response header twice (#10642 ) This commit fixes a misuse of the `http.ResponseWriter.WriteHeader`. A caller should either call `WriteHeader` exactly once or write to the response writer and causing an implicit 200 OK. Writing the response headers more than once causes a `http: superfluous response.WriteHeader call` log message. This commit fixes this by preventing a 2nd `WriteHeader` call being forwarded to the underlying `ResponseWriter`. Updates #10587	2020-10-08 09:29:10 -07:00
Harshavardhana	effe131090	fix: allow read unlocks to be defensive about split brains (#10637 )	2020-10-07 09:15:01 -07:00
Harshavardhana	18063bf25c	fix: cleanup old directory handling code (#10633 ) we don't need them anymore, remove legacy code.	2020-10-06 12:03:57 -07:00
Poorna Krishnamoorthy	dbbed6f7f0	update minio-go dependency (#10634 )	2020-10-06 08:37:09 -07:00
Poorna Krishnamoorthy	7fbfdceba3	Fix replication slowness (#10632 ) - Increase channel buffer length - Avoid blocking wait on replicaCh	2020-10-05 14:45:42 -07:00
Shireesh Anjal	f1418a50f0	add NVMe drive info [model num, serial num, drive temp. etc.] (#10613 ) * add NVMe drive info [model num, serial num, drive temp. etc.] * Ignore fuse partitions * Add the nvme logic only for linux * Move smart/nvme structs to a separate file Co-authored-by: wlan0 <sidharthamn@gmail.com>	2020-10-04 10:18:46 -07:00
Krishna Srinivas	045e30f2c1	Set LastModified time from source for bucket replication (#10627 )	2020-10-02 18:32:22 -07:00
Harshavardhana	c6a9a94f94	fix: optimize ServerInfo() handler to avoid reading config (#10626 ) fixes #10620	2020-10-02 16:19:44 -07:00
Harshavardhana	8e7c00f3d4	add missing request-id from DeleteObject events (#10623 ) fixes #10621	2020-10-02 13:36:13 -07:00
Harshavardhana	23e8390997	fix: Allow Walk to honor load balanced drives (#10610 )	2020-10-01 20:24:34 -07:00
Anis Elleuch	71403be912	fix: consider partNumber in GET/HEAD requests (#10618 )	2020-10-01 15:41:12 -07:00
Harshavardhana	f28d02b7f2	fix: simplify obd how we calculate transferred bytes (#10617 )	2020-10-01 14:34:51 -07:00
Harshavardhana	e0cb814f3f	fail if port is not accessible (#10616 ) throw proper error when port is not accessible for the regular user, this is possibly a regression. ``` ERROR Unable to start the server: Insufficient permissions to use specified port > Please ensure MinIO binary has 'cap_net_bind_service=+ep' permissions HINT: Use 'sudo setcap cap_net_bind_service=+ep /path/to/minio' to provide sufficient permissions ```	2020-10-01 13:23:31 -07:00
Harshavardhana	98a08e1644	fix: protect updating latencies/throughput slices in obd (#10611 ) Additionally close the transferChan upon function exit.	2020-10-01 09:50:08 -07:00
Klaus Post	3047121255	dataupdate: Bump to force rescan (#10609 ) After #10594 let's invalidate the bloom filters to force the next cycles to go through all data. There is a small chance that the linked PR could have caused missing bloom filter data. This will invalidate the current bloom filters and make the crawler go through everything.	2020-09-30 16:10:40 -07:00
Ritesh H Shukla	5a7f92481e	fix: client errors for DNS service creation errors (#10584 )	2020-09-30 14:09:41 -07:00
Anis Elleuch	0d45c38782	List v1/versions routes based on source IP if found (#10603 ) Routing using on source IP if found. This should distribute the listing load for V1 and versioning on multiple nodes evenly between different clients. If source IP is not found from the http request header, then falls back to bucket name instead.	2020-09-30 13:38:27 -07:00
Poorna Krishnamoorthy	56d1b227cf	Handle changes to versioning config for replication (#10598 ) Disallow versioning suspension on a bucket with pre-existing replication configuration If versioning is suspended on the target,replication should fail.	2020-09-30 13:36:37 -07:00
Lenin Alevski	bea87a5a20	fix: reading multiple TLS certificates when deployed in K8S (#10601 ) Ignore all regular files, CAs directory and any directory that starts with `..` inside the `.minio/certs` folder	2020-09-30 08:21:30 -07:00
Harshavardhana	2b4eb87d77	pick disks which are common maximally used (#10600 ) further optimization to ensure that good disks are always used for listing, other than healing we only use disks that are maximally used.	2020-09-29 22:54:02 -07:00
Harshavardhana	1f9abbee4d	make sure to release locks upon timeout (#10596 ) fixes #10418	2020-09-29 15:18:34 -07:00
Klaus Post	fdf0ae9167	exit data update tracker only upon context completion (#10594 ) The data update tracker saver would exit if data wasn't updated for between cycles.	2020-09-29 13:23:53 -07:00
Harshavardhana	00eb6f6bc9	cache DiskInfo at storage layer for performance (#10586 ) `mc admin info` on busy setups will not move HDD heads unnecessarily for repeated calls, provides a better responsiveness for the call overall. Bonus change allow listTolerancePerSet be N-1 for good entries, to avoid skipping entries for some reason one of the disk went offline.	2020-09-29 09:54:41 -07:00
Harshavardhana	66174692a2	add '.healing.bin' for tracking currently healing disk (#10573 ) add a hint on the disk to allow for tracking fresh disk being healed, to allow for restartable heals, and also use this as a way to track and remove disks. There are more pending changes where we should move all the disk formatting logic to backend drives, this PR doesn't deal with this refactor instead makes it easier to track healing in the future.	2020-09-28 19:39:32 -07:00
飞雪无情	209680e89f	Remove redundant http.HandlerFunc type conversion. (#10576 )	2020-09-28 13:33:49 -07:00
飞雪无情	27d9bd04e5	Handling unhandled errors in the InfoCannedPolicy method. (#10575 )	2020-09-27 10:24:04 -07:00
Harshavardhana	bebcf4f004	unlock() only if locking was successful	2020-09-25 19:36:47 -07:00
Harshavardhana	eafa775952	fix: add lock ownership to expire locks (#10571 ) - Add owner information for expiry, locking, unlocking a resource - TopLocks returns now locks in quorum by default, provides a way to capture stale locks as well with `?stale=true` - Simplify the quorum handling for locks to avoid from storage class, because there were challenges to make it consistent across all situations. - And other tiny simplifications to reset locks.	2020-09-25 19:21:52 -07:00
Harshavardhana	66b4a862e0	fix: network failure err check should ignore context canceled errors (#10567 ) context canceled errors bubbling up from the network layer has the potential to be misconstrued as network errors, taking prematurely a server offline and triggering a health check routine avoid this potential occurrence.	2020-09-25 14:35:47 -07:00
Anis Elleuch	9603489dd3	federation: Honor range with UploadObjectPart to a different cluster (#10570 ) Use gr & length instead of srcInfo.Reader & srcInfo.Size because they don't honor range header	2020-09-25 12:06:42 -07:00
Anis Elleuch	b302c8a5f4	heal: Fix periodic healing cleanup (#10569 ) isEnded() was incorrectly calculating if the current healing sequence is ended or not. h.currentStatus.Items could be empty if healing is very slow and mc admin heal consumed all items.	2020-09-25 10:29:00 -07:00
Praveen raj Mani	b880796aef	Set the maximum open connections limit in PG and MySQL target configs (#10558 ) As the bulk/recursive delete will require multiple connections to open at an instance, The default open connections limit will be reached which results in the following error ```FATAL: sorry, too many clients already``` By setting the open connections to a reasonable value - `2`, We ensure that the max open connections will not be exhausted and lie under bounds. The queries are simple inserts/updates/deletes which is operational and sufficient with the the maximum open connection limit is 2. Fixes #10553 Allow user configuration for MaxOpenConnections	2020-09-24 22:20:30 -07:00
Harshavardhana	37a5d5d7a0	reduce timeouts between servers for faster disconnects (#10562 )	2020-09-24 20:10:07 -07:00
Harshavardhana	3cac262dd1	report heal drives properly, also from global state (#10561 ) It is possible the heal drives are not reported from the maintenance check because the background heal state simply relied on the `format.json` for capturing unformatted drives. It is possible that drives might be still healing - make sure that applications which rely on cluster health check respond back this detail.	2020-09-24 15:36:47 -07:00
poornas	e6ab4db6b8	Fix minimum replication workers started (#10560 ) This PR also fixes GetReplicationConfiguration permission in web-handlers.go to use bucket as resource	2020-09-24 12:25:41 -07:00
Harshavardhana	ca989eb0b3	avoid ListBuckets returning quorum errors when node is down (#10555 ) Also, revamp the way ListBuckets work make few portions of the healing logic parallel - walk objects for healing disks in parallel - collect the list of buckets in parallel across drives - provide consistent view for listBuckets()	2020-09-24 09:53:38 -07:00
飞雪无情	d778d034e7	Remove redundant mgmtQueryKey type. (#10557 ) Remove redundant type conversion.	2020-09-24 08:40:21 -07:00
Harshavardhana	f7f9517b6a	fix: host extraction without port	2020-09-23 12:10:14 -07:00
Harshavardhana	90cff10e2b	avoid crash if disks are not initialized	2020-09-23 12:00:29 -07:00
Harshavardhana	81caf35926	fix: reduce healthcheck interval for storage rest client (#10544 )	2020-09-23 10:43:42 -07:00
poornas	5726cef3ca	validate bucket exists in ListRemoteTargets api (#10552 )	2020-09-23 10:37:54 -07:00
Harshavardhana	8b74a72b21	fix: rename READY deadline to CLUSTER deadline ENV (#10535 )	2020-09-23 09:14:33 -07:00
Klaus Post	eec69d6796	Fix stale context for bucket retrieval (#10551 ) The provided context gets captured by the closure making all subsequent calls fail.	2020-09-23 08:30:31 -07:00
Harshavardhana	0537a21b79	avoid concurrenct use of rand.NewSource (#10543 )	2020-09-22 15:34:27 -07:00
poornas	4c54ed8748	Close replica channel only once (#10542 ) Also enforce s3:GetReplicationConfiguration permission check as a bucket level resource.	2020-09-22 12:47:24 -07:00
Anis Elleuch	4c81201f95	fix: healing delete marker on versioned buckets (#10530 ) Healing was not working correctly in the distributed mode because errFileVersionNotFound was not properly converted in storage rest client. Besides, fixing the healing delete marker is not working as expected.	2020-09-21 15:16:16 -07:00
Harshavardhana	cd8d511d3d	move versionsOrder struct to xl-storage-utils	2020-09-21 14:24:42 -07:00
Harshavardhana	17e17da00d	add parallel workers to perform replication in parallel (#10525 ) set the concurrency for replication be to runtime.NumCPU()/2	2020-09-21 13:43:29 -07:00
Harshavardhana	a5da9120f3	fix: [fs] an error upon rwPool.Write() just attempt rwPool.Create() (#10533 ) On some NFS clients looks like errno is incorrectly set, which leads to incorrect errors thrown upwards.	2020-09-21 12:54:23 -07:00
poornas	aa12d75d75	fix crawler to detect lifecycle on bucket even if filter nil (#10532 )	2020-09-21 11:41:07 -07:00
Harshavardhana	6fcbdd5607	remove unused putObjectDir code (#10528 )	2020-09-21 09:41:39 -07:00
Harshavardhana	3831cc9e3b	fix: [fs] CompleteMultipart use trie structure for partMatch (#10522 ) performance improves by around 100x or more ``` go test -v -run NONE -bench BenchmarkGetPartFile goos: linux goarch: amd64 pkg: github.com/minio/minio/cmd BenchmarkGetPartFileWithTrie BenchmarkGetPartFileWithTrie-4 1000000000 0.140 ns/op 0 B/op 0 allocs/op PASS ok github.com/minio/minio/cmd 1.737s ``` fixes #10520	2020-09-21 01:18:13 -07:00
Krishna Srinivas	230fc0d186	Support for "directory" objects (#10499 )	2020-09-19 08:39:41 -07:00
Harshavardhana	7f9498f43f	fix: ignore faulty drives and continue (#10511 ) drives might return different types of errors handle them individually, and for some errors just log an error and continue	2020-09-18 12:09:05 -07:00
Harshavardhana	1cf322b7d4	change leader locker only for crawler (#10509 )	2020-09-18 11:15:54 -07:00
Klaus Post	0b1c824618	Fix incorrect request start time (#10516 ) Log request start time BEFORE starting processing the request	2020-09-18 09:30:52 -07:00
Klaus Post	c851e022b7	Tweaks to dynamic locks (#10508 ) * Fix cases where minimum timeout > default timeout. * Add defensive code for too small/negative timeouts. * Never set timeout below the maximum value of a request. * Protect against (unlikely) int64 wraps. * Decrease timeout slower. * Don't re-lock before copying.	2020-09-18 09:18:18 -07:00
Klaus Post	5ad032826a	Add a reasonable if unable to get total RAM (#10506 ) Though unlikely we shouldn't skip initializing the API if we cannot get RAM. Add 16GiB as a default and log the error.	2020-09-18 02:03:02 -07:00
Harshavardhana	84bf4624a4	fix: make sure to preserve metadata during overwrite in FS mode (#10512 ) This bug was introduced in `14f0047295` almost 3yrs ago, as a side affect of removing stale `fs.json` but we in-fact end up removing existing good `fs.json` for an existing object, leading to some form of a data loss. fixes #10496	2020-09-18 00:16:16 -07:00
Harshavardhana	4a36cd7035	fix: improve performance ListObjectParts in FS mode (#10510 ) from 20s for 10000 parts to less than 1sec Without the patch ``` ~ time aws --endpoint-url=http://localhost:9000 --profile minio s3api \ list-parts --bucket testbucket --key test \ --upload-id c1cd1f50-ea9a-4824-881c-63b5de95315a real 0m20.394s user 0m0.589s sys 0m0.174s ``` With the patch ``` ~ time aws --endpoint-url=http://localhost:9000 --profile minio s3api \ list-parts --bucket testbucket --key test \ --upload-id c1cd1f50-ea9a-4824-881c-63b5de95315a real 0m0.891s user 0m0.624s sys 0m0.182s ``` fixes #10503	2020-09-17 18:51:16 -07:00
Klaus Post	03490c811b	Fix obd goroutine leak (#10504 ) The gouroutine collecting transfer stats never exits. Add missing channel close.	2020-09-17 10:10:20 -07:00
Harshavardhana	ed78854cea	fix: list across all drives to avoid stale disks	2020-09-16 21:17:10 -07:00
Harshavardhana	e60834838f	fix: background disk heal, to reload format consistently (#10502 ) It was observed in VMware vsphere environment during a pod replacement, `mc admin info` might report incorrect offline nodes for the replaced drive. This issue eventually goes away but requires quite a lot of time for all servers to be in sync. This PR fixes this behavior properly.	2020-09-16 21:14:35 -07:00
Harshavardhana	d616d8a857	serialize replication and feed it through task model (#10500 ) this allows for eventually controlling the concurrency of replication and overally control of throughput	2020-09-16 16:04:55 -07:00
Anis Elleuch	24cab7f9df	ilm: Remove a 'null' version if not latest (#10494 ) If the ILM document requires removing noncurrent versions, the the server should be able to remove 'null' versions as well. 'null' versions are created when versioning is not enabled or suspended.	2020-09-16 10:21:50 -07:00
Harshavardhana	02c1a08a5b	fix: make sure to lock CopyObject for in-place updates (#10492 )	2020-09-15 20:44:48 -07:00
Ritesh H Shukla	5c47ce456e	Run replication in the background (#10491 )	2020-09-15 18:44:58 -07:00
Anis Elleuch	8ea55f9dba	obd: Add console log to OBD output (#10372 )	2020-09-15 18:02:54 -07:00
poornas	80e3dce631	azure: update content-md5 to metadata after upload (#10482 ) Fixes #10453	2020-09-15 16:31:47 -07:00
Harshavardhana	80fab03b63	fix: S3 gateway doesn't support full passthrough for encryption (#10484 ) The entire encryption layer is dependent on the fact that KMS should be configured for S3 encryption to work properly and we only support passing the headers as is to the backend for encryption only if KMS is configured. Make sure that this predictability is maintained, currently the code was allowing encryption to go through and fail at later to indicate that KMS was not configured. We should simply reply "NotImplemented" if KMS is not configured, this allows clients to simply proceed with their tests.	2020-09-15 13:57:15 -07:00
Harshavardhana	730d2dc7be	fix: allow CopyObject/PutObjecTags on pre-existing content (#10485 ) fixes #10475	2020-09-15 09:18:41 -07:00
Harshavardhana	0ee9678190	fix: add missing delete marker created filter (#10481 )	2020-09-14 21:32:52 -07:00
Klaus Post	34859c6d4b	Preallocate (safe) slices when we know the size (#10459 )	2020-09-14 20:44:18 -07:00
Klaus Post	b1c99e88ac	reduce CPU usage upto 50% in readdir (#10466 )	2020-09-14 17:19:54 -07:00
Harshavardhana	0104af6bcc	delayed locks until we have started reading the body (#10474 ) This is to ensure that Go contexts work properly, after some interesting experiments I found that Go net/http doesn't cancel the context when Body is non-zero and hasn't been read till EOF. The following gist explains this, this can lead to pile up of go-routines on the server which will never be canceled and will die at a really later point in time, which can simply overwhelm the server. https://gist.github.com/harshavardhana/c51dcfd055780eaeb71db54f9c589150 To avoid this refactor the locking such that we take locks after we have started reading from the body and only take locks when needed. Also, remove contextReader as it's not useful, doesn't work as expected context is not canceled until the body reaches EOF so there is no point in wrapping it with context and putting a `select {` on it which can unnecessarily increase the CPU overhead. We will still use the context to cancel the lockers etc. Additional simplification in the locker code to avoid timers as re-using them is a complicated ordeal avoid them in the hot path, since locking is very common this may avoid lots of allocations.	2020-09-14 15:57:13 -07:00
Harshavardhana	34ea1d2167	fix: return correct error code for MetadataTooLarge (#10470 ) fixes #10469	2020-09-13 21:26:35 -07:00
Harshavardhana	9d95937018	update KMS docs indicating deprecation of AUTO_ENCRYPTION env	2020-09-13 16:23:28 -07:00
Klaus Post	fa01e640f5	Continous healing: add optional bitrot check (#10417 )	2020-09-12 00:08:12 -07:00
Harshavardhana	f355374962	add support for configurable remote transport deadline (#10447 ) configurable remote transport timeouts for some special cases where this value needs to be bumped to a higher value when transferring large data between federated instances.	2020-09-11 23:03:08 -07:00
Harshavardhana	bda0fe3150	fix: allow LDAP identity to support form body POST (#10468 ) similar to other STS APIs	2020-09-11 23:02:32 -07:00
Harshavardhana	b70995dd60	Revert "ilm: Remove null version if not latest with proper config (#10467 )" This reverts commit `4b6264da7d`.	2020-09-11 18:15:49 -07:00
Anis Elleuch	4b6264da7d	ilm: Remove null version if not latest with proper config (#10467 )	2020-09-11 14:20:09 -07:00
Harshavardhana	48919de301	fix: for defer'ed deleteObject use internal context (#10463 )	2020-09-11 06:39:19 -07:00
Harshavardhana	eb2934f0c1	simplify webhook DNS further generalize for gateway (#10448 ) continuation of the changes from `eaaf05a7cc` this further simplifies, enables this for gateway deployments as well	2020-09-10 14:19:32 -07:00
Klaus Post	b7438fe4e6	Copy metadata before spawning goroutine + prealloc maps (#10458 ) In `(*cacheObjects).GetObjectNInfo` copy the metadata before spawning a goroutine. Clean up a few map[string]string copies as well, reducing allocs and simplifying the code. Fixes #10426	2020-09-10 11:37:22 -07:00
Anis Elleuch	ce6cef6855	erasure: Call Walk() from all disks (#10445 ) It does not make sense to call Walk() in only N/2 disks and then requires N/2 quorum, just keep it N/2+1 The commit fixes this behavior.	2020-09-10 09:27:52 -07:00
Klaus Post	493c714663	Remove erasureSets and erasureObjects from ObjectLayer (#10442 )	2020-09-10 09:18:19 -07:00
Harshavardhana	e959c5d71c	fix: server panic in FS mode (#10455 ) fixes #10454	2020-09-10 09:16:26 -07:00
Harshavardhana	4a2928eb49	generate missing object delete bucket notifications (#10449 ) fixes #10381	2020-09-09 18:23:08 -07:00
Anis Elleuch	af88772a78	lifecycle: NoncurrentVersionExpiration considers noncurrent version age (#10444 ) From https://docs.aws.amazon.com/AmazonS3/latest/dev/intro-lifecycle-rules.html#intro-lifecycle-rules-actions ``` When specifying the number of days in the NoncurrentVersionTransition and NoncurrentVersionExpiration actions in a Lifecycle configuration, note the following: It is the number of days from when the version of the object becomes noncurrent (that is, when the object is overwritten or deleted), that Amazon S3 will perform the action on the specified object or objects. Amazon S3 calculates the time by adding the number of days specified in the rule to the time when the new successor version of the object is created and rounding the resulting time to the next day midnight UTC. For example, in your bucket, suppose that you have a current version of an object that was created at 1/1/2014 10:30 AM UTC. If the new version of the object that replaces the current version is created at 1/15/2014 10:30 AM UTC, and you specify 3 days in a transition rule, the transition date of the object is calculated as 1/19/2014 00:00 UTC. ```	2020-09-09 18:11:24 -07:00
Harshavardhana	9109148474	add support for new UA values for update an check (#10451 )	2020-09-09 17:21:39 -07:00
Nitish Tiwari	eaaf05a7cc	Add Kubernetes operator webook server as DNS target (#10404 ) This PR adds a DNS target that ensures to update an entry into Kubernetes operator when a bucket is created or deleted. See minio/operator#264 for details. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-09-09 12:20:49 -07:00
Harshavardhana	958661cbb5	skip subdomain from bucket DNS which start with `minio.domain` (#10390 ) extend host matcher to reject the host match	2020-09-09 09:57:37 -07:00
Harshavardhana	6a0372be6c	cleanup tmpDir any older entries automatically just like multipart (#10439 ) also consider multipart uploads, temporary files in `.minio.sys/tmp` as stale beyond 24hrs and clean them up automatically	2020-09-08 15:55:40 -07:00
Harshavardhana	c13afd56e8	Remove MaxConnsPerHost settings to avoid potential hangs (#10438 ) MaxConnsPerHost can potentially hang a call without any way to timeout, we do not need this setting for our proxy and gateway implementations instead IdleConn settings are good enough. Also ensure to use NewRequestWithContext and make sure to take the disks offline only for network errors. Fixes #10304	2020-09-08 14:22:04 -07:00
Harshavardhana	96997d2b21	allow ctrl+c to be consistent at early startup (#10435 ) fixes #10431	2020-09-08 09:10:55 -07:00
Klaus Post	86a3319d41	Ignore config values from unknown subsystems (#10432 )	2020-09-08 08:57:04 -07:00
Harshavardhana	9f60e84ce1	always copy UserDefined metadata map (#10427 ) fixes #10426	2020-09-07 09:25:28 -07:00
Harshavardhana	572b1721b2	set max API requests automatically based on RAM (#10421 )	2020-09-04 19:37:37 -07:00
Harshavardhana	b0e1d4ce78	re-attach offline drive after new drive replacement (#10416 ) inconsistent drive healing when one of the drive is offline while a new drive was replaced, this change is to ensure that we can add the offline drive back into the mix by healing it again.	2020-09-04 17:09:02 -07:00
Harshavardhana	eb19c8af40	Bump response header timeout for proxying list request (#10420 )	2020-09-04 16:07:40 -07:00
Klaus Post	2d58a8d861	Add storage layer contexts (#10321 ) Add context to all (non-trivial) calls to the storage layer. Contexts are propagated through the REST client. - `context.TODO()` is left in place for the places where it needs to be added to the caller. - `endWalkCh` could probably be removed from the walkers, but no changes so far. The "dangerous" part is that now a caller disconnecting will propagate down, so a "delete" operation will now be interrupted. In some cases we might want to disconnect this functionality so the operation completes if it has started, leaving the system in a cleaner state.	2020-09-04 09:45:06 -07:00
poornas	0037951b6e	improve error message when remote target missing (#10412 )	2020-09-04 08:48:38 -07:00
Andreas Auernhammer	fbd1c5f51a	certs: refactor cert manager to support multiple certificates (#10207 ) This commit refactors the certificate management implementation in the `certs` package such that multiple certificates can be specified at the same time. Therefore, the following layout of the `certs/` directory is expected: ``` certs/ │ ├─ public.crt ├─ private.key ├─ CAs/ // CAs directory is ignored │ │ │ ... │ ├─ example.com/ │ │ │ ├─ public.crt │ └─ private.key └─ foobar.org/ │ ├─ public.crt └─ private.key ... ``` However, directory names like `example.com` are just for human readability/organization and don't have any meaning w.r.t whether a particular certificate is served or not. This decision is made based on the SNI sent by the client and the SAN of the certificate. *** The `Manager` will pick a certificate based on the client trying to establish a TLS connection. In particular, it looks at the client hello (i.e. SNI) to determine which host the client tries to access. If the manager can find a certificate that matches the SNI it returns this certificate to the client. However, the client may choose to not send an SNI or tries to access a server directly via IP (`https://<ip>:<port>`). In this case, we cannot use the SNI to determine which certificate to serve. However, we also should not pick "the first" certificate that would be accepted by the client (based on crypto. parameters - like a signature algorithm) because it may be an internal certificate that contains internal hostnames. We would disclose internal infrastructure details doing so. Therefore, the `Manager` returns the "default" certificate when the client does not specify an SNI. The default certificate the top-level `public.crt` - i.e. `certs/public.crt`. This approach has some consequences: - It's the operator's responsibility to ensure that the top-level `public.crt` does not disclose any information (i.e. hostnames) that are not publicly visible. However, this was the case in the past already. - Any other `public.crt` - except for the top-level one - must not contain any IP SAN. The reason for this restriction is that the Manager cannot match a SNI to an IP b/c the SNI is the server host name. The entire purpose of SNI is to indicate which host the client tries to connect to when multiple hosts run on the same IP. So, a client will not set the SNI to an IP. If we would allow IP SANs in a lower-level `public.crt` a user would expect that it is possible to connect to MinIO directly via IP address and that the MinIO server would pick "the right" certificate. However, the MinIO server cannot determine which certificate to serve, and therefore always picks the "default" one. This may lead to all sorts of confusing errors like: "It works if I use `https:instance.minio.local` but not when I use `https://10.0.2.1`. These consequences/limitations should be pointed out / explained in our docs in an appropriate way. However, the support for multiple certificates should not have any impact on how deployment with a single certificate function today. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-09-03 23:33:37 -07:00
Harshavardhana	1c6781757c	add missing ListBucketVersions from policy actions (#10414 )	2020-09-03 18:25:06 -07:00
Harshavardhana	b4e3956e69	update KES docs to talk about 'mc encrypt' command (#10400 ) add a deprecation notice for KMS_AUTO_ENCRYPTION	2020-09-03 12:43:45 -07:00
Harshavardhana	8a291e1dc0	Cluster healthcheck improvements (#10408 ) - do not fail the healthcheck if heal status was not obtained from one of the nodes, if many nodes fail then report this as a catastrophic error. - add "x-minio-write-quorum" value to match the write tolerance supported by server. - admin info now states if a drive is healing where madmin.Disk.Healing is set to true and madmin.Disk.State is "ok"	2020-09-02 22:54:56 -07:00
Klaus Post	650dccfa9e	cache: Only start at high watermark (#10403 ) Currently, cache purges are triggered as soon as the low watermark is exceeded. To reduce IO this should only be done when reaching the high watermark. This simplifies checks and reduces all calls for a GC to go through `dcache.diskSpaceAvailable(size)`. While a comment claims that `dcache.triggerGC <- struct{}{}` was non-blocking I don't see how that was possible. Instead, we add a 1 size to the queue channel and use channel semantics to avoid blocking when a GC has already been requested. `bytesToClear` now takes the high watermark into account to it will not request any bytes to be cleared until that is reached.	2020-09-02 17:48:44 -07:00
Andreas Auernhammer	9a703befe6	crypto: reduce retry delay when retrying KES requests (#10394 ) This commit reduces the retry delay when retrying a request to a KES server by: - reducing the max. jitter delay from 3s to 1.5s - skipping the random delay when there are more KES endpoints available. If there are more KES endpoints we can directly retry to the request by sending it to the next endpoint - as pointed out by @krishnasrinivas	2020-09-02 11:04:10 -07:00
Klaus Post	9a1615768d	Fix flaky TestXLStorageVerifyFile (#10398 ) `TestXLStorageVerifyFile` would fail 1 in 256 if the first random character was 'a'. Instead write 256 bytes which has 1 in 256^256 probability.	2020-09-02 09:42:24 -07:00
Harshavardhana	37da0c647e	fix: delete marker compatibility behavior for suspended bucket (#10395 ) - delete-marker should be created on a suspended bucket as `null` - delete-marker should delete any pre-existing `null` versioned object and create an entry `null`	2020-09-02 00:19:03 -07:00
Harshavardhana	2acb530ccd	update rulesguard with new rules (#10392 ) Co-authored-by: Nitish Tiwari <nitish@minio.io> Co-authored-by: Praveen raj Mani <praveen@minio.io>	2020-09-01 16:58:13 -07:00
Klaus Post	3e1fb17b70	heal: Check for truncated files (#10399 ) When checking parts we already do a stat for each part. Since we have the on disk size check if it is at least what we expect. When checking metadata check if metadata is 0 bytes.	2020-09-01 12:06:45 -07:00
Klaus Post	a89d6b8e3d	Fix common Windows failure (#10397 ) The `getNonLoopBackIP` may grab an IP from an interface that doesn't allow binding (on Windows), so this test consistently fails. We exclude that specific error.	2020-09-01 10:11:15 -07:00
Klaus Post	1c085f7d1a	Fix crash on Windows when crawling (#10385 ) * readDirN: Check if file is directory `syscall.FindNextFile` crashes if the handle is a file. `errFileNotFound` matches 'unix' functionality: `d19b434ffc/cmd/os-readdir_unix.go (L106)` Fixes #10384	2020-09-01 09:33:16 -07:00
Harshavardhana	4b6585d249	support 'ldap:user' variable replacement properly (#10391 ) also update `ldap.go` examples with latest minio-go changes Fixes #10367	2020-09-01 12:26:22 +05:30
Harshavardhana	9ffad7fceb	discard empty endpoint in crypto kes introduced in `18725679c4`	2020-08-31 19:35:43 -07:00
Andreas Auernhammer	18725679c4	crypto: allow multiple KES endpoints (#10383 ) This commit addresses a maintenance / automation problem when MinIO-KES is deployed on bare-metal. In orchestrated env. the orchestrator (K8S) will make sure that `n` KES servers (IPs) are available via the same DNS name. There it is sufficient to provide just one endpoint.	2020-08-31 18:10:52 -07:00
Anis Elleuch	ba8a8ad818	ListObjectsV1 requests unnecessarily fail with offline nodes (#10386 ) ListObjectsV1 requests are actually redirected to a specific node, depending on the bucket name. The purpose of this behavior was to optimize listing. However, the current code sends a Bad Gateway error if the target node is offline, which is a bad behavior because it means that the list request will fail, although this is unnecessary since we can still use the current node to list as well (the default behavior without using proxying optimization) Currently, you can see mint fails when there is one offline node, after this PR, mint will always succeed.	2020-08-31 12:37:31 -07:00
Harshavardhana	102ad60dee	simplify removing temporary files (#10389 )	2020-08-31 12:35:40 -07:00
Gaige B Paulsen	859ef52886	update for smartos build (solaris too) (#10378 )	2020-08-31 10:19:25 -07:00
Harshavardhana	e730da1438	fix: referesh JWKS public keys upon failure (#10368 ) fixes #10359	2020-08-28 08:15:12 -07:00
Anis Elleuch	46ee8659b4	fix write quorum calculation for bucket operations (#10364 ) When the number of disks is odd, the calculation of quorum for bucket operations were not correct, fix it.	2020-08-27 12:55:32 -07:00
Harshavardhana	a359e36e35	tolerate listing with only readQuorum disks (#10357 ) We can reduce this further in the future, but this is a good value to keep around. With the advent of continuous healing, we can be assured that namespace will eventually be consistent so we are okay to avoid the necessity to a list across all drives on all sets. Bonus Pop()'s in parallel seem to have the potential to wait too on large drive setups and cause more slowness instead of gaining any performance remove it for now. Also, implement load balanced reply for local disks, ensuring that local disks have an affinity for - cleanupStaleMultipartUploads()	2020-08-26 19:29:35 -07:00
Jorge Israel Peña	0a2e6d58a5	hdfs gateway handle listing single files (#10362 )	2020-08-26 16:03:53 -07:00
Klaus Post	1b119557c2	getDisksInfo: Attribute failed disks to correct endpoint (#10360 ) If DiskInfo calls failed the information returned was used anyway resulting in no endpoint being set. This would make the drive be attributed to the local system since `disk.Endpoint == disk.DrivePath` in that case. Instead, if the call fails record the endpoint and the error only.	2020-08-26 10:11:26 -07:00
Harshavardhana	7778fef6bb	update continous heal metrics appropriately for scanned items (#10352 ) bonus make sure to ignore objectNotFound, and versionNotFound errors properly at all layers, since HealObjects() returns objectNotFound error if the bucket or prefix is empty.	2020-08-26 08:53:33 -07:00
飞雪无情	ea1803417f	Use constants for gateway names to avoid bugs caused by spelling. (#10355 )	2020-08-26 08:52:46 -07:00
Harshavardhana	d19b434ffc	fix: bring back delayed leaf detection in listing (#10346 )	2020-08-25 12:26:48 -07:00
Klaus Post	17a1eda702	Disregard healing disks in crawling (#10349 ) When crawling never use a disk we know is healing. Most of the change involves keeping track of the original endpoint on xlStorage and this also fixes DiskInfo.Endpoint never being populated. Heal master will print `data-crawl: Disk "http://localhost:9001/data/mindev/data2/xl1" is Healing, skipping` once on a cycle (no more often than every 5m).	2020-08-25 10:55:15 -07:00
Daniel Valdivia	7d1734d033	indicate through HTTP header cluster healing in progress (#10342 )	2020-08-24 15:20:50 -07:00
Harshavardhana	03ec6adfd0	fix: KES http2.0 communication support (#10341 )	2020-08-24 14:37:53 -07:00
Harshavardhana	309b10f201	keep crawler cycle at 5 minutes	2020-08-24 14:05:16 -07:00
Klaus Post	c097ce9c32	continous healing based on crawler (#10103 ) Design: https://gist.github.com/klauspost/792fe25c315caf1dd15c8e79df124914	2020-08-24 13:47:01 -07:00
Harshavardhana	caad314faa	add ruleguard support, fix all the reported issues (#10335 )	2020-08-24 12:11:20 -07:00
Klaus Post	bc2ebe0021	Only enforce quota on success (#10339 ) We should only enforce quotas if no error has been returned. firstErr is safe to access since all goroutines have exited at this point. If `firstErr` hasn't been set by something else return the context error if cancelled.	2020-08-24 10:15:46 -07:00
Harshavardhana	11aa393ba7	Allow region errors to be dynamic (#10323 ) remove other FIXMEs as we are not planning to fix these, instead we will add dynamism case by case basis. fixes #10250	2020-08-23 22:06:22 -07:00

... 6 7 8 9 10 ...

3564 commits