minio

Author	SHA1	Message	Date
Harshavardhana	2760fc86af	Bump default idleConnsPerHost to control conns in time_wait (#10653 ) This PR fixes a hang which occurs quite commonly at higher concurrency by allowing following changes - allowing lower connections in time_wait allows faster socket open's - lower idle connection timeout to ensure that we let kernel reclaim the time_wait connections quickly - increase somaxconn to 4096 instead of 2048 to allow larger tcp syn backlogs. fixes #10413	2020-10-12 14:19:46 -07:00
P R	abb14aeec1	Header row seperator for console Table (#10651 )	2020-10-12 11:32:43 -07:00
Ritesh H Shukla	8ceb2a93fd	fix: peer replication bandwidth monitoring in distributed setup (#10652 )	2020-10-12 09:04:55 -07:00
Ritesh H Shukla	c2f16ee846	Add basic bandwidth monitoring for replication. (#10501 ) This change tracks bandwidth for a bucket and object - [x] Add Admin API - [x] Add Peer API - [x] Add BW throttling - [x] Admin APIs to set replication limit - [x] Admin APIs for fetch bandwidth	2020-10-09 20:36:00 -07:00
Harshavardhana	a0d0645128	remove safeMode behavior in startup (#10645 ) In almost all scenarios MinIO now is mostly ready for all sub-systems independently, safe-mode is not useful anymore and do not serve its original intended purpose. allow server to be fully functional even with config partially configured, this is to cater for availability of actual I/O v/s manually fixing the server. In k8s like environments it will never make sense to take pod into safe-mode state, because there is no real access to perform any remote operation on them.	2020-10-09 09:59:52 -07:00
Harshavardhana	736e58dd68	fix: handle concurrent lockers with multiple optimizations (#10640 ) - select lockers which are non-local and online to have affinity towards remote servers for lock contention - optimize lock retry interval to avoid sending too many messages during lock contention, reduces average CPU usage as well - if bucket is not set, when deleteObject fails make sure setPutObjHeaders() honors lifecycle only if bucket name is set. - fix top locks to list out always the oldest lockers always, avoid getting bogged down into map's unordered nature.	2020-10-08 12:32:32 -07:00
Poorna Krishnamoorthy	907a171edd	Generalize error messages for remote targets (#10638 ) This is to allow remote targets to be generalized for replication/ILM transition Also adding a field in BucketTarget to identify a remote target with a label.	2020-10-08 10:54:11 -07:00
Harshavardhana	effe131090	fix: allow read unlocks to be defensive about split brains (#10637 )	2020-10-07 09:15:01 -07:00
Shireesh Anjal	f1418a50f0	add NVMe drive info [model num, serial num, drive temp. etc.] (#10613 ) * add NVMe drive info [model num, serial num, drive temp. etc.] * Ignore fuse partitions * Add the nvme logic only for linux * Move smart/nvme structs to a separate file Co-authored-by: wlan0 <sidharthamn@gmail.com>	2020-10-04 10:18:46 -07:00
Harshavardhana	23e8390997	fix: Allow Walk to honor load balanced drives (#10610 )	2020-10-01 20:24:34 -07:00
Harshavardhana	98a08e1644	fix: protect updating latencies/throughput slices in obd (#10611 ) Additionally close the transferChan upon function exit.	2020-10-01 09:50:08 -07:00
Anis Elleuch	0d45c38782	List v1/versions routes based on source IP if found (#10603 ) Routing using on source IP if found. This should distribute the listing load for V1 and versioning on multiple nodes evenly between different clients. If source IP is not found from the http request header, then falls back to bucket name instead.	2020-09-30 13:38:27 -07:00
Shireesh Anjal	6e138f955e	Fix a couple of typos in json config (#10605 ) Vault.Encrypt: encryp -> encrypt SysOBDProcess.Uids: uidsomitempty -> uids,omitempty	2020-09-30 13:08:11 -07:00
Harshavardhana	2b4eb87d77	pick disks which are common maximally used (#10600 ) further optimization to ensure that good disks are always used for listing, other than healing we only use disks that are maximally used.	2020-09-29 22:54:02 -07:00
Harshavardhana	1f9abbee4d	make sure to release locks upon timeout (#10596 ) fixes #10418	2020-09-29 15:18:34 -07:00
Harshavardhana	849fcf0127	block unlocks if there are quorum failures (#10582 ) fixes #10418	2020-09-28 15:39:52 -07:00
Harshavardhana	eafa775952	fix: add lock ownership to expire locks (#10571 ) - Add owner information for expiry, locking, unlocking a resource - TopLocks returns now locks in quorum by default, provides a way to capture stale locks as well with `?stale=true` - Simplify the quorum handling for locks to avoid from storage class, because there were challenges to make it consistent across all situations. - And other tiny simplifications to reset locks.	2020-09-25 19:21:52 -07:00
Harshavardhana	66b4a862e0	fix: network failure err check should ignore context canceled errors (#10567 ) context canceled errors bubbling up from the network layer has the potential to be misconstrued as network errors, taking prematurely a server offline and triggering a health check routine avoid this potential occurrence.	2020-09-25 14:35:47 -07:00
飞雪无情	4de88e87bb	os.SEEK_SET is deprecated,use io.SeekStart. (#10563 )	2020-09-25 03:12:25 -07:00
Praveen raj Mani	b880796aef	Set the maximum open connections limit in PG and MySQL target configs (#10558 ) As the bulk/recursive delete will require multiple connections to open at an instance, The default open connections limit will be reached which results in the following error ```FATAL: sorry, too many clients already``` By setting the open connections to a reasonable value - `2`, We ensure that the max open connections will not be exhausted and lie under bounds. The queries are simple inserts/updates/deletes which is operational and sufficient with the the maximum open connection limit is 2. Fixes #10553 Allow user configuration for MaxOpenConnections	2020-09-24 22:20:30 -07:00
Harshavardhana	0537a21b79	avoid concurrenct use of rand.NewSource (#10543 )	2020-09-22 15:34:27 -07:00
Shireesh Anjal	b17dc81540	Change "disks" node to "drives" in OBD output (#10540 )	2020-09-22 11:53:19 -07:00
Harshavardhana	3831cc9e3b	fix: [fs] CompleteMultipart use trie structure for partMatch (#10522 ) performance improves by around 100x or more ``` go test -v -run NONE -bench BenchmarkGetPartFile goos: linux goarch: amd64 pkg: github.com/minio/minio/cmd BenchmarkGetPartFileWithTrie BenchmarkGetPartFileWithTrie-4 1000000000 0.140 ns/op 0 B/op 0 allocs/op PASS ok github.com/minio/minio/cmd 1.737s ``` fixes #10520	2020-09-21 01:18:13 -07:00
Harshavardhana	1cf322b7d4	change leader locker only for crawler (#10509 )	2020-09-18 11:15:54 -07:00
poornas	00555c747e	Strip standard ports off remote target url (#10498 )	2020-09-17 11:09:50 -07:00
Harshavardhana	d616d8a857	serialize replication and feed it through task model (#10500 ) this allows for eventually controlling the concurrency of replication and overally control of throughput	2020-09-16 16:04:55 -07:00
Anis Elleuch	8ea55f9dba	obd: Add console log to OBD output (#10372 )	2020-09-15 18:02:54 -07:00
Harshavardhana	0ee9678190	fix: add missing delete marker created filter (#10481 )	2020-09-14 21:32:52 -07:00
Harshavardhana	0104af6bcc	delayed locks until we have started reading the body (#10474 ) This is to ensure that Go contexts work properly, after some interesting experiments I found that Go net/http doesn't cancel the context when Body is non-zero and hasn't been read till EOF. The following gist explains this, this can lead to pile up of go-routines on the server which will never be canceled and will die at a really later point in time, which can simply overwhelm the server. https://gist.github.com/harshavardhana/c51dcfd055780eaeb71db54f9c589150 To avoid this refactor the locking such that we take locks after we have started reading from the body and only take locks when needed. Also, remove contextReader as it's not useful, doesn't work as expected context is not canceled until the body reaches EOF so there is no point in wrapping it with context and putting a `select {` on it which can unnecessarily increase the CPU overhead. We will still use the context to cancel the lockers etc. Additional simplification in the locker code to avoid timers as re-using them is a complicated ordeal avoid them in the hot path, since locking is very common this may avoid lots of allocations.	2020-09-14 15:57:13 -07:00
Andreas Auernhammer	224daee391	fix nats TLS unit tests (#10476 ) This commit fixes the nats TLS tests by generating new certificates (root CA, server and client) - each valid for 10y. The new certificates don't have a common name (deprecated by X.509) but SANs instead. Since Go 1.15 the Go `crypto/x509` package rejects certificates that only have a common name and no SAN. See: https://golang.org/doc/go1.15#commonname	2020-09-14 13:19:46 -07:00
Harshavardhana	48919de301	fix: for defer'ed deleteObject use internal context (#10463 )	2020-09-11 06:39:19 -07:00
Anis Elleuch	af88772a78	lifecycle: NoncurrentVersionExpiration considers noncurrent version age (#10444 ) From https://docs.aws.amazon.com/AmazonS3/latest/dev/intro-lifecycle-rules.html#intro-lifecycle-rules-actions ``` When specifying the number of days in the NoncurrentVersionTransition and NoncurrentVersionExpiration actions in a Lifecycle configuration, note the following: It is the number of days from when the version of the object becomes noncurrent (that is, when the object is overwritten or deleted), that Amazon S3 will perform the action on the specified object or objects. Amazon S3 calculates the time by adding the number of days specified in the rule to the time when the new successor version of the object is created and rounding the resulting time to the next day midnight UTC. For example, in your bucket, suppose that you have a current version of an object that was created at 1/1/2014 10:30 AM UTC. If the new version of the object that replaces the current version is created at 1/15/2014 10:30 AM UTC, and you specify 3 days in a transition rule, the transition date of the object is calculated as 1/19/2014 00:00 UTC. ```	2020-09-09 18:11:24 -07:00
Nitish Tiwari	eaaf05a7cc	Add Kubernetes operator webook server as DNS target (#10404 ) This PR adds a DNS target that ensures to update an entry into Kubernetes operator when a bucket is created or deleted. See minio/operator#264 for details. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-09-09 12:20:49 -07:00
Klaus Post	0987069e37	select: Fix integer conversion overflow (#10437 ) Do not convert float value to integer if it will over/underflow. The comparison cannot be `<=` since rounding may overflow it. Fixes #10436	2020-09-08 15:56:11 -07:00
Harshavardhana	c13afd56e8	Remove MaxConnsPerHost settings to avoid potential hangs (#10438 ) MaxConnsPerHost can potentially hang a call without any way to timeout, we do not need this setting for our proxy and gateway implementations instead IdleConn settings are good enough. Also ensure to use NewRequestWithContext and make sure to take the disks offline only for network errors. Fixes #10304	2020-09-08 14:22:04 -07:00
Andreas Auernhammer	fbd1c5f51a	certs: refactor cert manager to support multiple certificates (#10207 ) This commit refactors the certificate management implementation in the `certs` package such that multiple certificates can be specified at the same time. Therefore, the following layout of the `certs/` directory is expected: ``` certs/ │ ├─ public.crt ├─ private.key ├─ CAs/ // CAs directory is ignored │ │ │ ... │ ├─ example.com/ │ │ │ ├─ public.crt │ └─ private.key └─ foobar.org/ │ ├─ public.crt └─ private.key ... ``` However, directory names like `example.com` are just for human readability/organization and don't have any meaning w.r.t whether a particular certificate is served or not. This decision is made based on the SNI sent by the client and the SAN of the certificate. *** The `Manager` will pick a certificate based on the client trying to establish a TLS connection. In particular, it looks at the client hello (i.e. SNI) to determine which host the client tries to access. If the manager can find a certificate that matches the SNI it returns this certificate to the client. However, the client may choose to not send an SNI or tries to access a server directly via IP (`https://<ip>:<port>`). In this case, we cannot use the SNI to determine which certificate to serve. However, we also should not pick "the first" certificate that would be accepted by the client (based on crypto. parameters - like a signature algorithm) because it may be an internal certificate that contains internal hostnames. We would disclose internal infrastructure details doing so. Therefore, the `Manager` returns the "default" certificate when the client does not specify an SNI. The default certificate the top-level `public.crt` - i.e. `certs/public.crt`. This approach has some consequences: - It's the operator's responsibility to ensure that the top-level `public.crt` does not disclose any information (i.e. hostnames) that are not publicly visible. However, this was the case in the past already. - Any other `public.crt` - except for the top-level one - must not contain any IP SAN. The reason for this restriction is that the Manager cannot match a SNI to an IP b/c the SNI is the server host name. The entire purpose of SNI is to indicate which host the client tries to connect to when multiple hosts run on the same IP. So, a client will not set the SNI to an IP. If we would allow IP SANs in a lower-level `public.crt` a user would expect that it is possible to connect to MinIO directly via IP address and that the MinIO server would pick "the right" certificate. However, the MinIO server cannot determine which certificate to serve, and therefore always picks the "default" one. This may lead to all sorts of confusing errors like: "It works if I use `https:instance.minio.local` but not when I use `https://10.0.2.1`. These consequences/limitations should be pointed out / explained in our docs in an appropriate way. However, the support for multiple certificates should not have any impact on how deployment with a single certificate function today. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-09-03 23:33:37 -07:00
Harshavardhana	1c6781757c	add missing ListBucketVersions from policy actions (#10414 )	2020-09-03 18:25:06 -07:00
Harshavardhana	8a291e1dc0	Cluster healthcheck improvements (#10408 ) - do not fail the healthcheck if heal status was not obtained from one of the nodes, if many nodes fail then report this as a catastrophic error. - add "x-minio-write-quorum" value to match the write tolerance supported by server. - admin info now states if a drive is healing where madmin.Disk.Healing is set to true and madmin.Disk.State is "ok"	2020-09-02 22:54:56 -07:00
飞雪无情	2d96940826	fix: adminTrace show any errors when server is shutdown. (#10370 )	2020-08-28 10:04:54 -07:00
Anis Elleuch	9acdeab73d	lifecycle: Accept document without expiration (#10348 )	2020-08-25 12:38:59 -07:00
KevinSmile	5f7bd2b1da	fix: lifecycle-expiration validation bug (#10327 )	2020-08-24 13:56:50 -07:00
Harshavardhana	caad314faa	add ruleguard support, fix all the reported issues (#10335 )	2020-08-24 12:11:20 -07:00
Praveen raj Mani	d0c910a6f3	Support https and basic-auth for elasticsearch notification target (#10332 )	2020-08-23 09:43:48 -07:00
Tobias Nygren	052b5262ff	use statvfs(2) for disk.GetInfo on NetBSD (#10257 )	2020-08-20 20:13:06 -07:00
Krishnan Parthasarathi	ccd967e3be	Add ExpiresAt to LicenseInfo (#10293 )	2020-08-19 19:21:04 -07:00
Harshavardhana	c8b84a0e9e	Add nancy vulnerability scanner (#10289 )	2020-08-19 14:25:21 -07:00
Harshavardhana	74116204ce	handle fresh setup with mixed drives (#10273 ) fresh drive setups when one of the drive is a root drive, we should ignore such a root drive and not proceed to format. This PR handles this properly by marking the disks which are root disk and they are taken offline.	2020-08-18 14:37:26 -07:00
Klaus Post	adca28801d	feat: disable Parquet by default (breaking change) (#9920 ) I have built a fuzz test and it crashes heavily in seconds and will OOM shortly after. It seems like supporting Parquet is basically a completely open way to crash the server if you can upload a file and run s3 select on it. Until Parquet is more hardened it is DISABLED by default since hostile crafted input can easily crash the server. If you are in a controlled environment where it is safe to assume no hostile content can be uploaded to your cluster you can safely enable Parquet. To enable Parquet set the environment variable `MINIO_API_SELECT_PARQUET=on` while starting the MinIO server. Furthermore, we guard parquet by recover functions.	2020-08-18 10:23:28 -07:00
Harshavardhana	ede86845e5	docs: Add policy variables for resource and conditions (#10278 ) Bonus fix adds LDAP policy variable and clarifies the usage of policy variables for temporary credentials. fixes #10197	2020-08-17 17:39:55 -07:00
Harshavardhana	83a82d818e	allow lock tolerance to match storage-class drive tolerance (#10270 )	2020-08-14 18:17:14 -07:00

1 2 3 4 5 ...

1615 commits