Commit graph

173 commits

Author SHA1 Message Date
Hans5958
b21fbff228 Also convert to the new links format, some other edits 2022-04-18 12:22:04 +07:00
AnonymousRandomPerson
ce34865053 Added script to migrate atlas formats 2022-04-17 23:36:14 -04:00
Hans5958
bbf6703df2 Print formatter log every 200 due to slowdown 2022-04-17 14:38:53 +07:00
Hans5958
9b7baca08a
Merge pull request #1165 from fabi321/fix-line-end
Fixed platform specific behaviour in format.py
2022-04-17 14:30:28 +07:00
Hans5958
2f417e433a Optimize new center algo a little bit
incl. Don't use modules
2022-04-17 14:12:34 +07:00
Fabian Wunsch
a80f070173
Merge branch 'remaster' into fix-line-end 2022-04-17 09:12:12 +02:00
Hans5958
41e29c1abb
Merge branch 'remaster' into new-center-algorithm 2022-04-17 13:39:14 +07:00
Hans5958
03c6748d02 Fix bugs on formatter 2022-04-17 13:36:19 +07:00
Fabian Wunsch
c01610774d
Merge branch 'remaster' into new-center-algorithm 2022-04-17 08:31:51 +02:00
Hans5958
49a0c85c27 Fix pattern to compensate R/ on links object 2022-04-17 13:18:43 +07:00
Hans5958
d93c2ed5a5 Use specific pattern for new format subreddits 2022-04-17 13:12:22 +07:00
Hans5958
bb2704c2d2 Formatter support for new format 2022-04-16 22:25:11 +07:00
Hans5958
41ab1f0d20 Use different flair for edit 2022-04-16 22:25:11 +07:00
Hans5958
d9107ac6df Fix merging contributors 2022-04-16 22:25:11 +07:00
Hans5958
6faec6f11d Store and check read edit entry ids 2022-04-16 22:25:11 +07:00
Hans5958
1e1c007d31 Support editing on script, submitted_by to contributors, merge to atlas script 2022-04-16 22:25:10 +07:00
Stefano Haagmans
1828a0ae53 Optimised images and fixed headers 2022-04-15 13:35:00 +02:00
Fabian Wunsch
d32649d5c7 Added comments
and tweaked the treshold a bit
2022-04-12 15:13:39 +02:00
Fabian Wunsch
24bed2a73c Moved log out of loop 2022-04-12 08:15:27 +02:00
Fabian Wunsch
9c5782e6c8 More balanced approach
to centroid vs pole of inaccessability
2022-04-12 08:15:21 +02:00
Fabian Wunsch
5aa211e120 Much better way of finding the visual center 2022-04-12 08:15:12 +02:00
Fabian Wunsch
132798351d Added typing; added clarification 2022-04-12 08:15:02 +02:00
Fabian Wunsch
95c706f95a Small bugfix 2022-04-12 08:14:56 +02:00
Fabian Wunsch
8fc546885e More descriptive variable names 2022-04-12 08:14:46 +02:00
Fabian Wunsch
302f0fe068 Changed codebase and minor bugfixes
Changed the codebase to https://github.com/Twista/python-polylabel/
and rounded results
2022-04-12 08:14:42 +02:00
Fabian Wunsch
d819cfb313 Changed the algorithm to calculate the center
this is the version from Mapbox, just to see how well it performs. It
definitely needs improvements as of now. I'm going to include the
atlas.json, but it will be excluded once the pr is ready to be merged
2022-04-12 08:14:10 +02:00
ash
8af6b4abaa
Merge pull request #1234 from placeAtlas/master
Merge
2022-04-11 14:56:47 +01:00
Fabian Wunsch
ed084ac936 Fixed logical indentation error 2022-04-11 19:51:15 +07:00
Fabian Wunsch
149a1e4209 Changed indentation from spaces to tabs 2022-04-11 19:51:15 +07:00
Fabian Wunsch
37d4325186 Small naming improvement for better clarity 2022-04-11 19:51:15 +07:00
Fabian Wunsch
da58a13058 Small consistency improvement 2022-04-11 19:51:15 +07:00
Fabian Wunsch
2ea0e48a1a Off by one error 2022-04-11 19:51:15 +07:00
Fabian Wunsch
9b44ba2ab8 Changed the print content
Co-authored-by: Hans5958 <hans5958@outlook.com>
2022-04-11 19:51:15 +07:00
Fabian Wunsch
c4716f5f0e Remove duplicate points using format.py
This is acutally a very widespread "problem". There are many entries
that have 2 points after each other that are identical. This happens
especially often with identical first and last points. This should not
alter the appearance in any way.
2022-04-11 19:51:15 +07:00
AnonymousRandomPerson
6a220c13c1 Updated Reddit additions 2022-04-11 02:22:55 -04:00
AnonymousRandomPerson
e0728154ad Merge branch 'master' into cleanup 2022-04-10 14:13:14 -04:00
Hans5958
547ffc773e List ids that already been read in a seperate file 2022-04-10 21:34:06 +07:00
Fabian Wunsch
8f095a6ba0 Using LF instead of RCLF 2022-04-10 16:10:40 +02:00
Fabian Wunsch
a052531c13 Even available without import since 3.5 2022-04-09 23:32:17 +02:00
Fabian Wunsch
c91df97e25 More beautiful way of fixing this 2022-04-09 23:30:11 +02:00
Fabian Wunsch
ebcd5ff847 Fixed platform specific behaviour in format.py
Previously this would've used CRLF on Windows and LF on Linux, now it's
using CRLF on all platforms, as it's the current format
2022-04-09 23:24:33 +02:00
Fabian Wunsch
706f2b32b0 Added a cleanup step to re calulate the center
This change calculates the center of each entry and compares it to the
current center. If they are different, the center get's updated.
2022-04-09 21:08:36 +02:00
Hans5958
804159e116 Reduce copyright notice, add 2022 contributors, add missing notice
The lengthy boilerplate is unnecessary, especially since it is already linked. The type of license can stay there instead.
2022-04-09 16:33:10 +07:00
Hans5958
a19ee7d65a Use HTTPS link to license 2022-04-09 16:33:09 +07:00
Hans5958
b5ee700b00 Forgot to also parse subreddits 2022-04-09 16:33:09 +07:00
Hans5958
31101f8211 Forgot to avoid IndexError 2022-04-09 16:33:09 +07:00
Hans5958
cc4bd5b6da Remove entries with less than 3 points 2022-04-09 16:33:09 +07:00
Hans5958
a996899814 Add ensure_ascii=False 2022-04-09 16:33:09 +07:00
Hans5958
7ee2e5a15a Remove atlas-before-ids-migration from formatter 2022-04-09 16:33:09 +07:00
Hans5958
9053d6d106 Add docs on formatter, move path length checker to formatter 2022-04-09 16:33:09 +07:00
Hans5958
9dc9450524 Smart quotes, why? Get lost. 2022-04-09 16:33:09 +07:00
Hans5958
69ecd7351f Move subreddit-formatted websites to subreddit 2022-04-09 16:33:09 +07:00
Hans5958
d5e7e23a33 Move user links to website, small logic fix 2022-04-08 12:15:21 +07:00
Hans5958
4b26b86e18 Forgot to change that 2022-04-08 12:15:21 +07:00
Hans5958
f4de2cbf7e Make validator more represents the JS script, commatization on spaces
Fun fact: The JS script also includes those that have no r/, but I made the validator warns it so future contributors can confirm and change it into a proper format, or delete it.
2022-04-08 11:48:40 +07:00
Hans5958
1406cc5b7a Forgot to just use strip, also remove trailing comma on subs 2022-04-08 11:37:16 +07:00
Hans5958
cc497d5178 Add forgotten parts on the port, move some parts for better git diff 2022-04-08 11:21:16 +07:00
Hans5958
e9cd995dbd Add semicolons on subreddit formatter 2022-04-08 11:14:17 +07:00
Hans5958
121d8653a5 Update auth setup docs with more clarity
Also adapted from Nick

Co-authored-by: Nicolas Abram <abramlujan@gmail.com>
2022-04-08 11:09:35 +07:00
Hans5958
ad15cefb07 Remove zero width joiner before parsing 2022-04-08 11:03:18 +07:00
Hans5958
f04102f29c Port flair edit, small log change
Logic and port made/adapted from Nick's

Co-authored-by: Nicolas Abram <abramlujan@gmail.com>
2022-04-08 11:03:18 +07:00
Hans5958
4c4fb973f3 Fix trailing comma removal 2022-04-08 11:03:18 +07:00
Hans5958
33bda5f364 Expand error info 2022-04-08 11:03:17 +07:00
Hans5958
8913e781bf Forgot to silent on formatting 2022-04-08 11:03:17 +07:00
Hans5958
cd87520c87 Handle escaping escape characters 2022-04-08 11:03:17 +07:00
Hans5958
1da79b806b Take only the first object, remove escapes, add more info on errors 2022-04-08 11:03:17 +07:00
Hans5958
713ebec844 Somehow commatization gone 2022-04-08 11:03:16 +07:00
Hans5958
796bd09a15 Support some other symbols 2022-04-08 11:03:16 +07:00
Hans5958
a2e483f156 Better URL regex
by stephenhay from https://mathiasbynens.be/demo/url-regex
2022-04-08 11:03:16 +07:00
Hans5958
ad5312e0a3 Support accidental extra slashes 2022-04-08 11:03:16 +07:00
Hans5958
69d836fe4a Add more psuedo-empty strings 2022-04-08 11:03:16 +07:00
Hans5958
ceb8cb4a03 Improve CSTW and whoops 2022-04-08 11:03:15 +07:00
Hans5958
90f9b05092 Add validation 2022-04-08 11:03:15 +07:00
Hans5958
af2b9045d8 I mean, do this instead.
This assumes that it uses Markdown
2022-04-08 11:03:15 +07:00
Hans5958
f942dea58d Nope, don't do that! 2022-04-08 11:03:15 +07:00
Hans5958
4c5711d97d Add support(?) for users 2022-04-08 11:03:14 +07:00
Hans5958
02448d0fad Happy little accidents 3 2022-04-08 11:03:14 +07:00
Hans5958
e113b29939 Add CSTW, clarity fixes 2022-04-08 11:03:14 +07:00
Hans5958
7571a92fd9 Simplify try-catch, assert that path length > 0, separator on fail file
Assertion migrated from ARP

Co-authored-by: Cheng Hann Gan <chenghanngan.us@gmail.com>
2022-04-08 11:03:14 +07:00
Hans5958
2db0dcce6c Optimize and remove redundant code, make some tamer 2022-04-08 11:03:13 +07:00
Hans5958
ede4bc8208 Move things to make it more effective 2022-04-08 11:03:13 +07:00
Hans5958
4acefe3e6d Tiny refactor, remove redundant subreddit link on website
Resolves #707
2022-04-08 11:03:13 +07:00
Hans5958
7abe5cbed6 Happy little accidents 2, convert and on subreddit 2022-04-08 11:03:13 +07:00
Hans5958
b5aa228962 Adapt ARP's url fix
Co-authored-by: Cheng Hann Gan <chenghanngan.us@gmail.com>
2022-04-08 11:03:12 +07:00
Hans5958
3a86822175 Happy little accidents 2022-04-08 11:03:12 +07:00
Hans5958
73ef9e0f27 Remove raw scripts
Maybe yes maybe no eh
2022-04-08 11:03:12 +07:00
Hans5958
5a660759bf Improve and merge scripts, use JSON instead of regex 2022-04-08 11:03:12 +07:00
unknown
93e80dc7f9 New submissions, flair editing script 2022-04-07 17:00:30 -03:00
unknown
c7f4b927f5 Fix missing id in entry 2022-04-06 17:38:25 -03:00
unknown
7ad57eaef9 Make redditcrawl create a valid json for atlas_temp 2022-04-06 16:47:46 -03:00
Stefano
dc9f184e86
Merge branch 'master' into bulk-format-220406 2022-04-06 01:11:31 -07:00
Hans5958
a097844861 Add and improve scripts 2022-04-06 15:08:26 +07:00
unknown
f03a825c5d Fix crawler 2022-04-06 04:48:39 -03:00
Stefano
decde2cf02
Merge pull request #606 from nico-abram/counter
Fix Success counter
2022-04-05 23:23:20 -07:00
unknown
c726458e07 Fix validation script 2022-04-06 01:47:57 -03:00
unknown
443c210299 Fix Success counter 2022-04-05 22:33:57 -03:00
unknown
35672fdb47 Fix reddit crawl newline and quote handling 2022-04-05 19:00:31 -03:00
ash
2848f93f47
Merge branch 'master' into master 2022-04-05 22:38:13 +01:00
Nicolas Abram
2f45690040
Fix reddit crawling file encoding 2022-04-05 18:19:49 -03:00
Stefano Haagman
a820417402 Merge branch 'cleanup' 2022-04-05 22:22:43 +02:00