0
0
Fork 0
mirror of https://github.com/go-gitea/gitea synced 2025-01-14 00:15:32 +01:00
gitea/modules
Bruno Sofiato 900ac62251
Allow code search by filename (#32210)
This is a large and complex PR, so let me explain in detail its changes.

First, I had to create new index mappings for Bleve and ElasticSerach as
the current ones do not support search by filename. This requires Gitea
to recreate the code search indexes (I do not know if this is a breaking
change, but I feel it deserves a heads-up).

I've used [this
approach](https://www.elastic.co/guide/en/elasticsearch/reference/7.17/analysis-pathhierarchy-tokenizer.html)
to model the filename index. It allows us to efficiently search for both
the full path and the name of a file. Bleve, however, does not support
this out-of-box, so I had to code a brand new [token
filter](https://blevesearch.com/docs/Token-Filters/) to generate the
search terms.

I also did an overhaul in the `indexer_test.go` file. It now asserts the
order of the expected results (this is important since matches based on
the name of a file are more relevant than those based on its content).
I've added new test scenarios that deal with searching by filename. They
use a new repo included in the Gitea fixture.

The screenshot below depicts how Gitea shows the search results. It
shows results based on content in the same way as the current version
does. In matches based on the filename, the first seven lines of the
file contents are shown (BTW, this is how GitHub does it).


![image](https://github.com/user-attachments/assets/9d938d86-1a8d-4f89-8644-1921a473e858)

Resolves #32096

---------

Signed-off-by: Bruno Sofiato <bruno.sofiato@gmail.com>
2024-10-11 23:35:04 +00:00
..
actions Fix wrong status of Set up Job when first step is skipped (#32120) 2024-09-24 18:34:08 +00:00
activitypub Remove SHA1 for support for ssh rsa signing (#31857) 2024-09-07 18:05:18 -04:00
analyze
assetfs
auth
avatar
badge
base
cache bump to go 1.23 (#31855) 2024-09-10 02:23:07 +00:00
charset
container
csv
dump
emoji
eventsource
generate
git update git book link to v2 (#32221) 2024-10-09 13:04:34 +08:00
gitgraph
gitrepo
globallock
graceful
hcaptcha
highlight
hostmatcher Support allowed hosts for migrations to work with proxy (#32025) 2024-09-11 05:47:00 +00:00
html
httpcache Fix wrong last modify time (#32102) 2024-09-21 21:56:25 +00:00
httplib Fix wrong last modify time (#32102) 2024-09-21 21:56:25 +00:00
indexer Allow code search by filename (#32210) 2024-10-11 23:35:04 +00:00
issue/template bump to go 1.23 (#31855) 2024-09-10 02:23:07 +00:00
json
label
lfs
lfstransfer Add pure SSH LFS support (#31516) 2024-09-27 10:27:37 -04:00
log
markup Use camo.Always instead of camo.Allways (#32097) 2024-09-21 12:50:54 +03:00
mcaptcha
metrics
migration Support migration from AWS CodeCommit (#31981) 2024-09-11 07:49:42 +08:00
nosql
optional
options
packages Add bin to Composer Metadata (#32099) 2024-09-21 22:42:17 +00:00
paginator
pprof
private
process
proxy
proxyprotocol
public
queue bump to go 1.23 (#31855) 2024-09-10 02:23:07 +00:00
recaptcha
references
regexplru
repository Support repo license (#24872) 2024-10-01 15:25:08 -04:00
secret
session
setting Enhance USER_DISABLED_FEATURES to allow disabling change username or full name (#31959) 2024-10-05 20:41:38 +00:00
sitemap
ssh
storage bump to go 1.23 (#31855) 2024-09-10 02:23:07 +00:00
structs Support repo license (#24872) 2024-10-01 15:25:08 -04:00
svg
sync
system
templates Lazy load avatar images (#32051) 2024-09-17 19:02:48 +00:00
test
testlogger
timeutil
translation
turnstile
typesniffer
updatechecker
uri
user
util
validation
web
webhook
zstd