Skip to content
GitLab
Explore
Sign in
Register
Primary navigation
Search or go to…
Project
P
postgres-lambda-diff
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Jakob Huber
postgres-lambda-diff
Commits
7953fdcd
Commit
7953fdcd
authored
17 years ago
by
Tom Lane
Browse files
Options
Downloads
Patches
Plain Diff
Add a CaseSensitive parameter to synonym dictionaries.
Simon Riggs
parent
2fc27954
No related branches found
No related tags found
No related merge requests found
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
doc/src/sgml/textsearch.sgml
+12
-4
12 additions, 4 deletions
doc/src/sgml/textsearch.sgml
src/backend/tsearch/dict_synonym.c
+22
-4
22 additions, 4 deletions
src/backend/tsearch/dict_synonym.c
with
34 additions
and
8 deletions
doc/src/sgml/textsearch.sgml
+
12
−
4
View file @
7953fdcd
<!-- $PostgreSQL: pgsql/doc/src/sgml/textsearch.sgml,v 1.4
1
2008/03/0
4
03:
17:18 momjian
Exp $ -->
<!-- $PostgreSQL: pgsql/doc/src/sgml/textsearch.sgml,v 1.4
2
2008/03/
1
0 03:
01:28 tgl
Exp $ -->
<chapter id="textsearch">
<title id="textsearch-title">Full Text Search</title>
...
...
@@ -2209,7 +2209,8 @@ SELECT ts_lexize('public.simple_dict','The');
dictionary can be used to overcome linguistic problems, for example, to
prevent an English stemmer dictionary from reducing the word 'Paris' to
'pari'. It is enough to have a <literal>Paris paris</literal> line in the
synonym dictionary and put it before the <literal>english_stem</> dictionary:
synonym dictionary and put it before the <literal>english_stem</>
dictionary. For example:
<programlisting>
SELECT * FROM ts_debug('english', 'Paris');
...
...
@@ -2242,10 +2243,17 @@ SELECT * FROM ts_debug('english', 'Paris');
<productname>PostgreSQL</> installation's shared-data directory).
The file format is just one line
per word to be substituted, with the word followed by its synonym,
separated by white space. Blank lines and trailing spaces are ignored,
and upper case is folded to lower case.
separated by white space. Blank lines and trailing spaces are ignored.
</para>
<para>
The <literal>synonym</> template also has an optional parameter
<literal>CaseSensitive</>, which defaults to <literal>false</>. When
<literal>CaseSensitive</> is <literal>false</>, words in the synonym file
are folded to lower case, as are input tokens. When it is
<literal>true</>, words and tokens are not folded to lower case,
but are compared as-is.
</para>
</sect2>
<sect2 id="textsearch-thesaurus">
...
...
This diff is collapsed.
Click to expand it.
src/backend/tsearch/dict_synonym.c
+
22
−
4
View file @
7953fdcd
...
...
@@ -7,7 +7,7 @@
*
*
* IDENTIFICATION
* $PostgreSQL: pgsql/src/backend/tsearch/dict_synonym.c,v 1.
7
2008/0
1/01 19:45:52 momjian
Exp $
* $PostgreSQL: pgsql/src/backend/tsearch/dict_synonym.c,v 1.
8
2008/0
3/10 03:01:28 tgl
Exp $
*
*-------------------------------------------------------------------------
*/
...
...
@@ -30,6 +30,7 @@ typedef struct
{
int
len
;
/* length of syn array */
Syn
*
syn
;
bool
case_sensitive
;
}
DictSyn
;
/*
...
...
@@ -77,6 +78,7 @@ dsynonym_init(PG_FUNCTION_ARGS)
DictSyn
*
d
;
ListCell
*
l
;
char
*
filename
=
NULL
;
bool
case_sensitive
=
false
;
FILE
*
fin
;
char
*
starti
,
*
starto
,
...
...
@@ -90,6 +92,8 @@ dsynonym_init(PG_FUNCTION_ARGS)
if
(
pg_strcasecmp
(
"Synonyms"
,
defel
->
defname
)
==
0
)
filename
=
defGetString
(
defel
);
else
if
(
pg_strcasecmp
(
"CaseSensitive"
,
defel
->
defname
)
==
0
)
case_sensitive
=
defGetBoolean
(
defel
);
else
ereport
(
ERROR
,
(
errcode
(
ERRCODE_INVALID_PARAMETER_VALUE
),
...
...
@@ -154,8 +158,16 @@ dsynonym_init(PG_FUNCTION_ARGS)
}
}
d
->
syn
[
cur
].
in
=
lowerstr
(
starti
);
d
->
syn
[
cur
].
out
=
lowerstr
(
starto
);
if
(
case_sensitive
)
{
d
->
syn
[
cur
].
in
=
pstrdup
(
starti
);
d
->
syn
[
cur
].
out
=
pstrdup
(
starto
);
}
else
{
d
->
syn
[
cur
].
in
=
lowerstr
(
starti
);
d
->
syn
[
cur
].
out
=
lowerstr
(
starto
);
}
cur
++
;
...
...
@@ -168,6 +180,8 @@ skipline:
d
->
len
=
cur
;
qsort
(
d
->
syn
,
d
->
len
,
sizeof
(
Syn
),
compareSyn
);
d
->
case_sensitive
=
case_sensitive
;
PG_RETURN_POINTER
(
d
);
}
...
...
@@ -185,7 +199,11 @@ dsynonym_lexize(PG_FUNCTION_ARGS)
if
(
len
<=
0
||
d
->
len
<=
0
)
PG_RETURN_POINTER
(
NULL
);
key
.
in
=
lowerstr_with_len
(
in
,
len
);
if
(
d
->
case_sensitive
)
key
.
in
=
pnstrdup
(
in
,
len
);
else
key
.
in
=
lowerstr_with_len
(
in
,
len
);
key
.
out
=
NULL
;
found
=
(
Syn
*
)
bsearch
(
&
key
,
d
->
syn
,
d
->
len
,
sizeof
(
Syn
),
compareSyn
);
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment