MySQL fulltext index检索中文有哪些注意事项
本篇内容介绍了“MySQL fulltext index检索中文有哪些注意事项”的有关知识,在实际案例的操作过程中,不少人都会遇到这样的困境,接下来就让小编带领大家学习一下如何处理这些情况吧!希望大家仔细阅读,能够学有所成!
1、查看表结构
mysql> show create table product_test02;
| Table | Create Table
| product_test02 | CREATE TABLE `product_test02` (
`product_id` varchar(40) CHARACTER SET utf8 COLLATE utf8_bin NOT NULL,
`artisan_id` varchar(40) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`name` varchar(60) CHARACTER SET utf8 COLLATE utf8_bin NOT NULL DEFAULT '',
`des` varchar(1000) CHARACTER SET utf8mb4 COLLATE utf8mb4_bin NOT NULL DEFAULT '',
`zhima_price` double(11,2) DEFAULT NULL,
`market_price` double(11,2) DEFAULT NULL,
`cover_pic` varchar(100) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`work_time` int(11) DEFAULT NULL,
`comment_count` int(11) DEFAULT NULL,
`like_count` int(11) DEFAULT NULL,
`produt_consist` varchar(255) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`keep_time` int(11) DEFAULT NULL,
`create_at` timestamp NULL DEFAULT NULL,
`fav_count` int(11) DEFAULT NULL,
`width` int(11) DEFAULT NULL,
`height` int(11) DEFAULT NULL,
`is_publish` int(11) DEFAULT NULL,
`is_top` int(11) DEFAULT NULL,
`is_delete` int(11) DEFAULT NULL,
`sell_amount` int(11) DEFAULT '0',
`free_service_time` int(11) DEFAULT NULL,
`update_time` timestamp NULL DEFAULT NULL,
`other_1` varchar(255) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`other_2` varchar(255) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`other_3` varchar(255) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`other_4` varchar(255) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`other_5` varchar(255) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`is_audit` tinyint(1) DEFAULT '0',
`audit_time` timestamp NULL DEFAULT NULL,
`is_undercarriage` tinyint(1) DEFAULT '0',
`undercarriage_time` timestamp NULL DEFAULT NULL,
`category` varchar(30) CHARACTER SET utf8 COLLATE utf8_bin NOT NULL DEFAULT '',
`active_tag_dict_id` bigint(20) DEFAULT NULL,
`active_price` double(11,1) DEFAULT NULL,
`weight` int(11) unsigned DEFAULT '0',
`fit_people` varchar(300) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`matter_attent` varchar(800) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL,
`category_lv2_id` int(11) DEFAULT NULL,
`artisan_visit` int(1) DEFAULT '1',
`customer_visit` int(1) DEFAULT '0',
`customer_zhima_price` double(11,2) DEFAULT NULL,
`customer_market_price` double(11,2) DEFAULT NULL,
`service_sex` int(1) DEFAULT '0',
`service_mode` tinyint(2) DEFAULT '0' ,
`last_update_time` datetime DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP,
`sell_price` double(11,2) NOT NULL DEFAULT '0.00',
`is_new` int(1) NOT NULL,
`spu_id` int(11) DEFAULT NULL,
`category_id` int(11) DEFAULT NULL,
`other_info` varchar(100) CHARACTER SET utf8 COLLATE utf8_bin NOT NULL DEFAULT '',
`product_type` int(2) NOT NULL DEFAULT '0' ,
`product_code` varchar(15) CHARACTER SET utf8 COLLATE utf8_bin DEFAULT NULL
) ENGINE=InnoDB DEFAULT CHARSET=utf8;
2、在name字段创建fulltext索引
mysql> alter table product_test02 add FULLTEXT index ft_indx_name (name) WITH PARSER ngram;
Query OK, 0 rows affected, 1 warning (3 min 45.93 sec)
3、利用fulltext index进行检索
mysql> select name from product_test02 where match(name) against ('头部' in boolean mode) limit 1;
+---------------------+
| name |
+---------------------+
| 头部按*摩+拨筋 |
+---------------------+
1 row in set (0.00 sec)
mysql> select name from product_test02 where match(name) against ('头' in boolean mode) limit 1;
Empty set (0.00 sec)
经查询:这个是数据库ft_min_word_len参数有关,默认为4,至少检索4个字符,被检索字符串长度小于4个字符将检索不到。
4、改参数ft_min_word_len = 1并重启实例
mysql> show variables like 'ft%';
+--------------------------+----------------+
| Variable_name | Value |
+--------------------------+----------------+
| ft_boolean_syntax | + -><()~*:""&| |
| ft_max_word_len | 84 |
| ft_min_word_len | 1 |
| ft_query_expansion_limit | 20 |
| ft_stopword_file | (built-in) |
+--------------------------+----------------+
5、再次查询
mysql> select name from product_test02 where match(name) against ('头' in boolean mode) limit 1;
Empty set (0.01 sec)
依然查询不到,原因是ft_min_word_len 参数改完之后,必须重建所有fulltext index
6、重建fulltext index并检索
mysql> select name from product_test02 where match(name) against ('头部' in boolean mode) limit 1;
Empty set (0.00 sec)
mysql> select name from product_test02 where match(name) against ('3' in boolean mode) limit 1;
Empty set (0.00 sec)
经查询,ngram_token_size=2 #用中文检索分词插件ngram之前,先得在MySQL配置文件里面设置他的分词大小
7、更改参数ngram_token_size=1,并重启实例
mysql> show variables like 'ng%';
+------------------+-------+
| Variable_name | Value |
+------------------+-------+
| ngram_token_size | 1 |
+------------------+-------+
1 row in set (0.01 sec)
mysql> select name from product_test02 where match(name) against ('头部' in boolean mode) limit 1;
+---------------------------------------------+
| name |
+---------------------------------------------+
| 【头疼必拍】头部舒压+经络疏通 |
+---------------------------------------------+
1 row in set (0.01 sec)
mysql> select name from product_test02 where match(name) against ('头' in boolean mode) limit 1;
+--------------------------------------------------+
| name |
+--------------------------------------------------+
| 头部拨筋头晕头痛失眠【头部调理】 |
+--------------------------------------------------+
1 row in set (0.01 sec)
可以正常检索。
“MySQL fulltext index检索中文有哪些注意事项”的内容就介绍到这里了,感谢大家的阅读。如果想了解更多行业相关的知识可以关注亿速云网站,小编将为大家输出更多高质量的实用文章!
免责声明:
① 本站未注明“稿件来源”的信息均来自网络整理。其文字、图片和音视频稿件的所属权归原作者所有。本站收集整理出于非商业性的教育和科研之目的,并不意味着本站赞同其观点或证实其内容的真实性。仅作为临时的测试数据,供内部测试之用。本站并未授权任何人以任何方式主动获取本站任何信息。
② 本站未注明“稿件来源”的临时测试数据将在测试完成后最终做删除处理。有问题或投稿请发送至: 邮箱/279061341@qq.com QQ/279061341