微信公众号最佳实践 ( 9.7)智能问答,关键词回复,中文分词

本文主要是介绍微信公众号最佳实践 ( 9.7)智能问答,关键词回复,中文分词,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

智能问答

前面我们使用的都是
  • 基于固定查询指令的回复,这样好处是内容格式统一,方便软件开发人员编写程序做出分析,回复精准的内容给用户
  • 但在生活中,人们问的内容很随意,甚至千差万别,这时,回复内容想要和用户的问题相匹配,就需要更智能的程序了

这里写图片描述

关键词回复

我们需先定一个数组,数组中键为关键词,值为对应的回复,当用户输入的文字能匹配到某个关键词时,则回复该关键词对应的内容,我们定义“电话”,“地址”,”微信”,三个关键词,将其作为数组中的键,然后定义相应的值,当用户发送的内容中包含相应的关键词名称时,则回复对应的内容。

代码如下所示:

 private function receiveText($object){$content = trim($object->Content);$keywords = array("电话" => "0755-87654321","地址" => "广东省深圳市南山区高新南一道飞亚达科技大厦(518057)","A型号手机" => "A产品报价 3000元,基本参数为:****,","B型号手机" => "A产品报价 4000元,基本参数为:****,","C型号手机" => "A产品报价 2000元,基本参数为:****,","微信" => "xxxxx");foreach ($keywords as $key=>$value){$contain = strchr($content, $key);if (!empty($contain)){$reply = $value;break;}}$result = $this->transmitText($object, $reply);return $result;}

中文分词

除了关键词回复之外,还可以使用中文分词来帮助我们实现智能回复。

中文分词指的是将一个汉字序列切分成一个个单独词。
分词就是将连续的字序列按照一定的规范重新组合成词序列的过程。
我们知道在英文的行文中,单词之间是以空格作为自然分隔符的,而中文只有字、句、和段,能通过简单的分界符来简单划分,唯独词没有一个形式上的分界符,虽然英文也同样存在短语的划分问题,不过在词这一层上,中文要比英文复杂的多,困难的多。
中文分词就是要解决句子中词的划分问题,然后可以提取其中的关键词语进行搜索。
打个比方:“深圳天气怎么样”,这句话可分词为“深圳”,“天气”,“怎么样”三部分,因为它们代表不同词语类别的意思。

新浪云SEA提供了中文分词服务seaSegment,上述句子进行中文分词的代码如下所示

<?php$str="深圳天气怎么样";$seg = new SaeSegment();$ret = $seg->segment($str, 1);print_r($ret);
?>

代码运行后,结果如下所示:

Array ( [0] => Array ( [word] => 深圳 [word_tag] => 102 [index] => 0 ) [1] => Array ( [word] => 天气 [word_tag] => 95 [index] => 1 ) [2] => Array ( [word] => 怎么样 [word_tag] => 40 [index] => 2 ) 
)

上述结果中,

  • word表示分词
  • word_tag表示词性(102代表地点名词,95代表名词)
  • index表示词在句子中的位置索引

我们可以把分词功能写成函数,并且将名词作为分类标准,地点作为关键词组合成数组。

相应代码如下:

<?php
function sinasegment($str)
{$seg = new SaeSegment();$ret = $seg->segment($str, 1);if ($ret === false){return;}$category = "";$keyword = "";foreach ($ret as $key => $value) {if ($value["word_tag"] == 95){$category = $value["word"];}if ($value["word_tag"] == 102){$keyword = $value["word"];}}if (!empty($category) && !empty($keyword)){return array('category'=>$category, 'keyword'=>$keyword); }else{return;}
}
?>

根据获取的功能类别及关键字,我们可以做出根据地点来查询其相应的自动回复,比如天气预报和空气质量,在前面的章节中,我们提到了将语音识别结果并入文本消息结果的查询,基于这些,我们可以开发出非常智能的**语音识别搜索**及**文本搜索**功能.

相应代码如下:

 private function receiveVoice($object){$funcFlag = 0;if (isset($object->Recognition) && !empty($object->Recognition)){$content = strval($object->Recognition);include("segment.php");$result = sinasegment($content);if (is_array($result)){switch ($result['category']){case "天气":$url = "http://apix.sinaapp.com/weather/?appkey=trialuser&city=".urlencode($result['keyword']);$output = file_get_contents($url);$replyContent = json_decode($output, true);break;default:$replyContent = "还不支持这一功能:".$result['category'];break;}}else{$replyContent = "不能理解你的内容:".$content;}}else{$replyContent = "未开启语音识别功能";}if (is_array($replyContent)){$resultStr = $this->transmitNews($object, $replyContent);}else{$resultStr = $this->transmitText($object, $replyContent);}return $resultStr;}

segment.php

<?php
function sinasegment($str)
{$seg = new SaeSegment();$ret = $seg->segment($str, 1);if ($ret === false){return;}$category = "";$keyword = "";foreach ($ret as $key => $value) {if ($value["word_tag"] == 95){$category = $value["word"];}if ($value["word_tag"] == 102){$keyword = $value["word"];}}if (!empty($category) && !empty($keyword)){return array('category'=>$category, 'keyword'=>$keyword); }else{return;}
}
?>

关键词index.php运行图片:

这里写图片描述

关键词回复 index.php 整体代码如下:

<?php
/*CopyRight 2018 All Rights Reserved
*/define("TOKEN", "weixin");$wechatObj = new wechatCallbackapiTest();
if (!isset($_GET['echostr'])) {$wechatObj->responseMsg();
}else{$wechatObj->valid();
}class wechatCallbackapiTest
{public function valid(){$echoStr = $_GET["echostr"];if($this->checkSignature()){echo $echoStr;exit;}}private function checkSignature(){$signature = $_GET["signature"];$timestamp = $_GET["timestamp"];$nonce = $_GET["nonce"];$token = TOKEN;$tmpArr = array($token, $timestamp, $nonce);sort($tmpArr, SORT_STRING);$tmpStr = implode( $tmpArr );$tmpStr = sha1( $tmpStr );if( $tmpStr == $signature ){return true;}else{return false;}}public function responseMsg(){$postStr = $GLOBALS["HTTP_RAW_POST_DATA"];if (!empty($postStr)){$this->logger("R ".$postStr);$postObj = simplexml_load_string($postStr, 'SimpleXMLElement', LIBXML_NOCDATA);$RX_TYPE = trim($postObj->MsgType);switch ($RX_TYPE){case "event":$result = $this->receiveEvent($postObj);break;case "text":$result = $this->receiveText($postObj);break;default:$result = "other msg type: ".$RX_TYPE;break;}$this->logger("T ".$result);echo $result;}else {echo "";exit;}}private function receiveEvent($object){$content = "";switch ($object->Event){case "subscribe":$content = "欢迎关注 德强1012 ";$content .= isset($object->EventKey)?("\n来自二维码场景 ".$object->EventKey):"";break;case "unsubscribe":$content = "取消关注";break;}$result = $this->transmitText($object, $content);return $result;}private function receiveText($object){$content = trim($object->Content);$keywords = array("电话" => "0755-87654321","地址" => "广东省深圳市南山区高新南一道飞亚达科技大厦(518057)","A型号手机" => "A产品报价 3000元,基本参数为:****,","B型号手机" => "A产品报价 4000元,基本参数为:****,","C型号手机" => "A产品报价 2000元,基本参数为:****,","微信" => "xxxxx");foreach ($keywords as $key=>$value){$contain = strchr($content, $key);if (!empty($contain)){$reply = $value;break;}}$result = $this->transmitText($object, $reply);return $result;}private function transmitText($object, $content){$textTpl = "<xml><ToUserName><![CDATA[%s]]></ToUserName><FromUserName><![CDATA[%s]]></FromUserName><CreateTime>%s</CreateTime><MsgType><![CDATA[text]]></MsgType><Content><![CDATA[%s]]></Content></xml>";$result = sprintf($textTpl, $object->FromUserName, $object->ToUserName, time(), $content);return $result;}private function transmitNews($object, $arr_item){if(!is_array($arr_item))return;$itemTpl = "    <item><Title><![CDATA[%s]]></Title><Description><![CDATA[%s]]></Description><PicUrl><![CDATA[%s]]></PicUrl><Url><![CDATA[%s]]></Url></item>";$item_str = "";foreach ($arr_item as $item)$item_str .= sprintf($itemTpl, $item['Title'], $item['Description'], $item['PicUrl'], $item['Url']);$newsTpl = "<xml><ToUserName><![CDATA[%s]]></ToUserName><FromUserName><![CDATA[%s]]></FromUserName><CreateTime>%s</CreateTime><MsgType><![CDATA[news]]></MsgType><Content><![CDATA[]]></Content><ArticleCount>%s</ArticleCount><Articles>$item_str</Articles></xml>";$result = sprintf($newsTpl, $object->FromUserName, $object->ToUserName, time(), count($arr_item));return $result;}private function logger($log_content){}
}
?>

中文分词index.php运行图片

这里写图片描述
这里写图片描述

中文分词index.php整体代码如下:

说明:方倍工作室接口100,目前免费暂无
他们解释原因如下:

  • 接口100免费提供这些年,受到很多用户欢迎,每天访问量非常大,但由于极个别用户使用机器程序抓取数据以及故意使用DDOS恶意攻击,导致服务器不堪重负,维护费用成倍增长。我们决定关停这些接口,有需要的用户,可以联系QQ 1354386063,购买我们的接口源码,或者自行使用第三方收费接口。网页地址http://www.cnblogs.com/txw1958/p/weixin-api100.html

语音功能公众号显示开通了,不知道为啥没有用,检测中…..

<?phpdefine("TOKEN", "weixin");$wechatObj = new wechatCallbackapiTest();
if (!isset($_GET['echostr'])) {$wechatObj->responseMsg();
}else{$wechatObj->valid();
}class wechatCallbackapiTest
{public function valid(){$echoStr = $_GET["echostr"];if($this->checkSignature()){echo $echoStr;exit;}}private function checkSignature(){$signature = $_GET["signature"];$timestamp = $_GET["timestamp"];$nonce = $_GET["nonce"];$token = TOKEN;$tmpArr = array($token, $timestamp, $nonce);sort($tmpArr);$tmpStr = implode( $tmpArr );$tmpStr = sha1( $tmpStr );if( $tmpStr == $signature ){return true;}else{return false;}}public function responseMsg(){$postStr = $GLOBALS["HTTP_RAW_POST_DATA"];if (!empty($postStr)){$postObj = simplexml_load_string($postStr, 'SimpleXMLElement', LIBXML_NOCDATA);$RX_TYPE = trim($postObj->MsgType);switch ($RX_TYPE){case "text":$resultStr = $this->receiveText($postObj);break;case "event":$resultStr = $this->receiveEvent($postObj);break;case "voice":$resultStr = $this->receiveVoice($postObj);break;default:$resultStr = "";break;}echo $resultStr;}else {echo "";exit;}}private function receiveText($object){if (isset($object->Recognition)){$content = strval($object->Recognition);$mediaid = trim($object->MediaID);}else{$content = trim($object->Content);}if (isset($_SERVER['HTTP_APPNAME'])){include("segment.php");$result = sinasegment($content);if (is_array($result)){switch ($result['category']){case "天气"://$url = "http://api100.duapp.com/weather/?appkey=book97&city=".urlencode($result['keyword']);//$output = file_get_contents($url);//$replyContent = json_decode($output, true);//$replyContent = $result['keyword'];$city = $result['keyword'];include("weather2.php");$content = getWeatherInfo($city);$result = $this->transmitNews($object, $content);return $result;break;case "空气"://$url = "http://api100.duapp.com/airquality/?appkey=book97&city=".urlencode($result['keyword']);//$output = file_get_contents($url);//$replyContent = json_decode($output, true);$keyword =  $result['keyword'];include("airquality.php");$content = getAirQualityChina($keyword);if(is_array($content)){$result = $this->transmitNews($object, $content);}else{$result = $this->transmitText($object, $content);}return $result;default:$replyContent = "还不支持这一功能:".$result['category'];break;}}else{$replyContent = "不能理解你的内容:".$content;}}else{$replyContent = "只能运行在SAE环境!";}if (is_array($replyContent)){$resultStr = $this->transmitNews($object, $replyContent);}else{$resultStr = $this->transmitText($object, $replyContent);}return $resultStr;}private function receiveEvent($object){$replyContent = "";switch ($object->Event){case "subscribe":$replyContent = "欢迎关注 德强1012";case "unsubscribe":break;default:break;      }if (is_array($replyContent)){$resultStr = $this->transmitNews($object, $replyContent);}else{$resultStr = $this->transmitText($object, $replyContent);}return $resultStr;}private function receiveVoice($object){$funcFlag = 0;if (isset($object->Recognition) && !empty($object->Recognition)){$content = strval($object->Recognition);include("segment.php");$result = sinasegment($content);if (is_array($result)){switch ($result['category']){case "天气":// $url = "http://apix.sinaapp.com/weather/?appkey=trialuser&city=".urlencode($result['keyword']);//$output = file_get_contents($url);//$replyContent = json_decode($output, true);$city = $result['keyword'];include("weather2.php");$content = getWeatherInfo($city);$result = $this->transmitNews($object, $content);return $result;//$replyContent = $result['keyword'];break;default:$replyContent = "还不支持这一功能:".$result['category'];break;}}else{$replyContent = "不能理解你的内容:".$content;}}else{$replyContent = "未开启语音识别功能";}if (is_array($replyContent)){$resultStr = $this->transmitNews($object, $replyContent);}else{$resultStr = $this->transmitText($object, $replyContent);}return $resultStr;}private function transmitText($object, $content, $flag = 0){$textTpl = "<xml><ToUserName><![CDATA[%s]]></ToUserName><FromUserName><![CDATA[%s]]></FromUserName><CreateTime>%s</CreateTime><MsgType><![CDATA[text]]></MsgType><Content><![CDATA[%s]]></Content><FuncFlag>%d</FuncFlag></xml>";$resultStr = sprintf($textTpl, $object->FromUserName, $object->ToUserName, time(), $content, $flag);return $resultStr;}private function transmitNews($object, $arr_item){if(!is_array($arr_item))return;$itemTpl = "    <item><Title><![CDATA[%s]]></Title><Description><![CDATA[%s]]></Description><PicUrl><![CDATA[%s]]></PicUrl><Url><![CDATA[%s]]></Url></item>";$item_str = "";foreach ($arr_item as $item)$item_str .= sprintf($itemTpl, $item['Title'], $item['Description'], $item['PicUrl'], $item['Url']);$newsTpl = "<xml><ToUserName><![CDATA[%s]]></ToUserName><FromUserName><![CDATA[%s]]></FromUserName><CreateTime>%s</CreateTime><MsgType><![CDATA[news]]></MsgType><Content><![CDATA[]]></Content><ArticleCount>%s</ArticleCount><Articles>$item_str</Articles></xml>";$result = sprintf($newsTpl, $object->FromUserName, $object->ToUserName, time(), count($arr_item));return $result;}
}
?>

空气质量airquality.php整体代码如下:

<?phpfunction getAirQualityChina($city)
{//http://www.pm25.in/api/querys/aqi_details.json?stations=no&city=%E6%B7%B1%E5%9C%B3&token=5j1znBVAsnSf5xQyNQyq$url = "http://www.pm25.in/api/querys/aqi_details.json?stations=no&city=".urlencode($city)."&token=5j1znBVAsnSf5xQyNQyq";$ch = curl_init();curl_setopt($ch, CURLOPT_URL, $url);curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);$output = curl_exec($ch);curl_close($ch);$cityAir = json_decode($output, true);if (isset($cityAir['error'])){return $cityAir['error'];}else{$result = "空气质量指数(AQI):".$cityAir[0]['aqi']."\n"."空气质量等级:".$cityAir[0]['quality']."\n"."细颗粒物(PM2.5):".$cityAir[0]['pm2_5']."\n"."可吸入颗粒物(PM10):".$cityAir[0]['pm10']."\n"."一氧化碳(CO):".$cityAir[0]['co']."\n"."二氧化氮(NO2):".$cityAir[0]['no2']."\n"."二氧化硫(SO2):".$cityAir[0]['so2']."\n"."臭氧(O3):".$cityAir[0]['o3']."\n"."更新时间:".preg_replace("/([a-zA-Z])/i", " ", $cityAir[0]['time_point']);$aqiArray = array(); $aqiArray[] = array("Title" =>$cityAir[0]['area']."空气质量", "Description" =>$result, "PicUrl" =>"", "Url" =>"");return $aqiArray;}
}?>

天气查询weather2.php整体代码如下:

<?php//var_dump(getWeatherInfo("桃江"));function getWeatherInfo($cityName)
{if ($cityName == "" || (strstr($cityName, "+"))){return "发送天气加城市,例如'天气深圳'";}//1a3cde429f38434f1811a75e1a90310c$ak = '1a3cde429f38434f1811a75e1a90310c';$sk = '';$url = 'http://api.map.baidu.com/telematics/v3/weather?ak=%s&location=%s&output=%s&sn=%s';$uri = '/telematics/v3/weather';$location = $cityName;$output = 'json';$querystring_arrays = array('ak' => $ak,'location' => $location,'output' => $output);$querystring = http_build_query($querystring_arrays);$sn = md5(urlencode($uri.'?'.$querystring.$sk));$targetUrl = sprintf($url, $ak, urlencode($location), $output, $sn);// var_dump($targetUrl);$ch = curl_init();curl_setopt($ch, CURLOPT_URL, $targetUrl);curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);$result = curl_exec($ch);curl_close($ch);$result = json_decode($result, true);if ($result["error"] != 0){return $result["status"];}$curHour = (int)date('H',time());$weather = $result["results"][0];$weatherArray[] = array("Title" =>$weather['currentCity']."天气预报", "Description" =>"", "PicUrl" =>"", "Url" =>"");for ($i = 0; $i < count($weather["weather_data"]); $i++) {$weatherArray[] = array("Title"=>$weather["weather_data"][$i]["date"]."\n".$weather["weather_data"][$i]["weather"]." ".$weather["weather_data"][$i]["wind"]." ".$weather["weather_data"][$i]["temperature"],"Description"=>"", "PicUrl"=>(($curHour >= 6) && ($curHour < 18))?$weather["weather_data"][$i]["dayPictureUrl"]:$weather["weather_data"][$i]["nightPictureUrl"], "Url"=>"");}return $weatherArray;
}?>

分词功能函数segment.php整体代码如下:

<?php// $str="深圳天气怎么样";
//$result = sinasegment($str);  
// print_r($result) ;
function sinasegment($str)
{$seg = new SaeSegment();$ret = $seg->segment($str, 1);if ($ret === false){return;}$category = "";$keyword = "";foreach ($ret as $key => $value) {if ($value["word_tag"] == 95){$category = $value["word"];}if ($value["word_tag"] == 102){$keyword = $value["word"];}}if (!empty($category) && !empty($keyword)){return array('category'=>$category, 'keyword'=>$keyword); }else{return;}
}?>

这篇关于微信公众号最佳实践 ( 9.7)智能问答,关键词回复,中文分词的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/289510

相关文章

全面掌握 SQL 中的 DATEDIFF函数及用法最佳实践

《全面掌握SQL中的DATEDIFF函数及用法最佳实践》本文解析DATEDIFF在不同数据库中的差异,强调其边界计算原理,探讨应用场景及陷阱,推荐根据需求选择TIMESTAMPDIFF或inte... 目录1. 核心概念:DATEDIFF 究竟在计算什么?2. 主流数据库中的 DATEDIFF 实现2.1

Python办公自动化实战之打造智能邮件发送工具

《Python办公自动化实战之打造智能邮件发送工具》在数字化办公场景中,邮件自动化是提升工作效率的关键技能,本文将演示如何使用Python的smtplib和email库构建一个支持图文混排,多附件,多... 目录前言一、基础配置:搭建邮件发送框架1.1 邮箱服务准备1.2 核心库导入1.3 基础发送函数二、

Spring WebFlux 与 WebClient 使用指南及最佳实践

《SpringWebFlux与WebClient使用指南及最佳实践》WebClient是SpringWebFlux模块提供的非阻塞、响应式HTTP客户端,基于ProjectReactor实现,... 目录Spring WebFlux 与 WebClient 使用指南1. WebClient 概述2. 核心依

MyBatis-Plus 中 nested() 与 and() 方法详解(最佳实践场景)

《MyBatis-Plus中nested()与and()方法详解(最佳实践场景)》在MyBatis-Plus的条件构造器中,nested()和and()都是用于构建复杂查询条件的关键方法,但... 目录MyBATis-Plus 中nested()与and()方法详解一、核心区别对比二、方法详解1.and()

Spring Boot @RestControllerAdvice全局异常处理最佳实践

《SpringBoot@RestControllerAdvice全局异常处理最佳实践》本文详解SpringBoot中通过@RestControllerAdvice实现全局异常处理,强调代码复用、统... 目录前言一、为什么要使用全局异常处理?二、核心注解解析1. @RestControllerAdvice2

Spring事务传播机制最佳实践

《Spring事务传播机制最佳实践》Spring的事务传播机制为我们提供了优雅的解决方案,本文将带您深入理解这一机制,掌握不同场景下的最佳实践,感兴趣的朋友一起看看吧... 目录1. 什么是事务传播行为2. Spring支持的七种事务传播行为2.1 REQUIRED(默认)2.2 SUPPORTS2

Java中的雪花算法Snowflake解析与实践技巧

《Java中的雪花算法Snowflake解析与实践技巧》本文解析了雪花算法的原理、Java实现及生产实践,涵盖ID结构、位运算技巧、时钟回拨处理、WorkerId分配等关键点,并探讨了百度UidGen... 目录一、雪花算法核心原理1.1 算法起源1.2 ID结构详解1.3 核心特性二、Java实现解析2.

Redis出现中文乱码的问题及解决

《Redis出现中文乱码的问题及解决》:本文主要介绍Redis出现中文乱码的问题及解决,具有很好的参考价值,希望对大家有所帮助,如有错误或未考虑完全的地方,望不吝赐教... 目录1. 问题的产生2China编程. 问题的解决redihttp://www.chinasem.cns数据进制问题的解决中文乱码问题解决总结

MySQL 中 ROW_NUMBER() 函数最佳实践

《MySQL中ROW_NUMBER()函数最佳实践》MySQL中ROW_NUMBER()函数,作为窗口函数为每行分配唯一连续序号,区别于RANK()和DENSE_RANK(),特别适合分页、去重... 目录mysql 中 ROW_NUMBER() 函数详解一、基础语法二、核心特点三、典型应用场景1. 数据分

java向微信服务号发送消息的完整步骤实例

《java向微信服务号发送消息的完整步骤实例》:本文主要介绍java向微信服务号发送消息的相关资料,包括申请测试号获取appID/appsecret、关注公众号获取openID、配置消息模板及代码... 目录步骤1. 申请测试系统2. 公众号账号信息3. 关注测试号二维码4. 消息模板接口5. Java测试