一种无线传感网络占空比自适应调整方法,申请号CN201510227130.3-传众专利搜索

发明名称	一种无线传感网络占空比自适应调整方法
摘要	本发明提出了一种无线传感网络占空比自适应调整方法。首先，针对自适应占空比调整方法中，线性调整占空比无法实现占空比与流量快速最佳匹配的问题，本发明将采用基于强化学的预测方法，直接预测得出与节点流量相匹配的最佳占空比。其次，为了尽量减少簇头节点和普通节点间的通信，本发明将强化学方法用于普通节点，由普通节点直接根据自身的流量情况预测得出下一周期的最佳占空比。
申请公布号	CN104955107B	申请公布日期	2017.02.22
申请号	CN201510227130.3	申请日期	2015.05.06
申请人	南昌大学;北京邮电大学	发明人	肖霖;张天魁;王松;杨鼎成;罗欢
分类号	H04W28/18(2009.01)I;H04W52/02(2009.01)I;H04W56/00(2009.01)I	主分类号	H04W28/18(2009.01)I
代理机构	北京卓特专利代理事务所(普通合伙) 11572	代理人	托娅
主权项	一种无线传感网络占空比自适应调整方法，其特征在于，通过网络分簇算法将网络中各局部的节点分簇，使得整个网络形成由一个或者一个以上簇组成的树形拓扑，所述网络中的各簇是一个周边局域网络；其中，所述占空比自适应调整方法按照周期性方式进行，所述各簇内节点的信标间隔周期长度固定；每个周边局域网络子网由一个作为簇头节点的网络协调器负责管理，各簇之间的数据流量由各簇的簇头节点负责传递；各簇内其他传感器节点为普通节点，所述普通节点的一个周期分为预测、同步、数据和睡眠四个阶段，其中所述预测和同步阶段的时间长度固定，所述数据和睡眠阶段的长度可调整；所述普通节点在第<i>k</i>个周期内的上述预测、同步、数据和睡眠四个阶段，具体为：在预测阶段，普通节点<img file="302313dest_path_image001.GIF" wi="14" he="18" />采用基于Q‑learning算法的预测方法预测得到数据阶段的最佳发送数据包个数<img file="321085dest_path_image002.GIF" wi="18" he="27" />；在同步阶段，普通节点接收簇头节点广播的信标帧以更新本地时间，保持同步；在数据阶段，普通节点采用CSMA/CA方式竞争接入信道与簇头节点通信，各个普通节点与簇头节点间的通信时间分别由各自在预测阶段得到的最佳发送数据包的个数决定；在睡眠阶段，普通节点进入睡眠状态以降低功耗；所述簇头节点还用于收集对应簇内普通节点采集的数据并上传网络，同时维持网内节点同步；其中所述基于Q‑learning算法的预测方法包括：节点簇对应强化学习中的环境，节点对应能够感知节点簇状态的智能代理，节点与节点簇之间的相互作用可通过如下的三元组来表征：<img file="468032dest_path_image003.GIF" wi="179" he="36" />其中<img file="230452dest_path_image004.GIF" wi="20" he="35" />表示节点<img file="880876dest_path_image005.GIF" wi="17" he="21" />当前所处的状态，即节点<img file="804970dest_path_image006.GIF" wi="14" he="15" />在第<i>k</i>个周期内，节点<img file="439213dest_path_image007.GIF" wi="18" he="22" />缓存中所存储的数据包的个数，表征节点<img file="739745dest_path_image008.GIF" wi="18" he="19" />当前的流量情况，节点<img file="775834dest_path_image009.GIF" wi="17" he="20" />所处的所有可能的状态构成了一个离散状态集；<img file="870829dest_path_image010.GIF" wi="22" he="30" />表示节点<img file="992369dest_path_image011.GIF" wi="15" he="19" />在当前状态<img file="96591dest_path_image012.GIF" wi="20" he="38" />根据某种策略<img file="721607dest_path_image013.GIF" wi="69" he="25" />所采取的动作，即在第<i>k</i>个周期内，节点<img file="253083dest_path_image014.GIF" wi="16" he="25" />发送的数据包的个数，节点<img file="829295dest_path_image014.GIF" wi="16" he="25" />在当前状态<img file="737208dest_path_image015.GIF" wi="20" he="33" />可采取的所有可能的动作构成了一个离散动作集；<img file="482310dest_path_image016.GIF" wi="22" he="31" />表示对于节点<img file="919108dest_path_image017.GIF" wi="19" he="21" />在当前状态<img file="749661dest_path_image018.GIF" wi="22" he="35" />下根据某种策略<img file="195686dest_path_image019.GIF" wi="16" he="15" />，采取动作<img file="795294dest_path_image020.GIF" wi="22" he="35" />后收到节点簇所给出的立即损耗反馈；<img file="668572dest_path_image021.GIF" wi="26" he="34" />具体定义如下：<img file="986421dest_path_image022.GIF" wi="337" he="39" />其中<img file="970558dest_path_image023.GIF" wi="58" he="33" />、<img file="424673dest_path_image024.GIF" wi="57" he="35" />、<img file="468852dest_path_image025.GIF" wi="54" he="31" />和<img file="273997dest_path_image026.GIF" wi="50" he="36" />分别指节点<img file="560360dest_path_image027.GIF" wi="20" he="30" />与簇头节点进行一定数目数据包通信时所需要的发送能量、接收能量、空闲侦听能量和端到端时延，<img file="134561dest_path_image028.GIF" wi="17" he="20" />和<img file="349641dest_path_image029.GIF" wi="17" he="24" />分别指对能效和端到端时延赋予的权重；策略<img file="376503dest_path_image030.GIF" wi="15" he="25" />定义为状态<i>S</i>到动作<i>A</i>的映射<img file="968021dest_path_image031.GIF" wi="51" he="23" />，表示节点<img file="396729dest_path_image032.GIF" wi="15" he="21" />在状态<img file="517131dest_path_image033.GIF" wi="21" he="31" />下应当发送的数据包数目；对于某一给定的策略<img file="296869dest_path_image034.GIF" wi="67" he="23" />，节点<img file="426499dest_path_image035.GIF" wi="15" he="27" />在当前状态<img file="709712dest_path_image036.GIF" wi="19" he="35" />，根据策略<img file="532175dest_path_image037.GIF" wi="16" he="23" />采取动作<img file="533629dest_path_image038.GIF" wi="21" he="30" />到达状态<img file="201371dest_path_image039.GIF" wi="31" he="30" />后，所得到的总的折扣损耗的期望值如<img file="339091dest_path_image040.GIF" wi="22" he="24" />下式所示：<img file="830990dest_path_image041.GIF" wi="226" he="37" />其中是指折扣因子，其确定了延迟奖赏与立即奖赏的相对比例；为了得到最优策略<img file="319740dest_path_image042.GIF" wi="19" he="27" />，使节点<img file="791173dest_path_image043.GIF" wi="19" he="30" />在状态<img file="48979dest_path_image044.GIF" wi="23" he="29" />下，根据该最优策略<img file="213244dest_path_image045.GIF" wi="21" he="23" />采取动作，最终使得节点总的折扣损耗的期望值最小，其表示如下所示：<img file="923711dest_path_image046.GIF" wi="261" he="39" />其中所采用的最优策略<img file="198834dest_path_image047.GIF" wi="20" he="26" />是：<img file="576726dest_path_image048.GIF" wi="211" he="41" />所述普通节点根据所述基于Q‑learning算法的预测方法预测得到第<i>k</i>个周期内节点<img file="380734dest_path_image049.GIF" wi="19" he="21" />的最佳发送数据包个数<img file="844076dest_path_image050.GIF" wi="22" he="31" />；所述普通节点根据前述<img file="657312dest_path_image051.GIF" wi="20" he="29" />利用如下公式得到超帧指数SO：<img file="889710dest_path_image052.GIF" wi="334" he="69" />其中，<img file="130198dest_path_image053.GIF" wi="24" he="32" />是每个数据包的传输时间，η是吞吐量限制因子，<img file="80837dest_path_image054.GIF" wi="15" he="23" />和<img file="697763dest_path_image055.GIF" wi="36" he="29" />分别是ACK数据包的等待时间和传输时间，<img file="283203dest_path_image056.GIF" wi="32" he="27" />是信标的传输时间；所述普通节点根据前述超帧指数SO自适应调整占空比DC值，其中占空比DC定义为信标间隔周期中活跃期长度SD与当前整个信标间隔周期长度BI的比值，即，<img file="960172dest_path_image057.GIF" wi="111" he="42" />，其中，<img file="132527dest_path_image058.GIF" wi="56" he="23" />，<img file="287565dest_path_image059.GIF" wi="57" he="30" />，SO和BO分别是超帧指数和信标指数；所述普通节点根据前述占空比DC值调整数据阶段时间长度，在此时间内可以向簇头节点发送<i>f</i>个数据包。
地址	330031 江西省南昌市红谷滩新区学府大道999号