JasonLe's TechBlog

Python 调用 C++ DLL

January 28th, 2018 by JasonLe's Tech 2,542 views

update 2018-2-4

早些时候用Pyqt写了一个app，最近需要加入一个License的认证模块，然而这个认证模块是拿C++写的，并只给出了dll、lib和header。。。网上查找了一些资料，一种方法是采用ctypes 去加载dll；另外一种就是编写一个PyObject 然后编译成pyd模块供python调用。

这里我采用第一种方式，他们提供给我的模块是_stdll的方式，因为然后我又把这个dll包了一层，以__cdecl方式导出。

#ifdef LICENSEDLL_EXPORTS
#define LICENSEDLL_API extern "C" __declspec(dllexport)
#else
#define LICENSEDLL_API __declspec(dllimport)
#endif

LICENSEDLL_API bool  verify(char *module, char *errorMsg);
.....

然后我将这个工程编译生成LicenseDLL.dll。下面就是python方面的任务了,我引入ctypes，然后利用CDLL加载LicenseDLL.dll，利用dll.verify.argtypes，我声明verify函数具有两个参数，其中pinfo为100个字节的数组，并将其传入verify函数。该函数返回后，写入ErrorInfo，使用QString(str(ErrorInfo.raw))将其转换成QString对象，完成对于dll的调用。

def CheckLicenseInfo():
    from ctypes import *
    dll = CDLL(app.g_pwd + os.sep + 'LicenseDLL.dll')
    dll.verify.argtypes = [c_char_p,c_char_p]
    pinfo = c_char * 100
    ErrorInfo = pinfo()

    if not dll.verify('DESIGNAPP', ErrorInfo):
        QMessageBox.critical(None, 'Error', QString(str(ErrorInfo.raw)))
        exit()

但是这种方式有个最大的问题：由于直接读dll，可能会存在dll依赖链的问题，比如我操作的时候就遇到Windows Error 126和127的问题，126应该就是dll依赖缺失，需要使用depandency的问题，127有可能是dll本身的问题，没有把里面的函数符号导出。127比较复杂，有时候遇到挺懵逼的…..

编写python扩展也是一种可行的方式，可以编译成pyd的形式，在python下直接import。因为python要调用C++的模块，那么必然涉及到python对象向C++传值和传地址的问题，也涉及到C++返回值向python传值的问题，直接上源码来一步一步说明吧。

编写代码首先要引入Python.h Python.lib 默认用安装包安装的是没有python_d.lib的，当然也要引入需要wapper的头文件，我根据官方提供的spamError例子代码修改为我的代码，总的框架代码有PyMODINIT_FUNC initXXXXX(void)，static PyMethodDef LicenseMethods[]={…},还有常规的warpper代码static PyObject *
xxxxxxxxxx(PyObject *self, PyObject *args)。PyMODINIT_FUNC initXXXXX(void)为固定格式，把XXXXX换成我们编写的模块即可。

#include "Python.h"
#include "LicenseProve.h"

static PyObject *LicenseError;

static PyObject *isHavingLicense(PyObject *self, PyObject *args)
{
    const char *name;
	char *err;
    if (!PyArg_ParseTuple(args, "s|s", &amp;name,&amp;err))
        return NULL;

    std::string modelName(name);
    std::string errorMsg(err);

    bool ret = Allone::License::CLicenseProve::getInstance()-&gt;isHavingLicense(modelName,errorMsg);

	//if(ret == false)
	//	return Py_False;
    //return Py_True;
    return Py_BuildValue("(iss)",(int)ret,modelName.c_str(),errorMsg.c_str());
}

static PyObject *getLicenseType(PyObject *self, PyObject *args)
{
    size_t ret = Allone::License::CLicenseProve::getInstance()-&gt;getLicenseType();
    return PyInt_FromSize_t(ret);
}

static PyObject *getLicenseInfo(PyObject *self, PyObject *args){...}
static PyObject *getRegisterInfo(PyObject *self, PyObject *args){...}
static PyObject *getAppendInfo(PyObject *self, PyObject *args){...}

static PyMethodDef LicenseMethods[] = {
    {"isHavingLicense",  isHavingLicense, METH_VARARGS,"Having License in this Computer?"},
	{"getLicenseType", getLicenseType, METH_NOARGS,"Get License type"},
	{"getLicenseInfo", getLicenseInfo, METH_NOARGS,"Get License Info"},
	{"getRegisterInfo", getRegisterInfo, METH_NOARGS,"Get License Register Info"},
	{"getAppendInfo", getAppendInfo, METH_NOARGS,"Get License Append Info"},
    {NULL, NULL, 0, NULL}        /* Sentinel */
};

PyMODINIT_FUNC initLicense(void)
{
    PyObject *m;

    m = Py_InitModule("License", LicenseMethods);
    if (m == NULL)
        return;

    LicenseError = PyErr_NewException("License.error", NULL, NULL);
    Py_INCREF(LicenseError);
    PyModule_AddObject(m, "error", LicenseError);
}

在LicenseMethods需要加入我们编写的函数，双引号里面的是python调用的函数名，第二个参数是c++函数里面的函数名，第三个是声明传入参数类型，METH_VARARGS为可变参数，METH_NOARGS是没有参数。这个实质上就是一个参数表，连接python和c++端的实现。c++端的实现都写成static PyObject *
xxxxxxxxxx(PyObject *self, PyObject *args)，我们需要的参数都是通过args以Tuple传入，调用PyArg_ParseTuple(args, “s|s”, &name,&err)，可以拿到传入的值，”s|s”表示元祖里每个参数的类型，常用类型可以在官方wiki中找到，s就是代表string，i就是代表integer等等。后面就是根据指定的类型，传入到name和error中，C++中可以使用参数传值，但是python的参数传入的都是值，因此我将修改的的值都放置到一个tuple中返回给python，也就是使用Py_BuildValue返回，参数和PyArg_ParseTuple类似，第一个是每个参数的类型，后面是参数值。如果返回值简单，我们也可以直接返回Py_False等。

在PyMODINIT_FUNC initLicense(void)中，大致也分为Py_InitModule初始化，PyErr_NewException注册，在异常处理这一块，我没有细看，但是这个应该是处理各种C++发生的异常，由于python不用管理内存，但是在C++中我们就必须处理，因此Py_INCREF用来管理引用计数，最后还需要将error和注册的python模块连接起来。

在python中，只需要使用import License,直接引入即可，看下面代码：

def CheckLicenseInfo2():
    error = ""
    # print(License.isHavingLicense('DESIGNAPP',error))
    ret = License.isHavingLicense('DESIGNAPP',error)
    err = ret[2]
    if not ret[0]:
        QMessageBox.critical(None, 'Error', QString(str(err)))
        exit()

    print(License.getLicenseType())
    print(License.getLicenseInfo())
    print(License.getRegisterInfo())
    print(License.getAppendInfo())

由于isHavingLicense返回的是tuple，因此我们只要指明索引，就可以找到需要的内容。更多的参数类型可以查看C:\Python27\include，下面都是各种的传入传出函数和宏，满足我们的需要。

这种pyd的方式实质上也是dll，但是比直接调用ctype读取dll有更好的兼容性，在实际生产中推荐使用pyd调用C++函数函数。

参考：

http://www.cnblogs.com/night-ride-depart/p/4907613.html

https://docs.python.org/2/library/ctypes.html

http://wolfprojects.altervista.org/dllforpyinc.php

http://icejoywoo.github.io/2015/10/30/intro-of-extending-cpython.html

https://docs.python.org/2/c-api/arg.html

https://docs.python.org/2/extending/extending.html?highlight=meth_varargs

http://blog.csdn.net/mkc1989/article/details/38943927

Posted in Python

golang ide环境配置

January 20th, 2018 by JasonLe's Tech 1,412 views

最近golang和区块链火起来了，觉得自己有必要也跟一次风了。

golang的背景啊，优缺点啊就不讲了，网上大把资料，只要有C/C++的基础，这个golang程序猿就可以快速胜任。我就说一下配置过程和调试过程中的坑。golang的安装非常简单，只需要将其解压到制定目录，设置好GOROOT和GOPATH即可，GOROOT是golang的可执行文件，而GOPATH从我理解就是Go Project的所在目录，我们平时go get所下载的工程都会放到$GOPATH/src中。

由于刚开始接触，我手动设置了go编程环境，也使用go run 运行了hello world，但是一直无法调试go程序，在网上找了半天debugger，发现delver可以用来调试go程序，我习惯使用git+make install安装程序，安装完毕之后，我将其安装到go所在目录，方便直接命令行调用。

spider@ubuntu:/usr/lib/go-1.9.2$ ls
api  AUTHORS  bin  blog  CONTRIBUTING.md  CONTRIBUTORS  doc  favicon.ico  lib  LICENSE  misc  PATENTS  pkg  README.md  robots.txt  src  test  VERSION
spider@ubuntu:/usr/lib/go-1.9.2$ cd bin/
spider@ubuntu:/usr/lib/go-1.9.2/bin$ ls
dlv  go  godoc  gofmt

只要用过gdb的童靴，这个就非常简单了，比如break，continue，print 等，如下所示：

spider@ubuntu:/tmp$ dlv debug test.go
Type 'help' for list of commands.
(dlv) l
> _rt0_amd64_linux() /usr/lib/go-1.9.2/src/runtime/rt0_linux_amd64.s:8 (PC: 0x4571b0)
Warning: debugging optimized function
     3:	// license that can be found in the LICENSE file.
     4:	
     5:	#include "textflag.h"
     6:	
     7:	TEXT _rt0_amd64_linux(SB),NOSPLIT,$-8
=>   8:		LEAQ	8(SP), SI // argv
     9:		MOVQ	0(SP), DI // argc
    10:		MOVQ	$main(SB), AX
    11:		JMP	AX
    12:	
    13:	// When building with -buildmode=c-shared, this symbol is called when the shared
(dlv) b main.main
Breakpoint 1 set at 0x4a08d8 for main.main() ./test.go:12
(dlv) c
> main.main() ./test.go:12 (hits goroutine(1):1 total:1) (PC: 0x4a08d8)
Warning: debugging optimized function
     7:	   return n
     8:	  }
     9:	  return fibonacci(n-2) + fibonacci(n-1)
    10:	}
    11:	
=>  12:	func main() {
    13:	    var i int
    14:	    for i = 0; i < 10; i++ { 15: fmt.Printf("%d\t", fibonacci(i)) 16: } 17: } (dlv) n > main.main() ./test.go:13 (PC: 0x4a08ef)
Warning: debugging optimized function
     8:	  }
     9:	  return fibonacci(n-2) + fibonacci(n-1)
    10:	}
    11:	
    12:	func main() {
=>  13:	    var i int
    14:	    for i = 0; i < 10; i++ { 15: fmt.Printf("%d\t", fibonacci(i)) 16: } 17: } (dlv) > main.main() ./test.go:14 (PC: 0x4a08f8)
Warning: debugging optimized function
     9:	  return fibonacci(n-2) + fibonacci(n-1)
    10:	}
    11:	
    12:	func main() {
    13:	    var i int
=>  14:	    for i = 0; i < 10; i++ { 15: fmt.Printf("%d\t", fibonacci(i)) 16: } 17: } (dlv) > main.main() ./test.go:15 (PC: 0x4a0913)
Warning: debugging optimized function
    10:	}
    11:	
    12:	func main() {
    13:	    var i int
    14:	    for i = 0; i < 10; i++ { =>  15:	       fmt.Printf("%d\t", fibonacci(i))
    16:	    }
    17:	}
(dlv) p i
0
(dlv) n
0	> main.main() ./test.go:14 (PC: 0x4a0a02)
Warning: debugging optimized function
     9:	  return fibonacci(n-2) + fibonacci(n-1)
    10:	}
    11:	
    12:	func main() {
    13:	    var i int
=>  14:	    for i = 0; i < 10; i++ { 15: fmt.Printf("%d\t", fibonacci(i)) 16: } 17: } (dlv) > main.main() ./test.go:15 (PC: 0x4a0913)
Warning: debugging optimized function
    10:	}
    11:	
    12:	func main() {
    13:	    var i int
    14:	    for i = 0; i < 10; i++ { =>  15:	       fmt.Printf("%d\t", fibonacci(i))
    16:	    }
    17:	}
(dlv) p i
1

另外我们也可以使用dlv attach pid的方式进行调试，当然我们必须开启/proc/sys/kernel/yama/ptrace_scope to 0。然而我gdb、dlv再怎么强大，还是没有IDE方便，尤其是断点调试，经过调研VSCode、Gogland、Atom满足我们需求，由于之前我一直是用JetBrain系产品进行开发，毫无疑问，我是用Gogland作为IDE。我着重说一下他的调试，当debug某个程序时候，Gogland会打印信息：

GOROOT=/usr/lib/go-1.9.2 #gosetup
GOPATH=/home/spider/go #gosetup
/usr/lib/go-1.9.2/bin/go build -o /tmp/___go_build_main_go -gcflags "-N -l" -a /home/spider/go/src/awesomeProject/main.go #gosetup
/opt/GoLand/plugins/intellij-go-plugin/lib/dlv/linux/dlv --listen=localhost:41151 --headless=true --api-version=2 --backend=default exec /tmp/___go_build_main_go -- #gosetup

可以看到第一行、第二行是打印GOROOT和GOPATH，第三行是打印进行编译，添加-gcflags “-N -l”，是为了去掉编译优化，方便调试，这行执行完毕会把程序放到/tmp/___go_build_main_go中，第四行就是使用gogland中的dlv插件的debug server，后端就是我们编译的/tmp/___go_build_main_go程序。

参考

https://golang.org/doc/editors.html

Posted in Go

Python 包管理工具总结

December 4th, 2017 by JasonLe's Tech 1,391 views

最近一直使用elasticsearch-py 操作数据库，最开始我是clone的他的官方仓库，然后使用python setup.py install方式安装的，虽然也可以使用，调用他的包没有什么问题，但是在pycharm中一直出现红色下划线，当遇到参数错误的时候，也没有办法跳入接口，看内部实现，比较抓狂。。。。。

考虑到之前配置python包遇到很多小问题，这次一次性把技术债还了。

通过查资料，大致可以理出来distutils、setuptools、distribute、disutils2、distlib、pip这几个工具的出现先后：

首先出现的安装工具是distutils，distutils 是 python 标准库的一部分，我们在python工程中的setup.py就是利用distutils完成的，他的工作原理很简单，但是功能有限。
为了完善distutils工具，产生了setuptools，它包含了 easy_install 这个工具；其中 ez_setup.py 是 setuptools 的安装工具，ez 就是 easy 的缩写。我们可以是使用
easy_install http://example.com/Package-1.2.3.tgz .egg 方式安装。
distribute 是 setuptools 的一个分支版本，目前distribute 又合并回了 setuptools 中。本质上是同一个东西。如果查看一下 easy_install 的版本，它本质上就是 distribute 。
distutils2是一个新的distutils库，作为distutils代码库的一个分支。
distlib是distutils2的一部分
pip是目前 python 包管理的事实标准，2008年发布。用来替换 easy_install，但是它仍有大量的功能建立在 setuptools 组件之上。

以上工具中distutils、setuptools、distribute、pip是主流包管理器，disutils2、distlib还需要观察。

eggs Vs whl

Eggs 格式是 setuptools 引入的一种文件格式，它使用 .egg 扩展名，用于 Python 模块的安装。而setuptools 可以识别这种格式。并解析安装它。

wheel 本质上是一个 zip 包格式，它使用 .whl 扩展名，用于 python 模块的安装，它的出现是为了替代 Eggs。

eggs和whl本质上都是压缩包，我们都可以通过修改后缀名，解压提取内容！但是在pip中不太推荐egg的安装方式，因为egg安装后，只是把这个egg安装包放到dist-package中，而whl本质是一种源码安装，安装后在dist-package中存在源码和同名的info文件来描述这个安装包，因此我们可以在调用的时候，查看接口，而egg在编译器看来就是一堆二进制数据。

拿elasticsearch-py源码包举例，从github上clone最新的源码到本地，我们可以使用python setup.py install 直接将egg安装到/usr/local/lib/python2.7/dist-package中，而源码放在elasticsearch-py源码中的build中，所以可以使用python setup.py sdist 将其压缩为egg，然后使用pip安装该egg。另外也可以将源码打成rpm : python setup.py bdist_rpm exe: python setup.py bdist_wininst。

但是还是推荐打成whl格式的安装包，python setup.py bdist_wheel 。这个会将源码安装到dist-package中。

依赖安装：setup.py和requirements.txt的对比这篇文章主要参数了两种安装文件的异同，归纳起来就是setup.py无法灵活限定软件版本，而requirements.txt可以限定具体软件包的版本，可以配合setup.py实现。

requirements.txt:

--index https://pypi.python.org/simple/
-e https://github.com/foo/bar.git#egg=bar
-e .

比如 pip install -r requirements.txt 可以照常工作，它会先安装requirements路径下的bar包，然后继续开始解析抽象依赖，结合 –index 选项后转换为具体依赖然后再安装她们。

这个办法可以让我们解决一种类似这样的情形：比如有两个或两个以上的包在一起开发但是是分开发行的，或者说有一个尚未发布的包并把它分成了几个部分。如果顶层的包依然仅仅按照“名字”来依赖的话，我们依然可以使用requirements.txt 来安装开发版本的依赖包。

参考

https://docs.python.org/3/distutils/introduction.html?highlight=distutils#a-simple-example

http://blog.csdn.net/lynn_kong/article/details/17540207

https://stackoverflow.com/questions/6344076/differences-between-distribute-distutils-setuptools-and-distutils2/14753678#14753678

Posted in Python

elasticsearch 配置遇到的问题

October 7th, 2017 by JasonLe's Tech 1,390 views

为了存储海量数据，以便进一步进行数据分析，调研了一段时间，elasticsearch是个不错的选择。
elasticsearch 是一个用于搜索领域的分布式数据库，基于jdk为jdk1.8.0_73以上。不同于mysql之类的关系型数据库，elasticsearch基于RESTful web接口需要使用POST/GET/DELETE/PUT来处理数据。我采用elasticsearch-py接口对数据库进行CRUD。不过在启动的时候，发现如下问题：

[2017-12-27T21:07:13,695][INFO ][o.e.t.TransportService   ] [node-1] publish_address {192.168.228.134:9300}, bound_addresses {[::]:9300}
[2017-12-27T21:07:14,005][INFO ][o.e.b.BootstrapChecks    ] [node-1] bound or publishing to a non-loopback or non-link-local address, enforcing bootstrap checks
ERROR: [1] bootstrap checks failed
[1]: max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144]
[2017-12-27T21:07:14,155][INFO ][o.e.n.Node               ] [node-1] stopping ...
[2017-12-27T21:07:14,468][INFO ][o.e.n.Node               ] [node-1] stopped
[2017-12-27T21:07:14,482][INFO ][o.e.n.Node               ] [node-1] closing ...
[2017-12-27T21:07:14,764][INFO ][o.e.n.Node               ] [node-1] closed

这个时候需要扩大虚拟内存堆：sysctl -w vm.max_map_count=262144

另外安装elasticsearch-head在5.x版本后需要借助nodejs服务，这一块配置安装比较繁琐，先要配置nodejs/npm/grunt。

npm安装:

curl https://npmjs.org/install.sh | sh
sh install.sh

如果出现 npm cannot be installed without Node.js. Install Node.js first, and then try again. 则需要安装Node.js

apt-get install nodejs 如果node.js版本过低，则需要升级

# 第一步：首先安装 n 模块：
npm install -g n
# 第二步：升级node.js到最新稳定版
n stable

node 环境安装完毕后安装 elasticsearch-head 所需模块：

git clone git://github.com/mobz/elasticsearch-head.git
cd elasticsearch-head
npm install
npm run start

配置完 elasticsearch-head，要在./config/elasticsearch.yml中打开注释,然后重新启动es。

http.port: 9200
# 跨域
http.cors.enabled: true
http.cors.allow-origin: "*"

参考:

https://my.oschina.net/kittyMan/blog/387512?p=1
http://orchome.com/489

Posted in Code杂谈

basemap 尝鲜

September 28th, 2017 by JasonLe's Tech 1,325 views

去年做毕业论文的时候做统计的时候用过gnuplot，最近做数据挖掘和可视化，又发现matplotlib是一个比较活跃的绘图python库。通过matplotlib可以绘制散点图，柱状图，折线图。这些配合sklearn可以进行经典的数据挖掘。

最近老板给我一些gis方面的数据，先摸索一下数据分布，为以后的聚类做准备。开始我直接将GIS的经纬度scatter到二维坐标系中，发现没有地图做配合，看分布非常抽象。忽闻basemap作为matplotlib的一个子插件可以胜任该工作。

首先basemap可以绘制不同的地理信息图，包括blue marble 球状的、饼状的、二维地图等不同形式。我这里主要绘制折线图，所以使用plot就可以了。

import numpy as np
from mpl_toolkits.basemap import Basemap
import matplotlib.pyplot as plt
from datetime import datetime

map = Basemap(projection='mill',lon_0=180)
map.drawcoastlines()
map.drawparallels(np.arange(-90,90,30),labels=[1,0,0,0])
map.drawmeridians(np.arange(map.lonmin,map.lonmax+30,60),labels=[0,0,0,1])
map.drawmapboundary(fill_color='aqua')
map.fillcontinents(color='coral',lake_color='aqua')

只需要声明Basemap方法，声明投影方式即可，对于投影方式我挺懵逼的，不过我们把basemap下载下来，里面有examples，可以直接运行run_all.py,观察自己想要的投影方式，当然了最基本的经纬度概念还是要有，否则就无法选择我们想要的固定区域。merc就是绘制其中一部分地图的投影方式

m = Basemap(llcrnrlon=-100.,llcrnrlat=20.,urcrnrlon=20.,urcrnrlat=60.,\
            rsphere=(6378137.00,6356752.3142),\
            resolution='l',projection='merc',\
            lat_0=40.,lon_0=-20.,lat_ts=20.)

其中llcrnrlon，llcrnrlat代表left down的经度，纬度；urcrnrlon，urcrnrlat代表upper right的经度，纬度。以对角坐标值就可以确定一个唯一的视口大小。

附我做的航路图：

使用matplotlib只要记住绘制散点（scatter），折线（plot）基本就可以满足我们的需要了。

参考

http://matplotlib.org/basemap/users/examples.html

Posted in Code杂谈

Python 调用 C++ DLL

参考：

golang ide环境配置

参考

Python 包管理工具总结

eggs Vs whl

参考

elasticsearch 配置遇到的问题

参考:

basemap 尝鲜

参考

Recent Posts

热门文章

Python 调用 C++ DLL

参考：

golang ide环境配置

参考

Python 包管理工具总结

eggs Vs whl

参考

elasticsearch 配置遇到的问题

参考:

basemap 尝鲜

参考

Tags

Recent Posts

热门文章