汉字转换为拼音的代码

发布时间 : 星期四 文章汉字转换为拼音的代码更新完毕开始阅读

'*************************************************************************** '* MODULE NAME: HzToPy '* AUTHOR & DATE: tt.t

'* 03 Apirl 2007

'* DESCRIPTION: 将中文字符串转换为拼音,就这些~

'* 有汉字得到拼音其实并不是我很关心的一个问题,只是发现已经公开 '* 的方法有很大的缺陷,但WORD却做得很好,因此才尝试解决这个问题。

'* 过程比我预期的要曲折的多,主要是VBA实在是一种很受限制的语言。

'* 不过好在有Google和Olldbg,难题也仅仅是如何找到绕过限制的途径,

'* 终于在5个小时内搞定了一切~

'* 时间比我预计的长了很多,因为我实在是不了解VBA,也不很熟悉OLE:\

'* 不过好在一切都解决了~~终于从VBA小白成长了一些。 '* 其实VBA也是很强大的~ '*

'* Theory: 废话了好多还是说说原理吧,虽然不是每个人都很关心~

'* WORD的拼音向导能够将汉字转成拼音全是倚仗微软拼音的帮助, '* 微软拼音2.0以上版本都提供了汉字到拼音的转换功能。

'* 微软拼音MSIME.China类中的IFELanguage接口具体实现了转换功能 '* 不过MSIME.China中没有提供IDispatch接口,VBA的CreateObject不支持

'* 调用这样的类,因此我们只好手工调用。CoCreateInstance可以创建类

'* 并获取IFELanguage接口,但我们无法直接调用,因为VBA不知道如何调用 '* IFELanguage接口的Method。这里困扰了我好久,原本希望能向其他语言那样 '* 声明接口结构,但VBA并不支持。万般无奈下只好在OLE相关DLL中寻找,期待能 '* 找到代理函数简介调用接口的Method。呵呵~功夫不负苦心人终于在OLEAUT32中 '* 找到了DispCallfunc。Google了一下,果然是我需要的。接口知道了,如何调用也 '* 清楚了,剩下的问题就是如何取得转换后的结果。IFELanguage.GetMorphResult会将

'* 转换的结果存在一个叫做tagMORRSLT的结构中,并返回指向tagMORRSLT的指针。

'* 新问题又来了,VBA不支持指针...sigh,为什么其他语言很容易实现的功能VBA用起来

'* 就这么烦呢~幸好VBA读取内存的限制也好突破,只需调用ntdll的RtlMoveMemory。

'* 好了~一切限制都已解除,HzToPy终于正常工作了~~

'* 说起来一切顺理成章,可是寻找解决方法的过程真的很痛苦,不过VBA经验值大涨也算有所收获。

'* 下面就让代码来说话吧。 '*

'* Memo: 改成类了,加入了拼音间加入分隔符和去掉注音的功能,请参照“模

1

块1”中的例子,用起来很简单:)

'* 更正了一个错误,redim时vba数组默认起始搞错了 '*

'***************************************************************************

Option Explicit

Public Enum PhoneticNotation pnDefault = 0 pnNoNotation = 1 End Enum

Private Type GUID Data1 As Long Data2 As Integer Data3 As Integer

Data4(0 To 7) As Byte End Type

Private Type TinyMORRSLT dwSize As Long pwchOutput As Long cchOutput As Integer End Type

Private Declare Sub MoveMemory Lib \ (Destination As Any, Source As Any, ByVal Length As Long)

Private Declare Function CoCreateInstance Lib \ rclsid As GUID, ByVal pUnkOuter As Long, _ ByVal dwClsContext As Long, riid As GUID, _ ByRef ppv As Long) As Long

Private Declare Function DispCallFunc Lib \

(ByVal pvInstance As Long, ByVal oVft As Long, _ ByVal cc As Long, ByVal vtReturn As Integer, _ ByVal cActuals As Long, prgvt As Integer, _

prgpvarg As Long, pvargResult As Variant) As Long

Private Declare Sub CoTaskMemFree Lib \

Dim MSIME_GUID As GUID 'MSIME's GUID Dim IFELanguage_GUID As GUID 'IFELanguage's GUID

Dim IFELanguage As Long 'Pointer to IFELanguage interface

2

Dim sNotation1 Dim sNotation2 Dim dNotation

Dim pvSeperator As String

Dim pvUseSeperator As Boolean Dim pvInitialOnly As Boolean Dim pvOnlyOneChar As Boolean

Private Sub InitalArray()

sNotation1 = Array(\ü\á\?\à\y\?\t\a\?\í\?\ì\?\?\?\ \?\?\ú\?\ù\?\?\?\?\?\\ā\ā\\ɡ\

sNotation2 = Array(\\

\\

dNotation = Array(\

\End Sub

Private Sub GenGUID()

InitalArray

'MSIME.China GUID = \ With MSIME_GUID

.Data1 = &HE4288337 .Data2 = &H873B .Data3 = &H11D1 .Data4(0) = &HBA .Data4(1) = &HA0 .Data4(2) = &H0 .Data4(3) = &HAA .Data4(4) = &H0 .Data4(5) = &HBB .Data4(6) = &HB8 .Data4(7) = &HC0 End With

'IFELanguage GUID = \ With IFELanguage_GUID .Data1 = &H19F7152 .Data2 = &HE6DB

3

.Data3 = &H11D0 .Data4(0) = &H83 .Data4(1) = &HC3 .Data4(2) = &H0 .Data4(3) = &HC0 .Data4(4) = &H4F .Data4(5) = &HDD .Data4(6) = &HB8 .Data4(7) = &H2E End With End Sub

Private Sub IFELanguage_Open() Dim ret As Variant

DispCallFunc IFELanguage, 4, 4, vbLong, 0, 0, 0, ret DispCallFunc IFELanguage, 12, 4, vbLong, 0, 0, 0, ret End Sub

Private Sub IFELanguage_Close() Dim ret As Variant

If IFELanguage = 0 Then Exit Sub

DispCallFunc IFELanguage, 8, 4, vbLong, 0, 0, 0, ret DispCallFunc IFELanguage, 16, 4, vbLong, 0, 0, 0, ret End Sub

'''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''' ''' Subroutine: GetPinYin '''

''' Purpose: 返回汉字的拼音 '''

''' Arguments: HzStr - 待转换的拼音 ''' '''

''' Date Developer Action

''' --------------------------------------------------------------------------

''' 02 April 2007 tt.t 更正ReDim Py时的错误 '''

Private Function IFELanguage_GetMorphResult(HzStr As String) As String Dim ret As Variant

Dim pArgs(0 To 5) As Long

4

联系合同范文客服:xxxxx#qq.com(#替换为@)