圖像分割英文資料及翻譯

上傳人：e*** IP屬地：湖北上傳時間：2021-11-11 格式：DOC 頁數(shù)：11 大小：106KB 積分：16 舉報 版權申訴

已閱讀5頁，還剩6頁未讀，繼續(xù)免費閱讀

版權說明：本文檔由用戶提供并上傳，收益歸屬內(nèi)容提供方，若內(nèi)容存在侵權，請進行舉報或認領

文檔簡介

1、一種在線圖像編碼識別系統(tǒng)的設計摘要：本文介紹了在線圖像編碼字符識別系統(tǒng)的設計與實現(xiàn)過程，對其中重點環(huán)節(jié)進行了分析與研究，給出了主要環(huán)節(jié)問題的解決方法，在識別算法上，結合模板匹配與特征識別，提出了基于特征加權的模板匹配算法，該算法對提高字符識別率提到了較好的作用。關鍵詞：圖像處理;模式識別;特征加權;軟件設計0引言圖像編碼字符識別的研究目前仍是國內(nèi)外一個重點研究課題，它具有廣泛的應用背景，比如車牌號碼自動識別、郵政編碼的自動識別、試卷自動閱讀、報表自動處理等，由于這種在線圖像編碼字符的識別都具有一些共性，本文結合在線輪胎編碼字符識別系統(tǒng)的設計，對一般圖像編碼字符識別系統(tǒng)進行了闡述，對關鍵環(huán)節(jié)進行

2、了研究與分析，該方法對其它在線圖像編碼字符系統(tǒng)的開發(fā)具有一定指導意義。1在線圖像編碼識別系統(tǒng)流程在線圖像編碼字符識別系統(tǒng)主要包括數(shù)字圖像的采集、存儲、圖像預處理、編碼圖像提取、編碼特征提取、編碼識別和后續(xù)處理等一些環(huán)節(jié)，其流程圖如圖1所示。圖像預處理后續(xù)處理編碼特征提取與識別編碼圖像的采集與存儲圖1 在線圖像編碼字符識別系統(tǒng)流程圖在線輪胎圖像編碼字符識別系統(tǒng)要求對通過生產(chǎn)流水線上每一個輪胎采集含有輪胎編碼的圖像，然后通過對圖像的處理，提取出輪胎編碼特征，采用合適的識別算法將每一位編碼字符進行識別。由于輪胎編碼字符在輪胎上有一定變形，且攝像角度不同，得到的編碼圖像差異也很大，規(guī)律性差，所以編碼

3、圖像的預處理和識別算法的選取顯得尤為重要。2圖像采集與存儲在線編碼圖像通常使用數(shù)碼攝像機、數(shù)碼照相機、數(shù)碼攝像頭等設備采集并輸入計算機進行處理，本系統(tǒng)采用QuickCamPro4000數(shù)碼攝像頭采集輪胎編碼圖像,直接按JPG格式存儲。編碼圖像一般都要先轉成BMP圖像格式，因為BMP格式己經(jīng)成為PC領域事實上的標準幾乎所有為Windows操作系統(tǒng)設計的圖像處理軟件都支持這種格式的圖像。BMP是Windows的原始位圖格式，它可以用于保存任意類型的位圖數(shù)據(jù)，可以支持所有的屏幕分辨率和Windows所支持的顏色組合。一般情況下，為了保證顯示的高效率，它對圖像數(shù)據(jù)沒有任何的壓縮，所以一幅很小的位圖就可

4、能占據(jù)相當大的空間。BMP位圖文件包括位圖文件頭、位圖信息頭、調(diào)色板、位圖數(shù)據(jù)區(qū)四個部分，位圖文件頭由14個字節(jié)構成，位圖信息頭由40個字節(jié)構成，調(diào)色板的大小取決于色彩數(shù)，單色圖像調(diào)色板占8個字節(jié)，16色圖像調(diào)色板占64個字節(jié)，256色圖像調(diào)色板占1024個字節(jié)，224色圖像沒有調(diào)色板，位圖數(shù)據(jù)區(qū)內(nèi)數(shù)據(jù)按行順序自下而上、自左而右排列。3圖像預處理圖像預處理主要包括有：圖像灰度化、圖像降噪與增強、編碼區(qū)邊緣檢測、圖像幾何校正、編碼區(qū)圖像提取、編碼圖像二值化、字符分割、字符歸一化等。下面介紹幾個關鍵環(huán)節(jié)的處理過程。3.1 圖像灰度化處理編碼圖像通常是彩色的，實際識別用的圖像是灰度圖，所在需要先將彩

5、色編碼圖像轉換為灰度圖像。在RGB顏色模型中，如果R=G=B，則顏色（R，G，B）表示一種黒白顏色，其中R=G=B的值叫灰度值，灰度化處理就是使彩色的R、G、B分量值相等的過程。常用灰度化處理方法是加權平均值法，即R=G=B=(WRR+WGG+WBB)/3其中，WR、WG、WB分別是R、G、B的權值，實驗和理論證明，當WR=0.3, WG=0.59, WB=0.11時，即當R=G=B=0.30R+0.59G+0.11B時，能得到最合理的灰度圖像。3.2 圖像增強處理3.2.1 直接灰度變換線性灰度變換：假設圖像灰度是線性變化的，如原圖像f(x,y)灰度范圍為a,b，要求變換后圖像灰度范圍達到c

6、,d，根據(jù)線性規(guī)律，則變換后圖像g(x,y)為：（1）非線性變換對數(shù)變換和指數(shù)變換。當需要擴展低灰度區(qū)、壓縮高灰度區(qū)時使用對數(shù)變換，當需要擴展高灰度區(qū)時使用指數(shù)變換。3.2.2 平滑濾波降噪由于噪聲對應圖像中的區(qū)域邊緣等灰度值具有較大較快變化的部分，屬高頻分量，所以使用低通濾波器（即平滑濾波器）降噪。同時平滑還可以使圖像模糊，有利于在提取較大的目標前去除較小的細節(jié)或將目標內(nèi)的小間斷連接起來。平滑降噪的方法是使用模板對圖像進行卷積運算，線性平滑濾波器最常用的模板是如圖2所示的3×3模板，將此模板與圖像中像素按如下方法進行卷積運算，可得到平滑降噪的圖像。將模板在圖中漫游，并將模板中心與

7、圖中每個像素位置重合；將模板上系數(shù)與模板下對應像素相乘；將所有乘積相加；將和賦給圖中對應模板中心位置的像素。非線性平滑濾波器最常用的是中值濾波器，它將區(qū)域中所有的值按大小進行排序，將排序后位于中間的像素值賦予中心像素。中值濾波可有效地去除隨機噪聲，能得到較好的視覺效果。3.3 編碼區(qū)邊緣檢測邊緣是灰度值不連續(xù)的結果，可利用求一階和二階導數(shù)的方法檢測到。因為在邊緣地帶導數(shù)值大，而非邊緣的地方導數(shù)值小。由于數(shù)字圖像是離散的，不能求導數(shù)，可以通過卷積的方法用差分近似代替微分。-101-202-101121000-1-2-1效果較好的邊緣檢測算法是Sobel算子。Sobel算子是一種梯度幅值，分別利用

8、垂直算子Sx、水平算子Sy來獲取編碼區(qū)垂直邊緣和水平邊緣，即在水平和垂直方向上使用如圖3所示的兩個不同的卷積模板，得到如圖4所示的邊緣檢測結果。圖2 平滑濾波器模板圖3 Sobel邊緣檢測模板圖4 Sobel算子邊緣檢測結果3.4 圖像幾何校正Hough變換可以檢測出編碼區(qū)圖像傾斜角度，根據(jù)此角度進行旋轉變換可使編碼區(qū)圖像得到校正。Hough變換可以將圖像空間XY中的直線（y=px+q）檢測問題轉換到參數(shù)空間PQ中點的檢測問題，在參數(shù)空間PQ里，建立一個累加數(shù)組Sum(p,q)，對每一個圖像空間中給定邊緣點，讓p取遍所有可能值，根據(jù)直線方程q=-xp+y計算出對應的q，對Sum(p,q)進

9、行累加，得到Sum(p,q)的值就是在(p,q)處共線的點的個數(shù)，(p,q)的值就是圖像空間中直線的斜率和截距，由斜率得到圖像編碼區(qū)水平邊緣角度。3.5 字符切割通過對編碼字符區(qū)直接進行水平掃描，由字符間距一般可以將字符區(qū)域分割出來。也可以通過對編碼字符區(qū)做垂直方向投影運算，根據(jù)字符大致寬度與字符總數(shù)，對字符進行切割。如圖5所示是編碼字符區(qū)及對應垂直投影圖。圖5 編碼字符及對應垂直投影圖6 線性插值示意圖3.6 字符歸一化處理對分割出的字符從四個方向掃描，確定字符邊界，然后采用線性插值方法對每個字符作歸一化處理，使每個字符歸一為32×16點陣。圖6為線性插值示意圖，根據(jù)線性原理，

10、f(x1)可由公式（2）計算：（2）4識別算法設計字符識別一般采取特征判別或模板匹配的方法，特征判別是根據(jù)特征抽取的程度分階段的、用結構分析的辦法完成字符的識別。模板匹配即是根據(jù)字符的知識采取按形匹配的方法，模板匹配一般分為兩類：一類是直接利用輸入的二維平面圖像與字典中記憶的圖形進行匹配；另一類是抽出部分特征與字典進行匹配。輪胎編碼圖像中字符僅涉及部分英文字符和10個阿拉伯數(shù)字，字符較少，結構相對簡單，因此具體識別時，既可以采用圖形匹配的方法，也可以采用結構分析的方法。但由于輪胎上編碼字符有一定變形，且有斷裂現(xiàn)象，所以直接模板匹配與直接特征抽取方法識別率都不理想，本系統(tǒng)使用了模板匹配與特征識

11、別相結合的基于特征加權的模板匹配識別算法，其字符識別率比簡單模板匹配算法和特征識別算法識別率都有不同程度的提高?；谔卣骷訖嗟哪０迤ヅ渥R別算法基本思路是：給模板中有字符筆畫的點分配不同的權重，位于筆畫中心的點權重最高，位于筆畫邊緣的點權重最低，然后將樣本模板與標準模板逐點模糊匹配，按模糊識別規(guī)則識別。 5結論本文結合輪胎編碼識別系統(tǒng)的實現(xiàn)對在線圖像字符編碼識別系統(tǒng)的設計進行了闡述，提出了一種模板匹配與特征匹配相結合的識別算法，該方法對傳統(tǒng)的模板匹配算法進行了改進，提高了變形、斷裂等字符的識別率。這種方法在試驗中得到了驗證，取得了令人滿意的效果。 The Development of A Kin

12、d of Online Image Code Recognition SystemAbstract: This paper describes the design and the implement of online image coding char recognition system. It analyses and researches the important contents about the system. Then it provides the solutions of main problems. In recognition algorithm, combinin

13、g template matching with feature recognition, it put forword an improved template matching algorithm based on feature weights. The algorithm can obviously improve the char recognition ratio.Keyword: image processing; pattern recognition; feature weights; software design0 IntroductionCharacter recogn

14、ition of image coding is still the subject of intense study at home and abroad, it has broad applications, such as Automatic number plate recognition, postal code of the automatic identification, automatic reading papers, reports, automatic processing, because of this online image coded character re

15、cognition has some common, this paper online tire coding character recognition system for the general image coding character recognition system has been elaborated on the key link of the research and analysis, the method of the other online image coded character system Development of guiding signifi

16、cance.1 An online image coding identification system processesOnline image coding character recognition system includes digital image capture, storage, image preprocessing, encoding the image extraction, feature extraction coding, coding identification and follow-up treatment of some aspects of its

17、flow chart shown in Figure 1.Image preprocessingFollow-up treatmentFeature Extraction and Identification CodeCoded image capture and storage Figure 1-line character recognition image coding system flowchartOnline tire image coding character recognition system requires the production pipeline through

18、 the acquisition of each tire with tire encoded image, and then through image processing, coding to extract features of the tire, using the appropriate recognition algorithm to identify each coded character. Tire coding characters as a certain deformation in the tires, and different camera angles, a

19、re also great differences in the coding images, regularity is poor, so coded image preprocessing and recognition algorithms of selection is very important.2 Image Acquisition and StorageLine coding commonly used digital camera images, digital cameras, digital video cameras capture and processed in c

20、omputer, the system uses QuickCamPro4000 tire coding digital camera image capture, directly from JPG format.Coded images generally must first convert BMP image format, because the BMP format has become the de facto standard PC in the field - almost all of the Windows operating system designed for im

21、age processing software to support this format of the image. BMP is the original Windows bitmap format, which can be used to save any type of digital map data, can support all Windows supported screen resolution and color combination. Under normal circumstances, in order to ensure the display of hig

22、h efficiency, it does not have any compressed image data, so a small bitmap may occupy considerable space.BMP bitmap file includes the bitmap file header, bitmap information header, palette, bitmap data area of four parts, bitmap file header from 14 bytes constitute the bitmap header from 40 bytes c

23、omposition, tone color palette depends on the number of monochrome color images.Board accounted for 8 bytes, 16-color palette images accounted for 64 bytes, 256-color palette image 1024 bytes total, 224-color images without color palette, the bitmap data from the region under the order of the data b

24、y row and on the arrangement from left to right.3 PreprocessingImage preprocessing includes are: gray image, image noise reduction and enhancement, coding, edge detection, image geometry correction, image coding region of extraction, encoding image binarization, character segmentation, character nor

25、malization and so on. Here are some key aspects of the process.3.1 gray image processingImages are usually color coded, the actual identification with the image is grayscale, where the need to convert first color-coded images to grayscale. In the RGB color model, if R = G = B, then color (R, G, B) i

26、ndicates a Black white color, in which R = G = B is called the value of gray value, gray level processing is to make the color of the R , G, B component value equal to the process. Gray-scale processing methods are commonly used weighted average method, that is,R = G = B = (WRR + WGG + WBB) / 3Which

27、, WR, WG, WB are the R, G, B the weight of experimental and theoretical proof, when WR = 0.3, WG = 0.59, WB = 0.11, that is when R = G = B = 0.30R +0.59 G +0.11 B, can be the most reasonable grayscale.3.2 image enhancement processing3.2.1 Direct gray-scale transformation linear gray level transforma

28、tion: if the image gray scale is linear, as in the original image f (x, y) gray-scale range of a, b, asked the transformed image intensity range of up to c, d, According to the linear law, the transformed image g (x, y) as: (1) nonlinear transformation - log transformation and exponential transforma

29、tion:When the need to expand low gray zone, gray zone of high compression used on the log transformation, when the need to expand the use of high gray area index transformation.3.2.2 smoothing filter - Noise ReductionAs the noise in the area corresponding to the edge of the image gray value of such

30、rapid change with a larger part is a high frequency, so the use of low-pass filter (ie, smoothing filter) noise. At the same time can make the image fuzzy smoothing is beneficial to the larger goal of the extraction prior to removal of the smaller details or to target the small interruption link.Smo

31、othing noise reduction method is to use the template on the image convolution operation, linear smoothing filter is the most commonly used template is shown in Figure 2 of the 3 × 3 template, this template and image in pixels by the following method of convolution , get smooth image noise reduc

32、tion. In the figure, roaming the template and the template center and map location of each pixel overlap; the template on the coefficient multiplied with the template under the corresponding pixel; add all the product; It will assign the figure corresponds to the template and the center of the pixel

33、.The most commonly used non-linear smoothing filter is median filter, it will all of the values of the region are sorted according to size, will be sorted in the middle of the pixel values given to the center pixel. Median filter can effectively remove the random noise, can get a better visual effec

34、t.3.3 Edge detection codingEdge is the result of discrete gray value can be used to request the first and second derivative method to detect. Because the derivative of the edge of a large area, rather than the local derivative of the edge of the small. As the digital image is discrete, not the deriv

35、ative, convolution method can replace the differential with the differential approximation.Is better Sobel edge detection algorithm is operator. Sobel operator is a gradient amplitude, respectively, using vertical operator Sx, Sy operator to obtain the level of the coding region of the vertical edge

36、s and horizontal edges, that is, the horizontal and vertical directions as shown in Figure 3 using two different volumes product template, get the edge as shown in Figure 4 results.-101-202-101121000-1-2-1Figure 2 smoothing filter template Figure 3 Sobel edge detection templateFigure 4 Sobeledgedete

37、ction operator3.4 Image RectificationHough transform can detect the coding region of the image angle, the angle of rotation according to the coding region of the image transformation can be corrected.Hough transform to the image space XY of the line (y = px + q) parameter space detection problem is

38、transformed into the mid-point of detection PQ, PQ in the parameter space, the establishment of a cumulative array Sum (p, q), for each given the edge in image space, let p taken over all possible values, according to linear equation q =- xp + y to calculate the corresponding q, on the Sum (p, q) to

39、 accumulate, by Sum (p, q) the value of the is the (p, q) point total of the number line, (p, q) is the image space in the value of the slope and intercept, obtained by the slope angle of the edge image coding standard.3.5 Character CuttingCoded character area on the level of scanning directly from

40、the character spacing can generally be out of character segmentation. Can also be done by coded character area vertical projection operation, according to the character width and character less the total number of characters to be cut. Figure 5 is a coded character areas and the corresponding vertic

41、al projection.Figure 5 encoded characters and the corresponding Figure 6 Schematic diagram of vertical projection linear interpolation3.6 Character normalizationThe character of the segmented into four scans to determine the character boundaries, and then use linear interpolation for each character

42、for normalized so that each character is normalized to 32 × 16 lattice. Figure 6 Schematic diagram of linear interpolation, according to linear theory, f (x1) by the formula (2) Calculation: （2）4 Identification algorithmTo determine the general characteristics of character recognition or templa

43、te matching method, Feature identification is based on the degree of feature extraction stages, complete with a structural analysis approach to character recognition. Template matching that is based on knowledge of the characters take shape matching method according to the template matching is generally divided into two categories: direct use of the imported two-dimensional plane images and dictionary matching graphics memory; the other is out of some feature match with the dictionary.Tire coding image only some of the characters and English characters

人人文庫> 全部分類> 行業(yè)資料 > 信息產(chǎn)業(yè)

溫馨提示

1. 本站所有資源如無特殊說明，都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
2. 本站的文檔不包含任何第三方提供的附件圖紙等，如果需要附件，請聯(lián)系上傳者。文件的所有權益歸上傳用戶所有。
3. 本站RAR壓縮包中若帶圖紙，網(wǎng)頁內(nèi)容里面會有圖紙預覽，若沒有圖紙預覽就沒有圖紙。
4. 未經(jīng)權益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
5. 人人文庫網(wǎng)僅提供信息存儲空間，僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理，對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯，并不能對任何下載內(nèi)容負責。
6. 下載文件中如有侵權或不適當內(nèi)容，請與我們聯(lián)系，我們立即糾正。
7. 本站不保證下載資源的準確性、安全性和完整性, 同時也不承擔用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

圖像分割英文資料及翻譯

文檔簡介

溫馨提示

最新文檔

評論

圖像分割英文資料及翻譯

文檔簡介

溫馨提示

最新文檔

評論

相關文檔