Engaged in a lot of days of tesseract, in fact, I feel that the harvest is not very big, I want to look at the source code, I want to figure out, to facilitate the modification of the source code in the future, but. . but. . but. .
So today I plan to port this software to Zynq. I have successfully installed tesseract on the PC before, and I tried it with the test image that comes with it, no problem. Very easy. But transplanted to ZYnq, I have a dim sum, guilty is definitely a reason, my knowledge is not prepared enough, not much understanding of Linux, understanding of cross-compilation, etc., library, header file, configure, make, Make install and so on are all well-understood, and you need to know a bit about the parameters of configure. Ok, don't talk nonsense, I will record my own transplant process.
Known by tesseract, this software requires the support of leptonica. This thing is actually another software. It implements a lot of image processing functions, similar to OPENCV. This installation on the PC is very simple, .confiugre.make, make install is done, the default installation should be under .usr/local/. Of course, I need cross-compilation to generate a library that can be used on Zynq, header files or something. Naturally, you need to use cross-compilation tools, called arm-xilinx-linux-gunbeabi-gcc and so on, so long. . . However, the meaning is very obvious, indicating that the execution environment compiled by this compilation tool is the arm related product of xilinx company. For me, there is only one piece of ZED board in hand, but I feel that this board is still good. Haha. A lot of nonsense. . Tell me how to compile leptonica. . Let's talk nonsense again. The understanding of cross-compilation is more good. Otherwise, it doesn't mean to follow it. It is omnipotent to understand. . I am a newbie, I don't know anyone about Linux. Of course I will learn some knowledge games first. . /7082590, this blog is still ok, give me a reference and some knowledge, thank you, or I really don't know how to start, there is no one to guide, I don't know when I am, but fortunately have these The selfless dedication of netizens. .
The first step: set the relevant environment variables:
Export PATH=$PATH: (that is the arm/xilinx-linux-gnueabi-gcc tool directory /bin) / / specify the path of the cross compiler, this step has actually been done, the development board has been in this for many days The first experiment that can be considered as learning is the construction of the cross environment, remember it. .
Export CC= arm-xilinx-linux-gnueabi-gcc //Specify the C file compilation tool
Export CXX=arm-xilinx-linux-gnueabi-g++ //Specify the compiler tool for c++ files
What I want to say about this part is that I refer to that blog. In fact, these things should be studied with configure-help. Look at the parameters of this configuration. I looked at it, except for the two mentioned above. Others include some library paths, source file paths, and what are actually important. The household will execute
./configure --build=i686-pc-linux-gnu --host=arm-xilinx-linux-gnueabi --target=arm-xilinx-linux-gnueabi CFLAGS=-I/home/gzw/CodeSourcery/Sourcery_CodeBench_Lite_for_Xilinx_GNU_Linux/ LDFLAGS =-L/home/gzw/CodeSourcery/Sourcery_CodeBench_Lite_for_Xilinx_GNU_Linux/lib --prefix=/home/gzw/install/leptonica-1.69
This latter bunch is actually writing the parameters of configure, function, look at help to know.
Then make, make install, because we specified the installation path is /home/gzw/install/leptonica-1.69 when executing configure, so the installation is below this.
At this point, leptonica is installed.
Next, I will talk about the installation of tesseract, which is similar to the above, so the above is successful, and the following is like a duck. . . . .
I said earlier that tesseract needs to use leoptonica, so I will copy the installed /include and /lib to the path of the cross tool, and then specify the required header file when configuring, the library path. The class is fine. as follows:
./autogen.sh
This part is indispensable, because the makeup file needs the makefile.in file, in fact, the file to see the principle of the makefile that is automatically produced in Linux will probably know what to do with these files.
./configure --build=i686-pc-linux-gnu --host=arm-xilinx-linux-gnueabi --target=arm-xilinx-linux-gnueabi CFLAGS=-I/home/gzw/CodeSourcery/Sourcery_CodeBench_Lite_for_Xilinx_GNU_Linux/ LDFLAGS =-L/home/gzw/CodeSourcery/Sourcery_CodeBench_Lite_for_Xilinx_GNU_Linux/lib --prefix=/home/gzw/install/tesseract-3.01
After the make, make make install installation is successful, the path is below the specified /home/gzw/install/tesseract-3.01, this stuff should be in the East of Zynq needs to be used. Ok, let's do it first. Continue the experimental experiment. .
2013.4.2 I originally wanted to copy the relevant library file bin file to the zynq to run, but when copying, I said that there is not enough space. I don't know why. I can't execute the tesseract command under the mount directory, thinking that I can't find it. To the library file, even if I specify the LD_LIBRARY macro, it is not allowed, so I tried to get the relevant lib and bin into the root file system, and then made the image file, replacing the ramdisk8M, but it could not be executed at all. . This is a pinch. . Record it. .
/usr/local/share # cp /mnt/tesseract-3.01/bin/* /usr/local/bin/
/usr/local/share # cp /mnt/tesseract-3.01/lib/* /usr/local/lib/
Cp: ​​write error: No space left on device
Cp: ​​write error: No space left on device
/usr/local/share #
----------------------------------Dividing line-------------- -------------------------
I recorded what I did, and I have already cross-compiled leptonica and tesseract, and I am waiting to run it on the board. It is not right. It turned out that when I found the relevant documents on the board, I was told that there was not enough space. I said that I was a newbie. After all, I still didn’t understand enough, but fortunately, other people in the group understood it because it was made by the root file system. At that time, there was only 8M of space, and those library files were not small, so they couldn’t be played, so the group’s Yang Ping students made a 1G image file. The boot started from the root file system of ramdisk8M. The front is almost the same. The number of devices, when the kernel is correctly booted to execute the contents of the rcS file, the code is added, meaning that the 1G file system is mounted, and then the root directory is switched, which is equivalent to the entire running space being 1G space, not the one. 8M space is up. But why not just start the ramdisk8M directly, just use this 1G directly, because we do not know how to produce the correct U-boot! The boot work is handed over to u-boot. As for the boot.bin, it only contains the configuration of PS and PL. The second experiment like the book attachment CD only provides boot.bin, which is not a big reference, because everyone's PS, PL needs different configurations. All of the above are the work done by Yang Ping, haha
Now that the space is enough, first mount the SD card under /mnt/sdcard/ in the virtual machine: mount -t vboxsf SDCARD /mnt/sdcard
Then mount the 1G image in the sd card: mount /mnt/sdcard/disk1G.img /mnt/disk1g
Then copy the relevant files compiled with the cross tool on the virtual, in fact, copy the bin and lib files can be executed. Of course, you need to include the leptonica library file in order to properly call the relevant image processing library functions.
Cp /home/gzw/install/tesseract-3.01/lib/* /mnt/disk1g/usr/local/lib/
Cp /home/gzw/install/tesseract-3.01/bin/* /mnt/disk1g/usr/local/bin/
Also need to copy eng.traineddata to /mnt/disk1g/usr//local/share/tessdata/
Don't think that copying these is enough, not enough. In the implementation of the error message that did not find the libstdc++.so library or something, I found the location of this file, found in our cross-compilation tool arm-xilin-linux -gnueabi/libc /usr/lib Below, the following things in this folder are not copyable when making the root file system, so you need to copy them here.
After the copy is completed, insert the SD card into the ZED. After the boot is successful, switch to /usr/local/bin under the execution directory, then export LD_LIBRARAY_PATH=/usr/local/bin to specify the library file, export TESSDATA_PREFIX= /usr/local/share/ Specify the database file, which is actually the font library. The English font used for my experiment, the image to be recognized is also a picture that is all in English.
Then execute ./tesseract phototest.bmp phototest.txt -l eng, no prompt error indicates successful output, and generate a txt file in the current directory, which is the result of the recognition, use cat to display the file content, as follows:
This is a lot of 12 point text to test the
Ocr code and see if it works on all types
Of file format.
The quick brown dog jumped over the
Lazy fox. The quick brown dog jumped
Over the lazy fox. The quick brown dog
Jumped over the lazy fox. The quick
Brown dog jumped over the lazy fox.
The original picture is the above picture in English.
Description: When I execute the program, I often find that I can't find related image processing functions. These functions should need some image processing libraries. I only copied the leptonica library and the tesseract library, and the tesseract bin, and finally found only Can identify and process bmp images. To process other images, you only need to copy those image processing libraries. Although leptonica should contain a lot of image processing, it should be enough, and the results are not enough. Hehe, the amount of Oh.
Just perform the output, mark it. . . .
[ 0.800000] xusbps-ehci xusbps-ehci.0: irq 53, io mem 0x00000000
[ 0.830000] xusbps-ehci xusbps-ehci.0: USB 2.0 started, EHCI 1.00
[ 0.830000] hub 1-0:1.0: USB hub found
[ 0.830000] hub 1-0: 1.0: 1 port detected
[ 0.840000] IniTIalizing USB Mass Storage driver...
[ 0.840000] usbcore: registered new interface driver usb-storage
[ 0.850000] USB Mass Storage support registered.
[ 0.850000] Xilinx PS USB Device Controller driver (Apr 01, 2011)
[ 0.860000] mousedev: PS/2 mouse device common for all mice
[ 0.860000] Linux video capture interface: v2.00
[ 0.870000] gspca_main: v2.14.0 registered
[ 0.870000] usbcore: registered new interface driver uvcvideo
[ 0.870000] USB Video Class driver (1.1.1)
[ 0.880000] WDT OF probe
[ 0.880000] xwdtps f8005000.swdt: Xilinx Watchdog TImer at 0xe081c000 with TI
Meout 10 seconds
[ 0.890000] sdhci: Secure Digital Host Controller Interface driver
[ 0.890000] sdhci: Copyright(c) Pierre Ossman
[ 0.900000] sdhci-pltfm: SDHCI platform and OF driver helper
[ 0.900000] mmc0: Invalid maximum block size, assuming 512 bytes
[ 0.910000] mmc0: SDHCI controller on e0100000.sdhci [e0100000.sdhci] using A
DMA
[ 0.910000] usbcore: registered new interface driver usbhid
[ 0.920000] usbhid: USB HID core driver
[ 0.920000] No attached reported connected with modes
[ 0.930000] adv7511 0-0039: Failed to add route DAI IN->TMDS
[ 0.940000] [drm] Cannot find any crtc or sizes - going 1024x768
[ 0.940000] drivers/gpu/drm/analog/analog_drm_fbdev.c:analog_drm_fbdev_probe[
241]
[ 0.960000] asoc: adv7511 <-> 75c00000.axi-spdif-tx mapping ok
[ 0.970000] axi-spdif 75c00000.axi-spdif-tx: Failed to set DAI format: -22
[ 0.970000] Console: switching to colour frame buffer device 128x48
[ 0.980000] ALSA device list:
[ 0.980000] #0: HDMI monitor
[ 0.980000] TCP cubic registered
[ 0.980000] NET: Registered protocol family 17
[ 0.980000] VFP support v0.3: implementor 41 architecture 3 part 30 variant 9
Rev 4
[ 0.980000] Registering SWP/SWPB emulaTIon handler
[ 0.980000] registered taskstats version 1
[ 0.980000] drivers/rtc/hctosys.c: unable to open rtc device (rtc0)
[ 1.000000] mmc0: new high speed SDHC card at address b368
[ 1.000000] mmcblk0: mmc0:b368 F0F0F 3.71 GiB
[ 1.030000] mmcblk0: p1
[ 1.030000] fb0: frame buffer device
[ 1.030000] drm: registered panic notifier
[ 1.030000] [drm] Initialized analog_drm 1.0.0 20110530 on minor 0
[ 1.050000] RAMDISK: gzip image found at block 0
[ 1.480000] EXT4-fs (ram0): warning: mounting unchecked fs, running e2fsck is
Recommended
[ 1.490000] EXT4-fs (ram0): mounted filesystem without journal. Opts: (null)
[ 1.500000] VFS: Mounted root (ext4 filesystem) on device 1:0.
[ 1.500000] Freeing init memory: 152K
Starting rcS...
++ Mounting filesystem (0)
++ Setting up mdev (0)
++ Changing root filesystem
[1.830000] EXT4-fs (loop0): mounting ext2 file system using the ext4 subsyst
Em
[1.850000] EXT4-fs (loop0): warning: mounting unchecked fs, running e2fsck i
s recommended
[1.850000] EXT4-fs (loop0): mounted filesystem without journal. Opts: (null)
++ Mounting filesystem (1)
++ Setting up mdev (1)
++ Starting telnet daemon
++ Starting http daemon
++ Starting ftp daemon
++ Starting dropbear (ssh) daemon
/bin/sh: can't access tty; job control turned off
/ # export LD_LIBRARY_PATH=/usr/local/lib
/ # export TESSDATA_PREFIX=/usr/local/share/
/ # cd /usr/local/bin
/usr/local/bin # ls
1.out.txt phototest.GIF tesseract
Cntraining phototest.JPG unicharset_extractor
Combine_tessdata phototest.PNG wordlist2dawg
Eng.traineddata phototest.bmp
Mftraining phototest.tif
/usr/local/bin # ./tesseract phototest.bmp phototest.txt -l eng
Tesseract Open Source OCR Engine v3.01 with Leptonica
/usr/local/bin # cat phototest.txt
Cat: can't open 'phototest.txt': No such file or directory
/usr/local/bin # ls
1.out.txt phototest.GIF phototest.txt.txt
Cntraining phototest.JPG tesseract
Combine_tessdata phototest.PNG unicharset_extractor
Eng.traineddata phototest.bmp wordlist2dawg
Mftraining phototest.tif
/usr/local/bin # cat phototest.txt.txt
This is a lot of 12 point text to test the
Ocr code and see if it works on all types
Of file format.
The quick brown dog jumped over the
Lazy fox. The quick brown dog jumped
Over the lazy fox. The quick brown dog
Jumped over the lazy fox. The quick
Brown dog jumped over the lazy fox.
ac ev charger,ev charging station,dc ev charger,22kw ev charger,portable ev charger station
Shenzhen Jiesai Electric Co.,Ltd , https://www.gootuenergy.com