AYUDA EN LÍNEA
 WINDEVWEBDEV Y WINDEV MOBILE

Este contenido se ha traducido automáticamente.  Haga clic aquí  para ver la versión en inglés.
Ayuda / WLanguage / Funciones WLanguage / Funciones estándar / Funciones PDF
  • Conversion from PDF to text
  • Special cases
WINDEV
WindowsLinuxUniversal Windows 10 AppJavaReportes y ConsultasCódigo de Usuario (UMC)
WEBDEV
WindowsLinuxPHPWEBDEV - Código Navegador
WINDEV Mobile
AndroidWidget Android iPhone/iPadIOS WidgetApple WatchMac CatalystUniversal Windows 10 App
Otros
Procedimientos almacenados
Extract the text contained in an Document PDF.
Ejemplo
MaChaîne is string
MaChaîne = PDFToText("C:\Temp\MonDocument.pdf")
// Affichage dans un champ de saisie multiligne
SAI_SaisieTexteMulti = MaChaîne
Nueva funcionalidad versión 2024
WINDEV
MonPDF is pdfDocument = PDFOpen("test.pdf")
MaChaîne = PDFToText(MonPDF, "1-2")
// Affichage dans un champ de saisie multiligne
SAI_SaisieTexteMulti = MaChaîne
Sintaxis

Extract the content of a PDF file identified by its path Ocultar los detalles

<Result> = PDFToText(<PDF file> [, <Pages to extract> [, <Password> [, <Options>]]])
<Result>: Character string
Text of PDF file.
<PDF file>: Character string
Name and path of PDF file to analyze.
<Pages to extract>: Optional character string
Range of pages the text must be extracted form. The format used is identical to the one used in the standard printout boxes: individual page numbers of range of pages separated by semi-colons. For example, "1;3;4;6-10;12" means that pages 1, 3, 4, 6 to 10, and 12 will be processed.
If this parameter is not specified or if it corresponds to an empty string (""), all the pages are extracted.
<Password>: Optional character string
Password required to open the file if the PDF file is password protected.
<Options>: Integer constant
Text splitting mode:
pttCompatibleSplit PDF text using the algorithm from versions 24 and earlier.
pttDefault
(Default value)
Split PDF text using an optimized algorithm. This splitting may be different from previous versions.
Observaciones

Conversion from PDF to text

  • The formatting of the document is lost when the conversion is performed from PDF to text.
  • The text is extracted in the order or appearance of the PDF commands and it is sequentially written into the result string. The organization of the text in paragraphs and in blocks is kept (as well as the CR characters).
  • The Unicode characters are not returned.
  • The data found in a PDF form is not extracted (this data is not stored in the PDF file).

Special cases

  • PDFIsProtected is used to find out whether a password is required to open a PDF file.
  • PDFNumberOfPages returns the total number of pages found in a PDF file.
Clasificación Lógica de negocio / UI: Lógica de negocio
Componente: wd290wdpdf.dll
Versión mínima requerida
  • Versión 14
Esta página también está disponible para…
Comentarios
Haga clic en [Agregar] para publicar un comentario

Última modificación: 16/12/2023

Señalar un error o enviar una sugerencia | Ayuda local