Class TurFileUtils
Object
TurFileUtils
-
Field Summary
Fields -
Method Summary
Modifier and TypeMethodDescriptionstatic TurFileAttributes
documentToText
(MultipartFile multipartFile) getFileContent
(InputStream stream, BodyContentHandler handlerInner, AutoDetectParser parserInner, Metadata metadataInner, ParseContext parseContextInner) parseDocument
(InputStream stream) static TurTikaFileAttributes
static TurTikaFileAttributes
parseFile
(MultipartFile multipartFile) static TurTikaFileAttributes
static TurTikaFileAttributes
static TurFileAttributes
urlContentToText
(URL url)
-
Field Details
-
PDF_DOC_INFO_TITLE
- See Also:
-
CONNECTION_TIMEOUT_MILLIS
public static final int CONNECTION_TIMEOUT_MILLIS- See Also:
-
TMP
- See Also:
-
HEAD
- See Also:
-
-
Method Details
-
readFile
-
readFile
-
parseFile
-
parseFile
-
documentToText
-
urlContentToText
-
parseDocument
- Throws:
IOException
-
getFileContent
public static Optional<String> getFileContent(InputStream stream, BodyContentHandler handlerInner, AutoDetectParser parserInner, Metadata metadataInner, ParseContext parseContextInner) throws IOException - Throws:
IOException
-