{"_id":"pdf-parse","_rev":"3794419","name":"pdf-parse","description":"Pure javascript cross-platform module to extract text from PDFs.","dist-tags":{"latest":"1.1.1"},"maintainers":[{"name":"mehmet.kozan","email":""}],"time":{"modified":"2025-06-05T10:28:51.000Z","created":"2018-01-07T09:25:31.378Z","1.1.1":"2018-10-24T15:28:45.246Z","1.1.0":"2018-10-24T07:46:39.958Z","1.0.9":"2018-10-21T20:58:09.618Z","1.0.8":"2018-01-11T08:30:11.859Z","1.0.7":"2018-01-09T17:12:41.633Z","1.0.6":"2018-01-09T16:49:56.084Z","1.0.5":"2018-01-08T14:41:01.912Z","1.0.3":"2018-01-08T09:23:01.112Z","1.0.2":"2018-01-08T09:00:40.388Z","1.0.1":"2018-01-07T16:35:46.345Z","0.0.1":"2018-01-07T09:25:31.378Z"},"users":{},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"versions":{"1.1.1":{"name":"pdf-parse","version":"1.1.1","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"17da5108eda19454dd8a990f58cb6b4efa2b1bde","_id":"pdf-parse@1.1.1","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"745e07408679548b3995ff896fd38e96e19d14a7","size":11852761,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.1.1.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.1.1.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"directories":{},"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse_1.1.1_1540394924756_0.7319540936027915"},"_hasShrinkwrap":false,"publish_time":1540394925246,"_cnpm_publish_time":1540394925246,"_cnpmcore_publish_time":"2021-12-16T22:59:25.179Z"},"1.1.0":{"name":"pdf-parse","version":"1.1.0","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"8c4661b32721c4fe1ef25cf5bc7b6a674dde975d","_id":"pdf-parse@1.1.0","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"e888e6fd787f726e6259bcd4befdc547c63c70aa","size":11075575,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.1.0.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.1.0.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"directories":{},"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse_1.1.0_1540367199482_0.18957200870792335"},"_hasShrinkwrap":false,"publish_time":1540367199958,"_cnpm_publish_time":1540367199958,"_cnpmcore_publish_time":"2021-12-16T22:59:50.016Z"},"1.0.9":{"name":"pdf-parse","version":"1.0.9","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"58202bf00d6254d845708fc533dc18a973010b86","_id":"pdf-parse@1.0.9","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"fdd56f159695cf6edfdb07fa67f714460b606d4e","size":11075541,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.0.9.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.0.9.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"directories":{},"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse_1.0.9_1540155489195_0.5379410971895571"},"_hasShrinkwrap":false,"publish_time":1540155489618,"_cnpm_publish_time":1540155489618,"_cnpmcore_publish_time":"2021-12-16T23:00:12.466Z"},"1.0.8":{"name":"pdf-parse","version":"1.0.8","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"8db3df41725e4e92ffdd1c8f3a4c6c91d18da2bd","_id":"pdf-parse@1.0.8","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"da50ad4a1eed17486c7ab304ab8b7c60b5788e39","size":10897636,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.0.8.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.0.8.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse-1.0.8.tgz_1515659407572_0.10363834723830223"},"directories":{},"publish_time":1515659411859,"_hasShrinkwrap":false,"_cnpm_publish_time":1515659411859,"_cnpmcore_publish_time":"2021-12-16T23:00:37.158Z"},"1.0.7":{"name":"pdf-parse","version":"1.0.7","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0","co":"^4.6.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"43e7d698826fa2c66531fb1be79d61307907c56f","_id":"pdf-parse@1.0.7","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"32919661595e4f06fad81843b918164dbe0d17e1","size":10897672,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.0.7.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.0.7.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse-1.0.7.tgz_1515517958613_0.7706344500184059"},"directories":{},"publish_time":1515517961633,"_hasShrinkwrap":false,"_cnpm_publish_time":1515517961633,"_cnpmcore_publish_time":"2021-12-16T23:00:58.911Z"},"1.0.6":{"name":"pdf-parse","version":"1.0.6","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0","co":"^4.6.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"dc27acd4cfe16078de93896fd398bdb58f4fd7f9","_id":"pdf-parse@1.0.6","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"0fefb41a686ed0f3d90bac79f292b361fdc1c6e5","size":10897624,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.0.6.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.0.6.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse-1.0.6.tgz_1515516593792_0.02539448789320886"},"directories":{},"publish_time":1515516596084,"_hasShrinkwrap":false,"_cnpm_publish_time":1515516596084,"_cnpmcore_publish_time":"2021-12-16T23:01:20.032Z"},"1.0.5":{"name":"pdf-parse","version":"1.0.5","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0","co":"^4.6.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"64403027d3c2911da9f5f1fec66b6a4027dfacfa","_id":"pdf-parse@1.0.5","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"dae935b965abc84e1965403073cfb67a1e0ef2f8","size":10897644,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.0.5.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.0.5.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse-1.0.5.tgz_1515422461027_0.1595281409099698"},"directories":{},"publish_time":1515422461912,"_hasShrinkwrap":false,"_cnpm_publish_time":1515422461912,"_cnpmcore_publish_time":"2021-12-16T23:01:38.126Z"},"1.0.3":{"name":"pdf-parse","version":"1.0.3","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0","co":"^4.6.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"84e8dac627381b8ab6838f5f2443ba05317ef8e6","_id":"pdf-parse@1.0.3","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"461e55bf16dc77cb51cdbb3009f6ff751610287b","size":10824115,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.0.3.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.0.3.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse-1.0.3.tgz_1515403378192_0.47081224201247096"},"directories":{},"publish_time":1515403381112,"_hasShrinkwrap":false,"_cnpm_publish_time":1515403381112,"_cnpmcore_publish_time":"2021-12-16T23:02:00.481Z"},"1.0.2":{"name":"pdf-parse","version":"1.0.2","description":"Pure javascript cross-platform module to extract text from PDFs.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0","co":"^4.6.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"90904458e74b4408458a4bc152af3577401ba007","_id":"pdf-parse@1.0.2","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"ac03604dd2f1fee39952d6725303d3b256d136c4","size":10824002,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.0.2.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.0.2.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse-1.0.2.tgz_1515402037757_0.056794481351971626"},"directories":{},"publish_time":1515402040388,"_hasShrinkwrap":false,"_cnpm_publish_time":1515402040388,"_cnpmcore_publish_time":"2021-12-16T23:02:24.719Z"},"1.0.1":{"name":"pdf-parse","version":"1.0.1","description":"Pure javascript cross-platform PDF parser.","main":"index.js","keywords":["pdf-parse","pdf-crawler","xpdf","pdf.js","pdfreader","pdf-extractor","pdf2json","j-pdfjson","pdf-parser","pdf-extract","pdf-extractor","pdf-to-text","pdf-text-extract","pdfjs","server side PDF parsing","pdf metadata"],"dependencies":{"debug":"^3.1.0","node-ensure":"^0.0.0","co":"^4.6.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"2ff57b51b605cd666d5fed3b32ad11916f4c28ca","_id":"pdf-parse@1.0.1","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"e5b25a7c303d5ea4ced13e6aabeb5fc9a857d3fa","size":10823784,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-1.0.1.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-1.0.1.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse-1.0.1.tgz_1515342946011_0.09028392238542438"},"directories":{},"publish_time":1515342946345,"_hasShrinkwrap":false,"_cnpm_publish_time":1515342946345,"_cnpmcore_publish_time":"2021-12-16T23:02:44.519Z"},"0.0.1":{"name":"pdf-parse","version":"0.0.1","description":"Pure javascript PDF parser.","main":"index.js","keywords":["pdf-parse","pdf-crawler"],"dependencies":{"debug":"^3.1.0"},"devDependencies":{"mocha":"^4.0.1"},"scripts":{"test":"node node_modules/mocha/bin/_mocha --recursive --slow 10000","start":"node index.js"},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"repository":{"type":"git","url":"git+https://gitlab.com/autokent/pdf-parse.git"},"author":{"name":"Mehmet Kozan","email":"mehmet.kozan@live.com"},"license":"MIT","engines":{"node":">=6.8.1"},"gitHead":"a7348e4baf057e0034f7fdb87225d1331a018909","_id":"pdf-parse@0.0.1","_npmVersion":"5.5.1","_nodeVersion":"8.9.3","_npmUser":{"name":"mehmet.kozan","email":"mehmet.kozan@live.com"},"dist":{"shasum":"da639bc96dd202eaced21380fc920fc29d574f9a","size":7360610,"noattachment":false,"key":"/pdf-parse/-/pdf-parse-0.0.1.tgz","tarball":"http://registry.cnpm.dingdandao.com/pdf-parse/download/pdf-parse-0.0.1.tgz"},"maintainers":[{"name":"mehmet.kozan","email":""}],"_npmOperationalInternal":{"host":"s3://npm-registry-packages","tmp":"tmp/pdf-parse-0.0.1.tgz_1515317128829_0.7713571263011545"},"directories":{},"publish_time":1515317131378,"_hasShrinkwrap":false,"_cnpm_publish_time":1515317131378,"_cnpmcore_publish_time":"2021-12-16T23:02:54.959Z"}},"readme":"# pdf-parse\n\n**Pure javascript cross-platform module to extract texts from PDFs.**\n\n[![version](https://img.shields.io/npm/v/pdf-parse.svg)](https://www.npmjs.org/package/pdf-parse)\n[![downloads](https://img.shields.io/npm/dt/pdf-parse.svg)](https://www.npmjs.org/package/pdf-parse)\n[![node](https://img.shields.io/node/v/pdf-parse.svg)](https://nodejs.org/)\n[![status](https://gitlab.com/autokent/pdf-parse/badges/master/pipeline.svg)](https://gitlab.com/autokent/pdf-parse/pipelines)\n\n## Similar Packages\n* [pdf2json](https://www.npmjs.com/package/pdf2json) buggy, no support anymore, memory leak, throws non-catchable fatal errors\n* [j-pdfjson](https://www.npmjs.com/package/j-pdfjson) fork of pdf2json\n* [pdf-parser](https://github.com/dunso/pdf-parse) buggy, no tests\n* [pdfreader](https://www.npmjs.com/package/pdfreader) using pdf2json\n* [pdf-extract](https://www.npmjs.com/package/pdf-extract) not cross-platform using xpdf\n\n## Installation\n`npm install pdf-parse`\n \n## Basic Usage - Local Files\n\n```js\nconst fs = require('fs');\nconst pdf = require('pdf-parse');\n\nlet dataBuffer = fs.readFileSync('path to PDF file...');\n\npdf(dataBuffer).then(function(data) {\n\n\t// number of pages\n\tconsole.log(data.numpages);\n\t// number of rendered pages\n\tconsole.log(data.numrender);\n\t// PDF info\n\tconsole.log(data.info);\n\t// PDF metadata\n\tconsole.log(data.metadata); \n\t// PDF.js version\n\t// check https://mozilla.github.io/pdf.js/getting_started/\n\tconsole.log(data.version);\n\t// PDF text\n\tconsole.log(data.text); \n        \n});\n```\n\n## Basic Usage - HTTP\nYou can use [crawler-request](https://www.npmjs.com/package/crawler-request) which uses the `pdf-parse`\n\n## Exception Handling\n\n```js\nconst fs = require('fs');\nconst pdf = require('pdf-parse');\n\nlet dataBuffer = fs.readFileSync('path to PDF file...');\n\npdf(dataBuffer).then(function(data) {\n\t// use data\n})\n.catch(function(error){\n\t// handle exceptions\n})\n```\n\n## Extend\n* v1.0.9 and above break pagerender callback [changelog](https://gitlab.com/autokent/pdf-parse/blob/master/CHANGELOG)\n* If you need another format like json, you can change page render behaviour with a callback\n* Check out https://mozilla.github.io/pdf.js/\n\n```js\n// default render callback\nfunction render_page(pageData) {\n    //check documents https://mozilla.github.io/pdf.js/\n    let render_options = {\n        //replaces all occurrences of whitespace with standard spaces (0x20). The default value is `false`.\n        normalizeWhitespace: false,\n        //do not attempt to combine same line TextItem's. The default value is `false`.\n        disableCombineTextItems: false\n    }\n\n    return pageData.getTextContent(render_options)\n\t.then(function(textContent) {\n\t\tlet lastY, text = '';\n\t\tfor (let item of textContent.items) {\n\t\t\tif (lastY == item.transform[5] || !lastY){\n\t\t\t\ttext += item.str;\n\t\t\t}  \n\t\t\telse{\n\t\t\t\ttext += '\\n' + item.str;\n\t\t\t}    \n\t\t\tlastY = item.transform[5];\n\t\t}\n\t\treturn text;\n\t});\n}\n\nlet options = {\n    pagerender: render_page\n}\n\nlet dataBuffer = fs.readFileSync('path to PDF file...');\n\npdf(dataBuffer,options).then(function(data) {\n\t//use new format\n});\n```\n\n## Options\n\n```js\nconst DEFAULT_OPTIONS = {\n\t// internal page parser callback\n\t// you can set this option, if you need another format except raw text\n\tpagerender: render_page,\n\t// max page number to parse\n    max: 0,\n    //check https://mozilla.github.io/pdf.js/getting_started/\n    version: 'v1.10.100'\n}\n```\n### *pagerender* (callback)\nIf you need another format except raw text.  \n\n### *max* (number)\nMax number of page to parse. If the value is less than or equal to 0, parser renders all pages.  \n\n### *version* (string, pdf.js version)\ncheck [pdf.js](https://mozilla.github.io/pdf.js/getting_started/)\n\n* `'default'`\n* `'v1.9.426'`\n* `'v1.10.100'`\n* `'v1.10.88'`\n* `'v2.0.550'`\n\n>*default* uses version *v1.10.100*   \n>[mozilla.github.io/pdf.js](https://mozilla.github.io/pdf.js/getting_started/#download)\n\n## Test\n* `mocha` or `npm test`\n* Check [test folder](https://gitlab.com/autokent/pdf-parse/tree/master/test) and [quickstart.js](https://gitlab.com/autokent/pdf-parse/blob/master/quickstart.js) for extra usages.\n\n## Support\nI use this package actively myself, so it has my top priority. You can chat on WhatsApp about any infos, ideas and suggestions.\n\n[![WhatsApp](https://img.shields.io/badge/style-chat-green.svg?style=flat&label=whatsapp)](https://api.whatsapp.com/send?phone=905063042480&text=Hi%2C%0ALet%27s%20talk%20about%20pdf-parse)\n\n### Submitting an Issue\nIf you find a bug or a mistake, you can help by submitting an issue to [GitLab Repository](https://gitlab.com/autokent/pdf-parse/issues)\n\n### Creating a Merge Request\nGitLab calls it merge request instead of pull request.  \n\n* [A Guide for First-Timers](https://about.gitlab.com/2016/06/16/fearless-contribution-a-guide-for-first-timers/)\n* [How to create a merge request](https://docs.gitlab.com/ee/gitlab-basics/add-merge-request.html)\n* Check [Contributing Guide](https://gitlab.com/autokent/pdf-parse/blob/master/CONTRIBUTING.md) \n\n## License\n[MIT licensed](https://gitlab.com/autokent/pdf-parse/blob/master/LICENSE) and all it's dependencies are MIT or BSD licensed.\n","_attachments":{},"homepage":"https://gitlab.com/autokent/pdf-parse","bugs":{"url":"https://gitlab.com/autokent/pdf-parse/issues","email":"mehmet.kozan@live.com"},"license":"MIT"}