Skip to main content
Para la 0.8
- Parameters for tar medium version.
- Use a page selector to calculate top htmls and know if it's same as before.
- Fix multiword search.
- Set up fixed versions of CDPedia.
- Merged enhance-home branch.
- Merged very_important_articles branch in.
- Merged lp:~ccanepacc/cdpedia/win-exe-generator-variants2 but changing it to keep only the pyinstaller 2 stuff.
- Fix the generated config to be more multiplatform regarding paths.
- Merged lp:~ccanepacc/cdpedia/table-resouces
- Removed two symlinks for the project, to make it more windows-development friendly.
- A couple of fixes for serving web pages.
- Avoid double decoding.
- Make the process nicer.
- Merged lp:~facundo/cdpedia/other-image-url
- New utilitie to generate a sample of the dump.
- Get the in-namespaces name correctly.
- Merged dump-support-art-namespaces branch.
- Better string.
- Merged the clean-scrap branch.
- Merged lp:~dmascialino/cdpedia/fix-scraper_pagination_99
- Reverted Diego's changes.
- Syntax error.
- Some optimizations and cleanups.
- Modifico scraper, para que tenga en cuenta la paginacion en Categorías. fixed #99
- Usar twisted para el scraper
- Better logging and whitespace.
- Fix for hidden user in commits bug.
- 1st. if we don't found a good enough version of the article we use the first version in its history; 2nd. we download history for 6 versions only and if we don't find a good one, we download it's 100 last history
- This uses mediawiki API to check article's history..
- We log
- Fixed bug introduced on r439
- Moved python tutorial to a compressed scheme, fixed #110 and #137
- ticket 142: remove not last version message..
- Fixed error message, and added a timeout.
- ups
- print article no its repr()
- Add a little retrying to the fetch.
- Support no history and small cleanups.
- issue #124, merge from : https://cdpedia.googlecode.com/svn/branches/issue124
- Finished the 'extraer' refactor.
- Half refactor of image extractor.
- Support re-asking for the search after end of itself.
- Fixed bug introduced on r421 (ExtraerContenido moved to scraper)
- Added license header and some cleanup to web_app
- Full search implemented.
- #91. Movemos el preprocesador ExtraerContenido al scraper
- Verify some details on the downloaded page.
- Fixed header
- We need AUTHORS as project root.
- Change directory so we can run it from anywhere.
- Patched the HTTPServer to have shutdown in old Python versions.
- Simple search (without javascript) working.; TODO: add support to index not ready and add pagination features.
- Fixed unittest discovery support of index_tests
- Fixed some broken urls from last commit.
- Reordered static files to a simple structure.
- Cache the search.
- First steps of search implementation in the web_app
- Restructured cdpedia.py and added command line options for daemon mode, portname, hostname and verbose
- Added verbose attribute to Destacados
- Fixed bug introducen on r405
- Portales en la portada aunque no haya destacados
- More consistent unicode handling in web_app.
- Grouped in searcher.
- Actualizada a 0.7
- Searcher
- Algunos tests para el watchdog
- Browser WatchDog andando
- Wrapeo las paginas de institucional #136
- La leyenda de error de suffix tree ahora solo aparece en la generacion #130
- Desactivar el browser watchdog con BROWSER_WD_SECONDS = 0
- Se puede elegir el puerto y hostname en el config.py #98
- cdpedia.py funcionando nuevamente
- Agrega debugger a la web_app
- Bump werkzeug version a 0.7.1
- Docs, process
- More explicit option in tar, and correct XO config.
- Remove the symlink after building the tarball
- Update real values for XO and CD.
- New options for generar.py
- Don't put all redirects in the blocks, just the useful ones.
- Mergedo el branch web-werkzeug a trunk
- Agrego metodo from_path a to3dirs.
- Ahora se testea tanto el compressed_index como el easy_index automaticamente.
- Muevo los tests de to3dirs al directorio /tests y agrego algunos tests más.