Skip to content

Commit

Permalink
Code
Browse files Browse the repository at this point in the history
  • Loading branch information
sayli2212 committed May 7, 2019
1 parent 448b591 commit 455f81e
Show file tree
Hide file tree
Showing 108 changed files with 524,219 additions and 0 deletions.
11 changes: 11 additions & 0 deletions Chapter04/00_data/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
## Engineering alpha factors

Based on a conceptual understanding of key factor categories, their rationale and popular metrics, a key task is to identify new factors that may better capture the risks embodied by the return drivers laid out previously, or to find new ones.

### Useful pandas and NumPy methods

NumPy and pandas are the key tools for custom factor computations. The Notebook [feature_engineering](feature_engineering.ipynb) contains examples of how to create various factors.

The notebook uses data generated by the notebook [create_datasets](../../data/create_datasets.ipynb) script in the data folder in the root directory of this GitHub repo and stored in HDF5 format for faster access.

See the notebook [storage_benchmarks](../../02_market_and_fundamental_data/04_storage_benchmark/storage_benchmark.ipynb) in the directory for Chapter 2, Market and Fundamental Data for a comparison of parquet, HDF5, and csv storage formats for pandas DataFrames.
Loading

0 comments on commit 455f81e

Please sign in to comment.