Reading Materials

This course does have required and optional readings (see Course Schedule). The IEE and ACM sites may require you to log in with your Georgia Tech username and password first before viewing the PDF paper.

In general, the required Szeliski textbook readings provide a more comprehensive and formal background to the material that is presented in the course lectures. You do not need to spend too much time trying to understand all the formulas and equations that are in these readings, but use the textbook to help with any additional details of the lecture material.

You will also find PDF versions of Szeliski (2010) and Forsyth & Ponce (2012) textbooks on Canvas → Files → Books.

Author(s) Book Links
Szeliski (2010) Szeliski, R. (2010). Computer Vision: Algorithms and Applications. Springer.
Forsyth & Ponce (2012) Forsyth, D., & Ponce, J. (2012). Computer Vision: A Modern Approach. Pearson.
Hecht (2016) Hecht, E. (2016). Optics, 5th Edition. Addison-Wesley.
London, et al. (2017) London, B., Stone, J., & Upton, J. (2017). Photography, 12th Edition. Pearson.
Smith (1998) Smith, S. (1998). The Scientist and Engineer’s Guide to Digital Signal Processing.
White (2007) White, R. (2007). How Digital Photography Works, Second Edition. Que Publishers.
Author(s) Paper Links
Adelson & Bergen (1991) Adelson, E. & Bergen, J. (1991). The Plenoptic Function and the Elements of Early Vision. Computational Models of Visual Processing (pp. 3-20).
Adelson & Wang (1992) Adelson, E. & Wang, J. (1992).  Single Lens Stereo with a Plenoptic Camera. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(2).
Agarwala, et al. (2004) Agarwala, A., Dontcheva, M., Agrawala, M., Drucker, S.,  Colburn, A., Curless, B., Salesin, D., & Cohen, M. (2004). Interactive Digital Photomontage. ACM Transactions on Graphics (Proc. SIGGRAPH), 23(3), (pp. 294-302).
Agarwala, et al. (2005) Agarwala, A.,  Zheng, C.,  Pal, C., Agrawala, M., Cohen, M., Curless, B., Salesin, D., & Szeliski, R. (2005). Panoramic Video Textures. ACM Transactions on Graphics (Proc. SIGGRAPH), 24(3), (pp. 821-827).
Avidan & Shamir (2007) Avidan, S., & Shamir, A. (2007). Seam Carving for Content-Aware Image Resizing. ACM Transactions on Graphics (Proc. SIGGRAPH), 26(3), (pp. 10-es).
Bai, et al. (2012) Bai, J., Agarwala, A., Agrawala, A., & Ramamoorthi, R. (2012). Selectively De-Animating Video. ACM Transactions on Graphics, 31(4).
Baker, et al. (2010) Baker, S., Bennett, E., Kang, S., & Szeliski, R. (2010). Removing Rolling Shutter Wobble. 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
Banterle, et al. (2018) Banterle, F., Artusi, A., Debattista, K., & Chalmers, A. (2018). Advanced High Dynamic Range Imaging. CRC Press.
Barnes, et al. (2009) Barnes, C., Shechtman, E., Finkelstein, A., & Goldman, D.B. (2009). PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing. ACM Transactions on Graphics (Proc. SIGGRAPH), 28(3).
Beier & Neely (1992) Beier, T., & Neely, S. (1992). Feature-based Image Metamorphosis. In Proceedings of the 19th Annual Conference on Computer Graphics and Interactive Techniques (pp. 35-42). ACM.
Boykov & Jolly (2001) Boykov, Y., & Jolly, M. (2001), Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in N-D images. In In Proceedings of the Eighth IEEE International Conference on Computer Vision. ACM.
Brown & Lowe (2003) Brown, M., & Lowe, D. G. (2003). Recognising Panoramas. In Proceedings of the Ninth IEEE International Conference on Computer Vision. ACM.
Burt & Adelson (1983a) Burt, P., & Adelson, E. (1983a). The Laplacian Pyramid as a Compact Image Code, In IEEE Transactions on Communications, 31(4). (pp. 532-540).
Burt & Adelson (1983b) Burt, P., & Adelson, E. (1983b). A Multiresolution Spline With Application to Image Mosaics. In ACM Transactions on Graphics, 2(4).
Criminisi, et al. (2004) Criminisi, A., Pérez P., & Toyama, K. (2004). Region Filling and Object Removal by Exemplar-Based Image Inpainting. IEEE Transactions on Image Processing, 13(9).
Davis (1998) Davis, J. (1998).  Mosaics of Scenes with Moving Objects. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
Debevec (2012) Debevec, P. (2012). The Light Stages and Their Applications to Photoreal Digital Actors. SIGGRAPH Asia 2012 Technical Briefs.
Debevec & Malik (1997) Debevec, P., & Malik, J. (1997). Recovering High Dynamic Range Radiance Maps from Photographs. In Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques (pp. 369-378).
Durand & Dorsey (2002) Durand, F., & Dorsey, J. (2002). Fast Bilateral Filtering for the Display of High-Dynamic-Range Images. ACM Transactions on Graphics.
Efros & Freeman (2001) Efros, A., & Freeman, W. (2001). Image Quilting for Texture Synthesis and Transfer. In Proceedings of the 28th Annual Conference on Computer Graphics and Pattern Recognition (pp. 341-346).
Forssén & Ringaby (2010) Forssén, P., & Ringaby, E. (2010). Rectifying rolling shutter video from hand-held devices2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
Gortler, et al. (1996) Gortler, S., Grzeszczuk, R., Szeliski, R., & Cohen, M. (1996). The Lumigraph. In Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques (pp. 43-54).
Grossberg & Nayar (2003) Grossberg, M.D., & Nayar, S.K. (2003). Determining the Camera Response from Images: What is Knowable?. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003.
Grundmann, et al. (2011) Grundmann, M., Kwatra, V., & Essa, I. (2011). Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2011.
Grundmann, et al. (2012) Grundmann, M., Kwatra, V., Castro, D., & Essa, I. (2012). Calibration-Free Rolling Shutter Removal. In Proceedings of IEEE Conference on Computational Photography, 2012.
Harris & Stephens (1988) Harris, C., & Stephens, M. (1988). A Combined Corner and Edge Detector. Proceedings of the 4th Alvey Vision Conference, 1988.
Jia (2008) Jia, Sun, Tang and Shum (2008). Drag-and-Drop Pasting. ACM SIGGRAPH 2006 Papers. (pp. 631-637).
Jones, et al. (2014) Jones, B., Sodhi, R., Murdock, M., Mehra, R., Benko, H., Wilson, A., Ofek, E., MacIntyre, B., Raghuvanshi, N. & Shapira, L. (2014). RoomAlive: Magical Experiences Enabled by Scalable, Adaptive Projector-Camera Units. In Proceedings of the 27th Annual Symposium on User Interface Software and Technology (pp. 637-644).
Joshi, et al. (2012) Joshi, N., Mehta, S., Drucker, S., Stollnitz, E., Hoppe, H., Uyttendaele, M., & Cohen, M. (2012). Cliplets: Juxtaposing Still and Dynamic Imagery. In Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology (pp. 251-260).
Kushal, et al. (2012) Kushal, A., Self, B., Furukawa, Y., Gallup, D., Hernandez, C., Curless, B., & Seitz, S. (2012). Photo Tours. 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.
Kwatra, et al. (2003) Kwatra, V., Schödl, A., Essa, I., Turk, G., Bobick, A. (2003). Graphcut Textures: Image and Video Synthesis Using Graph Cuts. ACM Transactions on Graphics, 22(3).
Lee, et al. (2004) Lee, J.C., Dietz, P.H., Aminzade, D., Raskar, R., & Hudson, S. (2004). Automatic Projector Calibration using Embedded Light Sensors. ACM UIST 2004.
Levin, et al. (2007) Levin, A., Fergus, R., Durand, F., Freeman, W. (2007). Image and Depth from a Conventional Camera with a Coded Aperture. ACM Transactions on Graphics, 26(3).
Levoy & Hanrahan (1996) Levoy, M., & Hanrahan, P. (1996). Light field Rendering. In Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques (pp. 31-42).
Liu (2005) Liu, C., Torralba, A., Freeman, W.T., Durand, F., & Adelson, E. (2005). Motion Magnification. ACM SIGGRAPH 2005 Papers. (pp. 519-526).
Lowe (2004) Lowe, D. (2004). Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60(2). (pp. 91-110).
Marner, et al. (2104) Marner, M., Smith, R., Walsh, J., & Thomas, B. (2014). Spatial User Interfaces for Large Scale Projector-Based Augmented Reality. IEEE Computer Graphics and Applications (pp. 74-82).
McMillan & Gortler (1999) McMillan, L., & Gortler, S. (1999). Image-Based Rendering:A New Interface Between Computer Vision and Computer Graphics. ACM SIGGRAPH Computer Graphics, 33(2).
Mikolajczyk & Schmid (2001) Mikolajczyk, K., & Schmid, C. (2001). Indexing Based on Scale Invariant Interest Points. In Proceedings of the Eighth IEEE International Conference on Computer Vision. ACM.
Ng, et al. (2005) Ng, R.,  Levoy, M., Brédif, M., Duval, G., Horowitz, M., & Hanrahan, P. (2005). Light Field Photography with a Hand-held Plenoptic Camera. Stanford Tech Report CTSR 2005-02, 2005.
Pérez, et al. (2003) Pérez, P., Gangnet, M., & Blake, A. (2003). Poisson Image Editing. In ACM Transactions on Graphics, 22(3).
Raskar, et al. (2006) Raskar, R., Agrawal, A., & Tumblin, J. (2006). Coded Exposure Photography: Motion Deblurring using Fluttered Shutter. ACM Transactions on Graphics.
Raskar (2009) Raskar, R. (2009).  Computational Photography: Epsilon to Coded Photography. Emerging Trends in Visual Computing, Springer 2009.
Reinhard, et al. (2002) Reinhard, E., Stark, M.,  Shirley, P., & Ferwerda, J. (2002). Photographic Tone Reproduction for Digital Images. ACM Transactions on Graphics. 
Rubinstein, et al. (2008) Rubinstein, M., Shamir, A., & Avidan, S. (2008). Improved Seam Carving for Video RetargetingACM Transactions on Graphics (Proc. SIGGRAPH), 27(3).
Schödl, et al. (2000) Schödl, A.,  Szeliski, R., Salesin, D., & Essa, I. (2000). Video Textures. In Proceedings of the 27th Annual Conference on Computer Graphics and Interactive Techniques (pp. 489-498). ACM.
Schödl & Essa (2002) Schödl, A., & Essa, I. (2002). Controlled Animation of Video Sprites. In Proceedings of the 2002 ACM SIGGRAPH/Eurographics Symposium on Computer Animation. ACM.
Shah & Kwatra (2012) Shah, R., & Kwatra, V. (2012). All Smiles : Automatic Photo Enhancement by Facial Expression Analysis. In Proceedings of the 9th European Conference on Visual Media Production. (pp. 1-10).
Snavely, et al. (2006) Snavely, N., Seitz, S., & Szeliski, R. (2006). Photo Tourism: Exploring photo collections in 3D. ACM Transactions on Graphics (SIGGRAPH Proceedings), 25(3), (pp. 835-846).
Snavely, et al. (2007) Snavely, N., Seitz, S., & Szeliski, R. (2007). Modeling the World from Internet Photo Collections. International Journal of Computer Vision.
Summet, et al. (2007) Summet, J., Flagg, M., Cham, T., Rehg, J., & Sukthankar, R. (2007). Shadow Elimination and Blinding Light Suppression for Interactive Projected Displays. IEEE Transactions on Visualization and Computer Graphics, 13(3).
Tamburo, et al. (2014) Tamburo, R.  Nurvitadhi, E., Chugh, A., Chen, M., Rowe, A.,  Kanade, T., & Narasimhan, S. (2014). Programmable Automotive Headlights. ECCV 2014.
Torralba & Freeman (2012) Torralba, A., & Freeman, W. (2012). Accidental Pinhole and Pinspeck Cameras: Revealing the scene outside the picture. Proceedings of 25th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2012).
Wang (2006) Wang, Q. & Ward, R. (2006). Fast Image/Video Contrast Enhancement Based on WTHE2006 IEEE Workshop on Multimedia Signal Processing, (pp. 338-343).
Ward (2001) Ward, G. (2001). High Dynamic Range Imaging. Proceedings of the Ninth Color Imaging Conference, November 2001.
Wu, et al. (2012) Wu, H., Rubinstein, M., Shih, E., Guttag, J., Durand, F., & Freeman, W.T. (2012). Eulerian Video Magnification for Revealing Subtle Changes in the World. ACM Transactions on Graphics (Proc. SIGGRAPH), 23(3), (pp. 294-302).