Обсуждение: In reference to gsoc

Поиск
Список
Период
Сортировка

In reference to gsoc

От
Tanvi Dadu
Дата:
Hello !
I am a sophomore pursuing Computer Engineering in NSIT affiliated with Delhi University.  I came across your organisation through gsoc portal and am very interested to work on project PostgreSQL sorting algorithm benchmark and implementation. 
My skill set include android, python , java , c++/c, git/github. I  have good knowledge of data structure and algorithms. I am an hard working person and am willing to learn and research on new algorithms and data structures need for this particular project. I believe that this project fits my skill set.
It would be helpful if someone direct me to the repo I will be working and sorting techniques used.  Please give me a brief about the data structures used and also the starting point for getting familiar repository and work that needs to be done in this project.
Awaiting your reply :)  
Regards
Tanvi 


Re: In reference to gsoc

От
Robert Haas
Дата:
On Thu, Mar 1, 2018 at 7:50 AM, Tanvi Dadu <tanvidadu1998@gmail.com> wrote:
> It would be helpful if someone direct me to the repo I will be working and
> sorting techniques used.  Please give me a brief about the data structures
> used and also the starting point for getting familiar repository and work
> that needs to be done in this project.

You're probably not going to have a ton of luck working the PostgreSQL
community unless you can figure out things like this without help from
other people. Most people here are quite busy and don't have time to
help you figure out things that you should be able to figure out
yourself with 10 minutes and a search engine, and the location of the
source code is one of those things.

But see:

https://www.postgresql.org/docs/current/static/sourcerepo.html
https://www.postgresql.org/docs/current/static/git.html

I would start by reading the code and comments in
src/backend/utils/sort/tuplesort.c

See also https://wiki.postgresql.org/wiki/Development_information

I don't really know what work needs to be done in terms of sorting at
this point.  Quite a few improvements have been made over the last few
years and I'm not sure exactly what could profitably be done next.
Andres Freund mentioned to me yesterday that there might be some work
around sorting + JIT once he gets the basic JIT stuff committed; and
it would certainly be cool to be able to use the parallel sort stuff
we added in commit 9da0cc35284bdbe8d442d732963303ff0e0a40bc for
parallel query as well as for parallel index builds.  However, I don't
have an exact description of what work might need to be done there in
either case, and neither constitutes an improvement to the sorting
code per se.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company