Skip to content

Race condition in multithreading environment due to usage of os.chdir #1052

Open
@rcarpa

Description

@rcarpa

Hi,

I recently ran into a race condition when using two different instances of git.Repo objects in two different threads.

Inside repo.index.add(), some functions are decorated with the git_working_dir decorator. This decorator changes the process working directory to the one of repository, performs some system calls, then returns the working dir back to the initial value.

If two threads perform repo.index.add(), even on two independent Repo() objects, the working directory of the entire process could be permanently modified. Moreover, system calls done by index.add() may fail. For example lstat() will be executed with a relative file name assuming a wrong absolute path.

For example:

  1. working dir (wd) is /home/user
  2. git_working_dir is entered in thread 1. (/home/user is registered, wd changed to /repo1)
  3. git_working_dir is entered in thread 2. (/repo1 is registered, wd changed to /repo2)
  4. git_working_dir is exited in thread 1. (wd restored to /home/user)
  5. git_working_dir is exited in thread 2. (wd restored to /repo1)
    now the process wd is /repo1

I believe I could work around this issue by creating git object directly rather than adding to the index then committing, but I thought you may be interested to know about the issue.

Radu

Activity

Byron

Byron commented on Sep 2, 2020

@Byron
Member

Thanks for the detailed report. I am leaving it open for others to find, despite having no hopes of this being tackled.

Generally, GitPython is not thread-safe, but if desired, certain types of actions cold be made to allow multiple threads under the condition that these operate on their own repositories.

rcarpa

rcarpa commented on Sep 2, 2020

@rcarpa
Author

Seems like the mentioned decorator is only used to protect calls to _store_path. Would it be difficult to ensure that this function does system calls only on absolute paths and performs conversion to relative names (to add to git) by path manipulation (prefix stripping)?
Curious to know why it is complicated.

Byron

Byron commented on Sep 2, 2020

@Byron
Member

That should definitely be possible. The reason it is currently implemented like this is simplicity: delegating all path handling to the operating system (the devil is in the details especially when windows is involved).

rcarpa

rcarpa commented on Sep 2, 2020

@rcarpa
Author

Ah ... didn't think about multi-platforming at all :( . Thanks for the explanation !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @Byron@rcarpa

        Issue actions

          Race condition in multithreading environment due to usage of os.chdir · Issue #1052 · gitpython-developers/GitPython